Senior Site Reliability Engineer

Vor 7 Tagen


Berlin, Deutschland On Ag Vollzeit

LOCATION: BERLIN

Technology
In short

In the dynamic landscape of On, the tech thrives much like a spirited runner: always moving, always improving. We are building technology that continues to supercharge the growth of On, helping to ignite the human spirit through movement. We're seeking a Site Reliability Engineer to ensure our digital platforms deliver exceptional performance, reliability, and scalability to support our global customer base.

As a Site Reliability Engineer (SRE) at On, you will play an important role in building and maintaining our cloud infrastructure to support our e-commerce platforms, customer-facing applications, and internal systems. You will work closely with engineering teams to improve reliability, optimize performance, and implement automation solutions.

Your Mission
  • System Reliability & Performance: Contribute to high availability (99.99%+ uptime), scalability, and performance of On's digital platforms through proactive optimization and robust infrastructure design.
  • Infrastructure Development: Build and maintain cloud-based infrastructure using Infrastructure-as-Code (IaC) tools.
  • Automation: Develop and implement automation solutions to streamline deployments, reduce toil, and enhance monitoring.
  • Incident Response: Lead incident resolution, perform troubleshooting, and root cause analyses towards minimizing downtime and improving system resilience.
  • Monitoring & Observability: Improve and maintain monitoring, logging, and alerting solutions to ensure proactive issue detection and resolution.
  • Collaboration: Partner with the SRE team and software engineers to identify opportunities, develop, and roll out major features.
  • Compliance & Security: Integrate security best practices into our systems and solutions.
Your story
  • Experience in site reliability engineering with a track record of managing complex, high-traffic systems.
  • Expertise in cloud platforms (GCP, AWS) and container orchestration (Kubernetes, GKE).
  • Proficiency in scripting and programming (e.g. in Python, Go) for automation and tooling.
  • Experience with CI/CD pipelines (ArgoCD, GitHub Actions) and IaC (Terraform).
  • Solid understanding of networking, load balancing, and DNS management.
  • Experience with observability and monitoring for cloud native environments.
  • Strong analytical skills with a proactive approach to resolving complex technical challenges.
  • Excellent communication skills, with the ability to explain technical concepts to diverse stakeholders.

Nice to Have:

  • Background with e-commerce platforms or high-traffic consumer applications.
  • Experience in platform engineering, dedicated to building solutions that enhance developer experience (DevEx) and boost software development efficiency.
  • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience).
About the Team

You will join a skilled and dynamic team of cloud & site reliability engineers dedicated to transforming On's technological foundation. We are crafting scalable, resilient cloud solutions to power internal operations, enhance product performance, and support On's growth. As a key member of our team, you will shape our cloud infrastructure strategy, ensuring robust, efficient, and sustainable systems that drive innovation. Join us in Berlin, to make a lasting impact on On's digital future


Berlin

In a former 19th Century postal building by Berlin's river Spree, you'll find our future-focused, creative hub.

Köpenicker Str. 122,

10179

Germany

Gioia

HEAD OF OPERATIONS PROJECT PORTFOLIO MANAGEMENT

My journey has evolved for sure and, actually, change can lead to infinite possibilities. Those core values that were there when I joined are still here today and I really find them extraordinary every day.

Vivek

DIRECTOR OF ENGINEERING

We're closely connected to the whole business. Every day, we're making quick decisions to improve the customer experience. Because these live changes impact the whole business, we deeply respect individual opinion and what others think about the end customer experience.

Gioia

HEAD OF OPERATIONS PROJECT PORTFOLIO MANAGEMENT

My journey has evolved for sure and, actually, change can lead to infinite possibilities. Those core values that were there when I joined are still here today and I really find them extraordinary every day.

Vivek

DIRECTOR OF ENGINEERING

We're closely connected to the whole business. Every day, we're making quick decisions to improve the customer experience. Because these live changes impact the whole business, we deeply respect individual opinion and what others think about the end customer experience.

Gioia

HEAD OF OPERATIONS PROJECT PORTFOLIO MANAGEMENT

My journey has evolved for sure and, actually, change can lead to infinite possibilities. Those core values that were there when I joined are still here today and I really find them extraordinary every day.

Vivek

DIRECTOR OF ENGINEERING

We're closely connected to the whole business. Every day, we're making quick decisions to improve the customer experience. Because these live changes impact the whole business, we deeply respect individual opinion and what others think about the end customer experience.

What we offer

On is a place that is centered around growth and progress. We offer an environment designed to give people the tools to develop holistically – to stay active, to learn, explore and innovate. Our distinctive approach combines a supportive, team-oriented atmosphere, with access to personal self-care for both physical and mental well-being, so each person is led by purpose.

On is an Equal Opportunity Employer. We are committed to creating a work environment that is fair and inclusive, where all decisions related to recruitment, advancement, and retention are free of discrimination.

Build the better you

What to expect

We want to set everyone up for success, so here's the lowdown on how we hire. Our process is a two-way street – bringing you into our culture, while helping us learn how you think.

Our full process can last about eight weeks from application to offer, because we care about getting it right. These steps explain how we usually do things.

Before you get started, feel free to consider if you want to work with us. Strange question? Well, we give people a lot of space to navigate their day-to-day and that style isn't for everyone. We want you to be passionate about what you do and be sure this is the right fit. Because when skills and passion combine – it creates that 'Wow' moment.



  • Berlin, Berlin, Deutschland Glow Beauty On Demand Vollzeit

    About the opportunity We are seeking a Senior Site Reliability Engineer to join the Platform Engineering Domain in the AI Platform Team. The mission of Platform Engineering is to provide trusted, performant, self-service platforms that empower product teams to build 'the bank the world loves to use.' The AI Platform team contributes to this mission by...


  • Berlin, Deutschland Interhyp Gruppe Vollzeit

    Senior Site Reliability Engineer (m/w/d)Unser Antrieb ist es, Träume zu erfüllen. Für unsere Kund innen den vom eigenen Zuhause und für unsere Mitarbei tenden den Traum vom beruflichen Zuhause. Genau diese Leidenschaft hat uns zum größten Vermittler von privaten Bau finan zierungen in Deutschland gemacht und lässt uns auch niemals stillstehen. Wir...


  • Berlin, Berlin, Deutschland Kombo Vollzeit

    Senior Site Reliability Engineer (Database) @Kombo Berlin (On-site) · Full-timeTL;DRJoin Kombo as one of our first Database Reliability Engineer. You'll take ownership of our Postgres infrastructure, ensuring performance, scalability, and reliability as we grow.High impact, high autonomy, and the chance to shape Kombo's database reliability practices from...


  • State of Berlin, Deutschland N26 GmbH Vollzeit

    About the opportunityWe are seeking a Senior Site Reliability Engineer to join the Platform Engineering Domain in the AI Platform Team.The mission of Platform Engineering is to provide trusted, performant, self-service platforms that empower product teams to build 'the bank the world loves to use.' The AI Platform team contributes to this mission by creating...


  • Berlin, Berlin, Deutschland Vodafone Vollzeit

    Senior Expert Site Reliability Engineer (m/w/d) Stellen-ID: 274245Bei Vodafone arbeiten wir jeden Tag an einer besseren Zukunft. Für eine Welt, die besser vernetzt, inklusiver und nachhaltiger ist. Denn für uns ist Technologie nur so stark wie die Menschen, die sie nutzen. Sei dabei und lass uns gemeinsam die Welt von morgen gestalten.Was Dich...


  • Berlin, Berlin, Deutschland Glow Beauty On Demand Vollzeit

    About the opportunity We are seeking a Site Reliability Engineer to join the Observability group inside our Platform Engineering domain. Platform Engineering's goal is to provide easy to use, self-service platforms to enable other segments to easily build, deploy and monitor their business applications. And Observability's role in that part of the company...


  • Berlin, Berlin, Deutschland Hirefive Vollzeit 60.000 € - 120.000 € pro Jahr

     Site Reliability Engineer Our growing user base demands cheap, fast and highly available web hosting and we need youto make it possible Join us as a full-time Site Reliability Engineer. This position will offer you personal andprofessional development, startup insights, and the opportunity to be part of one of the mostinspiring deep-tech startups. You...


  • Berlin, Deutschland GEMA Vollzeit

    Für unser Team Solutions Engineering im Bereich GEMA Digital am Standort Berlin suchen wir zum nächstmöglichen Zeitpunkt eine/einen Senior Software Engineer / Site Reliability Engineer (m/w/d) in Vollzeit (40 Stunden/Woche). Du liebst Musik? Dann hilf mit, sie zu schützen! Bei der GEMA Digital gestalten wir die digitale Zukunft der Musiklizenzierung....


  • Berlin, Berlin, Deutschland IONOS SE Vollzeit

    Bei IONOS arbeitest Du bei dem führenden europäischen Anbieter von Cloud-Infrastruktur, Cloud-Services und Hosting-Dienstleistungen partnerschaftlich mit unterschiedlichen Teams zusammen. Wir bieten Dir eine Perspektive in einer der zukunftssichersten Branchen. Uns zeichnen offene Arbeitsstrukturen, Duz-Kultur und flache Hierarchien mit unvergleichlichem...


  • State of Berlin, Deutschland N26 GmbH Vollzeit

    About the opportunityWe are seeking a Senior Reliability Engineer to join the Platform Engineering Domain in the Scalability Team.The mission of Platform Engineering is to provide trusted, performant, self-service platforms that empower product teams to build "the bank the world loves to use." Scalability's part of this mission is to develop solutions for...