Site Reliability Engineering Lead
vor 17 Stunden
Job ID
481430
Posted since
20-Oct-2025
Organization
Smart Infrastructure
Field of work
Customer Services
Company
Siemens AG
Experience level
Experienced Professional
Job type
Full-time
Work mode
Hybrid (Remote/Office)
Employment type
Permanent
Location(s)
- Karlsruhe - Baden-Wuerttemberg - Germany
We are looking for a proactive and experienced Site Reliability Engineering Lead (f/m/d) with proven experience in SRE transformations to drive our cloud operations and reliability engineering practices within our cloud ecosystem. This role is essential to ensuring the reliability, availability, security and performance of our productive environments. You will be responsible for defining and driving CloudOps strategy, and topics like managing level 3 operational support, including incident triage, root cause analysis, resolution, and ensuring our products meet all their Service Level Agreements, and finally establish and monitor key reliability metrics.
What we offer you
- An attractive remuneration package
- Access to Siemens share plans
- 30 days of paid vacation and a variety of flexible work schedules that allow time off for you and your family
- 2 to 3 days of mobile working per week as a global standard
- Flexible training opportunities for both your professional and personal development that you can tailor to your interests
Since each of over 300,000 team members feels that other benefits are particularly important, and we cannot list our entire benefit portfolio here, you can find more information here.
The individual benefits are subject to regulatory, contractual, or corporate conditions.
How you'll make an impact
- Collaborate with all stakeholders to define the CloudOps and Site Reliability Engineering strategy and execution for cloud-hosted products.
- Manage Level 3 operational support, including incident triage, root cause analysis and resolution in close collaboration with R&D and service delivery teams (Bx tech support and cloud operation level 01 and 02).
- Ensure the definition and tracking of Service Level Indicators (SLIs), Service Level Objectives (SLOs), operational level agreements (OLA) and Service Level Agreements (SLAs) to measure and improve system reliability and availability.
- Ensure KPIs such as uptime, latency, error rates, and incident resolution time are established and monitored.
- Operate and optimize cloud infrastructure with a focus on availability, performance, and cost-efficiency.
- Ensure the harmonization and enhancement of topics like observability and alerting systems for the different products (e.g., CloudWatch, Datadog, Prometheus, Grafana).
- Lead level 3 incident response and on-call coordination using PagerDuty, ensuring rapid mitigation and root cause analysis. Clear process and escalation paths with L1 and L2.
- Collaborate with development and platform teams to ensure smooth deployment and operations.
- Maintain operational documentation and runbooks.
- Drive automation and continuous improvement in deployment, monitoring, and recovery processes.
Your defining qualities
Education
Master's degree in Computer Science, Engineering, or a related field.
Experience & Skills
Longterm experience in CloudOps, Site Reliability Engineering.
- Longterm experience in managing operations for a SAAS Product.
- Deep understanding of AWS services and cloud-native architectures.
- Profound experience with incident management and escalation processes (PagerDuty or similar).
- Proven experience with monitoring, logging, and alerting tools.
- Solid understanding of networking, security, and system administration in cloud environments.
- Experience with ITIL-based support processes and ticketing systems.
Strong analytical and problem-solving skills.
Ways of working
Availability(On-Call) on weekends.
Languages
Excellent communication skills in English.
You are much more than your qualifications, and we believe in the potential of every single candidate. We look forward to getting to know you
Your individual personality and perspective are important to us. We create a working environment that reflects the diversity of the society and support you in your personal and professional development. Let's get to know your authentic personality and create a better future together with us. As an equal-opportunity employer we are happy to consider applications from individuals with disabilities.
About us
Ready to dive into the future?
At Siemens Smart Infrastructure Buildings, we focus on creating innovative solutions to make buildings smarter, safer, and more sustainable – in other words, simply better. Interested in being part of this journey? Join us and seize the opportunity to shape the future with us.
You can find more information about the department and its products here:
– if you would like to find out more about jobs & careers at Siemens.
FAQ – if you need further information on the application process.
-
Site Reliability Engineer
vor 13 Stunden
Karlsruhe, Baden-Württemberg, Deutschland Tipico VollzeitCompany Description Our LineupWe are Tipico, Germany's leading sports betting provider and one of the most dynamic tech companies in the industry. We approach every challenge like a Championship match, with a mission to excite the arena and elevate the betting experience for every customer. Our culture is energetic and ambitious—we play as a team to win...
-
Site Reliability Engineer
Vor 6 Tagen
Karlsruhe, Baden-Württemberg, Deutschland Ionos En Vollzeit 80.000 € - 120.000 € pro JahrAt IONOS, the leading European provider of cloud infrastructure, cloud services and hosting services, you will work together with a wide range of teams. We are characterized by open structures, a friendly working culture and flat hierarchies with a strong team spirit. We firmly believe that work and fun are compatible, and offer you the right environment...
-
Senior Site Reliability Engineer
vor 2 Wochen
Karlsruhe, Baden-Württemberg, Deutschland synava Vollzeit 90.000 € - 120.000 € pro JahrUnternehmensbeschreibung Unsere Plattform hilft Radiolog:innen, dass sie schneller arbeiten können, Praxisteams weniger Stress haben und Patient:innen besser versorgt werden.Damit das gelingt, braucht sie jemanden wie dich: eine:n SRE, der oder die Systeme nicht nur betreibt, sondern versteht, verbessert und...
-
Senior Site Reliability Engineer
vor 1 Woche
Karlsruhe, Baden-Württemberg, Deutschland synava Vollzeit 80.000 € - 120.000 € pro JahrUnternehmensbeschreibung UnternehmensbeschreibungWir sind ein innovativer Anbieter im Bereich medizinischer Softwarelösungen und gestalten die digitale Zukunft des Gesundheitswesens aktiv mit. Unsere Plattform hilft Radiolog:innen, dass sie schneller arbeiten können, Praxisteams weniger Stress haben und Patient:innen besser versorgt werden.Damit das...
-
Senior Site Reliability Engineer
vor 22 Stunden
Karlsruhe, Baden-Württemberg, Deutschland synava VollzeitUnternehmensbeschreibung Die medavis GmbH ist ein Softwareunternehmen aus Karlsruhe. Wir entwickeln Lösungen speziell für die Radiologie – für Praxen, MVZ und Kliniken. Unser Hauptprodukt ist ein Radiologie-Informationssystem (RIS), die zentrale Software für den Ablauf in der Radiologie: Termine planen, Untersuchungen steuern, Befunde diktieren und...
-
Senior Engineering Manager
vor 1 Woche
Karlsruhe, Baden-Württemberg, Deutschland Vulcan Energie Ressourcen Vollzeit 90.000 € - 120.000 € pro JahrWe want you for... Vulcan Energie Ressourcen GmbH is an innovative, fast-growing company in the field of sustainable lithium extraction and renewable energies. With our lithium plant, we are making a decisive contribution to the decarbonization of European industry and to securing a sustainable supply of lithium in Europe. Support our dedicated team in...
-
Displays Business Development Leader
vor 22 Stunden
Karlsruhe, Baden-Württemberg, Deutschland Visteon Vollzeit 120.000 € - 200.000 € pro JahrJob DescriptionVisteon is a global automotive technology leader, advancing mobility through innovative technology solutions that enable a software-defined future. The company's state-of-the-art product portfolio merges digital cockpit innovations, advanced displays, AI-enhanced software solutions, and integrated EV architecture solutions. With expertise...
-
Global Business Development Leader
vor 14 Stunden
Karlsruhe, Baden-Württemberg, Deutschland Visteon Vollzeit 80.000 € - 120.000 € pro JahrJob DescriptionVisteon is a global automotive technology leader, advancing mobility through innovative technology solutions that enable a software-defined future. The company's state-of-the-art product portfolio merges digital cockpit innovations, advanced displays, AI-enhanced software solutions, and integrated EV architecture solutions. With expertise...
-
Cockpit Electronics Business Development Leader
vor 18 Stunden
Karlsruhe, Baden-Württemberg, Deutschland Visteon Vollzeit 80.000 € - 100.000 € pro JahrJob DescriptionVisteon is a global automotive technology leader, advancing mobility through innovative technology solutions that enable a software-defined future. The company's state-of-the-art product portfolio merges digital cockpit innovations, advanced displays, AI-enhanced software solutions, and integrated EV architecture solutions. With expertise...
-
Lead Software Engineer
Vor 6 Tagen
Karlsruhe, Baden-Württemberg, Deutschland Hirefive Vollzeit 100.000 € pro JahrTITLE: Lead Software Engineer (Operations Management System)LOCATION: Karlsruhe or Munich (Hybrid)CONTRACT: Full time, permanentSALARY: Up to €100'000 COMPANY: Our client is a Series A funded Start-Up that is developing a Cloud Provider that's deeply integrated into the development workflow. Their product transforms deployment into a developer-centric...