Aktuelle Jobs im Zusammenhang mit Senior Site Reliability Engineer - Berlin, Berlin - Xenon7

Senior Expert Site Reliability Engineer

vor 2 Wochen

Berlin, Berlin, Deutschland Vodafone Vollzeit

Senior Expert Site Reliability Engineer (m/w/d) Stellen-ID: Bei Vodafone arbeiten wir jeden Tag an einer besseren Zukunft. Für eine Welt, die besser vernetzt, inklusiver und nachhaltiger ist. Denn für uns ist Technologie nur so stark wie die Menschen, die sie nutzen. Sei dabei und lass uns gemeinsam die Welt von morgen gestalten.Was Dich erwartet:Du...
Senior Expert Site Reliability Engineer

vor 18 Stunden

Berlin, Berlin, Deutschland Vodafone Vollzeit

Senior Expert Site Reliability Engineer (m/w/d) Stellen-ID: 274245Bei Vodafone arbeiten wir jeden Tag an einer besseren Zukunft. Für eine Welt, die besser vernetzt, inklusiver und nachhaltiger ist. Denn für uns ist Technologie nur so stark wie die Menschen, die sie nutzen. Sei dabei und lass uns gemeinsam die Welt von morgen gestalten.Was Dich...
Senior Site Reliability Engineer

vor 2 Wochen

Berlin, Berlin, Deutschland Kombo Vollzeit

Senior Site Reliability Engineer (Database) @Kombo Berlin (On-site) · Full-timeTL;DRJoin Kombo as one of our first Database Reliability Engineer. You'll take ownership of our Postgres infrastructure, ensuring performance, scalability, and reliability as we grow.High impact, high autonomy, and the chance to shape Kombo's database reliability practices from...
Site Reliability Engineer

Vor 4 Tagen

Berlin, Berlin, Deutschland IONOS SE Vollzeit

Bei IONOS arbeitest Du bei dem führenden europäischen Anbieter von Cloud-Infrastruktur, Cloud-Services und Hosting-Dienstleistungen partnerschaftlich mit unterschiedlichen Teams zusammen. Wir bieten Dir eine Perspektive in einer der zukunftssichersten Branchen. Uns zeichnen offene Arbeitsstrukturen, Duz-Kultur und flache Hierarchien mit unvergleichlichem...
Senior Site Reliability Engineer

vor 1 Woche

Berlin, Berlin, Deutschland Scout24 SE Vollzeit

Why Scout24?Scout24 is home of ImmoScout24, Germany's #1 for real estate. With ImmoScout24 we have been revolutionizing the real estate market in Germany and Austria for more than 25 years. Our goal is to build a digital ecosystem that brings homeowners, seekers, and agents together. Finding the right home and property is one of the most important decisions...
Site Reliability Engineer

Vor 4 Tagen

Berlin, Berlin, Deutschland IONOS Vollzeit

Bei IONOS arbeitest Du bei dem führenden europäischen Anbieter von Cloud-Infrastruktur, Cloud-Services und Hosting-Dienstleistungen partnerschaftlich mit unterschiedlichen Teams zusammen. Wir bieten Dir eine Perspektive in einer der zukunftssichersten Branchen. Uns zeichnen offene Arbeitsstrukturen, Duz-Kultur und flache Hierarchien mit unvergleichlichem...
Site Reliability Engineer

vor 1 Woche

Berlin, Berlin, Deutschland Ionos En Vollzeit

At IONOS, the leading European provider of cloud infrastructure, cloud services and hosting services, you will work together with a wide range of teams. We are characterized by open structures, a friendly working culture and flat hierarchies with a strong team spirit. We firmly believe that work and fun are compatible, and offer you the right environment...
Site Reliability Engineer

vor 2 Wochen

Berlin, Berlin, Deutschland Blackfluo Vollzeit

Job DescriptionLocation: Full remote, EU timezone (CET +/- 2 hours)Start Date: As soon as possibleLanguages: English requiredWe are looking for a skilled Site Reliability Engineer (SRE) with deep expertise in AWS to help us scale and secure our infrastructure. As an SRE, you will be instrumental in ensuring the reliability, performance, and scalability of...
Site Reliability Engineer

vor 1 Woche

Berlin, Berlin, Deutschland Wire Vollzeit

Who We AreWe are looking for a Site Reliability Engineer to complement our Customer Operations Team. In this role, you will support our customers as they deploy our product and it's dependencies. To accomplish this, you will build, improve and manage our automation and deployment infrastructure, to ensure the reliability and resilience of our product for our...
Site Reliability Engineer

vor 1 Woche

Berlin, Berlin, Deutschland 1KOMMA5˚ Vollzeit

1KOMMA5°We are looking for you as an addition to our tech-team in Berlin, Munich or Hamburg. 1KOMMA5° is building Germany's largest one-stop-shop for sale, installation and services related to solar, heat pumps, electricity and charging infrastructure. And they are all connected Be a part of our missionBecome a part of our mission and learn about our...

Senior Site Reliability Engineer

vor 3 Wochen

Berlin, Berlin, Deutschland Xenon7 Vollzeit

About us:

Where elite tech talent meets world-class opportunities

At Xenon7, we work with leading enterprises and innovative startups on exciting, cutting-edge projects that leverage the latest technologies across various domains of IT including Data, Web, Infrastructure, AI, and many others. Our expertise in IT solutions development and on-demand resources allows us to partner with clients on transformative initiatives, driving innovation and business growth. Whether it's empowering global organizations or collaborating with trailblazing startups, we are committed to delivering advanced, impactful solutions that meet today's most complex challenges.

About the Client:

Join one of Egypt's premier financial institutions, renowned for its extensive suite of banking services, including Institutional Banking, Personal Banking, and Islamic Banking. With a global presence through over 50 branches and correspondents, we serve a diverse and dynamic clientele. As we embark on a groundbreaking digital transformation journey, we are committed to leveraging the latest technologies to establish a state-of-the-art data architecture that will redefine our performance and service delivery.

Position Overview

The Senior Site Reliability Engineer is a technical leadership role responsible for designing, implementing, and maintaining highly available, scalable, and secure infrastructure for critical banking applications, including Mobile Banking and Internet Banking platforms on on-premise infrastructure. This role leads SRE initiatives, mentors junior engineers, drives continuous improvement in production support, and leads observability strategy using OpenShift, Kubernetes, Prometheus, Grafana, and ELK Stack on on-premise data center infrastructure.

Key Responsibilities

· Design and architect highly available and scalable OpenShift/Kubernetes infrastructure for banking applications on on-premise servers

· Lead and implement comprehensive monitoring and observability strategy using Prometheus and Grafana

· Design and oversee centralized logging infrastructure using ELK Stack (Elasticsearch, Logstash, Kibana)

· Lead SRE best practices implementation and adoption of production support standards across teams

· Mentor and coach junior SRE and DevOps engineers on OpenShift, Kubernetes, monitoring, and production support

· Define and implement Service Level Indicators (SLIs), Objectives (SLOs), and Agreements (SLAs) with measurable metrics

· Lead incident response strategy, post-incident reviews, and drive continuous improvement in production stability

· Architect and implement advanced alerting, monitoring dashboards, and visualization strategies using Prometheus and Grafana

· Design automation frameworks and tools to reduce operational toil and improve production efficiency

· Lead OpenShift/Kubernetes cluster upgrades, security patches, and infrastructure modernization on-premise

· Establish production support procedures, on-call rotation policies, and escalation frameworks

· Optimize system performance, cost, and resource utilization across containerized on-premise infrastructure

· Conduct capacity planning, performance optimization, and infrastructure scaling initiatives

· Lead technical architecture reviews and infrastructure design decisions for banking applications

· Manage on-premise data center resources and infrastructure planning

· Participate in 24/7 on-call rotation and escalation for critical production incidents

· Ensure compliance, security hardening, and disaster recovery procedures for financial systems

Qualifications

· BSc in Computer Science, Information Technology, Software Engineering, or related field

· years of hands-on SRE, DevOps, or Production Engineering experience

· years of experience leading SRE teams or managing production support operations

· years of hands-on experience managing OpenShift and Kubernetes infrastructure on on-premise infrastructure

· Expert-level experience with Prometheus for monitoring and alerting in production

· Expert-level experience with Grafana for creating comprehensive monitoring dashboards

· Advanced experience with ELK Stack (Elasticsearch, Logstash, Kibana) for logging and log analysis

· Proven experience designing and scaling production systems for high-traffic banking applications

· Deep expertise in Linux/Unix system administration and container networking

· Advanced knowledge of CI/CD automation and deployment strategies

· Hands-on experience with database management, tuning, and optimization on-premises

· Strong experience with infrastructure automation and Infrastructure as Code

· Proven 24/7 production support experience in mission-critical environments

· Experience managing on-premise data center infrastructure

· Proven leadership skills and ability to mentor junior engineers

· Excellent communication skills and ability to present to executive stakeholders

· Experience in financial services or banking sector is highly preferred

Amerika

Europa

Asien / Ozeanien

Afrika

Aktuelle Jobs im Zusammenhang mit Senior Site Reliability Engineer - Berlin, Berlin - Xenon7

Senior Site Reliability Engineer