Site Reliability/devops

vor 4 Wochen

Berlin, Deutschland Amazon Web Services Development Center Germany GmbH Vollzeit

Experience supporting cloud systems or other services. Proficient troubleshooting and anticipating problems that affect the performance, reliability, or availability of software systems
- Proficient executing standard operating procedures and following operational best practices

Your responsibilities will encompass overseeing the launch of the ESC in 2025, working closely with global AWS teams, and influencing the evolution of AWS services and technology. A typical day in this role involves collaborating with technology leaders, contributing to the enhancement of day-to-day operations, and ensuring improvements in availability, reliability, latency, performance, and efficiency of the ESC.
The overarching goal is to deliver scalable services and ensure a high-availability experience for EU customers. If you are an experienced professional ready for a challenging and impactful opportunity, we invite you to join our efforts in building a best-in-class development engineering and operations team that aligns with AWS' commitment to customer satisfaction and continual innovation.
European Sovereign Cloud (ESC) is a part of AWS Utility Computing (UC).
AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon’s Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Internet of Things (Iot), Platform, and Productivity Apps services in AWS. Within AWS UC, Managed Operations engineers engage with AWS customers who require specialized security solutions for their cloud services.
A day in the life
You’ll spend a majority of your time operating and improving one of the largest software systems. Over the course of a week, you will review the operational health of the services in your team’s care, and as soon as you figure out why there was an anomaly, you write up an actionable bug report. As a responsible engineer, you’ve learned never to make changes to production systems without a plan, so you reviewed then executed changes following a change management process to one of the production systems in your care. Later in the week, you help to resolve your team’s backlog of operational issues. You round off the week by writing a cool script that you shared with your team which helps get to root cause faster of a hard problem that you diagnosed earlier.
You will be required to occasionally participate in an “on-call” rotations to resolve incidents occurring out-of-hours.
Eligibility requirements
- Fluency in written and spoken English is required
- Successful applicants must have the legal right to work in Germany
- Amazon will provide relocation support for successful applicants relocating within the European Union

About the team
Diverse Experiences
Why AWS
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.
Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why flexible work hours and arrangements are part of our culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.
Inclusive Team Culture
Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.
Mentorship and Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
- Experience working cross-organizationally and leading strategic team efforts requiring work from multiple team members
- Experience with Infrastructure as Code, (such as CDK, CloudFormation, Puppet, Chef, Ansible, or similar)

m/w/d

Site Reliability Engineer

vor 2 Monaten

Berlin, Deutschland THRYVE Vollzeit

Job Overview:As a Mid-Level Site Reliability Engineer, you will play a crucial role in ensuring the availability, performance, and reliability of our systems hosted on Google Cloud Platform (GCP). You will collaborate with engineering teams to design, deploy, and maintain scalable infrastructure, automate workflows, and troubleshoot production issues to...
(Senior) Site Reliability Engineer

vor 7 Monaten

Berlin, Deutschland Paymenttools Vollzeit

Reliability ist dein zweiter Vorname?Wir bei Paymenttools, einer Tochtergesellschaft der REWE Group, revolutionieren die Zahlungslandschaft. Von Apple Pay bis PayPal - wir haben es uns zur Aufgabe gemacht, digitale Transaktionen in ganz Europa und darüber hinaus zu vereinfachen und zu sichern. Unser Mantra: #wesolvepayn. Wir verbinden modernste Technologie...
(Senior) Site Reliability Engineer

vor 7 Monaten

Berlin, Deutschland Paymenttools Vollzeit

Reliability ist dein zweiter Vorname? Wir bei Paymenttools, einer Tochtergesellschaft der REWE Group, revolutionieren die Zahlungslandschaft. Von Apple Pay bis PayPal - wir haben es uns zur Aufgabe gemacht, digitale Transaktionen in ganz Europa und darüber hinaus zu vereinfachen und zu sichern. Unser Mantra: #wesolvepayn. Wir verbinden modernste Technologie...
(Senior) Site Reliability Engineer

vor 7 Monaten

Berlin, Deutschland Paymenttools Vollzeit

Reliability ist dein zweiter Vorname? Wir bei Paymenttools, einer Tochtergesellschaft der REWE Group, revolutionieren die Zahlungslandschaft. Von Apple Pay bis PayPal - wir haben es uns zur Aufgabe gemacht, digitale Transaktionen in ganz Europa und darüber hinaus zu vereinfachen und zu sichern. Unser Mantra: #wesolvepayn. Wir verbinden modernste Technologie...
Site-Sicherheitsingenieur für Zahlungssysteme

vor 1 Monat

Berlin, Berlin, Deutschland Paymenttools Vollzeit

Beschreibung der PositionWir suchen einen erfahrenden Site Reliability Engineer, der unsere Zahlungssysteme zuverlässig, skalierbar, beobachtbar und sicher macht. Als SRE wirst du mit Produktteams zusammenarbeiten, um Infrastruktur, Tools und Prozesse zu entwickeln, zu implementieren und zu warten, die unsere geschäftskritischen Zahlungsanwendungen und...
Site Reliability Engineer and Microservices Expert

vor 1 Monat

Berlin, Berlin, Deutschland EGYM GmbH Vollzeit

About the RoleWe are seeking a skilled Site Reliability Engineer to join our international team in Munich or Berlin.The ideal candidate will be experienced in working with Cloud Providers (GCP, AWS), Microservices and Container Orchestration. Proficient coding skills in at least one or more programming languages (preferably Go or Java) and a solid...
Site Reliability Engineer

Vor 4 Tagen

Berlin, Deutschland STACKIT Vollzeit

Auch das Onboarding neuer Cloud-Nutzer und die Unterstützung bei Cloud-Migrationen fällt in diesen Bereich. Du verfügst über gute Kenntnisse in einer der folgenden Programmiersprachen: C, C++, Go (Golang), Rust, Java Du hast ein grundlegendes Verständnis der Prinzipien des Site Reliability Engineering (SRE), wie zum Beispiel Monitoring, Alarmierung,...
Senior Director

vor 2 Monaten

Berlin, Deutschland Delivery Hero Vollzeit

Job DescriptionWe are on the lookout for a Senior Director, Site Reliability Engineering (all genders) tolead our global Site Reliability Engineering (SRE) department. The department is part ofthe Tech Foundations tribe, with the mission to increase the leverage of DeliveryHero’s engineering organisations by reducing complexity through...
Site Reliability Engineer

vor 7 Monaten

Berlin, Deutschland BestSecret Vollzeit

About BestSecret Group We are a leading European members-only online destination for premium and luxury off-price fashion. Partnering with over 3,000 international brands, our tech-focused mindset and strong commitment to sustainability drives a truly unique experience for our members. With almost 100 years of experience behind us, and a major tech...
Sre / DevOps

Vor 3 Tagen

Berlin, Deutschland Deta Vollzeit

At Deta we are trying to push the limits of two critical technologies — personal and cloud computing — for developers around the world. As part of our platform engineering team, you will improve upon and ensure reliability, availability and scalability across all Deta products and services. You will also be responsible for working together with our...
Site Reliability Engineer

vor 4 Wochen

Berlin, Deutschland Solactive AG Vollzeit

Company Description Since its creation in 2007 in the financial city of Frankfurt am Main, Solactive AG has grown to one of the key players in the indexing space. The German multi-asset index provider focusses on tailor-made indices, offering to its clients a faster service, with greater flexibility and at a reasonable cost. Solactive AG and its subsidiaries...
Site Reliability Engineer

vor 4 Wochen

Berlin, Deutschland Solactive AG Vollzeit

Company Description Since its creation in 2007 in the financial city of Frankfurt am Main, Solactive AG has grown to one of the key players in the indexing space. The German multi-asset index provider focusses on tailor-made indices, offering to its clients a faster service, with greater flexibility and at a reasonable cost. Solactive AG and its subsidiaries...
Site Reliability Engineer

vor 4 Wochen

Berlin, Deutschland Solactive AG Vollzeit

Company Description Since its creation in 2007 in the financial city of Frankfurt am Main, Solactive AG has grown to one of the key players in the indexing space. The German multi-asset index provider focusses on tailor-made indices, offering to its clients a faster service, with greater flexibility and at a reasonable cost. Solactive AG and its subsidiaries...
Site Reliability Engineer

vor 4 Wochen

Berlin, Deutschland Solactive AG Vollzeit

Company Description Since its creation in 2007 in the financial city of Frankfurt am Main, Solactive AG has grown to one of the key players in the indexing space. The German multi-asset index provider focusses on tailor-made indices, offering to its clients a faster service, with greater flexibility and at a reasonable cost. Solactive AG and its subsidiaries...
DevOps Engineer

vor 1 Monat

Berlin, Deutschland TechNET CxO Vollzeit

DevOps Engineer Location: Berlin, Germany (Hybrid, three days in the office) Salary: Up to €120,000 per annumPosition: Full-time, Permanent Are you ready to transform healthcare through technology?Join my client, a leading organisation in the health sector, on a mission to make care more accessible and effective through innovative, technology-driven...
Principal DevOps Specialist

vor 2 Monaten

Berlin, Berlin, Deutschland Meisterwerk Vollzeit

We're seeking a highly skilled Principal DevOps Specialist to join our team at Meisterwerk. As the DevOps lead, you will be responsible for designing, implementing, and maintaining our infrastructure and deployment processes. Your expertise with cloud, Infrastructure as Code, and container orchestration will ensure the stability, security, and scalability of...
Senior Site Reliability Engineer

vor 4 Monaten

Berlin, Deutschland MongoDB Vollzeit

MongoDB’s mission is to empower innovators to create, transform, and disrupt industries by unleashing the power of software and data. We enable organizations of all sizes to easily build, scale, and run modern applications by helping them modernize legacy workloads, embrace innovation, and unleash AI. Our industry-leading developer data platform, MongoDB...
Site Reliability Engineer

vor 1 Woche

Berlin, Deutschland STACKIT Vollzeit

Du willst mit uns STACKITEERs die Cloud-Welt im Sturm erobern und mit uns die Zukunft Europas gestalten? Prima! Dann bist du bei STACKIT genau richtig. Unsere Vision ist ambitioniert: Ein unabhängiges Europa - digital, führend. Als Cloud- und Colocation-Provider bauen wir die sichere Infrastruktur dafür. Mit unseren Serverstandorten ausschließlich in...
Site Reliability Engineer

vor 3 Monaten

Berlin, Deutschland Schwarz Dienstleistungen Vollzeit

Du willst mit uns STACKITEERs die Cloud-Welt im Sturm erobern und mit uns die Zukunft Europas gestalten? Prima! Dann bist du bei STACKIT genau richtig. Unsere Vision ist ambitioniert: Ein unabhängiges Europa - digital, führend. Als Cloud- und Colocation-Provider bauen wir die sichere Infrastruktur dafür. Mit unseren Serverstandorten ausschließlich in...
Site Reliability Engineer

vor 3 Monaten

Berlin, Deutschland Schwarz Dienstleistungen Vollzeit

Du willst mit uns STACKITEERs die Cloud-Welt im Sturm erobern und mit uns die Zukunft Europas gestalten? Prima! Dann bist du bei STACKIT genau richtig. Unsere Vision ist ambitioniert: Ein unabhängiges Europa - digital, führend. Als Cloud- und Colocation-Provider bauen wir die sichere Infrastruktur dafür. Mit unseren Serverstandorten ausschließlich in...

Amerika

Europa

Asien / Ozeanien

Afrika

Site Reliability/devops