Site Reliability Engineer

vor 3 Wochen

Berlin, Deutschland Nooxit Vollzeit

Full-time (40 h), as soon as possible, permanent and based in Berlin or remotely in home office.

We’re seeking an experienced Site Reliability Engineer (SRE) with a solid foundation in Python, a passion for performance optimization, and a proactive approach to infrastructure management. In this role, you’ll work closely with development and operations teams to maintain, monitor, and improve the reliability of our systems, leveraging cutting-edge tools and methodologies to ensure peak performance.

Tasks

Design, implement, and optimize systems to improve the reliability, performance, and scalability of our services.
Build and maintain observability solutions using tools like Jaeger, Prometheus, and Grafana to enhance monitoring, tracing, and alerting across applications.
Collaborate with development teams to build, manage, and scale Kubernetes environments, ensuring high availability and robust service delivery.
Develop automation scripts and tools in Python to enhance system reliability and reduce manual intervention.
Diagnose and resolve incidents, conduct root-cause analysis, and implement measures to prevent recurrence.
Participate in on-call rotations, ensuring rapid response to system issues while continuously improving incident management processes.

Requirements

Proficiency in Python for scripting and automation.
Experience with tracing tools such as Jaeger or similar to troubleshoot and monitor complex distributed systems.
Experience with monitoring tools such as Prometheus or similar for collecting and alerting on metrics.
Experience with dashboarding tools such as Grafana or similar for creating visualizations that aid in system monitoring and diagnostics.
Experience working in Kubernetes environments, with an understanding of container orchestration, scaling, and resource management.

Preferred Qualifications (Optional):

Hands-on experience with CI/CD pipelines and DevOps practices.
Familiarity with cloud platforms (AWS, GCP, Azure) and infrastructure-as-code tools like OpenTofu.

Benefits

Competitive salary
Flexible work hours and remote work opportunities.
A beautiful Gather remote office
An ambitious and helpful team
Opportunity to work with cutting-edge technologies and make a significant impact in a fast-growing startup environment

Are you interested?

Then apply right now by sending your CV If available, please include a Github link. A cover letter is not necessary.

If you have any questions, please contact us or just give us a call

Site Reliability Engineer

vor 1 Monat

Berlin, Berlin, Deutschland Schwarz Dienstleistungen Vollzeit

Schwarz Dienstleistungen sucht einen Site Reliability Engineer mit folgenden Qualifikationen:• Gute Kenntnisse in einer der folgenden Programmiersprachen: C, C++, Go (Golang), Rust, Java• Grundlegendes Verständnis der Prinzipien des Site Reliability Engineering (SRE), wie zum Beispiel Monitoring, Alarmierung, Fehlerbudgets, Fehleranalysen oder anderer...
Site Reliability Engineer

vor 3 Wochen

Berlin, Berlin, Deutschland Paymenttools Vollzeit

Stell dir vor, du bist ein wichtiger Teil des Teams, das die Zahlungssysteme von Paymenttools sicher und zuverlässig macht.Wir suchen nach erfahrener Site Reliability Engineer, der unsere Systeme optimalisiert, die Zuverlässigkeit erhöht und sicherstellt, dass unsere Zahlungssysteme immer verfügbar sind. Du wirst mit den Produktteams zusammenarbeiten, um...
Site Reliability Expert

vor 2 Wochen

Berlin, Berlin, Deutschland EGYM GmbH Vollzeit

About the RoleWe're seeking a highly skilled Site Reliability Engineer to join our team at EGYM GmbH. As a Site Reliability Engineer, you'll play a key role in ensuring the reliability and scalability of our cloud services.You'll be responsible for monitoring the availability and latency of our services, troubleshooting incidents, and collaborating with...
Site Reliability Engineer

vor 3 Wochen

Berlin, Deutschland THRYVE Vollzeit

Job Overview:As a Mid-Level Site Reliability Engineer, you will play a crucial role in ensuring the availability, performance, and reliability of our systems hosted on Google Cloud Platform (GCP). You will collaborate with engineering teams to design, deploy, and maintain scalable infrastructure, automate workflows, and troubleshoot production issues to...
Site Reliability Engineer and Microservices Expert

Vor 4 Tagen

Berlin, Berlin, Deutschland EGYM GmbH Vollzeit

About the RoleWe are seeking a skilled Site Reliability Engineer to join our international team in Munich or Berlin.The ideal candidate will be experienced in working with Cloud Providers (GCP, AWS), Microservices and Container Orchestration. Proficient coding skills in at least one or more programming languages (preferably Go or Java) and a solid...
(Senior) Site Reliability Engineer

vor 6 Monaten

Berlin, Deutschland Paymenttools Vollzeit

Reliability ist dein zweiter Vorname?Wir bei Paymenttools, einer Tochtergesellschaft der REWE Group, revolutionieren die Zahlungslandschaft. Von Apple Pay bis PayPal - wir haben es uns zur Aufgabe gemacht, digitale Transaktionen in ganz Europa und darüber hinaus zu vereinfachen und zu sichern. Unser Mantra: #wesolvepayn. Wir verbinden modernste Technologie...
(Senior) Site Reliability Engineer

vor 6 Monaten

Berlin, Deutschland Paymenttools Vollzeit

Reliability ist dein zweiter Vorname? Wir bei Paymenttools, einer Tochtergesellschaft der REWE Group, revolutionieren die Zahlungslandschaft. Von Apple Pay bis PayPal - wir haben es uns zur Aufgabe gemacht, digitale Transaktionen in ganz Europa und darüber hinaus zu vereinfachen und zu sichern. Unser Mantra: #wesolvepayn. Wir verbinden modernste Technologie...
(Senior) Site Reliability Engineer

vor 6 Monaten

Berlin, Deutschland Paymenttools Vollzeit

Reliability ist dein zweiter Vorname? Wir bei Paymenttools, einer Tochtergesellschaft der REWE Group, revolutionieren die Zahlungslandschaft. Von Apple Pay bis PayPal - wir haben es uns zur Aufgabe gemacht, digitale Transaktionen in ganz Europa und darüber hinaus zu vereinfachen und zu sichern. Unser Mantra: #wesolvepayn. Wir verbinden modernste Technologie...
Site Reliability Engineer

vor 3 Wochen

Berlin, Deutschland Nooxit Vollzeit

Full-time (40 h), as soon as possible, permanent and based in Berlin or remotely in home office. We’re seeking an experienced Site Reliability Engineer (SRE) with a solid foundation in Python, a passion for performance optimization, and a proactive approach to infrastructure management. In this role, you’ll work closely with development and operations...
Chief Site Reliability Officer

vor 3 Wochen

Berlin, Berlin, Deutschland Delivery Hero Vollzeit

OverviewDelivery Hero is a leading global food delivery marketplace, and we are seeking an experienced Chief Site Reliability Officer to lead our Site Reliability Engineering (SRE) department. This role will be based in Berlin, Germany, and will report to the leader of Developer Platform.
Site Reliability Engineer

vor 1 Monat

Berlin, Deutschland Schwarz IT Vollzeit

h1> Site Reliability Engineer - Platform Engineering - STACKIT Standort: Berlin Abteilung: IT - Cloud Services Level: Berufserfahrene Referenznummer: 42252-de_DE Du willst mit uns STACKITEERs die Cloud-Welt im Sturm erobern und mit uns die Zukunft Europas gestalten? Auch das Onboarding neuer Cloud-Nutzer und die Unterstützung bei...
Site Reliability Engineer

vor 3 Wochen

Berlin, Deutschland Schwarz IT Vollzeit

Site Reliability Engineer - Platform Engineering - STACKIT Standort: Berlin Abteilung: IT - Cloud Services Level: Berufserfahrene Referenznummer: 42252-de_DE Du willst mit uns STACKITEERs die Cloud-Welt im Sturm erobern und mit uns die Zukunft Europas gestalten? Prima! Dann bist du bei STACKIT genau richtig. Unsere Vision ist ambitioniert: Ein unabhängiges...
Site Reliability Engineer

vor 2 Wochen

Berlin, Berlin, Deutschland Nooxit Vollzeit

System Reliability Expert Sought for Cutting-Edge StartupNooxit is looking for an experienced Site Reliability Engineer to join our team in Berlin or remotely. The ideal candidate will have a solid foundation in Python, a passion for performance optimization, and a proactive approach to infrastructure management.The successful applicant will work closely...
Site Reliability Engineer

vor 3 Wochen

Berlin, Deutschland All the Top Bananas Vollzeit

Site Reliability Engineer - Platform Engineering - STACKITStandort: Berlin Abteilung: IT - Cloud Services Level: Berufserfahrene Referenznummer: 42252-de_DE Du willst mit uns STACKITEERs die Cloud-Welt im Sturm erobern und mit uns die Zukunft Europas gestalten? Prima! Dann bist du bei STACKIT genau richtig. Unsere Vision ist ambitioniert: Ein unabhängiges...
Site Reliability Engineer

vor 1 Monat

Berlin, Deutschland Schwarz IT Vollzeit

Site Reliability Engineer - Platform Engineering - STACKIT Standort: Berlin Abteilung: IT - Cloud Services Level: Berufserfahrene Referenznummer: 42252-de_DE Du willst mit uns STACKITEERs die Cloud-Welt im Sturm erobern und mit uns die Zukunft Europas gestalten? Prima! Dann bist du bei STACKIT genau richtig. Unsere Vision ist...
Site Reliability Engineer

vor 23 Stunden

Berlin, Deutschland Solactive AG Vollzeit

Company Description Since its creation in 2007 in the financial city of Frankfurt am Main, Solactive AG has grown to one of the key players in the indexing space. The German multi-asset index provider focusses on tailor-made indices, offering to its clients a faster service, with greater flexibility and at a reasonable cost. Solactive AG and its subsidiaries...
Site Reliability Engineer

vor 21 Stunden

Berlin, Deutschland Solactive AG Vollzeit

Company Description Since its creation in 2007 in the financial city of Frankfurt am Main, Solactive AG has grown to one of the key players in the indexing space. The German multi-asset index provider focusses on tailor-made indices, offering to its clients a faster service, with greater flexibility and at a reasonable cost. Solactive AG and its subsidiaries...
Site Reliability Engineer

vor 6 Monaten

Berlin, Deutschland BestSecret Vollzeit

About BestSecret Group We are a leading European members-only online destination for premium and luxury off-price fashion. Partnering with over 3,000 international brands, our tech-focused mindset and strong commitment to sustainability drives a truly unique experience for our members. With almost 100 years of experience behind us, and a major tech...
Site-Sicherheitsingenieur für Zahlungssysteme

Vor 5 Tagen

Berlin, Berlin, Deutschland Paymenttools Vollzeit

Beschreibung der PositionWir suchen einen erfahrenden Site Reliability Engineer, der unsere Zahlungssysteme zuverlässig, skalierbar, beobachtbar und sicher macht. Als SRE wirst du mit Produktteams zusammenarbeiten, um Infrastruktur, Tools und Prozesse zu entwickeln, zu implementieren und zu warten, die unsere geschäftskritischen Zahlungsanwendungen und...
Director of Site Reliability Engineering

vor 3 Wochen

Berlin, Deutschland Wikimedia Foundation Vollzeit

**Director of Site Reliability Engineering** We are strengthening the team and looking for a Director of Site Reliability Engineering (SRE) to lead our staff and ensure teams achieve our goals, towards our mission of providing the essential infrastructure for free knowledge. Wikimedia Foundation's SRE teams are responsible for ensuring our global top-10 web...

Amerika

Europa

Asien / Ozeanien

Afrika

Site Reliability Engineer