Site Reliability Engineer
vor 1 Woche
Location: Hybrid – Cologne (Rheinauhafen) — 3 days in the office, 2 remote (Tue + Thu) Team: Engineering · Reports to CTO Keep the world awake — build reliability at scale ilert helps thousands of DevOps & IT teams detect, fix, and communicate incidents faster. Our platform is mission-critical: customers rely on us 24/7 to keep their always-on businesses running. As a Site Reliability Engineer at ilert, you’ll own the reliability, performance, and scalability of our core platform across AWS, Kubernetes, Kafka, and more. Tasks Build & operate a highly available platform Run and evolve our AWS-based infrastructure Operate and optimize self-managed Kafka, ClickHouse clusters and our Observability stack Ensure resilience, disaster recovery, and capacity planning across the stack Improve reliability & performance Build and maintain SLOs, SLIs, error budgets, and observability dashboards Debug production issues across layers (networking, Kubernetes, application, DB) Improve performance of our ingestion pipeline Automation & tooling Automate operations with Terraform, Helm, Kubernetes operators, and internal tooling Build tooling for safer deploys, blue/green rollouts, and automated verification Strengthen incident response workflows through deep collaboration with our AI SRE agent team Security & compliance Implement best practices for workload isolation, secrets management, IAM, and auditability Support our ISO27001 posture by automating controls and hardening our infrastructure Cross-functional impact Partner with Backend, AI, and Product teams to design reliable services Participate in on-call rotation Lead post-incident reviews and drive reliability improvements long-term Requirements 3+ years experience as SRE, Platform Engineer, DevOps Engineer, or Infrastructure Engineer Strong hands-on experience with AWS, Kubernetes, Linux internals, networking, performance tuning Experience operating self-managed distributed systems, ideally Kafka or ClickHouse Strong understanding of observability Experience automating infrastructure with Terraform and CI/CD systems Fluent English (our working language); German optional Benefits 🚀 Product-centric - 100 % focused on solving a mission-critical pain felt by every always-on business | 🏡 Hybrid freedom - 2 days remote by default; gorgeous Rheinauhafen roof terrace when you’re in town | 🕒 Focus > meetings - We time-box syncs, favour async docs and protect maker time | 🌴 28 days off - …plus public holidays | 🚲 Commute perks - subsidised public transport| ilert is a SaaS company for alerting, on-call management and status pages and helps companies to operate always-on services and respond faster to incidents.
-
Site Reliability Engineer
Vor 5 Tagen
Cologne Bonn Region, Deutschland flaschenpost SE VollzeitSpannend, aufstrebend und verdammt schnell – wir sind flaschenpostWir liefern den Wocheneinkauf innerhalb von 120 Minuten zu unseren Kund:innen.Gemeinsam mit über Kolleg:innen schaffen wir es, eine komplette Branche neu zu erfinden. Seit unserer Gründung im Jahr 2016 sind wir mittlerweile als Sofortlieferdienst für Getränke und Lebensmittel in nahezu...
-
Network Security Engineer
Vor 4 Tagen
Cologne, Deutschland AlphaConsult Premium KG VollzeitÜber uns Die AlphaConsult Gruppe- Experten mit 15 starken Marken unter einem Dach! Als Teil der AlphaConsult-Gruppe ist die AlphaConsult Premium KG auf hochwertige Personaldienstleistungen für mittelständische Unternehmen spezialisiert und deckt durch ihr leistungsstarkes Dienstleistungsangebot alle Branchen und Berufsgruppen ab. Durch ein sehr...
-
Field Service Engineer
vor 2 Wochen
Cologne Bonn Region, Deutschland Teledyne Technologies Incorporated VollzeitBe visionaryTeledyne Technologies Incorporated provides enabling technologies for industrial growth markets that require advanced technology and high reliability. These markets include aerospace and defense, factory automation, air and water quality environmental monitoring, electronics design and development, oceanographic research, deepwater oil and gas...
-
Field Service Engineer
vor 1 Woche
Cologne Bonn Region, Deutschland Proclinical Staffing VollzeitKeep labs running at their best-join our team as a Field Service Engineer and deliver expert support for cutting-edge diagnostic systems across Germany.Proclinical is seeking a Field Service Engineer to join a dynamic team in Germany. In this role, you will be responsible for the installation, maintenance, calibration, and repair of in-vitro diagnostics...
-
AI Product Engineer
Vor 2 Tagen
Cologne, Deutschland ilert GmbH VollzeitTeam: Product & Engineering • Reports to the CTO Location: Hybrid - Cologne (Rheinauhafen) - 3 days in office, 2 days remote (Tue and Thu) Shape the future of autonomous incident response We’re on a mission to make downtime invisible. Thousands of DevOps and SRE teams rely on ilert to detect, resolve, and communicate incidents faster. As our first AI...
-
Artificial Intelligence Engineer
Vor 3 Tagen
Cologne Bonn Region, Deutschland AI Futures VollzeitAI Engineer | DAX 40 | LLMs, RAG, NLP & Voice Agents | Cologne/Bonn (Hybrid)AI Futures have been retained to appoint a AI Engineer for a €30bn+ DAX 40 organisation undergoing a large-scale transformation of its digital and AI capabilities. The company is building a next-generation AI platform, with a major focus on enterprise-wide conversational AI;...
-
Senior Azure Data Engineer
Vor 4 Tagen
Cologne, Deutschland novaCapta VollzeitSenior Azure Data Engineer / Scientist (m/w/d) Azure als dein Spielfeld für Innovationen? Senior Azure Data Engineer / Scientist (m/w/d) | Standort Fürth, Augsburg, Köln, Hamburg, Hannover, München, Seligenstadt oder Stuttgart | Voll- oder Teilzeit & Festanstellung | Ab sofort Als Azure Data Engineer / Scientist (m/w/d) entwickelst und...
-
Founding Sales Engineer
Vor 7 Tagen
Cologne, Deutschland ilert GmbH VollzeitAt ilert, we help companies keep their always-on digital services resilient and responsive. Our AI-first platform powers IT teams with comprehensive incident management capabilities to detect, respond, and resolve incidents before customers ever notice. We’ve done the hardest part already: built something people love. Hundreds of customers have come on...
-
Cologne, Deutschland Toyota Motorsport VollzeitWhat we offer: Exciting projects and a place for technical freedom and innovation to get things moving Attractive benefits packages e.g. competitive remuneration, social benefits, 30 days annual holiday, car leasing, free on-site gym A challenging, fulfilling workplace in a multi-national company within a familiar atmosphere - driven by fascination and...
-
Value Engineer
vor 8 Stunden
Cologne, Deutschland ONIQ GmbH VollzeitAbout ONIQONIQ is on a mission to transform manufacturing for a more sustainable and efficient future. With the IQA Lean Copilot, we are pioneering a new category in industrial tech - combining lean production principles with AI, process mining, and advanced data analytics. Our software empowers manufacturers to systematically identify inefficiencies,...