Site Reliability Engineer

vor 4 Wochen


Kirchheim bei München, Deutschland NetApp Vollzeit

Title: Site Reliability Engineer

Location:

Bangalore, Karnataka, IN, 560071

Requisition ID: 126262

About NetApp

We’re forward-thinking technology people with heart. We make our own rules, drive our own opportunities, and try to approach every challenge with fresh eyes. Of course, we can’t do it alone. We know when to ask for help, collaborate with others, and partner with smart people. We embrace diversity and openness because it’s in our DNA. We push limits and reward great ideas. What is your great idea?

"At NetApp, we fully embrace and advance a diverse, inclusive global workforce with a culture of belonging that leverages the backgrounds and perspectives of all employees, customers, partners, and communities to foster a higher performing organization." -George Kurian, CEO

Job Summary

As a Cloud Infrastructure/Site Reliability Engineer, you will be operating at the intersection of development and operations. Your role will involve engaging in and enhancing the lifecycle of cloud services - from design through deployment, operation, and refinement. You will be responsible for maintaining these services by measuring and monitoring their availability, latency, and overall system health. 
You will play a crucial role in sustainably scaling systems through automation and driving changes that improve reliability and velocity. As part of your responsibilities, you will administer cloud-based environments that support our SaaS/IaaS offerings, which are implemented on a microservices, container-based architecture (Kubernetes).
In addition, you will oversee a portfolio of customer-centric cloud services (SaaS/IaaS), ensuring their overall availability, performance, and security. You will work closely with both NetApp and cloud service provider teams, including those from Google, located across the globe in regions such as RTP, Reykjavík, Bangalore, Sunnyvale, Redmond, and more.
Due to the critical nature of the services we support, this position involves participation in a rotation-based on-call schedule as part of our global team. This role offers the opportunity to work in a dynamic, global environment, ensuring the smooth operation of vital cloud services. To be successful in this role, you should be a motivated self-starter and self-learner, possess strong problem-solving skills, and be someone who embraces challenges.

Job Requirements

• Incident Response and Troubleshooting: Address and perform root cause analysis (RCA) of complex live production incidents and cross-platform issues involving OS, Networking, and Database in cloud-based SaaS/IaaS environments. Implement SRE best practices for effective resolution.
• Analysis, and Infrastructure Maintenance: Continuously monitor, analyze, and measure system health, availability, and latency using tools like Prometheus, Stackdriver, ElasticSearch, Grafana, and SolarWinds. Develop strategies to enhance system and application performance, availability, and reliability. In addition, maintain and monitor the deployment and orchestration of servers, docker containers, databases, and general backend infrastructure.
• Document system knowledge as you acquire it, create runbooks, and ensure critical system information is readily accessible.
• Security Management: Stay updated with security protocols and proactively identify, diagnose, and resolve complex security issues.
• Automation and Efficiency: Identify tasks and areas where automation can be applied to achieve time efficiencies and risk reduction. Develop software for deployment automation, packaging, and monitoring visibility.
• Issue Tracking and Resolution: Use Atlassian Jira, Google Buganizer, and Google IRM to track and resolve issues based on their priority.
• Team Collaboration and Influence: Work in tandem with other Cloud Infrastructure Engineers and developers to ensure maximum performance, reliability, and automation of our deployments and infrastructure. Additionally, consult and influence developers on new feature development and software architecture to ensure scalability.
• Debugging, Troubleshooting, and Advanced Support: Undertake debugging and troubleshooting of service bottlenecks throughout the entire software stack. Additionally, provide advanced tier 2 and 3 support for NetApp's Cloud Data Services solutions.
• Directly influence the decisions and outcomes related to solution implementation: measure and monitor availability, latency, and overall system health.
• Proficiency in Linux/Unix and CORE OS.
• Demonstrated experience in scripting and infrastructure automation using tools such as Ansible, Python, Go or Ruby.
• Deep working knowledge of Containers, Kubernetes, and Serverless computing implementation.
• DevOps development methodologies.
• Familiarity with distributed systems design patterns using tools such as Kubernetes.
• Experience with cloud platforms such as AWS, Azure, or Google Cloud.

Education

A minimum of 5- 8 years of experience is required. 

A Bachelor of Science Degree in Computer Science, a master’s degree; or equivalent experience is required. 

Did you know…
Statistics show women apply to jobs only when they’re 100% qualified. But no one is 100% qualified. We encourage you to shift the trend and apply anyway We look forward to hearing from you.

Why NetApp?

In a world full of generalists, NetApp is a specialist. No one knows how to elevate the world’s biggest clouds like NetApp. We are data-driven and empowered to innovate. Trust, integrity, and teamwork all combine to make a difference for our customers, partners, and communities. 

We expect a healthy work-life balance. Our volunteer time off program is best in class, offering employees 40 hours of paid time off per year to volunteer with their favorite organizations. We provide comprehensive medical, dental, wellness, and vision plans for you and your family. We offer educational assistance, legal services, and access to discounts. We also offer financial savings programs to help you plan for your future.

If you run toward knowledge and problem-solving, join us. 


Job Segment: Cloud, Computer Science, Linux, Unix, Database, Technology


  • Reliability Engineer

    vor 1 Woche


    Wolfratshausen nahe München, Deutschland EagleBurgmann Germany GmbH & Co. KG Vollzeit

    „We will wow your world!” Das ist unser Versprechen, wenn es um Arbeiten bei Freudenberg geht. Als globaler Technologiekonzern machen wir die Welt nicht nur sauberer, gesünder und komfortabler, sondern bieten unseren 52.000 Mitarbeitenden auch ein vernetztes und vielfältiges Arbeitsumfeld, in dem sich alle individuell entfalten können. Lassen Sie sich...

  • IT-Site Engineer

    vor 2 Monaten


    München, Deutschland ZEISS Gruppe Vollzeit

    Ihre Rolle Als IT-Site Engineer bei ZEISS kümmern Sie sich täglich vor Ort um die IT-Infrastruktur, verschiedene standortnahe IT-Services und Anforderungen der Endanwender sowie externer Dienstleister. Es macht Ihnen Freude, Kolleginnen und Kollegen im persönlichen Kontakt die bestmögliche Erfahrung im Umgang mit ihrem hochmodernen ZEISS IT-Equipment zu...

  • IT-Site Engineer

    vor 8 Stunden


    München, Deutschland ZEISS Gruppe Vollzeit

    IT-Site Engineer sind Sie ein wichtiger Faktor für den reibungslosen IT-Infrastruktur Betrieb am eingesetzten Standort:Sie erbringen den Vorort Service & Support für die gesamte IT-Infrastruktur im Endanwender-Umfeld am zugeordneten ZEISS-StandortSie bearbeiten eigenständig die gemeldeten Sie führen eigenständig Tätigkeiten in der IT-Infrastruktur...

  • IT-Site Engineer

    vor 2 Wochen


    München, Deutschland ZEISS Gruppe Vollzeit

    IT-Site Engineer sind Sie ein wichtiger Faktor für den reibungslosen IT-Infrastruktur Betrieb am eingesetzten Standort:Sie erbringen den Vorort Service & Support für die gesamte IT-Infrastruktur im Endanwender-Umfeld am zugeordneten ZEISS-StandortSie bearbeiten eigenständig die gemeldeten Sie führen eigenständig Tätigkeiten in der IT-Infrastruktur...

  • IT-Site Engineer

    vor 3 Wochen


    München, Deutschland ZEISS Gruppe Vollzeit

    IT-Site Engineer sind Sie ein wichtiger Faktor für den reibungslosen IT-Infrastruktur Betrieb am eingesetzten Standort:Sie erbringen den Vorort Service & Support für die gesamte IT-Infrastruktur im Endanwender-Umfeld am zugeordneten ZEISS-StandortSie bearbeiten eigenständig die gemeldeten Sie führen eigenständig Tätigkeiten in der IT-Infrastruktur...

  • Reliability Engineer

    vor 2 Monaten


    München, Deutschland Sonoma Consulting Inc. Vollzeit

    VP of Engineering (75% development, 25% leadership) ~ Full-time Halo Group is a premier provider of IT talent. We place technology experts within the teams of the world’s leading companies to help them build innovative businesses that keep them one step closer to their customers and one step ahead of the competition. We offer a meaningful work...


  • München, Deutschland Focus Cloud Vollzeit

    Position: Technical Lead DevOps Engineer - Home Office (M/F/D)Location: Remote/Home OfficeAbout Us:My client are a global organization in the realm of AI and digital transformation, centered around training & education. They design digital products, automate routine work and design digital solutions for their customers. They are pioneers in the use of...


  • München, Deutschland Focus Cloud Vollzeit

    Position: Technical Lead DevOps Engineer - Home Office (M/F/D)Location: Remote/Home OfficeAbout Us:My client are a global organization in the realm of AI and digital transformation, centered around training & education. They design digital products, automate routine work and design digital solutions for their customers. They are pioneers in the use of...


  • München, Deutschland Reply Deutschland SE Vollzeit

    Du unterstützt bei der Implementierung von Cloud-Architekturen sowie bei der Migration bestehender lokaler Architekturen zu AWS/Azure/GCP Zudem implementierst und automatisierst du CI/CD-Pipelines auf Basis von GitLab CI, GitHub Actions, CircleCI, etc. Das Umsetzen des Cloud Cost Engineering für Systemlandschaften gehört zu deinen Aufgaben Du...

  • Senior Software Engineer

    vor 2 Monaten


    München, Deutschland Celonis Vollzeit

    We are looking for a Senior Software Engineer working primarily on the backend (Java, Spring Boot, kubernetes), to build new features for our Cloud Extractors Suite team. Software Engineers at Celonis are responsible for their services end to end, starting from design and development to deployment and maintenance. You'll be an integral part of our team's...


  • München, Deutschland Reply Vollzeit

    Als Entwicklungspartner unterstützt Liquid Reply seine Kunden in Bezug auf Container Orchestrierung wie Kubernetes, Cloud Native Development-Ansätzen sowie bei der Konzeption und Migration komplexer und schnelllebiger Anwendungen. Mit dem Schwerpunkt auf Multi-/Hybrid-Cloud-Implementierung, Site Reliability Engineering und Day-2 Betrieb befähigt Liquid...


  • München, Deutschland Contabo GmbH Vollzeit

    Your creative fieldWe are looking for a full-time, permanent Junior Software Engineer to start as soon as possible. We live remote-first, but you have the freedom to choose whether you want to work hybrid or completely on-site due to your proximity to one of our locations. As our new Junior Software Engineer, you will have a direct impact on the future of...

  • Implementation Engineer

    vor 2 Monaten


    München, Deutschland Tracie Healthcare Solutions GmbH Vollzeit

    Implementation Engineer (m/f/d) Linz or Munich, full-time or part-time Tracie Healthcare Solutions GmbH is a start-up for software solutions in the medical field. It is a corporate venture of the leading laboratory product manufacturer Greiner Bio-One (GBO). Our software provides process optimization for the pre-analytical phase of laboratory diagnostics...


  • München, Deutschland Hoffmann Group Vollzeit

    Senior) Analytics Engineer (m/f/d) Our motivated Business Intelligence team is looking for a (Senior) Analytics Engineer (m/f/d) at our Munich location. In our team, we are responsible for Business Intelligence in the Sales & Marketing area. We communicate openly and transparently, value teamwork and personal development, and together create a...


  • München, Deutschland Hoffmann Group Vollzeit

    Senior) Analytics Engineer (m/f/d) Our motivated Business Intelligence team is looking for a (Senior) Analytics Engineer (m/f/d) at our Munich location. In our team, we are responsible for Business Intelligence in the Sales & Marketing area. We communicate openly and transparently, value teamwork and personal development, and together create a...


  • München, Deutschland Celonis Vollzeit

    The Team: Our team is responsible for building the Celonis’ end-to-end Task Mining solution . Task Mining is the technology that allows businesses to capture user interaction (desktop) data, so they can analyze how people get work done, and how they can do it even better. We own all the related components, e.g. the desktop client, the related backend...

  • Senior Software Engineer

    vor 2 Monaten


    München, Deutschland Celonis Vollzeit

    The Role: We are looking for a Senior Software Engineer working primarily on the backend (Java, Spring Boot, kubernetes), to build new features for our Cloud Extractors Suite team. Software Engineers at Celonis are responsible for their services end to end, starting from design and development to deployment and maintenance. You'll be an integral part...


  • Kirchheim bei München, Deutschland Applied Materials Vollzeit

    Applied Materials is the world market leader for special systems and manufacturing processes in semiconductor, electronics and display technology. We not only provide the technology that powers nearly every new chip and advanced display in the world, but also our innovations shape the technology of the future. ~33,000 employees worldwide work in research...


  • Kirchheim bei München, Deutschland Applied Materials Vollzeit

    Applied Materials is the world market leader for special systems and manufacturing processes in semiconductor, electronics and display technology. We not only provide the technology that powers nearly every new chip and advanced display in the world, but also our innovations shape the technology of the future. ~33,000 employees worldwide work in research...


  • München, Deutschland Reply Deutschland SE Vollzeit

    Aufgaben Der Aufbau und die Implementierung von Cloud-Infrastrukturen sowie die Migration bestehender lokaler Architekturen zu AWS/Azure/GCP zählen zu deinen Aufgaben Du implementierst und automatisierst CI/CD-Pipelines & GitOps basierend auf GitLab CI, GitHub Actions, CircleCI, etc. Außerdem unterstützt du bei der Migration von Anwendungen auf...