Data Engineer – ML Training Infrastructure

vor 2 Wochen


Munich, Bayern, Deutschland SpAItial AI Vollzeit

SpAItial is pioneering the development of a frontier 3D foundation model, pushing the boundaries of AI, computer vision, and spatial computing. Our mission is to redefine how industries, from robotics and AR/VR to gaming and movies, generate and interact with 3D content.

We're seeking a
Data Engineer
to build the pipelines and infrastructure that fuel our large-scale model training. As the first engineer focused on data, you'll shape the backbone of how we handle terabytes of multimodal training data (images, video, and 3D). This role is ideal for someone who thrives at the intersection of data systems and machine learning—designing reliable, scalable, and efficient ways to get high-quality data into cutting-edge training runs.

Responsibilities

  • Architect and manage data infrastructure for large-scale ML training datasets (e.g., Apache, Iceberg, Parquet, Spark).
  • Build and operate ingestion pipelines for multimodal data (e.g., images, videos, 3D).
  • Design data loaders, caching, and serving strategies optimized for ML training.
  • Partner closely with ML researchers to ensure infrastructure scales with training demands.
  • Uphold code quality and best practices in testing, CI/CD, and reproducibility.

Key Qualifications:

  • 3+ years professional software/data engineering experience with production systems.
  • Proven experience in large-scale data processing for ML training (not just analytics/BI).
  • Hands-on with distributed data frameworks (e.g., Spark, Beam, Cloud SQL, AirFlow) and modern data formats (Parquet, Iceberg).
  • Proficiency in cloud platforms (AWS, GCP, or Azure).
  • Strong Python development skills, including testing and code quality.
  • Experience building and maintaining CI/CD pipelines.

Preferred Qualifications

  • Familiarity with ML frameworks (e.g., PyTorch, TensorFlow).
  • Experience preparing multimodal datasets (images, video, 3D) for ML pipelines.

At SpAItial, we are committed to creating a diverse and inclusive workplace. We welcome applications from people of all backgrounds, experiences, and perspectives. We are an equal opportunity employer and ensure all candidates are treated fairly throughout the recruitment process.


  • ML OPS Engineer

    vor 1 Tag


    Munich, Bayern, Deutschland Insight International (UK) Ltd Vollzeit

    Role: ML Ops EngineerLocation: Germany (Remote is fine)German Proficiency (Mandatory) Native levelIn the role of ML Ops Engineer, you will be responsible for developing, operating, and optimizing scalable ML infrastructures and processes across various phases of the pharmaceutical value chain. You will support data scientists in model implementation and help...


  • Munich, Bayern, Deutschland RobCo GmbH Vollzeit

    Your MissionAs a Data & Test Engineer for Robotics & ML Evaluation, you will own the ecosystem that measures how well our robot learning models perform - in simulation and on real robots. You will build datasets, metrics, tools, and testing workflows that enable ML researchers and robotics engineers to evaluate models reliably, reproducibly, and at...

  • Senior Data Engineer

    vor 1 Woche


    Munich, Bayern, Deutschland Awin Global Vollzeit

    Purpose of positionAs a Senior Data Engineer, you will play a pivotal role in our AI/ML workstream, you'll work closely with business teams and data scientists to design, maintain, and improve machine learning applications. Your main responsibilities will include managing existing ML workloads and building new batch and on-demand pipelines to support...


  • Munich, Bayern, Deutschland RobCo Vollzeit

    Deine AufgabenAs a Data & Test Engineer for Robotics & ML Evaluation, you will own the ecosystem that measures how well our robot learning models perform - in simulation and on real robots. You will build datasets, metrics, tools, and testing workflows that enable ML researchers and robotics engineers to evaluate models reliably, reproducibly, and at...


  • Munich, Bayern, Deutschland Bjak Vollzeit

    Build AI Systems That Make Finance Simpler, Smarter, and More InclusiveAt BJAK, we use AI to make insurance and financial services easier to access, understand, and afford for millions of users. As a Senior AI/ML Software Engineer, you'll help build the intelligent systems that power this mission - from personalized recommendations and fraud detection to...


  • Munich, Bayern, Deutschland Intrinsic Vollzeit

    Intrinsic is Alphabet's bet aiming to reimagine the potential of industrial robotics. Our team believes that advances in AI, perception and simulation will redefine what's possible for industrial robotics in the near future – with software and data at the core. Our mission is to make industrial robotics intelligent, accessible, and usable for millions...

  • Senior AI/ML Engineer

    Vor 5 Tagen


    Munich, Bayern, Deutschland HCLTech Germany Vollzeit

    Senior AI/ML EngineerWir sind HCLTech, einer der am schnellsten wachsenden großen Technologiekonzerne der Welt mit über Mitarbeitenden in 60 Ländern. Wir treiben den Fortschritt durch unsere branchenführenden Fähigkeiten in den Bereichen Digital, Engineering und Cloud voran. Die treibende Kraft hinter unserer Arbeit sind unsere vielfältigen, kreativen...

  • AI and ML Engineer

    Vor 7 Tagen


    Munich, Bayern, Deutschland Munich Re Vollzeit

    The Risk Intelligence and Knowledge Systems (RIKS) unit is part of Life & Health division, which serves as a regional analytics centre for Europe, UK and Latin Americas. The department is responsible for AI strategy development and execution, biometrics insights and knowledge modelling for more than 40 markets, for 500+ cedents, and 1,500+ active reinsurance...


  • Munich, Bayern, Deutschland Analog Devices Vollzeit

    About Analog DevicesAnalog Devices, Inc. (NASDAQ: ADI ) is a global semiconductor leader that bridges the physical and digital worlds to enable breakthroughs at the Intelligent Edge. ADI combines analog, digital, and software technologies into solutions that help drive advancements in digitized factories, mobility, and digital healthcare, combat climate...


  • Munich, Bayern, Deutschland Intrinsic Vollzeit

    Intrinsic is Alphabet's bet aiming to reimagine the potential of industrial robotics. Our team believes that advances in AI, perception and simulation will redefine what's possible for industrial robotics in the near future – with software and data at the core. Our mission is to make industrial robotics intelligent, accessible, and usable for millions...