Applied ML Engineer
vor 1 Woche
At Pruna, we're on a mission to make AI more efficient to build a better future.
While the focus of Foundational model Labs is scaling up, we aim to level the playing field by building AI models that are as accessible as possible.
After years of research on efficient ML, we decided that the best way to spread our impact was to take it into our own hands. Each of us cares deeply about empowering people to maximize their impact while minimizing their carbon footprint.
Role DescriptionAs an Applied ML Engineer at Pruna AI, you will bridge the gap between cutting-edge research and real-world application. Your mission is to identify the most promising AI models released by the community and industry, apply a combination of internal and external efficiency methods and to make them more efficient, and deploy them to be used by end users.
You'll be at the forefront of operationalizing our research, ensuring that users can benefit from state-of-the-art models without the heavy costs of deployment. This is a hands-on role combining deep ML expertise with practical engineering skills.
What you'll do:- Model Optimization
- Analyse newly released open source models and identify the impact of optimising and deploying this model.
- Apply a combination of internal and external efficiency methods to make them more efficient.
- Benchmark performance vs. baseline models and ensure minimal accuracy/performance trade-offs.
- Generate clear reports that translate technical results into actionable insights for communication and go-to-market.
- Continuously improve deployed models as research and hardware evolve.
- Deployment & Delivery
- Package optimized models for deployment on the cloud.
- Ensure smooth integration into Pruna's SaaS platform and customer environments.
- Collaborate with the Software team to scale testing, deployment, and monitoring.
- Collaborate closely with the Research team to help in identifying and applying promising algorithms as well as giving feedback to what can improve current algorithms.
- Customer & Partner Engagement
- You will work closely with customers and users of our Optimised Models. Whether it is to identify the most promising models or to understand the exact specifications that are required.
- You will have a constant contact with users in order to be able to quickly iterate and improve our models to best fit the industry and production use cases.
We would love to see:
Educational background or Experience- B.Sc./M.Sc./Ph.D. in Computer Science, Machine Learning, or related fields—or equivalent industry experience.
- Demonstrated experience working with modern AI models (e.g., transformers, diffusion, multimodal architectures,…).
- Strong foundations in deep learning and applied ML.
- Expertise in PyTorch and Python.
- Familiarity with model deployment workflows (Cog, Litserve, vLLM, etc.).
- Experience taking ML models from research to production in real-world environments.
- Understanding of performance benchmarking, profiling, and hardware-aware optimization.
- Comfort with neo cloud platforms (Replicate/Runpod/Modal), or legacy clouds (AWS/Azure/GCP) and containerization (Cog, Docker…).
- Strong understanding of benchmarking tools and frameworks for both quality and efficiency.
- Experience translating evaluation metrics into actionable engineering trade-offs.
- Strong sense of ownership and accountability.
- Ability to thrive in ambiguous, fast-moving environments.
- Clear communication skills to bridge research and customer needs.
- Passion for making AI both impactful and sustainable.
- Experience with compression methods (quantization, pruning, distillation, compilation).
- Knowledge of lower-level optimization frameworks (Triton, CUDA, C++).
- Prior experience in forward-deployed engineering or customer-facing ML roles.
Salary: We pay top market rates based on seniority and location, leveraging aggregated third party data.
Benefits: Meal vouchers, health & wellness solutions, mobility, travel policy to visit fellow Pruners, and a remote stipend for your home workspace.
Recruitment ProcessOur recruitment process consists of 4 interviews to check expectations, technical skills, and team/culture fit:
- Intro Call – Get to know each other. [~1 hour]
- Foundations – Problem-solving & ML/engineering fundamentals. [~1 hour]
- Challenge – Apply your skills on a representative task. [~2/3 hours preparation + 1 hour call]
- Meet the Team – Chat with Pruners and learn about day-to-day life. [~1 hour]
Accessibility note: We adapt the process to your needs to ensure equal opportunity for all applicants.
Our Values- Decide Wisely – Rational, customer-focused decisions.
- Trust by Default – Transparency and collaboration.
- Foster Inclusion – Supportive, diverse workplace.
- Grow Together – Feedback and recognition.
- Learn Relentlessly – Adapt and innovate in a fast-moving landscape.
-
AI/ML Engineer
vor 1 Woche
Munich, Bayern, Deutschland BrainHackathon Vollzeit 80.000 € - 120.000 € pro JahrAbout BrainHackathonBrainHackathon is an AI Transformation Partner helping organizations move from AI curiosity to AI capability — fast. We partner with ambitious teams to go from understanding to action in weeks, co-creating AI-powered solutions that transform how businesses grow. We are a Teal-inspired, purpose-driven organization — self-managed,...
-
Senior ML Platform Engineer
Vor 6 Tagen
Munich, Bayern, Deutschland Agile Robots SE Vollzeit 120.000 € - 150.000 € pro JahrAbout the roleAs a Senior ML Platform Engineer, you will take ownership of designing, building, and maintaining the infrastructure that powers our Generative AI framework. You will architect scalable, secure ML environments across development, testing, and production, while automating the entire ML lifecycle—from data ingestion and model training to...
-
Data Engineer – ML Training Infrastructure
Vor 4 Tagen
Munich, Bayern, Deutschland SpAItial AI Vollzeit 80.000 € - 120.000 € pro JahrSpAItial is pioneering the development of a frontier 3D foundation model, pushing the boundaries of AI, computer vision, and spatial computing. Our mission is to redefine how industries, from robotics and AR/VR to gaming and movies, generate and interact with 3D content.We're seeking aData Engineerto build the pipelines and infrastructure that fuel our...
-
DevOps/ML Engineer
Vor 5 Tagen
Munich, Bayern, Deutschland Reply Vollzeit 80.000 € - 120.000 € pro JahrBei Machine Learning Reply arbeiten wir mit unseren Kunden an hochmodernen Projekten, für die wir DevOps- und ML-Engineers suchen, um unsere Kundenprojekte rund um Machine Learning und Data Processing in verschiedenen Branchen zu unterstützen. Um unser Team zu erweitern, suchen wir talentierte und hochqualifizierte Berater*innen mit technischem Hintergrund...
-
Munich, Bayern, Deutschland Mistral AI Vollzeit 90.000 € - 120.000 € pro JahrAbout The JobMistral AI is seeking a Applied AI Engineer to facilitate the adoption of its products among customers and collaborate with them to address complex technical challenges.The Applied AI, Forward Deployed Machine Learning Engineer will be an integral part of our Applied AI team, which is dedicated to driving the successful deployment of Mistral AI...
-
DevOps/ML Engineer
Vor 4 Tagen
Munich, Bayern, Deutschland Machine Learning Reply Vollzeit 60.000 € - 120.000 € pro JahrJob description At Machine Learning Reply, we work with our customers on cutting-edge projects for which we are looking for DevOps and ML Engineers to support our customer projects around machine learning and data processing across various industries. To expand our team, we are looking for a talented and highly skilled consultant with a technical background...
-
Applied AI Engineer
Vor 4 Tagen
Munich, Bayern, Deutschland revel8 Vollzeit 45.000 € - 75.000 € pro JahrWhat we do At revel8, we redefine cybersecurity training with hyper-realistic simulations that reflect today's evolving attack landscape. Our innovative platform combines AI-driven multi-channel attacks with gamified, real-time learning to build resilience where it matters most: the people.We're looking for an Applied AI Engineer (Internship) to help us...
-
ML Application Engineer
vor 2 Wochen
Munich, Bayern, Deutschland Neural Concept Vollzeit 90.000 € - 120.000 € pro JahrAbout the RoleThe mission of Neural Concept's ML Application Engineering team is to help our customers solve their most complex engineering challenges. We collaborate closely with engineering and design teams within the automotive, aerospace and energy industries, understand their challenges and pains, and help them identify solutions to accelerate their...
-
SW Optimization Engineer AI/ML
vor 2 Wochen
Munich, Bayern, Deutschland Apple Vollzeit 120.000 € - 180.000 € pro JahrAt Apple, our Platform Architecture group is responsible for connecting our hardware and software into one unified system. You'll collaborate with engineers across Apple to design how all of our technologies work in unison, drive development of our renowned system-on-a-chip architecture and develop forward-looking prototype systems and software.Our team is...
-
SW Optimization Engineer AI/ML
vor 2 Wochen
Munich, Bayern, Deutschland Apple Vollzeit 120.000 € - 180.000 € pro JahrAt Apple, our Platform Architecture group is responsible for connecting our hardware and software into one unified system. You'll collaborate with engineers across Apple to design how all of our technologies work in unison, drive development of our renowned system-on-a-chip architecture and develop forward-looking prototype systems and software. Our team is...