Agentic & Generative Edge AI Optimization Engineer - Hamburg or Munich - Long Term Contract
vor 23 Stunden
Agentic & Generative Edge AI Optimization Engineer
Initial 12 month freelance contract + possibility of extension
Location: Hamburg or Munich
3 days onsite / 2 days remote per week
40 hours/week
ASAP start
At our client's AI Competence Center, they are looking for an AI Engineer passionate about Generative AI and Agentic AI systems, someone who thrives on optimizing models for efficient on-device deployment. You will work on large language models (LLMs), large multimodal models (LMMs), and Vision-Language-Action (VLA) models, ensuring they run reliably and efficiently on their NPU-based platforms.
Your mission will be to translate cutting-edge research into production-ready solutions, focusing on model compression, system optimizations, and agentic capabilities such as function calling and tool orchestration. Experience with designing secure and reliable agentic workflows, including guardrails and safe tool invocation, is considered a strong plus.
If you are inspired by deeply understanding the inner workings of LLMs, designing system-level optimizations, and building agentic systems under resource constraints, then you'll want to join our client's growing AI Competence Center.
What You'll Do
:
• Optimize LLMs and multimodal models for on-device deployment
o Investigate, develop and apply advanced quantization (8-bit, 4-bit, mixed precision), pruning, and distillation techniques for deriving optimized models for our client's NPU targets.
• Accelerate inference performance
o Investigate, develop and implement system optimizations such as speculative decoding and other efficient decoding algorithms tailored for edge environments.
• Engineer agentic AI capabilities towards tiny agents
o Investigate methodologies for enhancing the performance of small language models towards enabling tiny agents at the edge, while ensuring these follow safety principles.
• Work with inference engines and deployment frameworks
o Deploy optimized models using Ollama, , ONNX Runtime, and TFLite for efficient NPU inference.
• Benchmark LLMs and agentic systems
o Design benchmarking pipelines for assessing the performance of Generative and Agentic AI systems on-device.
• Develop demonstrators and proof-of-concepts
o Build technology PoCs for our client's relevant use-cases such as industrial safety monitoring, in-cabin sensing, and other edge AI applications for showcasing key technologies.
• Move key technologies from research into product solutions
o Translate advanced optimization techniques and agentic AI features into production-ready implementations and collaborate with product teams to integrate these features into our client's SW/HW portfolio.
Your Profile
:
• MSc, PhD or EngD in a technical specialism, like Computer Science or equally relevant.
• 5+ years of experience in software/AI engineering with deep exposure to LLMs, VLMs, and systems performance.
• Experience with LLM quantization techniques (e.g., SmoothQuant, SpinQuant, QuaRoT), pruning (Wanda, SparseGPT, etc.) and other system optimizations like speculative decoding.
• Track-record experience in working with AI frameworks (PyTorch, TensorFlow, etc.), required.
• Experience with Agentic AI technologies and familiarity with existing frameworks (e.g., LangChain, Google ADK, SmolAgents, etc.)
• Understanding of safety and security considerations for agentic systems (e.g., guardrails, policy enforcement, secure function calling) is a plus.
• Understanding of AI toolchains, deployment, portability and inference engines (CUDA, TensorRT, TFLite, ONNX, Ollama, etc.) preferred.
• Affinity and experience with embedded systems, and NPU accelerators required.
• Experience with embedded software architecture, build systems, version control systems required.
• Broad experience with Operating systems GNU/Linux, embedded systems, development boards, and processors, and SW competencies required.
• Familiarity with setting up and maintaining related ML-Ops development environments (MLFlow, ClearML, etc.) required.
• Knowledge of build systems (YOCTO, OpenEmbedded, etc.) beneficial, working with cross-compilation toolchains for ARM preferred.
• Solid programming experience of C, C++, Python and Bash programming languages on Linux systems required.
• Excellent communication skills in English (verbal /written) required. Experience in working in/with multi-site and multi-cultural projects/teams preferred.
-
Agentic Edge AI Engineer
vor 20 Stunden
Munich, Bayern, Deutschland 5V Tech VollzeitAgentic Edge AI Optimization EngineerMunich (DE) OR Hamburg (DE)Hybrid-Remote FlexibilityInitial 12-months + option to join as an employeeCompetitive Salary / Rate depending on experience levelA leading semiconductor company is expanding a high-priority R&D program focused on Agentic and Generative AI at the edge. This team is taking cutting-edge research...
-
Artificial Intelligence Engineer
vor 15 Stunden
Munich, Bayern, Deutschland WHD Consulting Ltd. VollzeitAgentic & Generative Edge AI Optimization Engineer – Long Term Contract (Munich or Hamburg)My client are looking to recruit an AI Engineer passionate about Generative AI and Agentic AI systems, someone who thrives on optimizing models for efficient on-device deployment. You will work on large language models (LLMs), large multimodal models (LMMs), and...
-
Senior Generative AI Engineer
Vor 4 Tagen
Munich, Bayern, Deutschland Ähdus Technology GmbH VollzeitJob Title:Senior Generative AI EngineerLocation:Munich, GermanyEmployment Type:Full-TimeAbout the Role:We are looking for aSenior Generative AI Engineerto lead the design and development of cutting-edge generative AI systems that produce human-like content such as text, images, code, and audio. You will play a key role in building and deploying large-scale...
-
Senior AI Engineer
vor 2 Wochen
Munich, Bayern, Deutschland Manex AI GmbH Vollzeit 100.000 € - 150.000 € pro JahrDESCRIPTIONAs a Senior AI Engineer at Manex AI, you'll build ambitious, domain-specific products powered by modern LLM technologies, end to end. You ship fast, make it robust and scalable, and level up our agents. You'll be part of an ultra-smart tech team where every line of code matters, and every idea you bring can shape the product.OUR MISSIONManex AI is...
-
AI Engineer
Vor 3 Tagen
Munich, Bayern, Deutschland beroe x nnamu GmbH VollzeitYour tasksWe are looking for an experienced AI Engineer to join our growing AI team at Beroe X nnamu. You'll play a key role in developing intelligent, agentic AI systems using cutting-edge large language models (LLMs), multi-agent orchestration, and retrieval-augmented generation (RAG). This is a hands-on role combining software engineering, ML/NLP...
-
Embedded Software Engineer
vor 2 Wochen
Munich, Bayern, Deutschland WHD Consulting Ltd. Vollzeit 60.000 € - 100.000 € pro JahrEmbedded SW Engineer (ADAS / Computer Vision) – Munich – Long Term ContractMy client are recruiting for a skilled and experienced Embedded Software Engineer on a long terms contract / feeelance basis to their team in developing cutting-edge robotic and AI-based solutions for the automotive industry. In this mid-level role, you will play a key part in...
-
Senior Machine Learning Engineer
vor 2 Wochen
Munich, Bayern, Deutschland AI Futures Vollzeit 90.000 € - 120.000 € pro JahrSenior Machine Learning Engineer – Munich, Germany - HybridAbout Our Client:Our client is a leading AI-driven Financial Services technology company with significant venture funding and a strong presence in Europe. They are building cutting-edge solutions that leverage Artificial Intelligence to transform financial operations, compliance, and customer...
-
Spatial AI Research Engineer/Scientist
vor 23 Stunden
Munich, Bayern, Deutschland Huawei Research Center Germany & Austria VollzeitHuawei's Munich Research Center is responsible for advanced technology research, architectural development, design and strategic engineering of our products.We are seeking a highly motivated and skilled Spatial AI Research Engineer/Scientist to join our team dedicated to pushing the boundaries of artificial intelligence in physical and virtual environments....
-
Customer Impact Engineer
Vor 4 Tagen
Munich, Bayern, Deutschland Manex AI VollzeitWHAT SETS YOU UP FOR SUCCESSDegree in Industrial Engineering, Mechanical Engineering, or similar (Plus: PhD)2+ years of professional experience in consulting, automotive, manufacturing, or techSolid knowledge of data science and AI in industrial contextsExcellent communication and stakeholder management skillsBusiness-fluent in...
-
AI Engineer
vor 15 Stunden
Munich, Bayern, Deutschland KI performance GmbH VollzeitBecome our new (Senior)AI Engineer (m/f/d)Are you passionate about AI and cutting-edge technologies? Do you want to help companies leverage AI to create real business impact? Are you ambitious, solution-oriented, and eager to work at the intersection of data engineering, machine learning, and cloud technology? Then join our team at KI performanceYour Role...