Agentic & Generative Edge AI Optimization Engineer - Hamburg or Munich - Long Term Contract

vor 23 Stunden


Munich, Bayern, Deutschland ConSol Partners Vollzeit

Agentic & Generative Edge AI Optimization Engineer

Initial 12 month freelance contract + possibility of extension

Location: Hamburg or Munich

3 days onsite / 2 days remote per week

40 hours/week

ASAP start

At our client's AI Competence Center, they are looking for an AI Engineer passionate about Generative AI and Agentic AI systems, someone who thrives on optimizing models for efficient on-device deployment. You will work on large language models (LLMs), large multimodal models (LMMs), and Vision-Language-Action (VLA) models, ensuring they run reliably and efficiently on their NPU-based platforms.

Your mission will be to translate cutting-edge research into production-ready solutions, focusing on model compression, system optimizations, and agentic capabilities such as function calling and tool orchestration. Experience with designing secure and reliable agentic workflows, including guardrails and safe tool invocation, is considered a strong plus.

If you are inspired by deeply understanding the inner workings of LLMs, designing system-level optimizations, and building agentic systems under resource constraints, then you'll want to join our client's growing AI Competence Center.

What You'll Do
:


• Optimize LLMs and multimodal models for on-device deployment

o Investigate, develop and apply advanced quantization (8-bit, 4-bit, mixed precision), pruning, and distillation techniques for deriving optimized models for our client's NPU targets.


• Accelerate inference performance

o Investigate, develop and implement system optimizations such as speculative decoding and other efficient decoding algorithms tailored for edge environments.


• Engineer agentic AI capabilities towards tiny agents

o Investigate methodologies for enhancing the performance of small language models towards enabling tiny agents at the edge, while ensuring these follow safety principles.


• Work with inference engines and deployment frameworks

o Deploy optimized models using Ollama, , ONNX Runtime, and TFLite for efficient NPU inference.


• Benchmark LLMs and agentic systems

o Design benchmarking pipelines for assessing the performance of Generative and Agentic AI systems on-device.


• Develop demonstrators and proof-of-concepts

o Build technology PoCs for our client's relevant use-cases such as industrial safety monitoring, in-cabin sensing, and other edge AI applications for showcasing key technologies.


• Move key technologies from research into product solutions

o Translate advanced optimization techniques and agentic AI features into production-ready implementations and collaborate with product teams to integrate these features into our client's SW/HW portfolio.

Your Profile
:


• MSc, PhD or EngD in a technical specialism, like Computer Science or equally relevant.


• 5+ years of experience in software/AI engineering with deep exposure to LLMs, VLMs, and systems performance.


• Experience with LLM quantization techniques (e.g., SmoothQuant, SpinQuant, QuaRoT), pruning (Wanda, SparseGPT, etc.) and other system optimizations like speculative decoding.


• Track-record experience in working with AI frameworks (PyTorch, TensorFlow, etc.), required.


• Experience with Agentic AI technologies and familiarity with existing frameworks (e.g., LangChain, Google ADK, SmolAgents, etc.)


• Understanding of safety and security considerations for agentic systems (e.g., guardrails, policy enforcement, secure function calling) is a plus.


• Understanding of AI toolchains, deployment, portability and inference engines (CUDA, TensorRT, TFLite, ONNX, Ollama, etc.) preferred.


• Affinity and experience with embedded systems, and NPU accelerators required.


• Experience with embedded software architecture, build systems, version control systems required.


• Broad experience with Operating systems GNU/Linux, embedded systems, development boards, and processors, and SW competencies required.


• Familiarity with setting up and maintaining related ML-Ops development environments (MLFlow, ClearML, etc.) required.


• Knowledge of build systems (YOCTO, OpenEmbedded, etc.) beneficial, working with cross-compilation toolchains for ARM preferred.


• Solid programming experience of C, C++, Python and Bash programming languages on Linux systems required.


• Excellent communication skills in English (verbal /written) required. Experience in working in/with multi-site and multi-cultural projects/teams preferred.


  • Agentic Edge AI Engineer

    vor 20 Stunden


    Munich, Bayern, Deutschland 5V Tech Vollzeit

    Agentic Edge AI Optimization EngineerMunich (DE) OR Hamburg (DE)Hybrid-Remote FlexibilityInitial 12-months + option to join as an employeeCompetitive Salary / Rate depending on experience levelA leading semiconductor company is expanding a high-priority R&D program focused on Agentic and Generative AI at the edge. This team is taking cutting-edge research...


  • Munich, Bayern, Deutschland WHD Consulting Ltd. Vollzeit

    Agentic & Generative Edge AI Optimization Engineer – Long Term Contract (Munich or Hamburg)My client are looking to recruit an AI Engineer passionate about Generative AI and Agentic AI systems, someone who thrives on optimizing models for efficient on-device deployment. You will work on large language models (LLMs), large multimodal models (LMMs), and...


  • Munich, Bayern, Deutschland Ähdus Technology GmbH Vollzeit

    Job Title:Senior Generative AI EngineerLocation:Munich, GermanyEmployment Type:Full-TimeAbout the Role:We are looking for aSenior Generative AI Engineerto lead the design and development of cutting-edge generative AI systems that produce human-like content such as text, images, code, and audio. You will play a key role in building and deploying large-scale...

  • Senior AI Engineer

    vor 2 Wochen


    Munich, Bayern, Deutschland Manex AI GmbH Vollzeit 100.000 € - 150.000 € pro Jahr

    DESCRIPTIONAs a Senior AI Engineer at Manex AI, you'll build ambitious, domain-specific products powered by modern LLM technologies, end to end. You ship fast, make it robust and scalable, and level up our agents. You'll be part of an ultra-smart tech team where every line of code matters, and every idea you bring can shape the product.OUR MISSIONManex AI is...

  • AI Engineer

    Vor 3 Tagen


    Munich, Bayern, Deutschland beroe x nnamu GmbH Vollzeit

    Your tasksWe are looking for an experienced AI Engineer to join our growing AI team at Beroe X nnamu. You'll play a key role in developing intelligent, agentic AI systems using cutting-edge large language models (LLMs), multi-agent orchestration, and retrieval-augmented generation (RAG). This is a hands-on role combining software engineering, ML/NLP...


  • Munich, Bayern, Deutschland WHD Consulting Ltd. Vollzeit 60.000 € - 100.000 € pro Jahr

    Embedded SW Engineer (ADAS / Computer Vision) – Munich – Long Term ContractMy client are recruiting for a skilled and experienced Embedded Software Engineer on a long terms contract / feeelance basis to their team in developing cutting-edge robotic and AI-based solutions for the automotive industry. In this mid-level role, you will play a key part in...


  • Munich, Bayern, Deutschland AI Futures Vollzeit 90.000 € - 120.000 € pro Jahr

    Senior Machine Learning Engineer – Munich, Germany - HybridAbout Our Client:Our client is a leading AI-driven Financial Services technology company with significant venture funding and a strong presence in Europe. They are building cutting-edge solutions that leverage Artificial Intelligence to transform financial operations, compliance, and customer...


  • Munich, Bayern, Deutschland Huawei Research Center Germany & Austria Vollzeit

    Huawei's Munich Research Center is responsible for advanced technology research, architectural development, design and strategic engineering of our products.We are seeking a highly motivated and skilled Spatial AI Research Engineer/Scientist to join our team dedicated to pushing the boundaries of artificial intelligence in physical and virtual environments....


  • Munich, Bayern, Deutschland Manex AI Vollzeit

    WHAT SETS YOU UP FOR SUCCESSDegree in Industrial Engineering, Mechanical Engineering, or similar (Plus: PhD)2+ years of professional experience in consulting, automotive, manufacturing, or techSolid knowledge of data science and AI in industrial contextsExcellent communication and stakeholder management skillsBusiness-fluent in...

  • AI Engineer

    vor 15 Stunden


    Munich, Bayern, Deutschland KI performance GmbH Vollzeit

    Become our new (Senior)AI Engineer (m/f/d)Are you passionate about AI and cutting-edge technologies? Do you want to help companies leverage AI to create real business impact? Are you ambitious, solution-oriented, and eager to work at the intersection of data engineering, machine learning, and cloud technology? Then join our team at KI performanceYour Role...