ML-QA Engineer

vor 2 Wochen


Berlin, Berlin, Deutschland nnamu Vollzeit 90.000 € - 120.000 € pro Jahr

Your tasks
We are looking for a
ML-QA Engineer with strong automation experience testing intelligent/agentic applications
to join our growing engineering team. You will play a critical role in ensuring the quality and reliability of our platform by developing automated test frameworks, writing robust test cases, and collaborating closely with developers, product owners, and DevOps.

The role goes beyond traditional QA: You will be working with A
I-driven, agentic applications that can plan, act, and adapt.
Your job will be to make sure they behave correctly, safely, and predictably under real-world conditions.

Key Responsibilities

  • Design, develop, and maintain automated test frameworks and test scripts using Python
  • Integrate automated tests into CI/CD pipelines to ensure continuous quality and fast feedback
  • Collaborate with developers and product managers to identify, reproduce, and resolve defects early in the lifecycle
  • Define and document test plans, strategies, and acceptance criteria for both deterministic and agentic features
  • Verify observability and explainability of agent behavior (logs, traces, intermediate actions) to ensure compliance, transparency, and auditability
  • Perform scenario-based, exploratory, and chaos testing to validate agent decision-making under uncertainty, failure, and edge conditions
  • Test human-in-the-loop workflows, ensuring proper escalation, overrides, and user safety
  • Conduct root cause analysis of production issues, including agent misbehavior, and contribute to long-term fixes
  • Contribute to the development and enforcement of QA standards and best practices

Your profile
Must-Have Qualifications

  • 3+ years of experience in QA Engineering with a focus on test automation for web and distributed systems
  • Proficient in Python for developing test scripts, validation tools, and data validation utilities
  • Hands-on experience with testing frameworks such as PyTest, Selenium, or Playwright for end-to-end and UI testing.
  • Strong understanding of software testing methodologies, including unit, integration, system, and end-to-end testing.
  • Familiarity with REST APIs and tools like Postman or Swagger for API testing
  • Strong problem solving skills with the ability to think beyond "happy path" testing.
  • Understanding of data validation pipeline and data pipeline testing, ensuring integrity and reproducibility across datasets

Nice-to-Have Qualifications

  • Experience testing agentic or AI-driven applications (e.g., autonomous decision-making, multi-step workflows)
  • Experience with scenario-based or chaos testing frameworks
  • Familiarity with observability and monitoring tools (e.g., Datadog, Grafana, AWS CloudWatch) to validate agent behavior
  • Experience testing microservices or serverless architectures on AWS
  • Familiarity with CI/CD tools like GitHub Actions, Jenkins, or GitLab CI.
  • Knowledge of security and guardrail testing (role-based access, safe action validation)
  • Experience writing performance and load tests for AI interface endpoints using Locust or k6.

Qualities we Value:

  • Systems thinker: You see how parts connect and anticipate failure points
  • Exploratory mindset: Comfortable testing for unknown unknowns
  • Domain curiosity: Willingness to deeply understand how the agent is supposed to act in the real world
  • Ethical awareness: You care about safety, fairness, and unintended consequences in AI systems
  • Collaborative: You work closely with engineers, product, and operations to build trust in our platform

Why us?
At Beroe X nnamu GmbH
, we prioritize a balanced work-life experience.

Here's what we offer you:

  • 4-Day Work Week & 30 Days Paid Vacation – More time to recharge
  • Competitive Compensation – Fair salary and comprehensive benefits.
  • Monthly Benefits Allowance – €40/month in vouchers for fitness and other perks with Probonio
  • Flexible Work Arrangements – Work the way that suits you. Professional Development – Access to training and certifications.
  • Team Events – Bi-annual company events and monthly lunch get-togethers.
  • Work Abroad Flexibility – Remote work from the EU or selected non-EU countries for up to 8 weeks a year with travel insurance coverage

We operate on a hybrid model with offices in Berlin and Munich, offering a 32-hour, 4-day workweek.

This means:

  • In-Office Collaboration: Work from the office two days a week
  • Manage Your Own Hours: Flexibility to work around your needs as long as team goals are met.

Our Culture
We are committed to fostering a collaborative, innovative and inclusive work environment where everyone's ideas matter. We know that diverse teams lead to better outcomes and welcome applicants from all backgrounds.

About Us
At Beroe x nnamu GmbH,
we are committed to empowering procurement teams to make informed, strategic decisions that drive real impact. By integrating AI and game theory, our Software as a Service (SaaS) platform, nnamu.negotiations, delivers a groundbreaking approach to complex, yet autonomous negotiations.

nnamu.negotiations delivers additional total-value-of-ownership savings efficiently and effectively for buyers with no prior knowledge of game theory required. By combining our world class proprietary AI with an unmatched, unique database of over EUR 400bn actual game theory project data we help clients uncover incremental value in their negotiations, driving outcomes that were previously out of reach.

Beroe x nnamu GmbH is now a Beroe, Inc. group entity
. Beroe is a global leader in procurement intelligence. Beroe has been on procurement's leading edge since the company's founding in 2006, bringing a world of insights forward. The unique combination of Beroe's expertise, AI tools, and vast amounts of reliable data enable organizations to make smarter, faster, better procurement decisions. Not tomorrow, not today, but now. Selected by ProcureTech as one of the "most pioneering Analytics, Data and Intelligence solutions in 2024", Beroe helps thousands of organizations sift through the data noise, mitigate risk, face fewer surprises, and ultimately gain a competitive edge.

With nnamu's cutting-edge AI and Beroe's trusted market intelligence, we are unlocking game-changing possibilities for procurement teams worldwide

The Challenge We are Solving
Despite the proven potential of game theory to enhance negotiation outcomes significantly – potentially unlocking an incremental USD 1 Trillion of value – its application in business negotiations remains rare, inconsistent, and hard to scale. This gap presents a vast opportunity but also a challenge that many companies have yet to overcome. Beroe x nnamu GmbH is pioneering the use of AI-powered game theory to make complex negotiation strategies accessible, scalable, and incredibly effective for organizations worldwide.


  • ML Engineer

    vor 2 Wochen


    Berlin, Berlin, Deutschland Personio Vollzeit 120.000 € - 180.000 € pro Jahr

    Personio's intelligent HR platform helps small and medium-sized organizations unlock the power of people by making complicated, time-consuming tasks simple and efficient. Our growing team of 1,800+ Personios across Europe and the US are building user-friendly products that delight our 14,000+ customers and their 1.5 million employees. Ready to make an impact...

  • QA Engineer

    vor 2 Wochen


    Berlin, Berlin, Deutschland primehire Vollzeit 45.000 € - 60.000 € pro Jahr

    QA Engineer with a focus in Backend (Java or Kotlin) - German speaking - BerlinWe're searching for a QA Engineer to join our customer in the banking industry. They're building a brand-new trading platform and need someone with backend knowledge in either Java or Kotlin. Additionally, they're looking for someone who brings strong exposure in Selenium or...

  • QA Engineer

    vor 1 Woche


    Berlin, Berlin, Deutschland JetBrains Vollzeit 60.000 € - 90.000 € pro Jahr

    At JetBrains, code is our passion. Ever since we started, back in 2000, we have strived to make the strongest, most effective developer tools on earth. By automating routine checks and corrections, our tools speed up production, freeing developers to grow, discover, and create.JetBrains is looking for a QA Engineer to join the Kotlin Compiler team ).Kotlin's...

  • AI/ML Engineer

    vor 1 Woche


    Berlin, Berlin, Deutschland Melotech Vollzeit 60.000 € - 120.000 € pro Jahr

    Who we areMelotech is revolutionizing media and entertainment. We create art through technology for humans to enjoy. In just 18 months, our work has been heard, watched and loved for over 2 billion minutes worldwide.Founded by entrepreneur and investor Soheil Mirpour, we are backed by top VCs Cherry Ventures, Speedinvest and GFC, alongside world-class angels...


  • Berlin, Berlin, Deutschland Michael Page Vollzeit 40.000 € - 50.000 € pro Jahr

    Spannende HerausforderungSpannendes UnternehmenAbout Our ClientStart:ab sofortAuslastung:VollzeitEinsatzort:Remote (ggf. Meetings in Berlin 2-3x pro Quartak für Architecture Reviews)Projektdauer:3 Monate (option auf Verlängerung)Projektsprache:EnglischJob DescriptionAufgabenbeschreibungEntwicklung und Optimierung von Machine-Learning-Modellen mit...

  • Senior QA Engineer

    vor 1 Woche


    Berlin, Berlin, Deutschland Lumenaza Vollzeit 60.000 € - 90.000 € pro Jahr

    Das sind wir Wir sind ein Green-Tech Unternehmen und gestalten die Energiebranche nachhaltiger. Bei uns hast Du die Gelegenheit, in der Welt der dezentralisierten und erneuerbaren Energien aktiv mitzuwirken. Unsere Mission ist es, die Energiewende voranzutreiben – und dies tun wir mit Leidenschaft, Technologie und Innovation.Lumenaza lebt klare Werte:...

  • Senior QA Engineer

    Vor 2 Tagen


    Berlin, Berlin, Deutschland Lumenaza GmbH Vollzeit 75.000 € - 95.000 € pro Jahr

    Das sind wirWir sind ein Green-Tech Unternehmen und gestalten die Energiebranche nachhaltiger. Bei uns hast Du die Gelegenheit, in der Welt der dezentralisierten und erneuerbaren Energien aktiv mitzuwirken. Unsere Mission ist es, die Energiewende voranzutreiben – und dies tun wir mit Leidenschaft, Technologie und Innovation.Lumenaza lebt klare Werte:...


  • Berlin, Berlin, Deutschland Gallup Vollzeit 80.000 € - 120.000 € pro Jahr

    Build automation that fuels data accuracy, product excellence and global insights.As a senior QA automation engineer at Gallup, you'll be hands-on with building and scaling automation frameworks while guiding fellow engineers, shaping test architecture and ensuring data quality across systems. With deep technical expertise in test automation and quality...


  • Berlin, Berlin, Deutschland Gallup Vollzeit 60.000 € - 120.000 € pro Jahr

    Build automation that fuels data accuracy, product excellence and global insights.As a senior QA automation engineer at Gallup, you'll be hands-on with building and scaling automation frameworks while guiding fellow engineers, shaping test architecture and ensuring data quality across systems. With deep technical expertise in test automation and quality...

  • QA Engineer

    vor 5 Stunden


    Berlin, Berlin, Deutschland NeuroNation Vollzeit 60.000 € - 120.000 € pro Jahr

    About usNeuroNation is a leading brain training platform used by over 30 million people worldwide. We collaborate closely with scientists and universities to deliver engaging, medically certified digital health solutions. Our mobile and web applications have been recognized by Apple, Google, and major health insurance companies. In Germany, our medical app...