Inference Optimization Engineer

Vor 6 Tagen


Tübingen, Baden-Württemberg, Deutschland Amazon Web Services Development Center Germany GmbH Vollzeit
About the Role

We are seeking a highly skilled AI Research Scientist/Inference Optimization Engineer to join our team at Amazon Web Services Development Center Germany GmbH. As a key member of our team, you will be responsible for optimizing foundation models for inference, working closely with our customers, product organizations, and the academic and research communities.

Key Responsibilities
  • Invent, implement, and deploy state-of-the-art machine learning algorithms and systems to improve the inference of foundation models.
  • Interact closely with our customers, product organizations, and the academic and research communities to advance the field of AI research and engineering.
  • Develop and maintain a deep understanding of modern deep learning architectures, including Transformers, and their applications in inference optimization.
  • Collaborate with cross-functional teams to design, develop, and deploy high-performance computing systems and infrastructure for AI research and engineering.
  • Stay up-to-date with the latest advancements in AI research and engineering, and apply this knowledge to drive innovation and improvement in our products and services.
Requirements
  • PhD or Master's degree in Computer Science, Electrical Engineering, Machine Learning, or a related field.
  • Experience in patents or publications at top-tier peer-reviewed conferences or journals.
  • Proficiency in programming languages such as Java, C++, Python, or related languages.
  • Experience with machine learning, mathematical optimization, parallel and distributed computing, and high-performance computing.
  • Solid technical understanding of modern deep learning architectures and frameworks such as PyTorch.
Preferred Qualifications
  • Experience with programming hardware accelerators (e.g., GPU, TPU, Neuron).
  • Experience with inference optimization of foundation models, including model compression techniques, architectural optimization, and system-level optimization.
  • Experience with inference engines (e.g., vLLM, TGI).
About Amazon Web Services

Amazon Web Services (AWS) is the world's most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and continue to innovate, providing a robust suite of products and services to power businesses.

We value diversity, equity, and inclusion, and strive to create a workplace where everyone can thrive. We believe that a diverse and inclusive workplace is essential to our success, and we are committed to building a culture that reflects this value.

We offer a range of benefits and programs to support our employees, including mentorship and career growth opportunities, flexible work arrangements, and a comprehensive benefits package.



  • Tübingen, Baden-Württemberg, Deutschland Amazon Web Services Development Center Germany GmbH Vollzeit

    About the RoleWe are seeking a highly skilled Senior Research Scientist to join our team at Amazon Web Services Development Center Germany GmbH. As a Senior Research Scientist, you will be responsible for developing and implementing cutting-edge machine learning algorithms and systems to optimize the inference of foundation models.Key ResponsibilitiesDesign...


  • Tübingen, Baden-Württemberg, Deutschland Amazon Web Services Development Center Germany GmbH Vollzeit

    About the RoleWe are seeking a highly skilled Senior Research Scientist to join our team at Amazon Web Services Development Center Germany GmbH. As a Senior Research Scientist, you will be responsible for developing and implementing cutting-edge machine learning algorithms and systems to optimize the inference of foundation models.Key ResponsibilitiesDesign...

  • AI Research Scientist

    Vor 5 Tagen


    Tübingen, Baden-Württemberg, Deutschland Amazon Web Services Development Center Germany GmbH Vollzeit

    About the RoleWe are seeking a highly skilled AI Research Scientist/Inference Optimization Engineer to join our team at Amazon Web Services Development Center Germany GmbH. As a key member of our team, you will be responsible for optimizing foundation models for inference, working closely with our customers, product organizations, and the academic and...

  • AI Research Scientist

    vor 1 Woche


    Tübingen, Baden-Württemberg, Deutschland Amazon Web Services Development Center Germany GmbH Vollzeit

    About the RoleWe are seeking a highly skilled AI Research Scientist/Inference Optimization Engineer to join our team at Amazon Web Services Development Center Germany GmbH. As a key member of our team, you will be responsible for optimizing foundation models for inference, working closely with our customers, product organizations, and the academic and...


  • Tübingen, Baden-Württemberg, Deutschland Amazon Web Services Development Center Germany GmbH Vollzeit

    Position: AWS AI Research & Engineering (AIRE) Scientist/Engineer AWS AI Research & Engineering (AIRE) is seeking talented scientists and engineers to enhance the optimization of foundational models for inference. At AIRE, we are dedicated to leveraging advanced techniques in compiler design, high-performance computing, and computer architecture to elevate...


  • Tübingen, Baden-Württemberg, Deutschland Amazon Web Services Development Center Germany GmbH Vollzeit

    Position: AWS AI Research & Engineering (AIRE) Scientist/Engineer AWS AI Research & Engineering (AIRE) is seeking talented scientists and engineers to enhance the optimization of foundational models for inference. At AIRE, we are dedicated to leveraging advanced techniques in compiler design, high-performance computing, and computer architecture to elevate...


  • Tübingen, Baden-Württemberg, Deutschland Amazon Web Services Development Center Germany GmbH Vollzeit

    Position: AWS AI Research Scientist II The AWS AI Research & Engineering (AIRE) team is on the lookout for innovative scientists and engineers dedicated to enhancing foundation models for inference. At AIRE, we focus on leveraging advanced techniques in compiler optimization, high-performance computing, and computer architecture to elevate the efficiency of...


  • Tübingen, Baden-Württemberg, Deutschland Amazon Web Services Development Center Germany GmbH Vollzeit

    Position: AWS AI Research Scientist II The AWS AI Research & Engineering (AIRE) team is on the lookout for innovative scientists and engineers dedicated to enhancing foundation models for inference. At AIRE, we focus on leveraging advanced techniques in compiler optimization, high-performance computing, and computer architecture to elevate the efficiency of...


  • Tübingen, Baden-Württemberg, Deutschland Hartmetall-Werkzeugfabrik Paul Horn GmbH Vollzeit

    Job Opportunity at Hartmetall-Werkzeugfabrik Paul Horn GmbHWe are seeking a highly motivated Technical Sales Engineer to join our team at Hartmetall-Werkzeugfabrik Paul Horn GmbH. If you have a background in Mechanical Engineering and experience in the Oil and Gas industry, this could be the perfect opportunity for you.Your Key ResponsibilitiesProcess...


  • Tübingen, Baden-Württemberg, Deutschland CureVac Vollzeit

    About CureVacCureVac is a global biopharmaceutical company that specializes in the development and optimization of mRNA technology for medical purposes. With over 20 years of expertise, our focus is on creating innovative solutions for prophylactic vaccines, cancer immunotherapies, and protein-based therapies.Job SummaryWe are seeking an experienced IT...


  • Tübingen, Deutschland Amazon Web Services Development Center Germany GmbH Vollzeit

    AWS AI Research & Engineering (AIRE) is looking for scientists and engineers to work on optimizing foundation models for inference in Tuebingen, Germany. At AIRE, we actively work on applying compiler, high-performance computing, and computer architecture techniques, amongst others, to optimize the performance of foundation model execution, including...