Inference Optimization Engineer
Vor 6 Tagen
We are seeking a highly skilled AI Research Scientist/Inference Optimization Engineer to join our team at Amazon Web Services Development Center Germany GmbH. As a key member of our team, you will be responsible for optimizing foundation models for inference, working closely with our customers, product organizations, and the academic and research communities.
Key Responsibilities- Invent, implement, and deploy state-of-the-art machine learning algorithms and systems to improve the inference of foundation models.
- Interact closely with our customers, product organizations, and the academic and research communities to advance the field of AI research and engineering.
- Develop and maintain a deep understanding of modern deep learning architectures, including Transformers, and their applications in inference optimization.
- Collaborate with cross-functional teams to design, develop, and deploy high-performance computing systems and infrastructure for AI research and engineering.
- Stay up-to-date with the latest advancements in AI research and engineering, and apply this knowledge to drive innovation and improvement in our products and services.
- PhD or Master's degree in Computer Science, Electrical Engineering, Machine Learning, or a related field.
- Experience in patents or publications at top-tier peer-reviewed conferences or journals.
- Proficiency in programming languages such as Java, C++, Python, or related languages.
- Experience with machine learning, mathematical optimization, parallel and distributed computing, and high-performance computing.
- Solid technical understanding of modern deep learning architectures and frameworks such as PyTorch.
- Experience with programming hardware accelerators (e.g., GPU, TPU, Neuron).
- Experience with inference optimization of foundation models, including model compression techniques, architectural optimization, and system-level optimization.
- Experience with inference engines (e.g., vLLM, TGI).
Amazon Web Services (AWS) is the world's most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and continue to innovate, providing a robust suite of products and services to power businesses.
We value diversity, equity, and inclusion, and strive to create a workplace where everyone can thrive. We believe that a diverse and inclusive workplace is essential to our success, and we are committed to building a culture that reflects this value.
We offer a range of benefits and programs to support our employees, including mentorship and career growth opportunities, flexible work arrangements, and a comprehensive benefits package.
-
Senior Research Scientist, AI Optimization
Vor 5 Tagen
Tübingen, Baden-Württemberg, Deutschland Amazon Web Services Development Center Germany GmbH VollzeitAbout the RoleWe are seeking a highly skilled Senior Research Scientist to join our team at Amazon Web Services Development Center Germany GmbH. As a Senior Research Scientist, you will be responsible for developing and implementing cutting-edge machine learning algorithms and systems to optimize the inference of foundation models.Key ResponsibilitiesDesign...
-
Senior Research Scientist, AI Optimization
Vor 6 Tagen
Tübingen, Baden-Württemberg, Deutschland Amazon Web Services Development Center Germany GmbH VollzeitAbout the RoleWe are seeking a highly skilled Senior Research Scientist to join our team at Amazon Web Services Development Center Germany GmbH. As a Senior Research Scientist, you will be responsible for developing and implementing cutting-edge machine learning algorithms and systems to optimize the inference of foundation models.Key ResponsibilitiesDesign...
-
AI Research Scientist
Vor 5 Tagen
Tübingen, Baden-Württemberg, Deutschland Amazon Web Services Development Center Germany GmbH VollzeitAbout the RoleWe are seeking a highly skilled AI Research Scientist/Inference Optimization Engineer to join our team at Amazon Web Services Development Center Germany GmbH. As a key member of our team, you will be responsible for optimizing foundation models for inference, working closely with our customers, product organizations, and the academic and...
-
AI Research Scientist
vor 1 Woche
Tübingen, Baden-Württemberg, Deutschland Amazon Web Services Development Center Germany GmbH VollzeitAbout the RoleWe are seeking a highly skilled AI Research Scientist/Inference Optimization Engineer to join our team at Amazon Web Services Development Center Germany GmbH. As a key member of our team, you will be responsible for optimizing foundation models for inference, working closely with our customers, product organizations, and the academic and...
-
Senior AI Research Scientist
Vor 5 Tagen
Tübingen, Baden-Württemberg, Deutschland Amazon Web Services Development Center Germany GmbH VollzeitPosition: AWS AI Research & Engineering (AIRE) Scientist/Engineer AWS AI Research & Engineering (AIRE) is seeking talented scientists and engineers to enhance the optimization of foundational models for inference. At AIRE, we are dedicated to leveraging advanced techniques in compiler design, high-performance computing, and computer architecture to elevate...
-
Senior AI Research Scientist
vor 4 Wochen
Tübingen, Baden-Württemberg, Deutschland Amazon Web Services Development Center Germany GmbH VollzeitPosition: AWS AI Research & Engineering (AIRE) Scientist/Engineer AWS AI Research & Engineering (AIRE) is seeking talented scientists and engineers to enhance the optimization of foundational models for inference. At AIRE, we are dedicated to leveraging advanced techniques in compiler design, high-performance computing, and computer architecture to elevate...
-
AI Research Scientist II
Vor 5 Tagen
Tübingen, Baden-Württemberg, Deutschland Amazon Web Services Development Center Germany GmbH VollzeitPosition: AWS AI Research Scientist II The AWS AI Research & Engineering (AIRE) team is on the lookout for innovative scientists and engineers dedicated to enhancing foundation models for inference. At AIRE, we focus on leveraging advanced techniques in compiler optimization, high-performance computing, and computer architecture to elevate the efficiency of...
-
AI Research Scientist II
vor 4 Wochen
Tübingen, Baden-Württemberg, Deutschland Amazon Web Services Development Center Germany GmbH VollzeitPosition: AWS AI Research Scientist II The AWS AI Research & Engineering (AIRE) team is on the lookout for innovative scientists and engineers dedicated to enhancing foundation models for inference. At AIRE, we focus on leveraging advanced techniques in compiler optimization, high-performance computing, and computer architecture to elevate the efficiency of...
-
Technical Sales Engineer
vor 2 Wochen
Tübingen, Baden-Württemberg, Deutschland Hartmetall-Werkzeugfabrik Paul Horn GmbH VollzeitJob Opportunity at Hartmetall-Werkzeugfabrik Paul Horn GmbHWe are seeking a highly motivated Technical Sales Engineer to join our team at Hartmetall-Werkzeugfabrik Paul Horn GmbH. If you have a background in Mechanical Engineering and experience in the Oil and Gas industry, this could be the perfect opportunity for you.Your Key ResponsibilitiesProcess...
-
IT Infrastructure Architect
Vor 7 Tagen
Tübingen, Baden-Württemberg, Deutschland CureVac VollzeitAbout CureVacCureVac is a global biopharmaceutical company that specializes in the development and optimization of mRNA technology for medical purposes. With over 20 years of expertise, our focus is on creating innovative solutions for prophylactic vaccines, cancer immunotherapies, and protein-based therapies.Job SummaryWe are seeking an experienced IT...
-
Applied Scientist II, AI Research
vor 2 Monaten
Tübingen, Deutschland Amazon Web Services Development Center Germany GmbH VollzeitAWS AI Research & Engineering (AIRE) is looking for scientists and engineers to work on optimizing foundation models for inference in Tuebingen, Germany. At AIRE, we actively work on applying compiler, high-performance computing, and computer architecture techniques, amongst others, to optimize the performance of foundation model execution, including...