Member of Technical Staff
Vor 2 Tagen
What if the future of generative AI isn't just better images or better text, but models that understand both—and use that understanding to create in ways neither modality could alone?
Our founding team pioneered Latent Diffusion and Stable Diffusion - breakthroughs that made generative AI accessible to millions. Today, our FLUX models power creative tools, design workflows, and products across industries worldwide.
Our FLUX models are best-in-class not only for their capability, but for ease of use in developing production applications. We top public benchmarks and compete at the frontier - and in most instances we're winning.
If you're relentlessly curious and driven by high agency, we want to talk.
With a team of ~50, we move fast and punch above our weight. From our labs in Freiburg - a university town in the Black Forest - and San Francisco, we're building what comes next.
But here's the frontier we're exploring: vision-language models that don't just caption images or generate from prompts, but truly understand the relationship between visual and linguistic information. Models that can enhance prompts intelligently, moderate content contextually, and unlock generative capabilities we haven't imagined yet. That's the research you'll lead.
What You'll PioneerYou'll run cutting-edge projects in multimodal vision-language and large language models, integrating them into our media generation pipeline in ways that push beyond what either modality could achieve alone. This isn't about implementing existing VLMs—it's about developing novel approaches that make FLUX more powerful, more controllable, and more aligned with what creators actually need.
You'll be the person who:
- Leads the development and training of state-of-the-art multimodal vision-language models within the FLUX technology stack—not just applying existing architectures, but innovating on them
- Designs and implements specialized fine-tuning strategies for VLMs to address specific use cases and performance requirements that general-purpose models can't handle
- Develops and optimizes LLM implementations for prompt enhancement, content moderation, and novel applications that improve how people interact with generative models
- Drives innovation by integrating VLM/LLM capabilities into our media generation pipeline in creative ways that enhance generative capabilities
- Conducts research to creatively combine vision and language models—exploring questions about how these modalities can inform and improve each other
- Maintains cutting-edge knowledge of the latest developments in multimodal AI and LLM research, evaluating emerging models and architectures for potential integration
- Collaborates with cross-functional teams to implement and deploy models at scale, contributing to architectural decisions and technical roadmap planning
- Documents and shares research findings with the broader team, translating breakthroughs into practical improvements
- How can vision-language models improve prompt understanding in ways that make generation more controllable and aligned with user intent?
- What's the right architecture for integrating VLMs into diffusion model workflows without creating computational bottlenecks?
- How do you fine-tune vision-language models for specialized creative tasks that weren't in the training data?
- Where can LLMs enhance the generative pipeline—prompt rewriting, content moderation, parameter suggestion—and where would they add more friction than value?
- What novel capabilities emerge when you deeply integrate vision and language understanding into generative workflows?
- How do you evaluate whether multimodal models are actually improving generation quality versus just adding complexity?
These aren't solved problems—they're research directions we're actively exploring.
Who Thrives HereYou've trained and fine-tuned large-scale vision-language models and understand the nuances of multimodal learning. You have strong intuitions about what makes VLMs work well, backed by either publications or practical projects that pushed the field forward. You're comfortable operating at the intersection of research and production, where models need to be both innovative and deployable.
You likely have:
- Demonstrated expertise in training and fine-tuning large-scale vision-language models—not just using pre-trained ones, but developing them
- Strong publication record or practical experience with relevant projects in multimodal AI research that shows you can push the frontier
- Proficiency in PyTorch or similar deep learning frameworks with deep understanding of their capabilities and limitations
- Experience with distributed training systems and large-scale model optimization—because VLMs don't fit on one GPU
- Track record of implementing and scaling AI models in production environments where research meets real-world constraints
We'd be especially excited if you:
- Have experience with diffusion models and generative AI architectures alongside autoregressive modeling—understanding how different paradigms can complement each other
- Bring a background in computer vision that informs your approach to multimodal models
- Contribute to open-source AI projects and understand the community
- Have worked in fast-paced startup environments where iteration speed matters
- Bring strong software engineering practices and system design skills
- Have experience with open-source VLM inference frameworks like vLLM
We're not just adding VLMs to our stack—we're exploring fundamental questions about how vision and language understanding can make generative models more powerful and more aligned with human intent. Every model you train teaches us something about multimodal learning. Every integration reveals new capabilities. Every research finding shapes where the field goes next. If that sounds more compelling than applying existing techniques, we should talk.
We're based in Europe and value depth over noise, collaboration over hero culture, and honest technical conversations over hype. Our models have been downloaded hundreds of millions of times, but we're still a ~50-person team learning what's possible at the edge of generative AI.
-
Member of Technical Staff
vor 1 Woche
Freiburg, Baden-Württemberg, Deutschland Black Forest Labs Vollzeit 80.000 € - 120.000 € pro JahrWhat if the future of generative AI isn't just better images or better text, but models that understand both—and use that understanding to create in ways neither modality could alone?We're the ~50-person team behind Stable Diffusion, Stable Video Diffusion, and FLUX.1—models with 400M+ downloads. But here's the frontier we're exploring: vision-language...
-
Member of Technical Staff
Vor 2 Tagen
Freiburg, Baden-Württemberg, Deutschland Black Forest Labs Vollzeit 60.000 € - 150.000 € pro JahrWhat if we could give artists the same precise camera control in AI-generated video that Pixar has in rendered animation—without sacrificing the creative spontaneity of diffusion models?Our founding team pioneered Latent Diffusion and Stable Diffusion - breakthroughs that made generative AI accessible to millions. Today, our FLUX models power creative...
-
Member of Technical Staff
Vor 2 Tagen
Freiburg, Baden-Württemberg, Deutschland Black Forest Labs Vollzeit 43.000 € - 86.000 € pro JahrWhat if the next breakthrough in generative AI isn't a new architecture, but understanding the tradeoffs well enough to make better training decisions?Our founding team pioneered Latent Diffusion and Stable Diffusion - breakthroughs that made generative AI accessible to millions. Today, our FLUX models power creative tools, design workflows, and products...
-
Member of Technical Staff
vor 1 Woche
Freiburg, Baden-Württemberg, Deutschland Black Forest Labs Vollzeit 1.000.000 € - 1.200.000 € pro JahrWhat if the gap between a research breakthrough and a tool creators actually use is giving them the right controls—not just better outputs?We're the ~50-person team behind Stable Diffusion, Stable Video Diffusion, and FLUX.1—models with 400M+ downloads. But here's what we've learned: raw generation power isn't enough. Creators need precise control—hex...
-
Member of Technical Staff
vor 1 Woche
Freiburg, Baden-Württemberg, Deutschland Black Forest Labs Vollzeit 80.000 € - 120.000 € pro JahrWhat if the next breakthrough in generative AI isn't a new architecture, but understanding the tradeoffs well enough to make better training decisions?We're the ~50-person team behind Stable Diffusion, Stable Video Diffusion, and FLUX.1—models with 400M+ downloads. But here's what keeps us at the frontier: relentlessly questioning every design choice,...
-
Member of Technical Staff
Vor 2 Tagen
Freiburg, Baden-Württemberg, Deutschland Black Forest Labs Vollzeit 60.000 € - 120.000 € pro JahrWhat if the gap between a research breakthrough and a tool creators actually use is giving them the right controls—not just better outputs?Our founding team pioneered Latent Diffusion and Stable Diffusion - breakthroughs that made generative AI accessible to millions. Today, our FLUX models power creative tools, design workflows, and products across...
-
Technical Sales Manager
Vor 2 Tagen
Freiburg, Baden-Württemberg, Deutschland Phaseform Vollzeit 80.000 € - 120.000 € pro JahrReady to leverage your technical expertise and established network in life science imaging to revolutionize adaptive optics? Phaseform is where cutting-edge R&D meets commercial impact. As pioneers of the New Era of Adaptive Optics, we're using our groundbreaking Deformable Phase Plate (DPP) technology to transform industries where only the best optics...
-
Technical Sales Manager
Vor 2 Tagen
Freiburg, Baden-Württemberg, Deutschland Phaseform Vollzeit 80.000 € - 120.000 € pro JahrReady to leverage your technical expertise and established network in life science imaging to revolutionize adaptive optics? Phaseform is where cutting-edge R&D meets commercial impact. As pioneers of the New Era of Adaptive Optics, we're using our groundbreaking Deformable Phase Plate (DPP) technology to transform industries where only the best optics...
-
Technical Sales Manager
Vor 2 Tagen
Freiburg, Baden-Württemberg, Deutschland Phaseform Vollzeit 80.000 € - 120.000 € pro JahrReady to leverage your technical expertise and established network in life science imaging to revolutionize adaptive optics? Phaseform is where cutting-edge R&D meets commercial impact. As pioneers of the New Era of Adaptive Optics, we're using our groundbreaking Deformable Phase Plate (DPP) technology to transform industries where only the best optics...
-
Technical Support Engineer Europe
vor 2 Wochen
Freiburg, Baden-Württemberg, Deutschland f90d52dd-a36b-45f3-b71b-5515e9a9043d Vollzeit 60.000 € - 76.000 € pro JahrSCITON is an industry leader and manufacturer of medical aesthetic lasers and light source technologies. With a vision to improve people's lives, our top-tier devices are built to order with integrity by pioneering, customer-focused, and results-driven individuals.Objectives:This role plays a vital part in supporting our Service team, a critical business...