Composable Data Stack Engineer

vor 9 Stunden


Berlin, Berlin, Deutschland dltHub ScaleVector GmbH Vollzeit
Job Description

We are seeking a skilled Composable Data Stack Engineer to join our team at dltHub / ScaleVector GmbH. As a key member of our core product team, you will collaborate directly with our CTO to design and implement high-performance data processing libraries.

About the Role

As a Composable Data Stack Engineer, you will be responsible for integrating query engines, transformation frameworks, and table formats with our library. You will work closely with our customers in commercial projects where dlt is combined with existing "modern data stack" infrastructure. Additionally, you will maintain the open-source project with our team, reviewing PRs, resolving issues, and engaging with community contributors.

Requirements
  • You have experience in building data apps or products based on composable data stacks.
  • You are familiar with Pythonic data libraries, including duckdb, arrow, datafusion, lancedb, delta-rs, ibis, pyiceberg, sqlglot, kedro, and hamilton.
  • You have a degree in computer science, data science, or equivalent experience.
  • You are fluent in writing Python code, including Python typing, unit testing, and writing docstrings.
  • You are familiar with GitHub workflows, including pull requests, code reviews, CI/CD services, and more.
Nice to Have
  • You are based in Berlin and willing to work in our office regularly.
  • You have a hacker nature and enjoy optimizing code.
  • You have experience with DevOps, including CI systems, Docker, Kubernetes, AWS/GCP/Digital Ocean, and more.
  • You have experience with machine learning, including toolsets, workflows, and practical applications.
About Us

dltHub / ScaleVector GmbH is a leading provider of open-source data processing libraries. We are backed by Foundation Capital, Dig Ventures, and technical founders from companies such as Datadog, Instana, Hugging Face, MotherDuck, Mesosphere, Matillion, Miro, and Rasa. Our mission is to integrate dlt fully with the emerging Pythonic Composable Data Stack ecosystem, creating a seamless experience for our users.



  • Berlin, Berlin, Deutschland dltHub Vollzeit

    About dltHubWe are a cutting-edge technology company based in Berlin and New York City, founded by data and machine learning veterans. Our mission is to integrate dlt, an open-source library for data processing, with the emerging Pythonic Composable Data Stack.Job DescriptionWe are seeking a highly skilled Data Engineer to join our core product team. As a...


  • Berlin, Berlin, Deutschland dltHub Vollzeit

    About dltHubWe are a cutting-edge technology company based in Berlin and New York City, founded by data and machine learning veterans. Our mission is to integrate dlt, an open-source library for data processing, with the emerging Pythonic Composable Data Stack.Job DescriptionWe are seeking a highly skilled Data Engineer to join our core product team. As a...


  • Berlin, Berlin, Deutschland dltHub Vollzeit

    About UsdltHub is a pioneering company that bridges the gap between the traditional Modern Data Stack and the emerging Pythonic Composable Data Stack. Our mission is to empower users by integrating dlt fully with this new ecosystem, respecting their time, effort, and investments in modern data stack.Job DescriptionWe are seeking a skilled Composable Data...


  • Berlin, Berlin, Deutschland dltHub Vollzeit

    About UsdltHub is a pioneering company that bridges the gap between the traditional Modern Data Stack and the emerging Pythonic Composable Data Stack. Our mission is to empower users by integrating dlt fully with this new ecosystem, respecting their time, effort, and investments in modern data stack.Job DescriptionWe are seeking a skilled Composable Data...


  • Berlin, Berlin, Deutschland dltHub Vollzeit

    About UsdltHub is a pioneering company that bridges the gap between the traditional Modern Data Stack and the emerging Pythonic Composable Data Stack. Our mission is to integrate dlt fully with this new ecosystem, making it a seamless gateway for users to process data.Your RoleWe are seeking a skilled Data Engineer to join our core product team. As a key...


  • Berlin, Berlin, Deutschland dltHub Vollzeit

    About UsdltHub is a pioneering company that bridges the gap between the traditional Modern Data Stack and the emerging Pythonic Composable Data Stack. Our mission is to integrate dlt fully with this new ecosystem, making it a seamless gateway for users to process data.Your RoleWe are seeking a skilled Data Engineer to join our core product team. As a key...

  • Senior Data Engineer

    vor 2 Wochen


    Berlin, Berlin, Deutschland dltHub Vollzeit

    About dltHubdltHub is a cutting-edge technology company that specializes in developing innovative data engineering solutions. Our mission is to bridge the gap between traditional data stacks and emerging composable data stacks, making it easier for users to integrate and process data.Job DescriptionWe are seeking a highly skilled Senior Data Engineer to join...

  • Senior Data Engineer

    vor 2 Wochen


    Berlin, Berlin, Deutschland dltHub Vollzeit

    About dltHubdltHub is a cutting-edge technology company that specializes in developing innovative data engineering solutions. Our mission is to bridge the gap between traditional data stacks and emerging composable data stacks, making it easier for users to integrate and process data.Job DescriptionWe are seeking a highly skilled Senior Data Engineer to join...

  • Senior Data Engineer

    vor 2 Wochen


    Berlin, Berlin, Deutschland dltHub Vollzeit

    About dltHubWe are a cutting-edge technology company based in Berlin and New York City, founded by data and machine learning veterans. Our mission is to integrate dlt, an open-source library, fully with the emerging Pythonic Composable Data Stack, making it a gateway that creates datasets for other components to process.Your RoleWe are seeking a highly...

  • Senior Data Engineer

    vor 2 Wochen


    Berlin, Berlin, Deutschland dltHub Vollzeit

    About dltHubWe are a cutting-edge technology company based in Berlin and New York City, founded by data and machine learning veterans. Our mission is to integrate dlt, an open-source library, fully with the emerging Pythonic Composable Data Stack, making it a gateway that creates datasets for other components to process.Your RoleWe are seeking a highly...


  • Berlin, Berlin, Deutschland dltHub ScaleVector GmbH Vollzeit

    About the RoleWe are seeking a highly skilled Composable Data Stack Python Engineer to join our team at dltHub / ScaleVector GmbH. As a key member of our core product team, you will collaborate directly with our CTO to design and implement high-performance data processing libraries.Key ResponsibilitiesDesign and implement features that integrate query...


  • Berlin, Berlin, Deutschland dltHub ScaleVector GmbH Vollzeit

    About the RoleWe are seeking a highly skilled Composable Data Stack Python Engineer to join our team at dltHub / ScaleVector GmbH. As a key member of our core product team, you will collaborate directly with our CTO to design and implement high-performance data processing libraries.Key ResponsibilitiesDesign and implement features that integrate query...


  • Berlin, Berlin, Deutschland dltHub Vollzeit

    About UsdltHub is a pioneering technology company that specializes in developing innovative data processing solutions. Our mission is to empower businesses to unlock the full potential of their data by providing cutting-edge tools and expertise.Job SummaryWe are seeking an experienced Composable Data Stack Python Engineer to join our team. As a key member of...


  • Berlin, Berlin, Deutschland dltHub Vollzeit

    About UsdltHub is a pioneering technology company that specializes in developing innovative data processing solutions. Our mission is to empower businesses to unlock the full potential of their data by providing cutting-edge tools and expertise.Job SummaryWe are seeking an experienced Composable Data Stack Python Engineer to join our team. As a key member of...


  • Berlin, Berlin, Deutschland dltHub ScaleVector GmbH Vollzeit

    About the RoleWe are seeking a skilled Python Engineer to join our team at dltHub / ScaleVector GmbH. As a key member of our core product team, you will collaborate directly with our CTO to design and implement high-performance data processing libraries.Key ResponsibilitiesDesign and implement features that integrate query engines, transformation frameworks,...


  • Berlin, Berlin, Deutschland dltHub ScaleVector GmbH Vollzeit

    About the RoleWe are seeking a skilled Python Engineer to join our team at dltHub / ScaleVector GmbH. As a key member of our core product team, you will collaborate directly with our CTO to design and implement high-performance data processing libraries.Key ResponsibilitiesDesign and implement features that integrate query engines, transformation frameworks,...

  • Senior Data Engineer

    Vor 4 Tagen


    Berlin, Berlin, Deutschland dltHub Vollzeit

    About UsdltHub is a pioneering company that bridges the gap between traditional data processing and the emerging Pythonic Composable Data Stack. Our mission is to empower users to seamlessly integrate our library with the modern data stack, respecting their time, effort, and investments.Your RoleWe are seeking a highly skilled Senior Software Engineer to...

  • Senior Data Engineer

    Vor 4 Tagen


    Berlin, Berlin, Deutschland dltHub Vollzeit

    About UsdltHub is a pioneering company that bridges the gap between traditional data processing and the emerging Pythonic Composable Data Stack. Our mission is to empower users to seamlessly integrate our library with the modern data stack, respecting their time, effort, and investments.Your RoleWe are seeking a highly skilled Senior Software Engineer to...

  • Senior Data Engineer

    vor 3 Wochen


    Berlin, Berlin, Deutschland dltHub Vollzeit

    About dltHubdltHub is a pioneering technology company that specializes in developing innovative data processing solutions. Our mission is to bridge the gap between traditional data management systems and emerging data libraries in Python, enabling seamless data integration and processing.Job DescriptionWe are seeking a highly skilled Senior Data Engineer to...

  • Senior Data Engineer

    vor 2 Wochen


    Berlin, Berlin, Deutschland dltHub Vollzeit

    About dltHubdltHub is a pioneering technology company that specializes in developing innovative data processing solutions. Our mission is to bridge the gap between traditional data management systems and emerging data libraries in Python, enabling seamless data integration and processing.Job DescriptionWe are seeking a highly skilled Senior Data Engineer to...