Data Engineer

vor 2 Wochen


Leipzig, Sachsen, Deutschland Cyber Insight Vollzeit

At Cyber Insight, we are building the next generation of AI-driven platforms for IT security and risk management. Our mission is to empower companies to gain deep insights into their IT landscapes and proactively mitigate risks in an increasingly complex digital world.

As a fast-growing startup, we combine expertise in cybersecurity, data engineering, and artificial intelligence to deliver solutions that automate risk assessments, predict potential threats, and help organizations stay ahead of evolving cyber risks. Our team thrives on innovation, collaboration, and a shared passion for making a real impact in the cybersecurity space.

We are looking for a
hands-on Data Engineer
who is passionate about building reliable, scalable, and secure data systems. You'll help shape our data architecture and pipelines that feed our AI models and risk assessment engines — including the crucial task of mapping vulnerabilities (CVEs) to specific software and system components.

Tasks

  • Design, build, and maintain
    data pipelines
    and
    ETL/ELT workflows
    across GCP and on-prem environments.

  • Ingest and process
    cybersecurity-relevant data sources
    such as CVE feeds, software inventories, vulnerability databases, and event logs.

  • Develop and maintain transformation logic and data models linking vulnerabilities (CVEs) to affected software and assets.

  • Implement and automate
    data validation
    ,
    consistency checks
    , and
    quality assurance
    using tools like
    Great Expectations
    or
    Deequ
    .

  • Collaborate with AI and graph modeling teams to structure and prepare data for
    threat intelligence
    and
    risk quantification models
    .

  • Manage and optimize data storage using
    BigQuery
    ,
    PostgreSQL
    , and
    Cloud Storage
    , ensuring scalability and performance.

  • Automate data workflows and testing through
    CI/CD pipelines
    (GitHub Actions, GCP Cloud Build, Jenkins).

  • Implement monitoring and observability for pipelines using
    Prometheus
    ,
    Grafana
    , and
    OpenTelemetry
    .

  • Apply a
    security-focused mindset
    in data handling, ensuring safe ingestion, processing, and access control of sensitive datasets.

Requirements

3+ years of experience in
data engineering
,
backend data systems
, or
cybersecurity data processing
.

  • Strong Python skills and experience with
    pandas
    ,
    PySpark
    , or
    Dask
    for large-scale data manipulation.

  • Proven experience with
    data orchestration and transformation frameworks
    (Airflow, dbt, or Dagster).

  • Solid understanding of
    data modeling
    ,
    data warehousing
    , and
    SQL optimization and ETL pipelines (Kafka)
    .

  • Familiarity with
    CVE data structures
    , vulnerability databases (e.g. NVD, CPE, CWE), or security telemetry.

  • Experience integrating heterogeneous data sources (APIs, CSV, JSON, XML, or event streams).

  • Knowledge of
    GCP data tools
    (BigQuery, Pub/Sub, Dataflow, Cloud Functions) or equivalent in Azure/AWS.

  • Experience with
    containerized environments
    (Docker, Kubernetes) and infrastructure automation (Terraform or Pulumi).

  • Understanding of
    data testing
    ,
    validation
    , and
    observability practices
    in production pipelines.

  • A structured and security-aware approach to building data products that support
    AI-driven risk analysis
    .

Nice to Have

  • Experience working with
    graph databases
    (Neo4j, ArangoDB) or
    ontology-based data modeling
    .

  • Familiarity with
    ML pipelines
    (Vertex AI Pipelines, MLflow, or Kubeflow).

  • Understanding of
    software composition analysis
    (SCA) or vulnerability scanning outputs (e.g. Trivy, Syft).

  • Background in
    threat intelligence
    ,
    risk scoring
    , or
    cyber risk quantification
    .

  • Experience in
    multi-cloud or hybrid setups
    (GCP, Azure, on-prem).

Benefits

  • Freedom to design and shape a modern, secure data platform from the ground up.

  • A collaborative startup environment where your work directly supports AI and cybersecurity products.

  • Flexible working hours and remote-friendly setup.

  • Exposure to cutting-edge technologies in
    AI
    ,
    data engineering
    , and
    cyber risk analytics
    .

  • Competitive salary and benefits tailored to your experience.

We are looking forward to meet you


  • Data Engineer

    vor 1 Woche


    Leipzig, Sachsen, Deutschland Cyber Insight GmbH Vollzeit

    At Cyber Insight, we are building the next generation of AI-driven platforms for IT security and risk management. Our mission is to empower companies to gain deep insights into their IT landscapes and proactively mitigate risks in an increasingly complex digital world.As a fast-growing startup, we combine expertise in cybersecurity, data engineering, and...


  • Leipzig, Sachsen, Deutschland Deloitte Vollzeit

    Du willst im Bereich Consulting Delivery – Engineering, Artificial Intelligence and Data Kunden helfen, strategische Entscheidungen zu AI & Data umzusetzen? Unser Team unterstützt Unternehmen bei der Entwicklung und Implementierung hochskalierbarer Datenplattformen, der Integration von Cloud-basierten Technologien und der Anwendung modernster Data...

  • Data Scientist

    Vor 2 Tagen


    Leipzig, Sachsen, Deutschland aevoloop GmbH Vollzeit

    Ihre AufgabenEstablish data warehousing within the R&D department to maximize data efficiency and automate data collection & generationRoll out and manage an electronic lab notebook system to structurize and collect experimental data for data evaluation and comparisonDevelop customized apps for facilitating processing of raw data of lab instruments and...

  • Staff AI Engineer

    vor 1 Stunde


    Leipzig, Sachsen, Deutschland Sportradar Vollzeit

    Company DescriptionWe're the world's leading sports technology company, at the intersection between sports, media, and betting. More than 1,700 sports federations, media outlets, betting operators, and consumer platforms across 120 countries rely on our know-how and technology to boost their business.Job DescriptionStaff AI Engineer – Generative AI Team...

  • Software Engineer, AI

    vor 1 Stunde


    Leipzig, Sachsen, Deutschland Sportradar Vollzeit

    Company DescriptionWe're the world's leading sports technology company, at the intersection between sports, media, and betting. More than 1,700 sports federations, media outlets, betting operators, and consumer platforms across 120 countries rely on our know-how and technology to boost their business.Job DescriptionABOUT US:We are looking for a talented AI...

  • Azure DevOps Engineer

    Vor 2 Tagen


    Leipzig, Sachsen, Deutschland abra Vollzeit

    We are looking for a highly skilled Azure DevOps / Cloud Engineer who can take full ownership of customer projects — from understanding the business need, through designing the solution, to hands-on implementation and delivery. This role combines deep technical expertise, strong customer-facing skills, and the ability to drive high-impact cloud solutions....

  • Software Engineer, AI

    Vor 2 Tagen


    Leipzig, Sachsen, Deutschland Sportradar Vollzeit

    We're the world's leading sports technology company, at the intersection between sports, media, and betting. More than 1,700 sports federations, media outlets, betting operators, and consumer platforms across 120 countries rely on our know-how and technology to boost their business.Job DescriptionABOUT US:We are looking for a talentedAI Engineerto join our...

  • Software Engineer, AI

    Vor 3 Tagen


    Leipzig, Sachsen, Deutschland Sportradar Vollzeit

    Company Description We're the world's leading sports technology company, at the intersection between sports, media, and betting. More than 1,700 sports federations, media outlets, betting operators, and consumer platforms across 120 countries rely on our know-how and technology to boost their business. Job Description ABOUT US:We are looking for a...


  • Leipzig, Sachsen, Deutschland abra Vollzeit

    abra R&D is looking for a strong senior fullstack EngineerDescriptionabra R&D is looking for a Full-Stack AI Platform Engineer to help us build the next generation of product intelligence and agentic AI solutions. If you're passionate about cutting-edge AI frameworks and high-performance data systems, this is your chance to shape the future with us.What...

  • Senior AI Engineer

    Vor 7 Tagen


    Leipzig, Sachsen, Deutschland myhotel GmbH Vollzeit

    Your missionPhase 1: Transform our product into a fully agentic systemRedesign Rocket (MiceDesks core product) into a multi-agent architecture with autonomous decision-making and workflow execution.Build agents that automate all major MICE processes: availability checks, pricing logic, offer creation, approvals, contracting, follow-ups, and post-booking...