Lead Data Engineer – Data Ingestion

1 tydzień temu


Gdańsk, Pomerania, Polska Hapag-Lloyd AG Pełny etat 80 000 zł - 120 000 zł rocznie

We are looking for a hands-on Lead Data Engineer – Data Ingestion to guide the development and execution of our data ingestion processes into the Data Lakehouse. In this role, you will be responsible for designing scalable and reliable data pipelines, transforming source inputs into trusted Bronze-layer assets. You'll work closely with data owners, source system teams, and the Product Manager to ensure successful onboarding of new sources and to drive continuous improvements in data quality, automation, and performance. You'll lead a small team of engineers by example—actively coding, reviewing solutions, and setting engineering standards. This is a role for someone who is passionate about data integration and wants to shape the way data flows through our platform.

  • Design and maintain scalable data ingestion pipelines from internal and external sources including Kafka, APIs, SFTP, and file-based systems.

  • Establish and maintain best practices for ingesting and processing structured and semi-structured data (e.g., JSON, Avro, CSV).

  • Align ingestion pipelines with enterprise data architecture and naming conventions.

  • Manage batch and micro-batch processing; contribute to future transition towards streaming ingestion.

  • Transform and validate ingested data into the Bronze layer within the Data Lakehouse.

  • Follow standardized onboarding scenarios and integrate new data sources in a consistent and governed way.

  • Apply schema and metadata standards using Unity Catalog and Collibra, ensure proper lineage tracking.

  • Collaborate with Product Manager and domain teams to scope and deliver ingestion use cases.

  • Ensure data quality and validation as part of the ingestion process.

  • Implement improvements in pipeline performance, cost efficiency, and automation.

  • Collaborate closely with teams using the ingested data to ensure usability and traceability.

  • Mentor team members, perform code reviews, and contribute to internal engineering standards and documentation.

  • Collaborate with the Data Governance team to ensure traceability, cataloging, and access control across new data domains.

  • Define and maintain onboarding playbooks and reusable ingestion templates.

  • Actively participate in backlog grooming, planning sessions, and technical refinement.

Qualifications

  • Minimum 6 years of experience in data engineering with a focus on ingestion and transformation.

  • Hands-on experience with Databricks (Spark), Python, Kafka, and cloud data processing on AWS (e.g., S3).

  • Experience with orchestration and workflow management (Airflow on Astronomer).

  • Strong SQL and Spark (preferably PySpark) skills.

  • Working knowledge of metadata governance via Unity Catalog and Collibra.

  • Familiarity with infrastructure-as-code tools (Terraform) and version control (GitLab).

  • Proven experience building reliable and performant ingestion pipelines.

  • Ability to collaborate with stakeholders, drive improvements, and document processes clearly.

  • Comfortable in an agile, product-oriented environment.

  • Experience working with semi-structured data and schema evolution techniques.

  • Knowledge of distributed data systems and challenges related to ingestion at scale.

  • Ability to lead technical conversations with external vendors or source system teams.

  • Understanding of testing frameworks for data pipelines and CI/CD validation.

  • Familiarity with tools for data profiling, schema validation, and automated lineage capture.

We offer
Area North is a newly established diversified Hapag-Lloyd Area covering for 8 countries with a mixture of own offices and agencies. Area's main office has recently been moved to new modern premises at Alchemia in Gdańsk, hosting other international companies and providing an agile international environment, which fits well to the Hapag-Lloyd strategy.

We offer:

  • Private medical care (Medicover)
  • Private life insurance (Unum)
  • Attractive annual bonus (depending on company performance results)
  • Group life insurance and employee capital plan (PPK)
  • Cafeteria benefit system (cinema tickets, vouchers etc.)
  • Charity and volunteer initiatives
  • Modern and well-connected office (Alchemia complex in Gdansk Oliwa)
  • Internal learning management system
  • Flexible working hours and home office possibility (hybrid work model)
  • Green commuting allowance

About Us
With a fleet of 313 modern container ships and a total transport capacity of 2.5 million TEU, Hapag-Lloyd is one of the world's leading liner shipping companies. In the Liner Shipping segment, the Company has around 14,000 employees and 400 offices in 140 countries. Hapag-Lloyd has a container capacity of 3.7 million TEU – including one of the largest and most modern fleets of reefer containers. A total of 133 liner services worldwide ensure fast and reliable connections between more than 600 ports on all the continents. In the Terminal & Infrastructure segment, Hapag-Lloyd has equity stakes in 21 terminals in Europe, Latin America, the United States, India and North Africa. Around 3,000 employees are assigned to the Terminal & Infrastructure segment and provide complementary logistics services at selected locations in addition to the terminal activities.



  • Gdańsk, Pomerania, Polska Hapag-Lloyd Pełny etat 60 000 zł - 120 000 zł rocznie

    DescriptionWe are looking for a hands-on Lead Data Engineer – Data Ingestion to guide the development and execution of our data ingestion processes into the Data Lakehouse. In this role, you will be responsible for designing scalable and reliable data pipelines, transforming source inputs into trusted Bronze-layer assets. You'll work closely with data...

  • Senior Data Engineer

    1 tydzień temu


    Gdańsk, Pomerania, Polska EPAM Systems Pełny etat 90 000 zł - 120 000 zł rocznie

    We are looking for a dynamic, hands-onSenior Data Engineerfor a full-stack AI native Data Eng team for building a cutting-edge quantitative investment platform.This role requires 5 days of work per week from the office in Krakow or Gdansk.ResponsibilitiesImplement data pipelines for structured, unstructured, and operational data sourcesEnsure efficient data...


  • Gdańsk, Pomerania, Polska Hapag-Lloyd Pełny etat 80 000 zł - 120 000 zł rocznie

    DescriptionWe are seeking an experienced Lead Data Engineer– Core Data Platform to drive the development and stability of our cloud-native data infrastructure. This role is crucial in ensuring our data platform is secure, reliable, and scalable. You'll be responsible for the provisioning and automation of key components including Databricks, Airflow...


  • Gdańsk, Pomerania, Polska KINESSO Poland Pełny etat 60 000 zł - 120 000 zł rocznie

    At KINESSO, we offer a unique perspective on the marketing landscape. We're building the future of performance marketing, fueled by a dynamic data ecosystem. This includes the breadth and depth of consumer data – encompassing demographics, lifestyle, purchase behavior, and more – combined with first-party client data and the rich, real-time signals from...

  • Data Platform Lead

    1 tydzień temu


    Gdańsk, Pomerania, Polska TF Bank Pełny etat 90 000 zł - 120 000 zł rocznie

    We are seeking a highly skilled Data Platform Lead to drive the design and development of Avarda Bank's next-generation data platform. This is a pivotal role where you will own the architecture, governance, and build of a cloud-native, event-driven platform that powers both advanced analytics and the consumer journey across Europe. You will combine strategic...

  • Data Engineer

    6 dni temu


    Gdańsk, Pomerania, Polska Sanoma Learning Pełny etat 40 000 zł - 80 000 zł rocznie

    Important:This position isonly for candidates legally residing in Polandand operating under a registered sole proprietorship (JDG).As part of onboarding, we kindly invite new joiners to our Warsaw office on the first day for a short introduction and identity verification.Please note that applications from outside Poland cannot be considered.About UsSanoma...

  • AI Validation engineer

    1 tydzień temu


    Gdańsk, Pomerania, Polska KBC Technologies Group Pełny etat 55 000 zł - 85 000 zł rocznie

    Job description: AI Validation engineerValidation Engineer to ensure the quality, reliability, and compliance of our dual-pipeline data ingestion systems and full RAG application deployments. This critical role involves designing, executing, and documenting validation protocols to verify that all data processes and AI applications meet established...


  • Gdańsk, Pomerania, Polska Hard Rock Digital Pełny etat 40 000 zł - 80 000 zł rocznie

    What are we building?Hard Rock Digital is a team focused on becoming the best online sportsbook, casino, and social casino company in the world. We're building a team that resonates passion for learning, operating, and building new products and technologies for millions of consumers. We care about each customer's interaction, experience, behaviour, and...

  • Data DevOps Engineer

    32 minut temu


    Gdańsk, Pomerania, Polska Hard Rock Digital Pełny etat

    Job description Location: Poland only, fully remoteJob Type: B2B, full timeOverviewHard Rock Digital is a team focused on becoming the best online sportsbook, casino, and social gaming company in the world. We care about each customer's interaction, experience, behaviour, and insight and strive to ensure we're always acting authentically.Rooted in the...

  • Senior Data Engineer

    1 tydzień temu


    Gdańsk, Pomerania, Polska B2B S.A Pełny etat 9 600 zł - 14 400 zł rocznie

    Jako Senior Data Engineer / ML będziesz odpowiedzialny za kompleksowe zarządzanie cyklem życia modeli uczenia maszynowego — od etapu rozwoju, poprzez wdrożenie i monitorowanie, aż po zapewnienie ich optymalnej wydajności i niezawodności.Hybrydowo (3 dni w tygodniu z biura)Senior Data Engineer / MLTwój zakres obowiązkówImplementacja zasad MLOps...