Data Engineer Databricks

2 dni temu


Katowice, śląskie, Polska Addepto Pełny etat 15 zł - 120 zł

Addepto is a leading AI consulting and data engineering company that builds scalable, ROI-focused AI solutions for some of the world’s largest enterprises and pioneering startups, including Rolls Royce, Continental, Porsche, ABB, and WGU. With our exclusive focus on Artificial Intelligence and Big Data, we help organizations unlock the full potential of their data through systems designed for measurable business impact and long-term growth.


Beyond client projects, we have developed our own product offerings born from real-life client insights and challenges. We are also actively releasing open-source solutions to the community, transforming practical experience into tools that benefit the broader AI ecosystem. This commitment to scalable innovation, proven ROI delivery, and knowledge sharing has earned us recognition by Forbes as one of the top 10 AI consulting companies worldwide.


As a Data Engineer, you will have the exciting opportunity to work with a team of technology experts on challenging projects across various industries, leveraging cutting-edge technologies. Here are some of the projects we are seeking talented individuals to join:

  • Design and development of a universal data platform for global aerospace companies. This Azure and Databricks powered initiative combines diverse enterprise and public data sources. The data platform is at the early stages of the development, covering design of architecture and processes as well as giving freedom for technology selection.

  • Data Platform Transformation for energy management association body. This project addressed critical data management challenges, boosting user adoption, performance, and data integrity. The team is implementing a comprehensive data catalog, leveraging Databricks and Apache Spark/PySpark, for simplified data access and governance. Secure integration solutions and enhanced data quality monitoring, utilizing Delta Live Table tests, established trust in the platform. The intermediate result is a user-friendly, secure, and data-driven platform, serving as a basis for further development of ML components.

  • Design of the data transformation and following data ops pipelines for global car manufacturer. This project aims to build a data processing system for both real-time streaming and batch data. We’ll handle data for business uses like process monitoring, analysis, and reporting, while also exploring LLMs for chatbots and data analysis. Key tasks include data cleaning, normalization, and optimizing the data model for performance and accuracy.


Your main responsibilities:

  • Design scalable data processing pipelines for streaming and batch processing using Big Data technologies like Databricks, Airflow and/or Dagster.

  • Contribute to the development of CI/CD and MLOps processes.

  • Develop applications to aggregate, process, and analyze data from diverse sources.

  • Collaborate with the Data Science team on Machine Learning projects, including text/image analysis and predictive model building.

  • Develop and organize data transformations using Databricks/DBT and Apache Airflow.

  • Translate business requirements into technical solutions and ensure optimal performance and quality.


What you’ll need to succeed in this role:

  • At least 3 years of commercial experience implementing, developing, or maintaining Big Data systems.

  • Strong programming skills in Python: writing a clean code, OOP design.

  • Experience in designing and implementing data governance and data management processes.

  • Familiarity with Big Data technologies like Airflow or Dagster, Databricks, Spark and DBT.

  • Experience implementing and deploying solutions in cloud environments (with a preference for Azure).

  • Knowledge of how to build and deploy Power BI reports and dashboards for data visualization.

  • Excellent understanding of dimensional data and data modeling techniques.

  • Excellent communication skills and consulting experience with direct interaction with clients.

  • Ability to work independently and take ownership of project deliverables.

  • Master’s or Ph.D. in Computer Science, Data Science, Mathematics, Physics, or a related field.


Discover our perks & benefits:

  • Work in a supportive team of passionate enthusiasts of AI & Big Data.

  • Engage with top-tier global enterprises and cutting-edge startups on international projects.

  • Enjoy flexible work arrangements, allowing you to work remotely or from modern offices and coworking spaces.

  • Accelerate your professional growth through career paths, knowledge-sharing initiatives, language classes, and sponsored training or conferences, including a partnership with Databricks, which offers industry-leading training materials and certifications.

  • Choose your preferred form of cooperation - B2B or a contract of mandate - and enjoy 20 fully paid days off

  • Participate in team-building events and utilize the integration budget.

  • Celebrate work anniversaries, birthdays, and milestones.

  • Access medical and sports packages, eye care, and well-being support services, including psychotherapy and coaching.

  • Get full work equipment for optimal productivity, including a laptop and other necessary devices.

  • With our backing, you can boost your personal brand by speaking at conferences, writing for our blog, or participating in meetups.

  • Experience a smooth onboarding with a dedicated buddy, and start your journey in our friendly, supportive, and autonomous culture.


Are you interested in Addepto and would like to join us?


Get in touch We are looking forward to receiving your application. Would you like to know more about us?

Visit our website (career page) and social media (Facebook, LinkedIn, Instagram).


  • Databricks Tech Lead

    3 tygodni temu


    Katowice, śląskie, śląskie, Polska KPMG Pełny etat

    Databricks Tech Lead - Zespół Data & CloudMiejsce pracy: KatowiceTechnologie, których używamyWymaganeDatabricksData LakeAzure Data FactoryAzure DevOpsCI/CDPythonSQLBIO projekcieZespół Data & Cloud zajmuje się dostarczaniem naszym Klientom usług z zakresu szeroko pojętej analityki danych, modelowania platform danych i Business Intelligence....

  • Databricks Tech Lead

    2 godzin temu


    Katowice, śląskie, śląskie, Polska KPMG Pełny etat

    Databricks Tech Lead - Zespół Data & Cloud​Miejsce pracy: KatowiceTechnologie, których używamyWymaganeDatabricksData LakeAzure Data FactoryAzure DevOpsCI/CDPythonSQLBIO projekcieZespół Data & Cloud​ zajmuje się dostarczaniem naszym Klientom usług z zakresu szeroko pojętej analityki danych, modelowania platform danych i Business Intelligence....

  • Senior Data Engineer

    4 tygodni temu


    Katowice, śląskie, Polska Sopra Steria Pełny etat 16 zł

    Company DescriptionSopra Steria is one of the largest players in the tech industry in Europe, known for its consulting, digital services and software development. We operate in nearly 30 countries in the world, hiring more than 55,000 employees.The Polish branch, as the Global Delivery Center, operates in Katowice since 2007 and has been growing ever since....

  • Data Engineer

    2 dni temu


    Katowice, śląskie, Polska Reply Polska Sp. z o. o. Pełny etat 18 zł - 480 zł

    ResponsibilitiesImplement data-facing backend features: analyze, design, and code services that read/write Delta tables, respecting schema evolution and time travel; document APIs and data contracts.Validate feasibility of new requirements within the existing medallion architecture; propose changes to bronze/silver/gold layers and indexing/partitioning where...

  • Senior Data Engineer

    2 tygodni temu


    Katowice, śląskie, śląskie, Polska Michael Page Pełny etat

    Senior Data EngineerMiejsce pracy: KatowiceTechnologies we useExpectedAzure SQLAbout the projectA global beverages and infusions company with over a century of heritage, known for its widely recognized tea brands and strong focus on sustainability, innovation, and responsible sourcing. Operating across multiple markets, the organization combines tradition...


  • Katowice, śląskie, Polska Relativity Pełny etat 15 zł - 83 zł

    We are building a specialized team focused on enabling advanced analytics and reporting capabilities across our internal data ecosystem. This team will design and maintain data platforms that integrate modern lakehouse technologies, distributed compute frameworks, and cloud-native services to support diverse analytical use cases and...

  • Senior Data Engineer

    2 godzin temu


    Katowice, śląskie, śląskie, Polska Sii Sp. z o.o. Pełny etat

    Senior Data Engineer (f/m/x)Miejsce pracy: KatowiceTechnologie, których używamyWymaganeETLAzure DataApache SparkData ValutTerraformSQLPythonScalaMile widzianeJavaScalaO projekcieProjekt obejmuje rozwój nowoczesnej platformy danych w chmurze, wspierającej analizę, raportowanie oraz rozwiązania machine learning. Dane przetwarzane są w trybie batch oraz...

  • Cloud Data Engineer

    2 godzin temu


    Katowice, śląskie, śląskie, Polska Sii Sp. z o.o. Pełny etat

    Cloud Data Engineer (f/m/x)Miejsce pracy: KatowiceTechnologie, których używamyWymaganeSpark/PySparkMicrosoft AzureETLSQLPythonDatabricksMile widzianeSnowflakeApache KafkaApache AirflowO projekcieChcesz rozwijać swoje kompetencje w technologiach chmurowych? Dołącz do naszej wyspecjalizowanej jednostki skupiającej ekspertów z dziedziny procesowania i...

  • Cloud Data Engineer

    7 dni temu


    Katowice, śląskie, śląskie, Polska Sii Sp. z o.o. Pełny etat

    Cloud Data EngineerMiejsce pracy: KatowiceTechnologie, których używamyWymaganeMicrosoft AzureETLSQLPythonDatabricksSparkPySparkMile widzianeSnowflakeApache KafkaApache AirflowO projekcieChcesz rozwijać swoje kompetencje w technologiach chmurowych? Dołącz do naszej wyspecjalizowanej jednostki skupiającej ekspertów z dziedziny procesowania i analizy...


  • Katowice, śląskie, Polska Reply Polska Sp. z o. o. Pełny etat 9 zł

    ResponsibilitiesData System Design: Design and implement robust, scalable data processing systems: this involves selecting appropriate storage technologies, designing schemas, and planning integration strategies. Data Integration and ETL Development: Develop and maintain pipelines for data transformation, integration, and ETL processes. Ensure data quality...