Lead Data Engineer Spark

4 tygodni temu


Kraków, małopolskie, Polska Addepto Pełny etat 21 zł

Addepto is a leading AI consulting and data engineering company that builds scalable, ROI-focused AI solutions for some of the world’s largest enterprises and pioneering startups, including Rolls Royce, Continental, Porsche, ABB, and WGU. With our exclusive focus on Artificial Intelligence and Big Data, we help organizations unlock the full potential of their data through systems designed for measurable business impact and long-term growth.Beyond client projects, we have developed our own product offerings born from real-life client insights and challenges. We are also actively releasing open-source solutions to the community, transforming practical experience into tools that benefit the broader AI ecosystem. This commitment to scalable innovation, proven ROI delivery, and knowledge sharing has earned us recognition by Forbes as one of the top 10 AI consulting companies worldwide.As a Lead Data Engineer, you will have the exciting opportunity to work with a team of technology experts on challenging projects across various industries, leveraging cutting-edge technologies. Here are some of the projects we are seeking talented individuals to join:Design and development of the platform for managing vehicle data for global automotive company. This project develops a shared platform for processing massive car data streams. It ingests terabytes of daily data, using both streaming and batch pipelines for near real-time insights. The platform transforms raw data for data analysis and Machine Learning, this empowers teams to build real-world applications like digital support and smart infotainment and unlocks data-driven solutions for car maintenance and anomaly detection across the organization.Design and development of a universal data platform for global aerospace companies. This Azure and Databricks powered initiative combines diverse enterprise and public data sources. The data platform is at the early stages of the development, covering design of architecture and processes as well as giving freedom for technology selection. This role represents a gradual shift away from hands-on coding towards a more strategic focus on system design, business consultation, and creative problem-solving. It offers an opportunity to engage more deeply with architecture-level decisions, collaborate closely with clients, and contribute to building innovative data-driven solutions from a broader perspective. Your main responsibilities:Design and develop scalable data management architectures, infrastructure, and platform solutions for streaming and batch processing using Big Data technologies like Apache Spark, Hadoop, Iceberg.Design and implement data management and data governance processes and best practices.Contribute to the development of CI/CD and MLOps processes.Develop applications to aggregate, process, and analyze data from diverse sources.Collaborate with the Data Science team on data analysis and Machine Learning projects, including text/image analysis and predictive model building.Develop and organize data transformations using DBT and Apache Airflow.Translate business requirements into technical solutions and ensure optimal performance and quality. What you'll need to succeed in this role:5+ years of proven commercial experience in implementing, developing, or maintaining Big Data systems.Strong programming skills in Python or Java/Scala: writing a clean code, OOP design.Experience in designing and implementing data governance and data management processes.Familiarity with Big Data technologies like Spark, Cloudera, Airflow, NiFi, Docker, Kubernetes, Iceberg, Trino or Hudi.Proven expertise in implementing and deploying solutions in cloud environments (with a preference for AWS).Excellent understanding of dimensional data and data modeling techniques.Excellent communication skills and consulting experience with direct interaction with clients.Ability to work independently and take ownership of project deliverables.Master’s or Ph.D. in Computer Science, Data Science, Mathematics, Physics, or a related field.Fluent English (C1 level) is a must. Discover our perks & benefits:Work in a supportive team of passionate enthusiasts of AI & Big Data.Engage with top-tier global enterprises and cutting-edge startups on international projects.Enjoy flexible work arrangements, allowing you to work remotely or from modern offices and coworking spaces. Accelerate your professional growth through career paths, knowledge-sharing initiatives, language classes, and sponsored training or conferences, including a partnership with Databricks, which offers industry-leading training materials and certifications.Choose from various employment options: B2B, employment contracts, or contracts of mandate.Make use of 20 fully paid days off available for B2B contractors and individuals under contracts of mandate.Participate in team-building events and utilize the integration budget.Celebrate work anniversaries, birthdays, and milestones.Access medical and sports packages, eye care, and well-being support services, including psychotherapy and coaching.Get full work equipment for optimal productivity, including a laptop and other necessary devices.With our backing, you can boost your personal brand by speaking at conferences, writing for our blog, or participating in meetups.Experience a smooth onboarding with a dedicated buddy, and start your journey in our friendly, supportive, and autonomous culture.Are you interested in Addepto and would like to join us?Get in touch We are looking forward to receiving your application. Would you like to know more about us?Visit our website (career page) and social media (Facebook, LinkedIn, Instagram).


  • Data Engineer

    1 tydzień temu


    Kraków, małopolskie, Polska Caspian One Pełny etat 37 zł - 800 zł

    A global leading Investment Bank is needing a Senior Spark Data Engineer to join an agile team responsible for building a strategic integration layer between Deal stores and Operations/Regulatory systems. This role is central to a multi-year program aimed at modernizing infrastructure and leveraging cloud technologies to improve scalability, resilience, and...

  • Data Engineer Spark

    4 tygodni temu


    Kraków, małopolskie, Polska Addepto Pełny etat 15 zł - 120 zł

    Addepto is a leading AI consulting and data engineering company that builds scalable, ROI-focused AI solutions for some of the world’s largest enterprises and pioneering startups, including Rolls Royce, Continental, Porsche, ABB, and WGU. With our exclusive focus on Artificial Intelligence and Big Data, we help organizations unlock the full potential of...


  • Kraków, małopolskie, Polska Grid Dynamics Poland Pełny etat 20 zł - 160 zł

    We're seeking a highly skilled Big Data Engineer to design and build critical financial technology solutions. This role is central to developing a tool that empowers financial advisors to manage client opportunities based on defined business rules.Essential functionsDesign, develop, and deploy Spark-based solutions for cleaning, transforming, and analyzing...


  • Kraków, małopolskie, Polska Grid Dynamics Poland Pełny etat 26 zł - 880 zł

    Our customer is one of the world’s largest technology companies based in Silicon Valley with operations all over the world. In this project, we are working on the bleeding edge of Big Data technology to develop a high-performance data analytics platform, which handles petabytes datasets.We are looking for an experienced Big Data...

  • Data Engineer

    1 tydzień temu


    Kraków, małopolskie, Polska Caspian One Pełny etat 30 zł

    Location: Krakow (Hybrid)Rate: Up to 2000 PLN/dayDuration: 6-month rolling contract Overview:Excellent opportunity to work with a highly reputable financial services company in Poland! We’re seeking a skilled Spark Software Engineer to join a dynamic agile team focused on building the strategic backbone between Dealstores and Operations/Regulatory systems....


  • Kraków, małopolskie, Polska Kolomolo Pełny etat

    Lead Data Platform Engineer (Databricks/ Data Mesh Architecture) Location: European Union (Remote or Hybrid) Full-TimeJoin the Future of Digital Tech with KolomoloAt Kolomolo, we don’t just follow trends - we set them. As a global supplier of IT services and digital modernization solutions, we help businesses embrace cutting-edge technology to optimize...

  • Data Engineer PySpark

    4 tygodni temu


    Kraków, małopolskie, Polska Hirexa Pełny etat 18 zł - 900 zł

    Job Title: Data EngineerLocation: Krakow, Poland (Hybrid)Employment Type: Contract About Hirexa Solutions:Hirexa Solutions is a leading player in the recruitment ecosystem across the United States, United Kingdom, Europe, and India. As the fastest-growing next-generation provider of technology talent, we empower our clients to become resourceful, achieve...

  • Data Test Engineer

    4 tygodni temu


    Kraków, małopolskie, małopolskie, Polska Connectis Pełny etat

    Data Test EngineerMiejsce pracy: KrakówTechnologie, których używamyWymaganePythonSASLinuxHadoopTeradataPostgreSQLMile widzianeHiveSparkDatabricksHDFSShell ScriptingO projekcieDla naszego Klienta, globalnej organizacji w sektorze finansowym, poszukujemy doświadczonej osoby na stanowisko Data Test Engineer.Projekt dotyczy migracji środowiska Big Data z...


  • Kraków, małopolskie, Polska Relativity Pełny etat 19 zł - 333 zł

    We are building a specialized team focused on enabling advanced analytics and reporting capabilities across our internal data ecosystem. As a Tech/Team Lead, you will combine deep technical expertise with leadership skills to guide a team in designing and maintaining data platforms that integrate modern lakehouse technologies, distributed...


  • Kraków, małopolskie, Polska N-iX Pełny etat 8 zł

    #4151Join our team to work on enhancing a robust data pipeline that powers our SaaS product, ensuring seamless contextualization, validation, and ingestion of customer data. Collaborate with product teams to unlock new user experiences by leveraging data insights. Engage with domain experts to analyze real-world engineering data and build data quality...