AI Model Optimization and Tool Development Engineer

1 tydzień temu


Warszawa, Mazovia, Polska 42dot Pełny etat 60 000 zł - 120 000 zł rocznie
We are looking for the best

42dot is seeking an AI Model Optimization and Tool Development Engineer (NPU) to focus on optimizing the autonomous driving stack and on-device large language models (LLMs). This role involves developing AI model optimization techniques for NPUs and building toolchains to ensure efficient execution. The engineer will be responsible for optimizing deep learning models for hardware accelerators, designing and developing toolchains that enhance performance, and supporting the advancement of AI technologies such as autonomous driving and LLMs through hardware-aware optimizations. This position plays a crucial role in bridging AI models with hardware accelerators, ensuring seamless integration and optimal efficiency.

Responsibilities

  • AI Model Porting and Optimization

    • Port AI models for LLM and autonomous driving stacks to NPU hardware and optimize their performance. Improve inference speed by utilizing techniques such as model compression (quantization, pruning, etc.), operator fusion, and memory optimization.

  • Toolchain Development & Compiler Engineering

    • Design and implement toolchains for porting AI models to NPUs. Integrate with deep learning frameworks such as TensorFlow and PyTorch to provide an efficient workflow. Develop tools for NPU-specific code generation, profiling, and debugging.

  • Optimization of Autonomous Driving and LLM Stacks

    • Optimize AI modules required for autonomous driving (e.g., object detection, path planning) to ensure compatibility and real-time execution performance. Enhance memory efficiency and speed through LLM inference optimization. Apply model parallelization and distributed execution techniques in multimodal AI stacks.

  • Performance Analysis and Improvement

    • Analyze AI model runtime performance and identify bottlenecks. Implement techniques to maximize hardware utilization.

  • Research and Adoption of New Technologies

    • Study the latest advancements in AI model optimization and NPU-related technologies. Experiment with and adopt new techniques to maximize NPU performance.

Qualifications

  • Bachelor's or Master's degree in Computer Science, AI, or a related field

  • At least 3 years of experience in AI model optimization and hardware acceleration

  • Familiarity with compiler technologies such as LLVM and MLIR

  • Experience optimizing AI models using NPUs, GPUs, or ASICs

  • Proficiency in deep learning frameworks and model conversion tools such as TensorFlow Lite, ONNX, and PyTorch

  • Expertise in model compression and optimization techniques, including quantization, pruning, and lazy evaluation

  • Proficiency in programming languages such as CUDA, C++, and Python, with experience in writing hardware-accelerated code

  • Strong understanding of memory management and parallel computing techniques

Preferred Qualifications

  • Experience with autonomous driving stacks, including SLAM, path planning, and object recognition

  • Optimization experience for on-device AI/LLM applications

  • Experience in AI optimization for embedded systems

  • Contributions to open-source AI optimization projects

Interview Process

  • Application Review → Coding Test → First Interview (~1 hour) → Second Interview (~3 hours) → Final Selection

  • The interview process may vary depending on the position and is subject to change based on the schedule and circumstances.

  • Applicants will be individually notified of the interview schedule and results via the email provided in their application.

Additional Information

  • In accordance with fair hiring practices, do not include any personal information unrelated to your job qualifications (e.g., Social Security Number, family relations, marital status, age, photo, physical condition, place of birth, etc.) in your resume.

  • All documents must be submitted in PDF format and under 30MB in size.

  •  If you experience issues uploading your resume, please send it along with the job posting URL to

  • We strongly encourage applications from U.S. veterans and candidates eligible for employment preference under applicable laws.

  • Qualified individuals with disabilities are encouraged to apply and will receive consideration under the Americans with Disabilities Act (ADA).

  • 42dot does not accept unsolicited resumes and will not pay fees for any such submissions. Equal Opportunity Statement

  • 42dot is an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees, regardless of race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, or veteran status.

※ Please review the following information before applying.

  • How to work in 42dot, About 42dot Way →



  • Warszawa, Mazovia, Polska Tenstorrent Pełny etat

    Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high...

  • AI Engineer

    1 tydzień temu


    Warszawa, Mazovia, Polska Siena AI Pełny etat 60 000 zł - 120 000 zł rocznie

    Meet SienaSiena is the first intelligence layer for customer experience. We're creating an operating system of AI agents that learn, remember, and act across every customer touchpoint—from support conversations to shopping experiences to voice and social media interactions.Siena doesn't just automate support; it powers shopping agents, builds persistent...

  • AI Backend Engineer

    1 tydzień temu


    Warszawa, Mazovia, Polska Movate Pełny etat

    Movateis a part of a global information and communication technology company. We strongly believe that our People are the reason for our success. Discover the concept of 'SMALLPORATION' and join our team"We don't just offer jobs, we provide careers."So, if you are eager to thrive with us, take a look at the offer belowA talented team of backend developers is...

  • OS Tool Software Engineer

    1 tydzień temu


    Warszawa, Mazovia, Polska 42dot Pełny etat 104 000 zł - 130 878 zł rocznie

    We are looking for the bestJoin our dynamic team as a seasoned System Software Engineer, where you'll play a pivotal role in revolutionizing vehicle computing systems Your expertise will be the driving force behind developing cutting-edge tools for distributed messaging systems, a key element in our trailblazing autonomous driving platform. As an OS Tool...

  • AI Data Engineer

    1 tydzień temu


    Warszawa, Mazovia, Polska Fetcherr Pełny etat 60 000 zł - 120 000 zł rocznie

    Fetcherr, experts in deep learning, e-commerce, and digitization, is disrupting traditional systems with its cutting-edge AI technology. At its core is the Large Market Model (LMM), an adaptable AI engine that forecasts demand and market trends with precision, empowering real-time decision-making. Specializing initially in the airline industry, Fetcherr aims...

  • AI Data Engineer

    8 godzin temu


    Warszawa, Mazovia, Polska Fetcherr Pełny etat

    Fetcherr, experts in deep learning, e-commerce, and digitization, is disrupting traditional systems with its cutting-edge AI technology. At its core is the Large Market Model (LMM), an adaptable AI engine that forecasts demand and market trends with precision, empowering real-time decision-making. Specializing initially in the airline industry, Fetcherr aims...

  • Senior AI Engineer

    1 tydzień temu


    Warszawa, Mazovia, Polska TeaCode Pełny etat 100 000 zł - 120 000 zł rocznie

    Pay rate: zł/hType of agreement:B2B contractType of work:100% remoteWe are looking for an experiencedSenior AI Engineerto design, develop, and implement advanced machine learning algorithms and systems.This role focuses on building robust, scalable AI solutions, including LLM-based (Large Language Model) pipelines, and taking them from proof of concept to...

  • Senior AI Solution Engineer

    1 tydzień temu


    Warszawa, Mazovia, Polska Semantive Pełny etat 60 000 zł - 120 000 zł rocznie

    We are an independent multi-cloud services company that helps enterprises and SMB companies build new foundations for future growth through successful cloud transformation and breadth of expertise in applying cloud technologies to unlock new possibilities. Today's digital and technological revolution is an important part of our business - we are not only...

  • OS Tool Software Engineer

    1 tydzień temu


    Warszawa, Mazovia, Polska 42dot Pełny etat 60 000 zł - 120 000 zł rocznie

    We are looking for the bestJoin our dynamic team as a seasoned System Software Engineer, where you'll play a pivotal role in revolutionizing vehicle computing systems Your expertise will be the driving force behind developing cutting-edge tools for distributed messaging systems, a key element in our trailblazing autonomous driving platform. As an OS Tool...


  • Warszawa, Mazovia, Polska AI Clearing Pełny etat

    Company DescriptionAI Clearing, headquartered in Austin, Texas, is a leading global provider of an AI-powered platform for the infrastructure construction industry. Established in 2020, the company offers the only computer vision foundational model and agentic platform specifically for the construction sector. AI Clearing partners with general contractors...