QA Engineer Tester AI LLM
2 tygodni temu
About the Role
We’re looking for a QA Tester with strong AI literacy and data validation skills to test how AI agents behave and ensure that their outputs are stored accurately and reliably in backend systems. This is a manual, exploratory testing role — ideal for someone who can combine curiosity about AI behavior with practical knowledge of data flows, relational databases, and result traceability.
You won’t be writing automation scripts, but you will need to understand how AI agents operate, how their outputs are used, and how to verify correctness across both the UI and backend.
What You’ll Do
Manually test AI-driven workflows that generate content, complete tasks, or make decisions
Assess AI behavior by checking for:
Consistency and repeatability
Hallucinations, inaccuracy, or bias
Relevance and task alignment
Evaluate data integrity:
Trace AI-generated data from the interface to the backend
Use SQL to validate how outputs are stored, structured, or logged
Compare AI intent/output to the resulting records in the database
Reproduce and report subtle, fuzzy, or probabilistic issues with structured documentation
Collaborate with engineers, AI designers, and product owners to define quality criteria across system layers
Must-Have Skills
2+ years of manual QA experience, ideally in exploratory or context-driven testing environments
Practical understanding of LLMs and AI tools (ChatGPT, Claude, etc.)
Basic to intermediate knowledge of SQL (joins, filters, aggregations, subqueries)
Experience validating data pipelines, audit logs, or relational integrity
Able to detect both UI anomalies and backend data discrepancies
Clear written and verbal communication for reporting behavior-based bugs
Familiarity with testing non-deterministic or AI-powered systems
Nice-to-Have
Understanding of prompt engineering and how LLM behavior can shift with input changes
Familiarity with AI agent architectures (e.g., LangChain, ReAct, RAG systems)
Experience working with BI tools (e.g., Metabase, Redash) or data validation frameworks
Background in content moderation, safety testing, or AI/UX evaluation
-
Ekspert AI LLM
4 tygodni temu
Warszawa, mazowieckie, Polska Transition Technologies PSC Pełny etatFirma Algomine, należąca do grupy TTPSC, poszukuje Data ScientistPoszukujemy Eksperta AI/LLM, który dołączy do naszego zespołu, aby wspierać rozwój innowacyjnych rozwiązań opartych na sztucznej inteligencji i najnowszych modelach językowych. Szukamy osoby z szeroką wiedzą techniczną i architektoniczną, potrafiącej łączyć kompetencje...
-
AI LLM Developer
2 tygodni temu
Warszawa, mazowieckie, Polska RITS Professional Services Pełny etat 23 zł - 520 złAI/LLM Engineer (NLP, Generative AI, Python)Lokalizacja: 100% zdalnie lub hybrydowo (Warszawa)Forma współpracy: B2BStawka: 140 (mid) –200 PLN (senior)/h nettoStart: ASAPWymiar: Full-timeCzas trwania: projekt długoterminowyO projekcieDołącz do zespołu pracującego nad zaawansowaną platformą opartą na sztucznej inteligencji i dużych modelach...
-
AI Developer – LLM RAG on-premise
4 tygodni temu
Warszawa, mazowieckie, Polska 1dea Pełny etat 21 zł - 840 złDla jednego z dużych klientów poszukujemy osoby do roli:AI EngineerWarunki zaangażowania: Obszar: consultingLokalizacja: Warszawa / hybryda Start: ASAP (akceptujemy kandydatury z max 1 msc okresem wypowiedzenia)Stawka (ustalana indywidualnie): B2B do 150 zł net + vat Zaangażowanie: B2B (outsourcing z 1dea), full-time, długofalowo Prosimy o przesyłanie...
-
QA Automation Engineer with AI experience
3 tygodni temu
Warszawa, mazowieckie, mazowieckie, Polska ACAISOFT POLAND Sp. z o.o. Pełny etatQA Automation Engineer with AI experienceMiejsce pracy: WarszawaTechnologies we useExpectedGitHub CopilotCursorClaude CodeAbout the projectYou will be cooperating with a leading provider of AI evaluation and optimization solutions, trusted by multinational companies to optimize AI agents and detect performance issues in large language models.In this role,...
-
AI Engineer MLOps Engineer Python LLM Kubernetes GCP
4 tygodni temu
Warszawa, mazowieckie, Polska RITS Professional Services Pełny etat 25 zł - 200 złStanowisko: AI Engineer / MLOps Engineer (Python, LLM, Kubernetes, GCP)Lokalizacja: Warszawa, ul. Chmielna 89 (praca w biurze raz w tygodniu)Model pracy: hybrydowyForma współpracy: B2B 150-170zł/h + VATPlanowany start: 20.10.2025Czas współpracy: ponad 12 miesięcyCo oferujemyDługofalową współpracę (ponad 12 miesięcy)Pracę przy nowoczesnych...
-
Warszawa, mazowieckie, Polska RITS Professional Services Pełny etat 18 zł - 480 złAI Engineer (Python / LLM / LangChain) – Mid+ / Senior – 100% zdalnieNajważniejsze informacjeTryb pracy: Hybryda, raz w tygodniu z biura w Warszawie lub Gdańsku Forma zatrudnienia: B2B Poziom doświadczenia: Mid/Senior (min. 3 lata komercyjnego doświadczenia) Wynagrodzenie (B2B): 110-135zł/h System poleceń: dostępny (szczegóły podczas rozmowy)...
-
Senior AI Engineer – Python Agents
7 dni temu
Warszawa, mazowieckie, Polska emagine Polska Pełny etat 30 zł - 240 złIndustry: Public transportWorking model: 100% remote (occasional travel to Denmark possible)Workload: Full-timeProject length: 6 months (+ extension) Type of assignment: B2B up to 200 PLN net/hourStart: 1 month noticeBYOD: yesThe Senior AI Engineer role is focused on developing and enhancing large language model (LLM) infrastructure within the GenAI...
-
Automated QA Engineer
4 tygodni temu
Warszawa, mazowieckie, Polska Transition Technologies MS Pełny etatWe are looking for an Automated QA Engineer to join our team who will focus on end-to-end quality assurance.Your responsibilities:● RAG Quality Evaluation: Design and curate datasets for RAG evaluation, develop and execute tests for RAG benchmarking, evaluating the accuracy and relevance of AI-generated content and information retrieval.● Data-Driven...
-
Automated QA Engineer
7 dni temu
Warszawa, mazowieckie, Polska Transition Technologies MS Pełny etatWe are looking for an Automated QA Engineer to join our team who will focus on end-to-end quality assurance.Your responsibilities:● RAG Quality Evaluation: Design and curate datasets for RAG evaluation, develop and execute tests for RAG benchmarking, evaluating the accuracy and relevance of AI-generated content and information retrieval.● Data-Driven...
-
Senior AI Engineer –LLM Trainer
4 tygodni temu
Warszawa, mazowieckie, Polska Upvanta sp. z o.o. Pełny etat 21 zł - 840 złAbout the role:You will focus on training and fine-tuning large language models (LLMs) to support our product that generates HTML from prompts, inspired by email-like instructions. Our stack leverages diffusion models and pre-trained LLMs.This is a hands-on role where your work will directly influence the product and shape the roadmap after launch.What...