Aktualne oferty pracy związane z Site Reliability Engineering Lead - Kraków - Andersen


  • Kraków, Polska HSBC Service Delivery (Polska) Sp. z o.o. Pełny etat

    Site Reliability Engineering Lead Miejsce pracy: Kraków Your responsibilities - Analyse incident and change data to identify patterns, root causes, and systemic risks. - Define and track service health metrics (MTTR, failure rates, change success, etc.). - Partner with product and support teams to implement reliability improvements. - Build and maintain...


  • Kraków, Lesser Poland Andersen Lab Pełny etat

    SummaryAndersen is hiring a Site Reliability Engineering Lead in India to drive reliability and performance for large-scale digital insurance platforms, enhancing integrations, optimizing cloud systems, and ensuring stable, high-quality service delivery.The customer is a well-established global organization providing financial protection and risk-management...


  • Kraków, Lesser Poland AVSystem Pełny etat 80 000 zł - 120 000 zł rocznie

    We build, test, launch, and operate the complex, high-stakes systems for our global telco customers. Your mission is to ensure the reliability, efficiency, and performance of our core products (like UMP, CEM, BSAP, and DHCP) across both our cloud and complex on-premise deployments.This is no small task. Our products handle hundreds of millions of devices in...

  • Site Reliability

    2 tygodni temu


    Kraków, Lesser Poland Canonical - Jobs Pełny etat 80 000 zł - 120 000 zł rocznie

    Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers,...


  • Kraków, małopolskie, Polska Motorola Solutions Pełny etat 18 zł

    The Emergency Call Routing team is responsible for SaaS solutions that provide geospatial and traditional call-routing capabilities to communities, regions, and states. These systems are ultra-highly available, providing the service to route any caller dialing 9-1-1 (or 1-1-2, etc.) to the appropriate public safety answering point (PSAP) as quickly as...


  • Kraków, Lesser Poland VGW Pełny etat

    Senior Site Reliability EngineerVGW is an interactive entertainment company, harnessing technology and creativity to deliver world-class, free-to-play online social games. We have an exciting opportunity to join our Engineering team in Poland and are currently looking for a Senior Site Reliability Engineer to join the team.You'll focus on ensuring the...


  • Kraków, Lesser Poland VGW Pełny etat

    Senior Site Reliability EngineerVGW is an interactive entertainment company, harnessing technology and creativity to deliver world-class, free-to-play online social games. We have an exciting opportunity to join our Engineering team in Poland and are currently looking for a Senior Site Reliability Engineer to join the team.You'll focus on ensuring the...


  • Kraków, Lesser Poland Allvue Systems Pełny etat 70 000 zł - 142 000 zł rocznie

    About AllvueWe are Allvue Systems, the leading provider of software solutions for the Private Capital and Credit markets. Whether a client wants an end-to-end technology suite, or independently focused modules, Allvue helps eliminate the boundaries between systems, information, and people. We're looking for ambitious, smart, and creative individuals to join...

  • Senior Site Reliability

    2 tygodni temu


    Kraków, Lesser Poland Canonical - Jobs Pełny etat 60 000 zł - 120 000 zł rocznie

    Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers,...


  • Kraków, Lesser Poland Motorola Solutions Pełny etat 60 000 zł - 120 000 zł rocznie

    Company OverviewAt Motorola Solutions, we believe that everything starts with our people. We're a global close-knit community, united by the relentless pursuit to help keep people safer everywhere. Our critical communications, video security and command center technologies support public safety agencies and enterprises alike, enabling the coordination that's...

Site Reliability Engineering Lead

2 tygodni temu


Kraków, Polska Andersen Pełny etat

Andersen is hiring a Site Reliability Engineering Lead to drive reliability and performance for large-scale digital insurance platforms, enhancing integrations, optimizing cloud systems, and ensuring stable, high-quality service delivery.  The customer is a well-established global organization providing financial protection and risk-management services across various markets. With a diverse portfolio and teams operating in multiple regions, the company supports businesses and individuals through reliable, scalable solutions.  The project focuses on enhancing large-scale digital platforms, improving cloud performance, optimizing integrations, and modernizing systems to support efficient service delivery and ongoing expansion.   Responsibilities:   Piloting SRE adoption, assessing current Digital Applications architecture, and implementing highly reliable, fault-tolerant design patterns.  Defining Critical User Journeys (CUJs), SLOs/SLIs, and error budgets, ensuring alignment with business and user experience.  Maintaining a prioritized toil backlog to drive automation and operational efficiency.  Coaching production support teams on SRE principles and practices.  Working with Regional and Global Digital teams to support SRE rollout and adoption.  Preparing and delivering training sessions and materials, fostering continuous improvement.  Providing recommendations on system architecture, fault tolerance, and disaster recovery.  Delivering uptime, performance, and availability targets through SLIs, SLOs, SLAs, and error budgets.  Monitoring risks to ensure service reliability and minimizing disruptions.  Embedding CUJ-level metrics and telemetry into all relevant services.  Implementing observability platforms, ensuring full monitoring coverage.  Building actionable dashboards, alerts, and reports using standard observability tools (including OpenTelemetry).  Automating deployments, failover, scaling, and remediation processes.  Eliminating manual work by promoting automation, improved tooling, and optimized workflows.  Leading incident response during outages and conducting root cause analysis.  Developing automated remediation for common failure scenarios.  Participating in on-call rotations and conducting blameless post-mortems with corrective actions.  Must-haves:   Experience in infrastructure teams for 15+ years.  Strong technical background with solid knowledge of software development principles, application production support, SDLC best practices, and Agile methodology.  Hands-on SRE experience with a strong understanding of SLOs, SLIs, error budgets, incident management, and conducting blameless post-mortems.  Strong ability to analyze and understand application architectures and identify areas for improvement.  Experience working with monitoring, logging, and observability tools to assess and improve application performance.  Proficiency in scripting and automation tools, including Python, Bash, and Terraform, to reduce toil and enhance operational efficiency.  Strong incident response and troubleshooting skills with the ability to perform effective root cause analysis.  Excellent communication and collaboration skills, enabling effective interaction with cross-functional teams and clear explanation of technical concepts.  Ability to coach and mentor team members in SRE practices and support the development of a reliability-focused culture.  Practical experience working in Agile teams and applying Agile development practices.  Proactive mindset focused on continuous improvement to increase system reliability and performance.  Level of English – from Intermediate+ and above.  Reasons why this job would be interesting to you:   Experience in teamwork with leaders in FinTech, Healthcare, Retail, Telecom, and others. Andersen cooperates with such businesses as Samsung, Siemens, Johnson & Johnson, BNP Paribas, Ryanair, Mercedes, TUI, Verivox, Allianz, T-Systems, etc..  The opportunity to change the project and/or develop expertise in an interesting business domain.  Job conditions – you can work both fully remotely and from the office or can choose a hybrid variant.  Guarantee of professional, financial, and career growth The company has introduced systems of mentoring and adaptation for each new employee.  The opportunity to earn up to an additional 1,000 USD per month, depending on the level of expertise, which will be included in the annual bonus, by participating in the company's activities.  Access to the corporate training portal, where the entire knowledge base of the company is collected and which is constantly updated.  Bright corporate life (parties / pizza days / PlayStation / fruits / coffee / snacks / movies).  Certification compensation (AWS, PMP, etc).  Referral program.  English courses.  Private health insurance and compensation for sports activities.  Your personal data is protected in accordance with GDPR regulations. Learn more: Join us