Site Reliability Engineer with Microsoft Azure

7 dni temu


Kraków, Lesser Poland Andersen Lab Pełny etat 60 000 zł - 120 000 zł rocznie

Summary

Andersen is hiring a Site Reliability Engineer with Microsoft Azure in India to drive reliability and performance for large-scale digital insurance platforms, enhancing integrations, optimizing cloud systems, and ensuring stable, high-quality service delivery.

The customer is a well-established global organization providing financial protection and risk-management services across various markets. With a diverse portfolio and teams operating in multiple regions, the company supports businesses and individuals through reliable, scalable solutions.

The project focuses on enhancing large-scale digital platforms, improving cloud performance, optimizing integrations, and modernizing systems to support efficient service delivery and ongoing expansion.

Responsibilities

  • Ensuring high availability, performance, and scalability of cloud infrastructure through proactive monitoring, automation, and continuous improvement.
  • Designing and maintaining resilient Azure-based infrastructure using IaC (Terraform).
  • Implementing end-to-end observability with telemetry, CUJ-level metrics, dashboards, alerts, and real-time performance insights.
  • Monitoring Critical User Journeys with product and business teams to maintain a reliable user experience.
  • Conducting load testing, capacity planning, and performance tuning to prepare systems for traffic growth and spikes.
  • Managing SLIs, SLOs, SLAs, and error budgets across critical services.
  • Implementing next-generation cloud reliability and fault-tolerance solutions, including disaster recovery improvements.
  • Identifying risks and preventing service disruptions through proactive reliability engineering.
  • Automating deployments, scaling, failover, and remediation to reduce manual toil and operational bottlenecks.
  • Leading incident response, participating in on-call rotations, conducting root cause analysis, and delivering blameless post-mortems.
  • Creating and maintaining runbooks, documentation, and operational guidelines.
  • Collaborating with engineering and global teams on reliability best practices; mentoring junior SREs and supporting SRE hiring.

Requirements

  • Experience as an SRE in cloud and infrastructure teams for 6+ years.
  • Extensive experience with Microsoft Azure cloud services and infrastructure management for a minimum of 5+ years.
  • Strong technical background with solid knowledge of software development principles, application production support, SDLC best practices, and Agile methodology.
  • Hands-on SRE experience with a strong understanding of SLOs, SLIs, error budgets, incident management, and conducting blameless post-mortems.
  • Strong ability to analyze and understand application architectures and identify areas for improvement.
  • Experience working with monitoring, logging, and observability tools to assess and improve application performance.
  • Proficiency in scripting and automation tools, including Python, Bash, and Terraform, to reduce toil and enhance operational efficiency.
  • Strong incident response and troubleshooting skills with the ability to perform effective root cause analysis.
  • Excellent communication and collaboration skills for working with cross-functional teams and clearly explaining technical concepts.
  • Ability to coach and mentor team members in SRE practices and foster a culture of reliability.
  • Practical experience applying Agile development practices and working in Agile teams.
  • Proactive mindset focused on continuous improvement to increase system reliability and performance.
  • Level of English – from Intermediate+ and above.

Desired skills

  • Additional certifications in cloud computing, DevOps, or SRE practices.
  • Microsoft Azure certifications such as Azure Administrator, Azure DevOps Engineer, or Azure Solutions Architect.

Reasons to join us

  • Experience in teamwork with leaders in FinTech, Healthcare, Retail, Telecom, and others. Andersen cooperates with such businesses as Samsung, Siemens, Johnson & Johnson, BNP Paribas, Ryanair, Mercedes, TUI, Verivox, Allianz, T-Systems, etc..
  • The opportunity to change the project and/or develop expertise in an interesting business domain.
  • Job conditions – you can work both fully remotely and from the office or can choose a hybrid variant.
  • Guarantee of professional, financial, and career growth The company has introduced systems of mentoring and adaptation for each new employee.
  • The opportunity to earn up to an additional 1,000 USD per month, depending on the level of expertise, which will be included in the annual bonus, by participating in the company's activities.
  • Access to the corporate training portal, where the entire knowledge base of the company is collected and which is constantly updated.
  • Bright corporate life (parties / pizza days / PlayStation / fruits / coffee / snacks / movies).
  • Certification compensation (AWS, PMP, etc).
  • Referral program.
  • English courses.
  • Private health insurance and compensation for sports activities.

Join us



  • Kraków, Lesser Poland Andersen Lab Pełny etat

    SummaryAndersen is hiring a Site Reliability Engineer with Microsoft Azure in India to drive reliability and performance for large-scale digital insurance platforms, enhancing integrations, optimizing cloud systems, and ensuring stable, high-quality service delivery.The customer is a well-established global organization providing financial protection and...

  • Site Reliability Engineer

    2 tygodni temu


    Kraków, Lesser Poland Andersen Lab Pełny etat 65 000 zł - 125 000 zł rocznie

    SummaryThe international IT сompany Andersen invites a Site Reliability Engineer to join our dynamic and highly skilled professional team.Andersen is a pre-IPO software development company that provides a full cycle of services, following project management standards and best practices. For over 18 years, we have been helping enterprises and middle-sized...


  • Kraków, Lesser Poland AVSystem Pełny etat 80 000 zł - 120 000 zł rocznie

    We build, test, launch, and operate the complex, high-stakes systems for our global telco customers. Your mission is to ensure the reliability, efficiency, and performance of our core products (like UMP, CEM, BSAP, and DHCP) across both our cloud and complex on-premise deployments.This is no small task. Our products handle hundreds of millions of devices in...


  • Kraków, Lesser Poland VGW Pełny etat

    Senior Site Reliability EngineerVGW is an interactive entertainment company, harnessing technology and creativity to deliver world-class, free-to-play online social games. We have an exciting opportunity to join our Engineering team in Poland and are currently looking for a Senior Site Reliability Engineer to join the team.You'll focus on ensuring the...


  • Kraków, Lesser Poland VGW Pełny etat

    Senior Site Reliability EngineerVGW is an interactive entertainment company, harnessing technology and creativity to deliver world-class, free-to-play online social games. We have an exciting opportunity to join our Engineering team in Poland and are currently looking for a Senior Site Reliability Engineer to join the team.You'll focus on ensuring the...


  • Kraków, Lesser Poland Aras Corporation Pełny etat 80 000 zł - 120 000 zł rocznie

    Aras is a leader in product lifecycle management (PLM) and digital thread solutions. As one of the fastest growing PLM companies, our technology enables the rapid delivery of flexible solutions built on a powerful digital thread backbone and a low-code development platform.Our platform and PLM applications connect users in all disciplines to critical product...


  • Kraków, Lesser Poland Motorola Solutions Pełny etat 60 000 zł - 120 000 zł rocznie

    Company OverviewAt Motorola Solutions, we believe that everything starts with our people. We're a global close-knit community, united by the relentless pursuit to help keep people safer everywhere. Our critical communications, video security and command center technologies support public safety agencies and enterprises alike, enabling the coordination that's...

  • Site Reliability

    2 tygodni temu


    Kraków, Lesser Poland Canonical - Jobs Pełny etat 80 000 zł - 120 000 zł rocznie

    Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers,...


  • Kraków, Lesser Poland Motorola Solutions Pełny etat 85 000 zł - 130 000 zł rocznie

    Company OverviewAt Motorola Solutions, we believe that everything starts with our people. We're a global close-knit community, united by the relentless pursuit to help keep people safer everywhere. Our critical communications, video security and command center technologies support public safety agencies and enterprises alike, enabling the coordination that's...


  • Kraków, Lesser Poland Andersen Lab Pełny etat

    SummaryAndersen is hiring a Site Reliability Engineer in India to drive reliability and performance for large-scale digital insurance platforms, enhancing integrations, optimizing cloud systems, and ensuring stable, high-quality service delivery.The customer is a well-established global organization providing financial protection and risk-management services...