Site Reliability Engineer

3 tygodni temu


Gdańsk, Trójmiasto, Polska Hard Rock Digital Pełny etat 31 zł

Location: Poland only, fully remoteJob Type: B2B, full time OverviewHard Rock Digital is a team focused on becoming the best online sportsbook, casino, and social gaming company in the world. We care about each customer's interaction, experience, behaviour, and insight and strive to ensure we’re always acting authentically. Rooted in the kindred spirits of the Seminole Tribe of Florida, the new Hard Rock Digital taps a brand known all over the world as the leader in gaming, entertainment, and hospitality. We’re taking that foundation of success and bringing it to the digital space.What’s the position?We are looking for a skilled Site Reliability Engineer (SRE) to maintain and improve the reliability, scalability, and performance of our Java-based application. You will be responsible for managing and monitoring the applications and infrastructure, using the Grafana stack (Grafana, Loki, Prometheus) to ensure a high level of observability, and implementing robust monitoring, alerting, and logging solutions. Key Responsibilities:Application Reliability & Performance:Ensure the availability, reliability, and performance of a high-traffic Java-based application in a distributed environment.Troubleshoot and resolve complex issues in production and non-production environments.Participate in both pre- and post-deployment performance testing and monitoring efforts to improve application performance.Optimize Java application performance, ensuring efficient resource utilization and scaling.Monitoring & Observability:Deploy and manage the Grafana stack (Grafana, Prometheus, Loki) to provide real-time monitoring, logging, and alerting.Implement and refine observability strategies to enhance application and infrastructure visibility.Create and maintain dashboards, alerts, and logs for comprehensive monitoring of system health and performance.Incident Management & Root Cause Analysis:Support the operations team’s incident response efforts, participate in post-mortems, and identify root causes of issues to prevent recurrence.Document and share lessons learned from incidents, contributing to a culture of continuous improvement.Collaboration & Cross-functional Support:Work closely with developers, architects, and other engineers to design and implement solutions that improve application reliability.Collaborate closely with DevOps and NOC teams to support the application platform.Communicate SRE practices and principles to technical and non-technical stakeholders.Provide feedback and insights on application performance, potential improvements, and observability metrics.RequirementsWhat are we looking for?The ideal candidate will have:Degree in computer science or a related field, or equivalent work experience2-3 years in SRE, DevOps, or similar Infrastructure rolesExperience managing large-scale, high-availability production systemsTrack record of incident response and post-mortem processesExperience with capacity planning and performance optimization1+ years hands-on experience managing production Kubernetes clustersDeep understanding of k8s architecture, networking, storage, and securityExperience with cluster scaling (Karpenter), upgrades, and multi-cluster managementProficiency with kubectl, Helm, and Kubernetes operatorsContainer orchestration and troubleshooting knowledgeExpertise with the Grafana stack for dashboards, alerting, and visualizationHands-on experience with Grafana Alloy for telemetry data collectionProficiency in PromQLExperience with Loki for log aggregation and analysisExperience building comprehensive monitoring and alerting strategiesHands-on experience managing Java-based applications in large-scale, distributed environments, with a focus on JVM tuning and application optimization.Cloud Platform expertise (AWS, GCP, or Azure)Familiarity with infrastructure as code (IAC) tools like Terraform/Terragrunt or Ansible.ArgoCD proficiency for GitOps workflows and continuous deploymentScripting abilities in Bash, Python, or GoExperience with CI/CD piplelines and automation toolsConfiguration Management and deployment automationStrong troubleshooting skills, with a proactive approach to diagnosing and resolving performance bottlenecks.Proven experience in on-call rotations, incident response, and root cause analysis.Strong communication skills (both written and verbal), positive attitude, and ability to receive constructive feedback.



  • Gdańsk, Trójmiasto, Polska eSky.pl Pełny etat 21 zł - 840 zł

    Pasja do podróżowania to coś, co łączy całą Grupę eSky (eSky, eDestinos, Thomas Cook). Nasza platforma powstała z połączenia wieloletniego doświadczenia w branży turystycznej z zamiłowaniem do nowoczesnych technologii. Tworzymy rozwiązania, które inspirują ludzi do poznawania świata i wspólnie zamieniamy te inspiracje w doświadczenia.Do...

  • Site Reliability Engineer

    4 tygodni temu


    Gdańsk, Trójmiasto, Polska Fibertide Pełny etat 20 zł

    Fibertide employs engineers with strong mathematical and computer science backgrounds. We design, maintain and develop large-scale cloud systems for companies from the U.S. and Europe. Our customers include both ambitious startups and established businesses that require assistance improving their technologies for large userbases, big datasets and rapid...

  • Senior DevOps Engineer

    1 tydzień temu


    Gdańsk, Trójmiasto, Polska KUBO Pełny etat

    For our client – a global technology company building and operating large-scale data and analytics platforms – we are looking for a Senior DevOps Engineer.You’ll join an international team supporting cloud-based solutions used across multiple business areas. Your focus will be on improving reliability, automation, and deployment processes within a...

  • Middle DevOps Engineer

    1 tydzień temu


    Gdańsk, Trójmiasto, Polska Ciklum Pełny etat 3 zł - 600 zł

    Salary range: B2B - 25-28 E/h + VAT Ciklum is looking for a Middle DevOps Engineer to join our team full-time in Poland.We are a custom product engineering company that supports both multinational organizations and scaling startups to solve their most complex business challenges. With a global team of over 4,000 highly skilled developers, consultants,...


  • Gdańsk, Trójmiasto, Polska Kubo Pełny etat 25 zł - 200 zł

    We are collaborating with a global aviation technology company to help them find a skilled Senior Software Engineer (C++) who will join their growing engineering team in Poland. The role focuses on developing advanced software solutions used daily by flight crews and ground operations teams worldwide.Take a look at the details below — and if you think this...


  • Gdańsk, Trójmiasto, Polska Ciklum Pełny etat 5 zł - 100 zł

    Salary range: B2B - 35-40 E/h + VAT Ciklum is looking for a Senior JavaScript Engineer to join our team full-time in the Poland.We are a custom product engineering company that supports both multinational organizations and scaling startups to solve their most complex business challenges. With a global team of over 4,000 highly skilled developers,...


  • Gdańsk, Trójmiasto, Polska Ciklum Pełny etat 3 zł - 500 zł

    Salary range: B2B 24-27 E/h + VAT Ciklum is looking for a Middle JavaScript Engineer to join our team full-time in the Poland.We are a custom product engineering company that supports both multinational organizations and scaling startups to solve their most complex business challenges. With a global team of over 4,000 highly skilled developers, consultants,...

  • Senior QA Engineer

    4 tygodni temu


    Gdańsk, Trójmiasto, Polska ERGO Technology & Services Pełny etat

    About UsERGO Technology & Services S.A. (ET&S S.A.) was established in January 2021 following the integration of ERGO Digital IT and Atena into one entity, leveraging both companies’ strengths and best practices. As a part of ERGO Technology & Services Management AG, the technology holding of ERGO Group AG, we support millions of internal and external...

  • IT Operations Engineer

    4 tygodni temu


    Gdańsk, Trójmiasto, Polska emagine Polska Pełny etat 24 zł - 360 zł

    Industry: BankingLocation: hybrid model 2-3x/week (Gdańsk/Gdynia)Remuneration: up to 155 zl/h net+vatType of contract: B2BDuration: Long-termIntroduction & Summary:We are seeking a skilled Developer in the role of an IT Operations Enginner to support a highly available application delivering data to multiple downstream systems. The candidate must possess a...

  • Expert AI Engineer

    4 tygodni temu


    Gdańsk, Trójmiasto, Polska Ciklum Pełny etat 7 zł - 200 zł

    Salary range: B2B - 50-52 E/h + VAT Ciklum is looking for an Expert AI Engineer to join our team full-time in Poland.We are a custom product engineering company that supports both multinational organizations and scaling startups to solve their most complex business challenges. With a global team of over 4,000 highly skilled developers, consultants, analysts...