Staff Site Reliability Engineer

5 dni temu

Warsaw, Polska VISA Pełny etat

Company Description Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure payments network, enabling individuals, businesses, and economies to thrive while driven by a common purpose – to uplift everyone, everywhere by being the best way to pay and be paid. Make an impact with a purpose-driven industry leader. Join us today and experience Life at Visa. Job Description Hadoop/Big-Data: Sound knowledge on managing large scale Hadoop platforms including monitoring the platform, debugging issues, and tuning the performance of the cluster. In-depth knowledge of the Hadoop ecosystem, including Zookeeper, HDFS, Yarn, HIVE, SPARK, Trino and Kafka. Proven experience in debugging issues on both Hadoop platform and applications. Familiarity with security tools such as Kerberos, Ranger, and active directory integrations. Experience on Cloud technologies preferably AWS EMR. Knowledge on Kubernetes, AI, MLOPS will be advantageous. Collaboration and Teamwork: Collaborate closely with L-3 teams to review new use cases and implement cluster hardening techniques, ensuring the development of robust and reliable platforms. Foster cross-team collaboration, building and maintaining strong relationships with customer teams, user communities, architects, and engineering teams. Work jointly on key deliverables to ensure production scalability and stability. Automation: Hands-on Experience with automations using Ansible, Shell, python, or any programming languages. The ability to automate the manual tasks is key in this role. Observability: knowledge on observability tools like Grafana, opera, Prometheus and Splunk. Linux: understanding of Linux, networking, CPU, memory, and storage. Programming Languages: Knowledge of and ability to code or program in one of python, Java or a widely used coding language. Communication: Excellent interpersonal skills, along with superior verbal and written communication abilities. This position is not ideal for a Hadoop developer. This is a hybrid position. Expectation of days in office will be confirmed by your hiring manager. Qualifications Basic Qualifications: As a Staff Site Reliability Engineer, you will play a key role in maintaining and supporting Visa's Data Platform, ensuring the reliability and performance of critical Big Data systems. You will drive innovation for our partners and clients globally by working on open-source Big Data clusters, optimizing their availability, efficiency, and scalability. Education & Experience: Master's degree in Math, Science, Engineering, Computer Science, Information Systems, or a related field; OR Bachelor's degree in Math, Science, Engineering, Computer Science, Information Systems, or a related field, AND a minimum of five years of relevant experience; OR A minimum of five years of experience working with Hadoop systems. Preferred Qualifications: Experience in Big Data SRE and Engineering across open-source platforms such as Hadoop, Kafka, HBase, and Spark, with strong troubleshooting and debugging skills. Proven ability to conduct effective root cause analysis of major production incidents, document findings, and implement high-availability solutions for critical services. Expertise in capacity planning, system expansions, and timely upgrades to mitigate scaling challenges, while automating repetitive tasks to reduce manual effort and prevent errors. Ability to fine-tune alerting and set up observability tools to proactively identify and resolve performance issues, collaborating with Level-3 teams on use case reviews and cluster hardening. Strong documentation skills to create standard operating procedures and platform utilization guidelines, ensuring consistency and efficiency in operations. Proficiency in leveraging DevOps tools and industry best practices, including incident, problem, and change management disciplines. Commitment to ensuring Hadoop platform performance meets service-level agreements, with experience in security remediation, automation, and self-healing implementations. Experience in developing automation tools and reports to streamline processes, using technologies such as Shell scripting, Ansible, Python, or other programming languages. Additional Information Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.

Staff Site Reliability Engineer

3 dni temu

Warsaw, Polska Visa Technology Europe sp. z o.o. Pełny etat

Staff Site Reliability Engineer - (Hadoop) Miejsce pracy: Warszawa Technologies we use Expected Kubernetes AWS Operating system Linux Your responsibilities Hadoop/Big-Data: •Sound knowledge on managing large scale Hadoop platforms including monitoring the platform, debugging issues, and tuning the performance of the cluster. •In-depth knowledge of the...
Staff Software Engineer

2 tygodni temu

Warsaw, Polska Google Pełny etat

Staff Software Engineer - Site Reliability Engineering Miejsce pracy: Warszawa Technologies we use Operating system - Windows About the project Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our...
Staff Software Engineer

2 tygodni temu

Warsaw, Polska Google Pełny etat

Staff Software Engineer - Site Reliability Engineering Miejsce pracy: Warszawa Technologies we use Operating system Windows About the project Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our internally...
Site Reliability Engineer

3 dni temu

Warsaw, Polska VISA Pełny etat

Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure payments network,...
Site Reliability Engineer Azure

5 dni temu

Warsaw, Polska Connectis Pełny etat

Wspólnie z naszym Partnerem, renomowaną amerykańską firmą specjalizującą się w produkcji artykułów konsumenckich, poszukujemy doświadczonego specjalisty na stanowisko Site Reliability Engineer (Azure). Projekt dotyczy wdrożenia i rozwoju jednolitej platformy observability, która zbiera metryki, logi i trace z całej infrastruktury i aplikacji w...
Senior Site Reliability Engineer

4 tygodni temu

Warsaw, Polska TQLO SPÓŁKA Z OGRANICZONĄ ODPOWIEDZIALNOŚCIĄ Pełny etat

Nasz Klient to międzynarodowa organizacja rozwijająca nowoczesną, wysokodostępną platformę digital obsługiwaną przez miliony użytkowników. Projekt koncentruje się na budowie i utrzymaniu skalowalnej infrastruktury chmurowej, automatyzacji procesów, poprawie niezawodności oraz wdrażaniu dobrych praktyk Site Reliability Engineering (SRE). Szukamy...
Senior Site Reliability Engineer

3 dni temu

Warsaw, Polska VISA Pełny etat

Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure payments network,...
Senior Site Reliability Engineer

4 tygodni temu

Warsaw, Polska DCG Pełny etat

As a recruitment company, DCG understands that every business is powered by experienced professionals. Our management style and partnership approach enable us to meet your needs and provide continuous support. Due to our ongoing growth and the large number of recruitment projects we undertake for our partners, we are currently looking for: Senior Site...
Data Site Reliability Engineer

4 tygodni temu

Warsaw, Polska Cyclad Pełny etat

Data Site Reliability Engineer Miejsce pracy: Warszawa Technologies we use Expected PostgreSQL Kafka TimescaleDB MongoDB GitOps Optional Kubernetes Google Cloud Platform About the project In Cyclad we work with top international IT companies in order to boost their potential in delivering outstanding, cutting edge technologies that shape the world of the...
Staff Software Engineer Site Reliability Engineering

3 dni temu

Warsaw, Polska Google Pełny etat

Minimum qualifications: Bachelor's degree in Computer Science, a related field, or equivalent practical experience. 8 years of experience with software development in one or more programming languages. 3 years of experience leading projects. 3 years of experience designing, analyzing, and troubleshooting distributed systems. Preferred qualifications:...

Ameryka

Europa

Azja / Oceania

Afryka

Staff Site Reliability Engineer