Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in pune
>
METRO Global Solution Center IN
>
Site Reliability Engineer

Site Reliability Engineer

METRO Global Solution Center IN

10 years

0 Lacs

pune maharashtra india

Posted:5 hours ago| Platform:

Apply

Skills Required

reliability service finance support transactions drive efficiency code automation stability gcp docker kubernetes automate provisioning terraform kustomize helm monitoring datadog analysis development github database postgresql networking linux software engineering orchestration configuration management scripting troubleshooting coding tooling communication agile scrum

Work Mode

On-site

Job Type

Full Time

Job Description

About us:

Metro Global Solution Center (MGSC) is internal solution partner for METRO, a €29.8 Billion international wholesaler with operations in 32 countries through 625 stores & a team of 91,000 people globally. Metro operates in a further 10 countries with its Food Service Distribution (FSD) business and it is thus active in a total of 34 countries.MGSC, location wise is present in Pune (India), Düsseldorf (Germany) and Szczecin (Poland). We provide HR, Finance, IT & Business operations support to 31 countries, speak 24+ languages and process over 18,000 transactions a day. We are setting tomorrow’s standards for customer focus, digital solutions, and sustainable business models. For over 10 years, we have been providing services and solutions from our two locations in Pune and Szczecin. This has allowed us to gain extensive experience in how we can best serve our internal customers with high quality and passion. We believe that we can add value, drive efficiency, and satisfy our customers.Website: https://www.metro-gsc.inCompany Size: 600-650Headquarters: Pune, Maharashtra, IndiaType: Privately HeldInception: 2011

Role Overview

We are seeking a Senior Site Reliability Engineer with strong experience in building and maintaining scalable, resilient systems. The ideal candidate will have hands-on expertise in cloud-native technologies, infrastructure as code, observability, and automation, with a focus on Google Cloud Platform (GCP).

Key Responsibilities

Ensure the stability and reliability of cloud-native applications deployed on GCP, containerized with Docker and orchestrated via Kubernetes.
Define, implement, and monitor SLOs, SLAs, and SLIs to measure system performance and user experience.
Automate infrastructure provisioning using Terraform and manage Kubernetes configurations with Kustomize and Helm.
Develop and maintain monitoring and alerting systems using Datadog and GCP-native tools.
Conduct incident analysis and postmortems to drive continuous improvement.
Collaborate with development teams to integrate reliability practices into CI/CD pipelines using GitHub Actions.
Manage and troubleshoot database systems, particularly PostgreSQL and Cassandra.
Apply networking knowledge and Linux system administration skills to troubleshoot and optimize system connectivity and performance.

Qualifications

Education

Bachelor’s or Master’s degree in Computer Science, Software Engineering, or equivalent practical experience.

Work Experience & Skills

5+ years of experience in Site Reliability Engineering.
Proven experience designing and operating elastic, resilient systems in cloud environments.
Strong understanding of GCP, Kubernetes, and container orchestration.
Proficiency in infrastructure as code and configuration management tools (Terraform, Helm, Kustomize).
Experience with monitoring and observability tools (Datadog, GCP Monitoring).
Solid scripting skills in bash and familiarity with automation frameworks.
Experience with CI/CD pipelines, especially using GitHub Actions.
Familiarity with networking fundamentals and troubleshooting.
Strong coding skills and ability to develop reliability-focused tooling.
Excellent communication skills in English (written and spoken).

Other Requirements

Strong problem-solving skills and a process-oriented mindset.
Ability to work independently and collaboratively in a fast-paced environment.
Passion for clean code, automation, and continuous improvement.

Nice-to-Have

Familiarity with monitoring tools (e.g., DataDog, Prometheus, GCP Monitoring).
Experience working in Agile/Scrum teams.

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

METRO Global Solution Center IN

Login to

Please Verify Your Phone or Email

Confirm Action

Site Reliability Engineer