Home
Jobs

Posted:2 weeks ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Company Size Large-scale / Global Experience Required 1 - 3 years Working Days 5 days/week Office Location Bellandur, Bengaluru Maharashtra, Pune Role & Responsibilities We are looking for a dedicated Site Reliability Engineer (SRE) - Cloud Ops to join our team. In this role, you will play a key part in ensuring the stability and scalability of our cloud infrastructure. You will be responsible for monitoring, troubleshooting, and resolving infrastructure and application alerts, managing pipelines, and addressing environment-related issues in a dynamic 24/7 operational environment. Key Responsibilities Infrastructure Monitoring and Alert Response: Proactively monitor infrastructure and application alerts, ensuring prompt resolution to maintain uptime and performance. Shift-Based Operations: Work in a 24/7 environment with flexible availability for rotational shifts. Cloud Environment Management: Manage and resolve environment-related issues, focusing on stability and efficiency. Pipeline Management: Oversee CI/CD pipelines and ensure smooth deployment of updates and releases. Operational Tasks: Execute day-to-day operational activities, including incident management, change management, and maintaining operational excellence. Tool Management: Utilize tools like Kubernetes, PagerDuty, and GCP Cloud to support operational activities. Ideal Candidate B.E/B.Tech graduate with 2+ years of experience in Site Reliability, Cloud Ops Monitoring and Alerting Expertise: In-depth knowledge of monitoring tools (Prometheus, Grafana, ELK ) , alert systems, and resolving related issues promptly. Kubernetes: Hands-on experience with Kubernetes for orchestration and container management. PagerDuty: Proficiency in setting up and managing alerting systems. Cloud Fundamentals: Basic understanding of GCP (Google Cloud Platform) services and operations. Incident Management: Strong problem-solving skills and experience in handling critical incidents under pressure. DevOps Processes: Basic knowledge of CI/CD pipelines, automation, and infrastructure-as-code practices. Skills: monitoring tools (prometheus, grafana, elk),incident management,infrastructure,management,gcp (google cloud platform),cd,infrastructure as code,gcp,ci,kubernetes,pagerduty,automation,application,ci/cd pipelines,pipelines,cloud,basic Show more Show less

Mock Interview

Practice Video Interview with JobPe AI

Start Gcp Interview Now
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You