Lead Cloud Operations & Reliability

6 - 10 years

0 Lacs

Posted:1 day ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

As a Cloud Operations Lead at a leading IT R&D organization in Kolkata, your role is crucial in ensuring the stability, performance, and security of cloud-based systems. You will drive operational excellence through proactive monitoring, incident management, automation, and capacity planning. Your responsibilities will include: - Managing day-to-day operations across production, staging, and development cloud environments within an R&D context - Ensuring high availability of services through robust monitoring, alerting, and incident response processes - Leading root cause analyses (RCA) and post-mortem reviews to drive continuous improvement - Implementing observability practices including logging, tracing, and metrics for proactive issue detection - Overseeing patch management and maintenance to ensure systems remain secure and up-to-date In addition, you will be responsible for: - Developing and maintaining automation scripts for provisioning, scaling, and monitoring cloud resources - Optimizing cloud usage through rightsizing, reserved instances, and cost governance (FinOps) - Standardizing operational runbooks and playbooks to streamline processes and reduce manual effort You will need to meet the following qualifications: - Bachelor's degree in Computer Science, IT, or a related field - 5-8 years of experience in cloud operations, SRE, or IT infrastructure - 2+ years in a leadership role managing operational teams, preferably in an R&D environment Your technical skills should include expertise in at least one major cloud platform (AWS, Azure, GCP), hands-on experience with monitoring and observability tools (CloudWatch, Datadog, New Relic, Prometheus), and strong knowledge of Infrastructure as Code (Terraform, CloudFormation, ARM templates). You should also have experience with incident management frameworks (ITIL, SRE principles, PagerDuty/On-Call rotations), familiarity with container orchestration (Kubernetes, ECS, AKS, GKE), and understanding of cloud security best practices and compliance frameworks. Your soft skills should include proven ability to lead and inspire teams in a fast-paced R&D environment, strong problem-solving, decision-making, and communication skills, and a collaborative mindset to work effectively with technical and business stakeholders. Preferred qualifications include cloud certifications (AWS SysOps, Azure Administrator, Google Cloud DevOps Engineer, or equivalent), experience managing multi-cloud environments, knowledge of FinOps and cost governance frameworks, and familiarity with ITIL processes or formal service management frameworks. Your success will be measured by system uptime, incident response time, cost efficiency, level of automation, and team performance.,

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You