CloudOps Engineer | Hyderabad

8 years

0 Lacs

Posted:1 day ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Contractual

Job Description

Cloud Operations Engineer

Key Responsibilities

Operational Excellence & SRE

  • Drive Site Reliability Engineering (SRE) practices, including SLIs, SLOs, SLAs, error budgets, and automation of operational tasks.
  • Manage incident response, root cause analysis, and post-incident reviews to strengthen platform resilience.
  • Build and optimize observability and monitoring frameworks (CloudWatch, Grafana, Loki, Tempo, Prometheus).
  • Implement self-healing systems and automated recovery where possible.
  • Oversee OS patching to ensure no outstanding vulnerabilities, maintaining compliance with security standards.

Hands-on Cloud & Systems Engineering

  • Provision, manage, and troubleshoot AWS services such as EC2, ECS, EKS, Lambda, ELB, S3, EFS, RDS, VPC, and IAM.
  • Administer Linux and Windows operating systems, including hardening, patching, and vulnerability remediation.
  • Troubleshoot complex issues across infrastructure, applications, networks, and operating systems.
  • Deploy and manage container-based workloads (ECS, EKS, Docker).
  • Automate operations using Infrastructure-as-Code (CloudFormation, Terraform) and scripting (Python, Ansible, Bash, PowerShell).
  • Implement and optimize GitLab CI/CD pipelines for operational automation.
  • Support cloud security, IAM, encryption, and compliance standards.

Basic Qualifications

  • 8+ years of experience in cloud operations, engineering, or SRE roles.
  • Strong hands-on expertise with AWS services (EC2, ECS, EKS, Lambda, ELB, S3, EFS, VPC, IAM).
  • Solid experience with Linux and Windows operating systems, including hardening and patching.
  • Proficiency with scripting languages (Python, Ansible, Bash, PowerShell).
  • Hands-on experience in container-based deployments (ECS, EKS, Docker).
  • Proven ability in infrastructure and application troubleshooting.
  • Deep knowledge of SRE principles, including monitoring, incident management, and SLIs/SLOs/SLAs.
  • Strong expertise in GitLab CI/CD and automation frameworks (CloudFormation, Terraform).
  • Working knowledge of cloud security, IAM, and encryption practices.
  • Excellent problem-solving, debugging, and communication skills.

Preferred Qualifications

  • AWS certifications: Solutions Architect - Professional, DevOps Engineer - Professional, or SysOps Administrator.
  • Experience with observability and monitoring tools (CloudWatch, Grafana, Loki, Tempo, Prometheus).
  • Familiarity with multi-cloud or hybrid-cloud operations (AWS and OCI).
  • Experience managing high-scale, high-availability, mission-critical environments.
  • Proven track record of implementing automation, SRE practices, and operational process improvements

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You