Principal Site Reliability Engineer

Cubic Transportation

12 - 18 years

25 - 40 Lacs

hyderabad

Posted:2 months ago| Platform:

Apply

Skills Required

docker site reliability engineering aws observability kubernetes logging reliability engineering continuous delivery sre continuous development ci/cd devops jenkins configuration management terraform continuous integration ansible alerts reliability monitoring

Work Mode

Work from Office

Job Type

Full Time

Job Description

Principal Site Reliability Engineer

Experience: 12 to 18 Years

Location: Hyderabad

Notice Period: Immediate to 30 Days

Key Responsibilities

Design, deploy, and maintain scalable, secure applications and infrastructure in cloud or hybrid environments
Implement and manage robust
monitoring, alerting, and observability systems
Automate recurrent operational tasks using
scripts (e.g., Python) and Infrastructure-as-Code tools (e.g., Terraform)
Collaborate with engineers to build highly available, reliable, deployable systems, establishing guardrails around SLOs, SLIs, and error budgets
Own incident response by participating in on-call rotations, conducting RCAs, and implementing preventive measures and self-healing solutions
Conduct performance tuning, capacity planning, and efficient disaster recovery design for strong Recovery Time Objectives (RTO) and Recovery Point Objectives (RPO)
Reduce manual toil in security compliance and patching processes through automation
Support project teams in troubleshooting and resolving operational issues across development, testing, and production environments
Provide guidance and operational support during project rollouts and infrastructure changes to ensure reliability and uptime
Collaborate with senior stakeholders, internal and external, to communicate technical concepts, resolve problems, and influence decision-making on technical matters
Work closely with the product team to stay informed about evolving system design, business logic, and transaction flows to ensure reliability and operational readiness across services
Identify and address organization-wide gaps in the SRE domain and develop implementable solutions that contribute to reliability and operational excellence

Required Qualifications

Bachelors degree in Computer Science, Engineering, or equivalent
12+ years as an
SRE, DevOps,
or related role managing large-scale solutions or platforms
Proficient in
scripting (PowerShell, Python, Go, Bash)
and solid understanding of coding/development principles
Hands-on experience with
cloud platforms (AWS, GCP, Azure) and container orchestration (Docker, Kubernetes)
Experienced with
monitoring, logging, alerting, and observability tools
Familiar with
CI/CD pipelines and infrastructure tooling (e.g., Jenkins, GitLab CI/CD, Argo CD)
Proficiency in Agile methodologies, such as SCRUM
Strong problem-solving and debugging skills, especially in high-pressure, production-critical environments
Strong collaboration and communication skills

Desired Qualifications

Experience with Terraform and other Infrastructure-as-Code tools
SRE-specific certifications from AWS, GCP, or Azure
Experience shaping and scaling SRE practices
Experience mentoring teams and fostering a strong reliability culture across the organization

More Jobs at Cubic Transportation

System Test Engineer

Hyderabad

3 - 7 yrs

INR 7 - 8 Lacs

Senior Software Test Engineer

Hyderabad

5 - 10 yrs

INR 15 - 17 Lacs

Senior Systems Test Engineer

Hyderabad

5.0 - 10.0 yrs

INR 11 - 13 Lacs

Senior Software Engineer

Hyderabad

1.0 - 2.0 yrs

INR 8 - 12 Lacs

Procurement Buyer

Hyderabad

6.0 - 8.0 yrs

INR 7 - 11 Lacs

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.