Jobs

Interviews
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home

Jobs

Home
>
Jobs in Trivandrum
>
UST
>
Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE)

UST

5 - 7 years

25 - 27 Lacs

Trivandrum

Posted:2 months ago| Platform:

Apply

Skills Required

Service level Networking Infrastructure Vulnerability Operations Distribution system Monitoring SQL Capacity planning Python

Work Mode

Work from Office

Job Type

Full Time

Job Description

Design and implement high-availability systems, ensuring systems are reliable, performant, and scalable. Establish and enforce Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Service Level Agreements (SLAs). Perform root cause analysis for system failures, providing insights and ensuring preventative measures are in place. Proactively manage incidents, ensuring timely resolution and effective post-mortem analysis to prevent recurrence. Automate infrastructure provisioning, deployment pipelines, and operational processes. Build, maintain, and optimize CI/CD pipelines with platforms such as GitLab (preferred), GitHub Actions, Jenkins, etc. Develop and manage Infrastructure as Code (IaC) using tools like Terraform, AWS CDK, and CloudFormation. Champion the adoption of automation and DevOps best practices across teams. Implement and manage enterprise observability tools such as Datadog, Dynatrace (preferred), or Grafana for monitoring, ing, and performance tracking. Establish proactive monitoring and ing systems to ensure the health of applications and infrastructure. Create and maintain robust incident response processes and manage on-call rotations, ensuring efficient handling of incidents. Optimize system performance and capacity planning, ensuring efficient resource utilization. Implement horizontal scaling strategies to ensure systems can handle increasing load. Collaborate with development teams to improve application resilience, optimize performance, and manage system health. Manage and optimize infrastructure in a major cloud platform (AWS, GCP, or Azure). Work with cloud infrastructure tools like Terraform and AWS CDK to provision and manage cloud resources. Implement infrastructure automation and ensure the infrastructure is scalable, reliable, and secure. Ensure security best practices are followed in infrastructure, code, and deployment processes. Conduct regular vulnerability assessments and work with teams to remediate identified risks. Ensure compliance with industry standards and organizational security requirements. Act as a technical leader, providing guidance and mentorship to junior SREs and other team members. Collaborate across development, operations, and product teams to drive a DevOps culture focused on automation, reliability, and efficiency. Advocate for a culture of ownership, continuous improvement, and shared responsibility across teams. Strong experience in a previous SRE role, with a proven track record in maintaining highly available and scalable systems. Expertise in one or more programming languages such as Python, Go, or Java. Deep understanding of distributed systems, networking, and operating systems. Hands-on experience with cloud platforms (AWS, GCP, Azure). Proficiency with enterprise observability tools, such as Datadog, Dynatrace, or Grafana. Extensive experience with CI/CD platforms, such as GitLab (preferred), GitHub Actions, or Jenkins. Experience with cloud infrastructure and automation tools, such as Terraform, AWS CDK, or similar IaC frameworks. Solid understanding of containerization and orchestration tools (e.g., Docker, Kubernetes). Strong knowledge of database management (SQL and NoSQL).

More Jobs at UST

UCC Engineer (Collab)

Trivandrum

5 - 7 yrs

INR 0 - 0 Lacs

Specialist I - Cloud Infrastructure Services - Network Engineer

Trivandrum

12 - 18 yrs

INR 0 - 0 Lacs

Application Packaging - SCCM, Release management

Trivandrum

8 - 12 yrs

INR 0 - 0 Lacs

Lead I - Cloud Infrastructure Services

Trivandrum

5 - 7 yrs

INR 0 - 0 Lacs

SQL Database Engineering

Trivandrum

12 - 15 yrs

INR 0 - 0 Lacs

Mock Interview

Practice Video Interview with JobPe AI

Start Service Level Interview Now

My Connections UST

Download Chrome Extension (See your connection in the UST )

Download Now

UST

www.ust.com

IT Services and IT Consulting

Aliso Viejo CA

10001 Employees

1845 Jobs

Key People

Kris Canekeratne

Co-Founder & CEO
Sandeep Reddy

President
Baskar Subramanian

Co-Founder & Chief Strategy Officer
Lynn C. Mclean

Chief Financial Officer

Login to

Please Verify Your Phone or Email

Confirm Action

Search

Profile

Upskill and Grow with AI

Senior Site Reliability Engineer (SRE)