Site Reliability Engineer II

5 years

15 - 16 Lacs

Posted:2 days ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Site Reliability Engineer II, 689

CTC:

Up to 16 LPA

Location:

Chennai (Onsite – Hybrid: 4 days office)**

Type:

Full-time

Job Summary

We are seeking a hands-on

Software Engineer with strong SRE exposure

to ensure the availability, reliability, and performance of large-scale cloud systems. The role focuses heavily on

automation

, proactive monitoring, incident management, and performance optimization across distributed environments.

Key Responsibilities

  • Automate manual infrastructure and operational tasks to improve reliability and efficiency.
  • Monitor and manage production environments; identify and resolve issues proactively.
  • Build tooling for access monitoring, session logging, reliability dashboards, and distributed system operations.
  • Collaborate with engineering teams to enhance on-call processes, incident response, and post-mortem analysis.
  • Perform capacity planning, performance optimization, and reliability improvements.
  • Maintain monitoring, alerting, and observability systems for proactive health checks.
  • Improve system stability, security, and performance using data-driven insights.
  • Create and maintain technical documentation, diagrams, and knowledge-sharing materials.

Must-Have Skills

  • 5+ years total IT experience, with 4+ years in Java and/or Python development
  • Strong fundamentals in software engineering + automation
  • Hands-on experience with:
    • Dynatrace (monitoring & observability)
    • GCP (cloud services)
    • Monitoring tools
    • Automation frameworks
    • Testing practices
  • Experience building APIs and working with front-end/back-end components
  • Strong analytical, problem-solving, and communication skills
  • Ability to work onsite (hybrid: 4 days office)

Nice-to-Have Skills

  • Exposure to SRE practices (incident management, post-mortem, SLIs/SLOs)
  • Knowledge of AI in Operations (AIOps)
  • Understanding of Observability, Cloud Engineering, DevOps
  • System Design fundamentals
  • Application Support experience
Skills: dynatrace,monitoring tools,automation frameworks,gcp

Mock Interview

Practice Video Interview with JobPe AI

Start Java Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Java Skills

Practice Java coding challenges to boost your skills

Start Practicing Java Now

RecommendedJobs for You