Site Reliability Engineer

5 years

0 Lacs

Posted:20 hours ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Job Title: Site Reliability Engineer

Location: Hyderabad

Experience: 5+ years

Employment Type: Full-Time


About the Role

scalable, reliable, and high-performance cloud-native applications on Microsoft Azure

Site Reliability Engineer (SRE)

As an SRE, you will:

  • Design, implement, and maintain our observability stack using OpenTelemetry standards.
  • Ensure the

    availability, performance, and scalability

    of production systems.
  • Collaborate with development teams to embed reliability practices, automate operational tasks, and respond to incidents quickly and effectively.


Key Responsibilities

1. Observability & OpenTelemetry (OTEL)

  • Build and manage an

    observability platform

    with OpenTelemetry for distributed tracing, metrics, and logs.
  • Instrument applications (Java, Python, Node.js) for end-to-end telemetry.
  • Configure OTEL Collectors to export telemetry to

    Prometheus, Grafana, Jaeger, Loki, Tempo, Azure Monitor, and Application Insights

    .
  • Develop custom instrumentation and semantic conventions.
  • Establish robust alerting and anomaly detection using Azure Monitor, Prometheus Alertmanager, etc.
  • Create

    dashboards

    (Grafana, Azure Dashboards) for real-time insights.
  • Continuously enhance observability by adopting

    best practices and new OTEL features

    .

2. Azure SRE Responsibilities

  • Reliability & Performance:

    Monitor systems, identify bottlenecks, and implement scaling and optimization strategies.
  • Incident Response:

    Participate in on-call rotations, lead incident resolution, conduct RCA, and maintain runbooks/playbooks.
  • Automation & IaC:

    Automate infrastructure and operational tasks using

    Azure DevOps, Terraform, Azure Bicep, PowerShell, or Bash

    .
  • CI/CD Integration:

    Embed reliability and observability checks into CI/CD pipelines.
  • Capacity Planning:

    Analyze usage patterns, plan for scalability, and optimize

    Azure resource costs

    .
  • Security & Compliance:

    Apply security best practices and ensure compliance.
  • Collaboration:

    Mentor development teams on

    SRE and observability practices

    .


Required Skills & Experience

  • 5+ years

    in SRE, DevOps, or a related infrastructure role.
  • Strong expertise with

    OpenTelemetry

    (instrumentation, collection, processing).
  • Hands-on with

    Azure cloud services

    (Monitor, Log Analytics, Application Insights).
  • Proficient with

    Infrastructure as Code (Terraform, Azure Bicep, ARM)

    .
  • Skilled in

    scripting/automation

    (Python, PowerShell, Bash).
  • Experience with

    Docker

    and

    Kubernetes/AKS

    .
  • Familiarity with observability backends:

    Grafana, Loki, Tempo, Prometheus, Jaeger

    .
  • Deep understanding of

    distributed systems and microservices

    .
  • Excellent

    problem-solving, analytical, and communication skills

    .


Preferred Qualifications

  • Azure certifications (AZ-104, AZ-400).
  • Experience with

    chaos engineering

    .
  • Knowledge of

    SLOs, SLIs, and error budgets

    .
  • Familiarity with

    database monitoring

    (PostgreSQL, Azure SQL).
  • Experience in

    high-availability or regulated environments

    .


Education

  • Bachelor’s degree in

    Computer Science, IT

    , or a related technical field (or equivalent practical experience).

Mock Interview

Practice Video Interview with JobPe AI

Start Java Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Java Skills

Practice Java coding challenges to boost your skills

Start Practicing Java Now

RecommendedJobs for You

Noida, Uttar Pradesh, India

Bengaluru, Karnataka, India

Hyderabad, Telangana, India

Hyderabad, Telangana, India

Bengaluru, Karnataka, India