Site Reliability Engineer (SRE) / DevOps Engineer

6 years

0 Lacs

Posted:3 days ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Company Description

first dedicated SRE/DevOps Engineer



Role Description

mid-level Site Reliability Engineer / DevOps Engineer


first SRE hire


Key Responsibilities:

·      Design, implement, and manage Azure cloud infrastructure using IaC tools (Terraform or ARM/Bicep).

Azure DevOps

Python

·      Optimize infrastructure for cost, security, and performance.

Azure Kubernetes Service (AKS)

·      Manage microservices deployment patterns including rolling, blue/green, and canary releases.

·      Implement best practices around service mesh, ingress, autoscaling, and observability.

·      Maintain container build standards and optimize Docker images.

·      Set up monitoring, logging, tracing, and alerting using tools like Prometheus, Grafana, Loki, ELK, Data Dog or Azure equivalents.

·      Enable SLO/SLI tracking and error-budget management.

·      Conduct root cause analysis (RCAs), performance tuning, and post-incident reviews.

on-call rotation

·      Implement DevSecOps practices within CI/CD and infrastructure.

·      Manage secret storage, code scanning, image scanning, role-based access, and network security.

SOC1, SOC2, and GDPR

·      Work with the engineering team to embed secure coding and deployment standards.

·      Collaborate with Java, Python, and React teams to ensure production deployability and system health.

·      Support PostgreSQL administration (preferably Azure Database for PostgreSQL) including:

o Backup/restore automation

o Performance tuning

o Monitoring & scaling

·      Introduce reliability-focused engineering practices and tools across teams.

·      Help establish a strong SRE foundation for incident management, capacity planning, and automation.

Document best practices, runbooks, and operational playbooks


Qualifications


·      3–6 years of experience in DevOps/SRE roles.

Azure

Kubernetes

Python

·      Solid understanding of distributed systems and cloud-native design.

Java, Python, and React-based

Azure DevOps

  •  PostgreSQL administration experience.
  • Expertise in Site Reliability Engineering and a strong understanding of maintaining system reliability and scaling infrastructure
  • Familiarity with tools for automation, CI/CD pipelines, and monitoring systems
  • Ability to collaborate effectively within cross-functional teams and adapt to dynamic environments
  • Bachelor’s degree in computer science, Information Technology, or related technical discipline is preferred.

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Java Skills

Practice Java coding challenges to boost your skills

Start Practicing Java Now

RecommendedJobs for You