Staff Site Reliability Engineer (SRE)

5 - 9 years

0 Lacs

Posted:1 day ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Role Overview: As a Staff SRE at The Modern Data Company, you will be joining the CTO Office to contribute to enhancing the reliability, scalability, and efficiency of platforms in multi-cloud environments. Your role will involve shaping observability, automation, and operational excellence by collaborating with platform engineering and product teams to establish robust monitoring systems. Key Responsibilities: - Design and maintain observability stacks (Prometheus, Thanos, Grafana) to monitor real-time system health and alerting. - Implement APM and log analytics to reduce MTTR and proactively manage system performance. - Ensure high availability and scalability across cloud regions by working closely with product and platform teams. - Automate operational processes such as incident response, infrastructure provisioning, and deployments using scripting and Infrastructure-as-Code. - Establish resilient CI/CD pipelines to support rapid and safe deployments. - Optimize infrastructure usage and cost efficiency across AWS, Azure, and GCP. - Lead incident response, conduct Root Cause Analysis (RCA), and drive systemic fixes across teams. - Define and document playbooks, reliability standards, and best practices. Qualifications Required: - 5+ years of experience in SRE or DevOps roles with production ownership. - Profound knowledge of observability tools such as Prometheus, Thanos, Grafana, and logging frameworks. - Strong expertise in Kubernetes, Docker, Terraform, and cloud-native architectures. - Demonstrated experience in automating workflows using Python, Bash, or similar scripting languages. - Hands-on experience with AWS, Azure, and/or GCP. - Problem-solving mindset with a focus on automation and scalability. - Effective communication skills and a continuous learning and collaborative approach. Please note that no additional details about the company were provided in the job description.,

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You