Site Reliability Engineer

7 - 12 years

15 - 25 Lacs

Posted:7 hours ago| Platform: Naukri logo

Apply

Work Mode

Hybrid

Job Type

Full Time

Job Description

Site Reliability Engineer

Primary Skills (Must-Have)

  • AWS (core services: EC2, EKS, Lambda, Redshift, S3, IAM, VPC)
  • CI/CD (Jenkins, GitHub Actions, AWS CodePipeline)
  • Infrastructure as Code (Terraform, CloudFormation)
  • Kubernetes (EKS) and container orchestration

Secondary Skills (Good-to-Have)

  • AWS Systems Manager, Dataiku platform operations
  • Experience with platform patching, upgrades, and maintenance

Tools & Platforms

  • Data Warehousing & Processing

    : Snowflake, Redshift, Apache Airflow, dbt
  • CI/CD & Deployment

    : Jenkins, GitHub Actions, AWS CodePipeline, Terraform
  • Cloud & Event Processing

    : AWS Lambda, API Gateway, SNS/SQS, Kafka, Step Functions
  • Monitoring & Logging

    : DataDog, AWS CloudWatch, Prometheus, Splunk
  • Incident Management

    : PagerDuty, Opsgenie, AWS Health Dashboard
  • Collaboration & Code Review

    : GitHub, Jira, Confluence

Key Responsibilities

Data Pipeline Reliability & Observability

  • Maintain highly available, fault-tolerant infrastructure for ETL jobs and real-time data processing
  • Implement monitoring of Airflow DAGs, Snowflake queries, and AWS data workflows
  • Automate health checks, error handling, and self-healing for data pipelines

Infrastructure & Cloud Automation

  • Deploy and manage AWS-based infrastructure with Terraform & CloudFormation
  • Optimize Kubernetes (EKS) clusters for scale and cost efficiency
  • Support scaling and reliability for Redshift, Snowflake, and storage solutions

Performance, Monitoring & Incident Response

  • Build real-time monitoring, logging, and alerting with DataDog, CloudWatch, and Prometheus
  • Define & track SLOs/SLIs to improve data platform uptime
  • Perform RCA, post-mortems, and security audits after incidents

Security & Compliance

  • Ensure compliance with GDPR, CCPA, SOC 2 across data pipelines
  • Apply AWS security best practices (IAM, KMS, Shield, WAF)
  • Secure API Gateways, data access policies, and encryption standards

Collaboration & Leadership

  • Partner with data engineers, analytics, and DevOps teams to improve reliability
  • Participate in DR (Disaster Recovery) planning and security compliance reviews
  • Promote best practices in automation, observability, and cost optimization

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Sonata Software logo
Sonata Software

Information Technology and Services

Bangalore

RecommendedJobs for You

chennai, tamil nadu, india