Data Analytics

3 years

0 Lacs

Posted:1 day ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Job Title:

Location:

Employment Type:


As an SRE in the Analytics domain, you will bridge the gap between data engineering, observability, and reliability operations — helping teams make data-driven decisions to improve availability, performance, and operational excellence.

Key Responsibilities

  • Design, build, and maintain

    data pipelines

    and

    analytics platforms

    that deliver actionable insights into system performance, availability, and reliability.
  • Develop

    dashboards and visualizations

    (Grafana, Tableau, Looker, etc.) to monitor key SRE metrics such as SLOs, SLIs, error budgets, and capacity trends.
  • Partner with SREs, DevOps, and product teams to analyze incident data, automate reporting, and identify recurring reliability issues.
  • Support

    data ingestion and aggregation

    from multiple sources (CloudWatch, Prometheus, ELK, Datadog, Splunk, etc.) into a unified analytics layer.
  • Build and maintain

    automation scripts

    and

    data models

    for performance and reliability insights.
  • Drive

    post-incident analysis

    and trend reporting to inform long-term reliability improvements.
  • Collaborate with business stakeholders to translate operational data into measurable reliability KPIs.
  • Ensure data quality, governance, and availability for all reliability-related analytics systems.


Qualifications

Required Skills:

  • Bachelor’s or master’s degree in computer science, or related field.
  • 3+ years of experience as an

    SRE, DevOps Engineer, or Data Engineer

    in large-scale, cloud-native environments.
  • Strong experience with

    AWS, GCP, or Azure

    platforms.
  • Hands-on experience with

    data analytics tools

    (e.g., Python, SQL, Pandas, PySpark).
  • Familiarity with

    monitoring and observability stacks

    (e.g., Prometheus, Grafana, ELK, Datadog).
  • Strong understanding of

    SRE concepts

    — SLIs, SLOs, error budgets, and incident management.
  • Excellent problem-solving, analytical, and communication skills.


Preferred Skills:

  • Experience with

    data visualization

    tools (Grafana, Power BI, Tableau, Looker).
  • Familiarity with

    machine learning for anomaly detection

    in reliability data.
  • Experience automating reporting and dashboards using

    AWS Glue, Athena, or Lambda

    .
  • Knowledge of

    CI/CD pipelines

    and

    Infrastructure as Code

    (Terraform, CloudFormation).
  • Familiarity with

    ITSM/CMDB systems

    and integrating reliability analytics into operational workflows.

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You

bengaluru, karnataka, india

mumbai, maharashtra, india

mumbai, maharashtra, india