Product Analyst

3 - 6 years

9 - 14 Lacs

Posted:5 hours ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

As a junior SRE / Observability Engineer, you will be part of the Atlas Platform Engineering team and will:
  • Participate in the creation and maintenance of observability standards and best practices
  • Participate in the review of the current observability platform.
  • Contribute to expand the observability stack across multiple clouds, regions, and clusters, managing all observability data.
  • Participate in the implementation of monitoring solutions for complex distributed systems.
  • Participate in the evaluation of new capabilities in the observability stack
  • Assist teams in creating clear, informative, and actionable dashboards to improve system visibility.
  • Automate monitoring and alerting processes, including enrichment strategies and ML-driven anomaly detection where applicable.
  • Work closely with R&D and product development teams.
  • Support the definition of SLI (service level indicators) and SLO (service level objectives) for the Atlas services.
  • Participate in the emergency response process
  • Participate in RCAs (root cause analysis)
  • Help to automate repetitive tasks and reduce toil.
Qualifications:
 
People and communication qualifications
  • Be a strong team player
  • Have good collaboration and communication skills
  • Problem-solving and analytical thinking
  • Be curious about how systems work
Technical qualifications - general:
  • Familiarity with cloud platforms (Ideally Azure)
  • Familiarity with Kubernetes and Istio as the architecture on which the observability and Atlas services run, and how they integrate and scale.
  • Knowledge of common programming languages and debugging techniques
  • Linux and scripting languages (Bash, Python, Golang).
  • Interest in DevOps and SRE principles.
Technical qualifications - observability
  • Understanding of observability principles (metrics, logs, traces)
  • Experience with unified observability platforms that enable the use of data (metrics, logs, and traces) to determine quickly if their application or service is operating as desired.
Technical qualifications SRE
  • Understanding of the Google SRE principles
  • SLIs and SLOs, Error Budget, Emergency response, Toil reduction, RCA
  • Experience in incident response

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
PTC logo
PTC

Software Development

Boston Massachusetts

RecommendedJobs for You

pune, maharashtra, india

gurugram, haryana, india