Posted:4 hours ago|
Platform:
Hybrid
Full Time
Reliability Engineering
• Define and measure SLOs, SLIs, and error budgets for key services.• Reduce operational load through automation and intelligent alerting.• Lead and participate in blameless post-incident reviews; turn learnings into improvements.• System Design & Scalability.
Write tools and services in Python, Go, Save or similar languages to automate deployment, monitoring, and recovery.(Any scripting)
• Build self-healing and auto-scaling systems.• Maintain high-quality documentation and runbooks through code generation and automation.• Observability & Incident Response• Develop deep insight into system performance through metrics, tracing, and logs.• Improve mean time to detect (MTTD) and mean time to recover (MTTR).• Participate in on-call rotations and continuously refine playbooks and tooling.
Macquarie
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
bengaluru
4.0 - 6.0 Lacs P.A.
bengaluru
4.0 - 6.0 Lacs P.A.
hyderabad
6.0 - 10.0 Lacs P.A.
bengaluru
4.0 - 9.0 Lacs P.A.
bengaluru
15.0 - 25.0 Lacs P.A.
gurugram
3.0 - 7.0 Lacs P.A.
15.0 - 22.5 Lacs P.A.
bengaluru, karnataka, india
5.5 - 8.5 Lacs P.A.
hyderabad
10.0 - 20.0 Lacs P.A.
hyderabad, telangana, india
5.5 - 8.0 Lacs P.A.