10.0 - 12.0 years

40.0 - 50.0 Lacs P.A.

Pune

Posted:1 week ago| Platform: Naukri logo

Apply Now

Skills Required

JavaDevOpsSreDynatraceSpringazureDataDogKubernetes

Work Mode

Work from Office

Job Type

Full Time

Job Description

Key Responsibilities: Collaborate with U.S.-based counterparts to define and monitor service SLOs, SLAs, and key performance indicators. Lead root cause analysis, blameless postmortems, and reliability improvements across environments. Review application code (primarily Java/Spring) to assist in identifying defects and systemic performance issues. Automate deployment pipelines, recovery workflows, and runbook processes to minimize manual intervention. Build and manage dashboards, alerts, and health checks using tools like DataDog, Azure Monitor, Prometheus, and Grafana. Contribute to architectural decisions with a lens on performance and operability. Guide and mentor offshore team members in incident response and production readiness. Participate in 24x7 support rotations aligned with EST coverage expectations. Required Experience & Skills: 10+ years in SRE, DevOps, or platform engineering experience, ideally supporting U.S. enterprise systems. Strong hands-on experience with Java/Spring Boot applications, with the ability to assist in code-level troubleshooting. Cloud infrastructure knowledge (Azure preferred) and container orchestration (Kubernetes). Proficient with logging/monitoring stacks (DataDog, ELK, Azure Monitor, Dynatrace, Splunk). Experience with ServiceNow (SNOW) for ITSM processes. Experience with Terraform or ARM templates, CI/CD automation, and scripting (Python, Bash). Familiarity with Salesforce systems highly preferred. Excellent communication skills and outstanding problem-solving ability in distributed environments. Demonstrated history of improving stability, availability, and delivery velocity for large-scale platforms.

Environmental Services
Baden Ontario +4

RecommendedJobs for You