Deputy Director - DPA SRE Principal Engineer

11 - 15 years

0 Lacs

Posted:17 hours ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

As a DPA SRE Principal Engineer at PepsiCo, your role is crucial in driving new shift left activities to apply Site Reliability Engineering (SRE) and quality assurance principles within the application design & architecture. You will be responsible for ensuring a high level of resiliency during operations and continuously improving through design throughout the software development lifecycle. Your main purpose is to provide a delightful customer experience for the user of the global consumer, commercial, supply chain, and enablement functions in the PepsiCo digital products application portfolio. **Key Responsibilities:** - Ensure ecosystem availability and performance in production environments, proactively preventing P1, P2, and potential P3 incidents. - Engage and influence product and engineering teams to embed reliability and operability into new services by defining and enforcing events, logging, monitoring, and observability standards. - Lead the team in diagnosing anomalies prior to any user impact and drive necessary remediations across the end-to-end ecosystem availability, performance, and consumption of the cloud architected application ecosystem. - Collaborate with Engineering & support teams, participate in escalations and postmortems, and empower customer-facing support teams with SRE insights and tooling. - Continuously optimize L2/support operations work via SRE workflow automation and drive AI Ops adoption across teams. - Be the key architect for the SRE orchestration platform design with inputs from various teams. **Qualifications:** - 11-15 years of work experience evolving to a SRE engineer with 3-5 years of experience in continuously improving and transforming IT operations ways of working. - Bachelors degree in Computer Science, Information Technology, or a related field. - Proven experience in designing events diagnostics, performance measures, and alert solutions to meet SLA/SLO/SLIs. - Strong expertise in SRE and IT Service Management processes with a track record for improving service offerings and proactively resolving incidents. - Hands-on experience in Python, SQL/No-SQL databases, monitoring tools like AppDynamics, ELK Stack, Grafana, Splunk, Dynatrace, Kafka, and other SRE Ops toolsets. - Firm understanding of cloud architecture for distributed environments and proficiency in front-end and back-end technologies. - Prior experience in shaping transformation and developing SRE solutions would be a plus.,

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You