At PwC, our people in infrastructure focus on designing and implementing robust, secure IT systems that support business operations. They enable the smooth functioning of networks, servers, and data centres to optimise performance and minimise downtime. Those in cloud operations at PwC will focus on managing and optimising cloud infrastructure and services to enable seamless operations and high availability for clients. You will be responsible for monitoring, troubleshooting, and implementing industry leading practices for cloud-based systems.Driven by curiosity, you are a reliable, contributing member of a team. In our fast-paced environment, you are expected to adapt to working with a variety of clients and team members, each presenting varying challenges and scope. Every experience is an opportunity to learn and grow. You are expected to take ownership and consistently deliver quality work that drives value for our clients and success as a team. As you navigate through the Firm, you build a brand for yourself, opening doors to more opportunities.
Skills
Examples of the skills, knowledge, and experiences you need to lead and deliver value at this level include but are not limited to:
- Apply a learning mindset and take ownership for your own development.
- Appreciate diverse perspectives, needs, and feelings of others.
- Adopt habits to sustain high performance and develop your potential.
- Actively listen, ask questions to check understanding, and clearly express ideas.
- Seek, reflect, act on, and give feedback.
- Gather information from a range of sources to analyse facts and discern patterns.
- Commit to understanding how the business works and building commercial awareness.
- Learn and apply professional and technical standards (e.g. refer to specific PwC tax and audit guidance), uphold the Firm's code of conduct and independence requirements.
Job Title: Site Reliability Engineer (SRE) – Associate
Location
: Bangalore (Hybrid)
Department
: Managed Services – Core Automation Team
Job Overview
We’re seeking a Senior Associate with deep hands-on experience in scripting, automation, and RPA to help build intelligent, resilient systems across Managed Services. You’ll work at the intersection of platform reliability and automation—developing scripts, automating runbooks, and integrating low-code/no-code solutions to eliminate manual work and improve operational efficiency. This role is ideal for someone who thrives in solving real-world production challenges with code, automation, and curiosity.
Key Responsibilities
- Automate repetitive infrastructure and application support activities using scripting (Python, Bash, PowerShell) and RPA/low-code platforms.
- Develop and maintain scripts and reusable components to drive system configuration, monitoring, and auto-remediation.
- Build self-healing workflows to identify and resolve issues proactively—minimizing human intervention.
- Integrate observability and alerting tools with automation pipelines to enable real-time anomaly detection and resolution.
- Leverage low-code/no-code automation platforms (e.g., Power Automate, UiPath, Automation Anywhere) to streamline manual business processes.
- Collaborate with operations, engineering, and platform teams to build reliable automation frameworks and support scaled delivery.
- Use GenAI and AI-driven tools to enhance decision automation and support proactive operations management.
- Create and maintain runbooks and documentation that evolve into automation-first playbooks.
- Continuously analyze operational inefficiencies and develop automation to close gaps.
Required Skills And Qualifications
- 2+ years of hands-on experience in Site Reliability Engineering, Automation Engineering, or RPA roles.
- Strong scripting proficiency in Python, Bash, and PowerShell for infrastructure and application automation.
- Practical experience with low-code/no-code platforms and RPA tools like UiPath, Power Automate, Automation Anywhere, or similar.
- Solid understanding of automation across monitoring, alerting, configuration management, and incident response.
- Exposure to log aggregation tools (e.g., Elastic Stack, Splunk) for troubleshooting and automation triggers.
- Experience building self-healing systems and integrating with event-based automation platforms.
- Familiarity with cloud environments (AWS, Azure, GCP) and integrating automation across hybrid infrastructure.
- Experience applying GenAI/AI-driven solutions to automate operations and support predictive monitoring.
- Strong analytical and root cause analysis skills for solving recurring issues via automation.
- Ability to work independently and collaborate effectively in cross-functional teams.
Desired Skills And Qualifications
- Experience working in a Managed Services or enterprise support environment with a focus on automation maturity.
- Understanding of ITIL/ITSM processes and how automation can improve service quality and consistency.
- Exposure to containerized environments (e.g., Docker, Kubernetes) and automation of application deployments.
- Experience with observability platforms like Datadog, Prometheus, or AppDynamics is a plus.
- Strong communication and stakeholder engagement skills to align automation initiatives with business needs.
Education Requirements
- Bachelor’s degree in Computer Science, IT, Engineering, or a related technical field.
- Certifications in RPA platforms, cloud technologies, or scripting/automation tools are a plus.