About NetApp
NetApp is the intelligent data infrastructure company, turning a world of disruption into opportunity for every customer. No matter the data type, workload or environment, we help our customers identify and realize new business possibilities. And it all starts with our people.If this sounds like something you want to be part of, NetApp is the place for you. You can help bring new ideas to life, approaching each challenge with fresh eyes. Of course, you won't be doing it alone. At NetApp, we're all about asking for help when we need it, collaborating with others, and partnering across the organization - and beyond.
Job Summary
As an SRE Engineer at NetApp India R&D division, you will be responsible for the development, reliability, automation, and operations of AI-driven services across cloud and on-prem environments. You will be part of a highly skilled technical team named NetApp Active IQ, contributing to cutting-edge reliability engineering practices while enabling Generative AI (GenAI) innovation.Your focus will be on applying SRE principles to GenAI-powered services — ensuring they are scalable, fault-tolerant, highly available, and meet strict SLAs. You will bridge development and operations by building automation, observability, and self-healing capabilities for AI-driven workloads.This position requires an individual to be creative, team-oriented, technology savvy, driven to produce results and demonstrates the ability to working across teamsJob Requirements
- Design, develop, and maintain SRE automation tools for monitoring, deployment, and scaling of AI/GenAI workloads.
- Implement observability platforms (metrics, tracing, logging) tailored for GenAI services running on both Cloud and Onprem.
- Collaborate with engineering and data science teams to productionize GenAI models at scale.
- Build fault-tolerant infrastructure for AI pipelines using Kubernetes, Docker, and cloud-native tools.
- Drive capacity planning, incident management, and postmortem analysis with a focus on continuous reliability improvement.
- Implement CI/CD with automated testing and validation pipelines.
- Develop self-recovery mechanisms for critical services to minimize downtime.
- Ensure security, compliance, and resilience in AI-based applications and microservices.
- Interact with Active IQ engineering teams across geographies to leverage expertise and contribute to the tech community.
EducationTypically requires no previous professional experience.At NetApp, we embrace a hybrid working environment designed to strengthen connection, collaboration, and culture for all employees. This means that most roles will have some level of in-office and/or in-person expectations, which will be shared during the recruitment process.
Equal Opportunity Employer:
NetApp is firmly committed to Equal Employment Opportunity (EEO) and to compliance with all laws that prohibit employment discrimination based on age, race, color, gender, sexual orientation, gender identity, national origin, religion, disability or genetic information, pregnancy, and any protected classification.
Why NetApp?
We are all about helping customers turn challenges into business opportunity. It starts with bringing new thinking to age-old problems, like how to use data most effectively to run better - but also to innovate. We tailor our approach to the customer's unique needs with a combination of fresh thinking and proven approaches.We enable a healthy work-life balance. Our volunteer time off program is best in class, offering employees 40 hours of paid time off each year to volunteer with their favourite organizations. We provide comprehensive benefits, including health care, life and accident plans, emotional support resources for you and your family, legal services, and financial savings programs to help you plan for your future. We support professional and personal growth through educational assistance and provide access to various discounts and perks to enhance your overall quality of life.If you want to help us build knowledge and solve big problems, let's talk.