Flex is the diversified manufacturing partner of choice that helps market-leading brands design, build and deliver innovative products that improve the world. A career at Flex offers the opportunity to make a difference and invest in your growth in a respectful, inclusive, and collaborative environment. If you are excited about a role but don't meet every bullet point, we encourage you to apply and join us to create the extraordinary.To support our extraordinary teams who build great products and contribute to our growth, we’re looking to add a
Senior Devops Engineer +
Site Reliability Engineer (SRE)
– IT
located in
Chennai location.
As a Sr. Site Reliability Engineer (SRE) on the Factory Applications team, you will guide the reliability strategy for “Brix” – a cloud-native, containerized, microservices-based platform powering global shop floor systems. You’ll lead a team of SREs, drive automation and performance initiatives, and collaborate cross-functionally to ensure scalable, resilient, and secure operations.Report to the
Senior Manager
, and the role involves:
What a typical day looks like:
- Leadership & Strategy
- Lead and mentor a team of SREs, fostering a culture of ownership, learning, and continuous improvement.
- Define and drive the SRE roadmap aligned with business and technical goals.
- Champion best practices in reliability engineering across development and operations teams.
- Technical Execution
- Architect and implement scalable infrastructure using Infrastructure as Code.
- Oversee monitoring, alerting, and observability systems to ensure platform health and performance.
- Lead incident response and postmortem processes, ensuring root cause analysis and long-term fixes.
- Collaborate with developers to integrate automated testing and CI/CD pipelines.
- Optimize system performance through metric analysis and proactive tuning.
- Collaboration & Communication
- Act as a liaison between engineering, operations, and business stakeholders.
- Maintain clear documentation and knowledge sharing across teams.
- Support global teams, including rotational night shift coverage as needed.
The experience we’re looking to add to our team:
- Bachelor’s or master’s degree in computer science, Information Technology, or related field (or equivalent work experience).
- 7 - 12+ years of experience in Information Technology or related field.
- Proven experience in DevOps and SRE methodologies.
- Expertise in Docker, Kubernetes, and cloud-native architecture.
- Solid programming skills in C#, TypeScript, Python, or Go.
- Proficiency in Unix/Linux environments and shell scripting.
- Experience with monitoring tools (Prometheus, Grafana) and test automation frameworks.
- Strong analytical and problem-solving abilities.
- Demonstrated leadership and project management capabilities.
Good to have:
- Advanced knowledge of CI/CD pipelines and Git workflows.
- Familiarity with configuration formats (YAML, JSON).
- Experience leading technical teams and driving cross-functional initiatives.
What you will get for the great work you provide:
NK99
Flex is an Equal Opportunity Employer and employment selection decisions are based on merit, qualifications, and abilities. We do not discriminate based on: age, race, religion, color, sex, national origin, marital status, sexual orientation, gender identity, veteran status, disability, pregnancy status, or any other status protected by law. We're happy to provide reasonable accommodations to those with a disability for assistance in the application process. Please email accessibility@flex.com and we'll discuss your specific situation and next steps (NOTE: this email does not accept or consider resumes or applications. This is only for disability assistance. To be considered for a position at Flex, you must complete the application process first).