Flex is the diversified manufacturing partner of choice that helps market-leading brands design, build and deliver innovative products that improve the world.
As a Site Reliability Engineer (SRE) on the Factory Applications team, you ll help maintain and scale Brix - a cloud-native, containerized, microservices-based platform used to build global shop floor systems. Your focus will be on automation, reliability, and performance.
What a typical day looks like:
- Automate infrastructure and application management using code.
- Build monitoring systems to track uptime, performance, and health.
- Develop automated regression tests and coordinate with developers on failures.
- Create processes for high availability and fast error recovery.
- Define and analyze metrics for performance tuning and troubleshooting.
- Collaborate with development and operations teams to improve service reliability.
- Use Infrastructure as Code to automate recurring tasks.
- Maintain documentation and debug full-stack issues.
Required Skills
- Strong grasp of DevOps and SRE principles.
- Experience with Docker and Kubernetes.
- Comfortable in Unix/Linux shell environments.
- Willingness to work night shifts (rotational or dedicated).
Preferred Skills
- Monitoring tools: Prometheus, Grafana, alerting systems.
- Infrastructure automation: Ansible, Chef, Terraform.
- Scripting: Unix shell, curl, jq, sed,..
- Programming: C#, TypeScript, Python and/or Go.
- Configuration: YAML and JSON.
- Version control: Git and branching strategies.
The experience we re looking to add to our team:
- Diploma or Bachelor s degree in Computer Science or related field.
- Minimum 2 years in software development, engineering, or system operations.
- Strong English communication skills.
- Analytical mindset with both logical and creative problem-solving abilities.
- Self-driven and well-organized.
Personal Qualities We Value
- Reliability: You take ownership and follow through on commitments.
- Curiosity: You re eager to learn new technologies and improve existing systems.
- Resilience: You thrive in fast-paced environments and adapt quickly to change.
- Collaboration: You work well across teams and cultures, sharing knowledge generously.
- Integrity: You uphold high standards of professionalism and ethical conduct.
What you ll receive for the great work you provide:
- Health Insurance
- Paid Time Off
BB04
IT