Jobs
Interviews

648 Sre Jobs - Page 5

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

2.0 - 4.0 years

14 - 18 Lacs

Pune

Hybrid

So, what’s the role all about? As a Site Reliability Engineer (SRE) for our large and regionally distributed SaaS platform, your primary responsibilities will be to improve the reliability and availability of our mission-critical cloud-based services. How will you make an impact? Essential Duties and Responsibilities: Observability and Monitoring: Create new dashboards and metrics to provide comprehensive observability into the health and performance of development teams' applications, including SLI/SLO metrics. Work with development teams to ensure proper monitoring is set up and enabled for their services. Identify evolutionary improvements to the observability and monitoring solutions. Reliability Consulting and Automation: Consult with development teams on SRE services and best practices to help them improve the reliability of their applications. Create automation and tooling to reduce toil and manual intervention. Incident and Problem Management: Assist other teams in data and performance analysis to identify the root causes of issues and recommend automation actions. Knowledge Sharing and Mentoring: Review the work of other SREs and provide training and guidance to help them improve their skills. Communicate effectively with both technical and non-technical peers and customers. Process and Documentation: Follow established processes when performing work or help document and create processes, as necessary. Document troubleshooting steps and results in appropriate locations for historical access. Ensure compliance with policies, procedures, and standards. Implement or coordinate remediation required by audits and assessments, and document, as necessary. Time Estimation: Estimate the time required to complete activities and projects. Have you got what it takes? 4+ years programming/scripting experience with any of the following: (Go, Python, .Net (C#), Node) 4+ years of experience working within public or private cloud environments 4+ years of SRE/DevOps/Observability or related experience 4+ years of AWS Experience with Agile, Jira, GitHub, monitoring, automation, dashboarding You will have an advantage if you also have: Kubernetes + certification, Grafana , AWS, Azure, DevOps experience. What’s in it for you? Join an ever-growing, market disrupting, global company where the teams – comprised of the best of the best – work in a fast-paced, collaborative, and creative environment! As the market leader, every day at NICE is a chance to learn and grow, and there are endless internal career opportunities across multiple roles, disciplines, domains, and locations. If you are passionate, innovative, and excited to constantly raise the bar, you may just be our next NICEr! Enjoy NICE-FLEX! At NICE, we work according to the NICE-FLEX hybrid model, which enables maximum flexibility: 2 days working from the office and 3 days of remote work, each week. Naturally, office days focus on face-to-face meetings, where teamwork and collaborative thinking generate innovation, new ideas, and a vibrant, interactive atmosphere. Requisition ID:7547 Reporting into: Manager, Cloud Operations Role Type: Individual Contributor

Posted 2 weeks ago

Apply

5.0 - 8.0 years

18 - 20 Lacs

Noida, Madurai, Chennai

Hybrid

1. Expertise on Observability/SRE tools, platforms, and standards, including ELK Stack, Grafana, Prometheus, Loki, Victoria Metrics, Telegraf 2. Familiarity with modern logging frameworks and best practices: Opentelemetry, Kafka etc. 3. Experience with data visualization tools like Grafana, Kibana to create informative and actionable dashboards, reports, and alerts. 4. Proficiency in scripting languages like Python, Bash, or PowerShell is valuable for automating data collection, analysis, and visualization processes. 5. Good to have Experience in Monitoring Tools SCOM, Opensearch.

Posted 2 weeks ago

Apply

6.0 - 11.0 years

5 - 15 Lacs

Bengaluru

Hybrid

Design, implement, and manage scalable, secure, and resilient cloud infrastructure on AWS. Maintain and enhance Kubernetes clusters, including deployment, monitoring, scaling, and troubleshooting. Implement Infrastructure as Code (IaC) using tools

Posted 2 weeks ago

Apply

5.0 - 10.0 years

27 - 42 Lacs

Faridabad

Work from Office

F5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F5XC Product. Primary Responsibilities F5xc SRE: Play the role of a hands-on SRE Engineer focused on automation and toil-reduction and participate in Ops cycles to support our product. Perform oncall support function on a rotation basis, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products Mentor junior team members to support their professional growth and development Easy-to-Use Automation: Continue to grow the infra-automation (k8s, ArgoCD, Helm Charts, Golang services, AWS, GCP, Terraform) with a focus on ease of configuration Environment Stability using Observability: Create and continue to evolve existing Observability (metrics & alerts) and participate in regular monitoring of infrastructure for stability. Collaborative Engagement: Collaborate closely with application owners and SRE team members as part of roadmap execution and continuous improvement of existing systems. Scale & Resilient systems: Design & deploy systems/infra which is highly available and resilient for the configured failure domains. Design systems using strong security principles with security by default. Knowledge, Skills and Abilities Hands-on programming experience in any one language python,golang + shell scripting. Hands-on terraform expertise. Strong networking fundamentals and experience dealing with different layers of the networking stack. SRE/Devops on Linux & Kubernetes: Demonstrate excellent, hands-on knowledge of deploying workloads and managing lifecyle on kubernetes, with practical experience on debugging issues. Experience in upgrading workloads for SaaS Services without downtime. Oncall Experience in managing everyday OPs for production environments. Experience in production alerts management and using dashboards to debug issues. GipOps: Experience with helm charts/kustomizations and gitops tools like ArgoCD/FluxCD. CI/CD: Experience working with/designing functional CI/CD systems. Cloud Infrastructure: Prior experience in deploying workloads and managing lifecycle on any cloud provider (AWS/GCP/Azure) Experience with Disaster Recovery and Migration is a plus Qualifications Typically, requires at least 8+ years of related experience with a bachelors degree, 6+ year and a masters degree, or a PhD with 4+ year of experience or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

5.0 - 10.0 years

27 - 42 Lacs

Ghaziabad

Work from Office

F5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F5XC Product. Primary Responsibilities F5xc SRE: Play the role of a hands-on SRE Engineer focused on automation and toil-reduction and participate in Ops cycles to support our product. Perform oncall support function on a rotation basis, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products Mentor junior team members to support their professional growth and development Easy-to-Use Automation: Continue to grow the infra-automation (k8s, ArgoCD, Helm Charts, Golang services, AWS, GCP, Terraform) with a focus on ease of configuration Environment Stability using Observability: Create and continue to evolve existing Observability (metrics & alerts) and participate in regular monitoring of infrastructure for stability. Collaborative Engagement: Collaborate closely with application owners and SRE team members as part of roadmap execution and continuous improvement of existing systems. Scale & Resilient systems: Design & deploy systems/infra which is highly available and resilient for the configured failure domains. Design systems using strong security principles with security by default. Knowledge, Skills and Abilities Hands-on programming experience in any one language python,golang + shell scripting. Hands-on terraform expertise. Strong networking fundamentals and experience dealing with different layers of the networking stack. SRE/Devops on Linux & Kubernetes: Demonstrate excellent, hands-on knowledge of deploying workloads and managing lifecyle on kubernetes, with practical experience on debugging issues. Experience in upgrading workloads for SaaS Services without downtime. Oncall Experience in managing everyday OPs for production environments. Experience in production alerts management and using dashboards to debug issues. GipOps: Experience with helm charts/kustomizations and gitops tools like ArgoCD/FluxCD. CI/CD: Experience working with/designing functional CI/CD systems. Cloud Infrastructure: Prior experience in deploying workloads and managing lifecycle on any cloud provider (AWS/GCP/Azure) Experience with Disaster Recovery and Migration is a plus Qualifications Typically, requires at least 8+ years of related experience with a bachelors degree, 6+ year and a masters degree, or a PhD with 4+ year of experience or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

5.0 - 10.0 years

27 - 42 Lacs

Chittoor

Work from Office

F5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F5XC Product. Primary Responsibilities F5xc SRE: Play the role of a hands-on SRE Engineer focused on automation and toil-reduction and participate in Ops cycles to support our product. Perform oncall support function on a rotation basis, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products Mentor junior team members to support their professional growth and development Easy-to-Use Automation: Continue to grow the infra-automation (k8s, ArgoCD, Helm Charts, Golang services, AWS, GCP, Terraform) with a focus on ease of configuration Environment Stability using Observability: Create and continue to evolve existing Observability (metrics & alerts) and participate in regular monitoring of infrastructure for stability. Collaborative Engagement: Collaborate closely with application owners and SRE team members as part of roadmap execution and continuous improvement of existing systems. Scale & Resilient systems: Design & deploy systems/infra which is highly available and resilient for the configured failure domains. Design systems using strong security principles with security by default. Knowledge, Skills and Abilities Hands-on programming experience in any one language python,golang + shell scripting. Hands-on terraform expertise. Strong networking fundamentals and experience dealing with different layers of the networking stack. SRE/Devops on Linux & Kubernetes: Demonstrate excellent, hands-on knowledge of deploying workloads and managing lifecyle on kubernetes, with practical experience on debugging issues. Experience in upgrading workloads for SaaS Services without downtime. Oncall Experience in managing everyday OPs for production environments. Experience in production alerts management and using dashboards to debug issues. GipOps: Experience with helm charts/kustomizations and gitops tools like ArgoCD/FluxCD. CI/CD: Experience working with/designing functional CI/CD systems. Cloud Infrastructure: Prior experience in deploying workloads and managing lifecycle on any cloud provider (AWS/GCP/Azure) Experience with Disaster Recovery and Migration is a plus Qualifications Typically, requires at least 8+ years of related experience with a bachelors degree, 6+ year and a masters degree, or a PhD with 4+ year of experience or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

5.0 - 10.0 years

27 - 42 Lacs

Greater Noida

Work from Office

F5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F5XC Product. Primary Responsibilities F5xc SRE: Play the role of a hands-on SRE Engineer focused on automation and toil-reduction and participate in Ops cycles to support our product. Perform oncall support function on a rotation basis, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products Mentor junior team members to support their professional growth and development Easy-to-Use Automation: Continue to grow the infra-automation (k8s, ArgoCD, Helm Charts, Golang services, AWS, GCP, Terraform) with a focus on ease of configuration Environment Stability using Observability: Create and continue to evolve existing Observability (metrics & alerts) and participate in regular monitoring of infrastructure for stability. Collaborative Engagement: Collaborate closely with application owners and SRE team members as part of roadmap execution and continuous improvement of existing systems. Scale & Resilient systems: Design & deploy systems/infra which is highly available and resilient for the configured failure domains. Design systems using strong security principles with security by default. Knowledge, Skills and Abilities Hands-on programming experience in any one language python,golang + shell scripting. Hands-on terraform expertise. Strong networking fundamentals and experience dealing with different layers of the networking stack. SRE/Devops on Linux & Kubernetes: Demonstrate excellent, hands-on knowledge of deploying workloads and managing lifecyle on kubernetes, with practical experience on debugging issues. Experience in upgrading workloads for SaaS Services without downtime. Oncall Experience in managing everyday OPs for production environments. Experience in production alerts management and using dashboards to debug issues. GipOps: Experience with helm charts/kustomizations and gitops tools like ArgoCD/FluxCD. CI/CD: Experience working with/designing functional CI/CD systems. Cloud Infrastructure: Prior experience in deploying workloads and managing lifecycle on any cloud provider (AWS/GCP/Azure) Experience with Disaster Recovery and Migration is a plus Qualifications Typically, requires at least 8+ years of related experience with a bachelors degree, 6+ year and a masters degree, or a PhD with 4+ year of experience or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

5.0 - 10.0 years

27 - 42 Lacs

Gurugram

Work from Office

F5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F5XC Product. Primary Responsibilities F5xc SRE: Play the role of a hands-on SRE Engineer focused on automation and toil-reduction and participate in Ops cycles to support our product. Perform oncall support function on a rotation basis, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products Mentor junior team members to support their professional growth and development Easy-to-Use Automation: Continue to grow the infra-automation (k8s, ArgoCD, Helm Charts, Golang services, AWS, GCP, Terraform) with a focus on ease of configuration Environment Stability using Observability: Create and continue to evolve existing Observability (metrics & alerts) and participate in regular monitoring of infrastructure for stability. Collaborative Engagement: Collaborate closely with application owners and SRE team members as part of roadmap execution and continuous improvement of existing systems. Scale & Resilient systems: Design & deploy systems/infra which is highly available and resilient for the configured failure domains. Design systems using strong security principles with security by default. Knowledge, Skills and Abilities Hands-on programming experience in any one language python,golang + shell scripting. Hands-on terraform expertise. Strong networking fundamentals and experience dealing with different layers of the networking stack. SRE/Devops on Linux & Kubernetes: Demonstrate excellent, hands-on knowledge of deploying workloads and managing lifecyle on kubernetes, with practical experience on debugging issues. Experience in upgrading workloads for SaaS Services without downtime. Oncall Experience in managing everyday OPs for production environments. Experience in production alerts management and using dashboards to debug issues. GipOps: Experience with helm charts/kustomizations and gitops tools like ArgoCD/FluxCD. CI/CD: Experience working with/designing functional CI/CD systems. Cloud Infrastructure: Prior experience in deploying workloads and managing lifecycle on any cloud provider (AWS/GCP/Azure) Experience with Disaster Recovery and Migration is a plus Qualifications Typically, requires at least 8+ years of related experience with a bachelors degree, 6+ year and a masters degree, or a PhD with 4+ year of experience or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

5.0 - 10.0 years

27 - 42 Lacs

Mandya

Work from Office

F5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F5XC Product. Primary Responsibilities F5xc SRE: Play the role of a hands-on SRE Engineer focused on automation and toil-reduction and participate in Ops cycles to support our product. Perform oncall support function on a rotation basis, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products Mentor junior team members to support their professional growth and development Easy-to-Use Automation: Continue to grow the infra-automation (k8s, ArgoCD, Helm Charts, Golang services, AWS, GCP, Terraform) with a focus on ease of configuration Environment Stability using Observability: Create and continue to evolve existing Observability (metrics & alerts) and participate in regular monitoring of infrastructure for stability. Collaborative Engagement: Collaborate closely with application owners and SRE team members as part of roadmap execution and continuous improvement of existing systems. Scale & Resilient systems: Design & deploy systems/infra which is highly available and resilient for the configured failure domains. Design systems using strong security principles with security by default. Knowledge, Skills and Abilities Hands-on programming experience in any one language python,golang + shell scripting. Hands-on terraform expertise. Strong networking fundamentals and experience dealing with different layers of the networking stack. SRE/Devops on Linux & Kubernetes: Demonstrate excellent, hands-on knowledge of deploying workloads and managing lifecyle on kubernetes, with practical experience on debugging issues. Experience in upgrading workloads for SaaS Services without downtime. Oncall Experience in managing everyday OPs for production environments. Experience in production alerts management and using dashboards to debug issues. GipOps: Experience with helm charts/kustomizations and gitops tools like ArgoCD/FluxCD. CI/CD: Experience working with/designing functional CI/CD systems. Cloud Infrastructure: Prior experience in deploying workloads and managing lifecycle on any cloud provider (AWS/GCP/Azure) Experience with Disaster Recovery and Migration is a plus Qualifications Typically, requires at least 8+ years of related experience with a bachelors degree, 6+ year and a masters degree, or a PhD with 4+ year of experience or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

5.0 - 10.0 years

27 - 42 Lacs

Hassan

Work from Office

F5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F5XC Product. Primary Responsibilities F5xc SRE: Play the role of a hands-on SRE Engineer focused on automation and toil-reduction and participate in Ops cycles to support our product. Perform oncall support function on a rotation basis, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products Mentor junior team members to support their professional growth and development Easy-to-Use Automation: Continue to grow the infra-automation (k8s, ArgoCD, Helm Charts, Golang services, AWS, GCP, Terraform) with a focus on ease of configuration Environment Stability using Observability: Create and continue to evolve existing Observability (metrics & alerts) and participate in regular monitoring of infrastructure for stability. Collaborative Engagement: Collaborate closely with application owners and SRE team members as part of roadmap execution and continuous improvement of existing systems. Scale & Resilient systems: Design & deploy systems/infra which is highly available and resilient for the configured failure domains. Design systems using strong security principles with security by default. Knowledge, Skills and Abilities Hands-on programming experience in any one language python,golang + shell scripting. Hands-on terraform expertise. Strong networking fundamentals and experience dealing with different layers of the networking stack. SRE/Devops on Linux & Kubernetes: Demonstrate excellent, hands-on knowledge of deploying workloads and managing lifecyle on kubernetes, with practical experience on debugging issues. Experience in upgrading workloads for SaaS Services without downtime. Oncall Experience in managing everyday OPs for production environments. Experience in production alerts management and using dashboards to debug issues. GipOps: Experience with helm charts/kustomizations and gitops tools like ArgoCD/FluxCD. CI/CD: Experience working with/designing functional CI/CD systems. Cloud Infrastructure: Prior experience in deploying workloads and managing lifecycle on any cloud provider (AWS/GCP/Azure) Experience with Disaster Recovery and Migration is a plus Qualifications Typically, requires at least 8+ years of related experience with a bachelors degree, 6+ year and a masters degree, or a PhD with 4+ year of experience or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

5.0 - 10.0 years

27 - 42 Lacs

Navi Mumbai

Work from Office

F5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F5XC Product. Primary Responsibilities F5xc SRE: Play the role of a hands-on SRE Engineer focused on automation and toil-reduction and participate in Ops cycles to support our product. Perform oncall support function on a rotation basis, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products Mentor junior team members to support their professional growth and development Easy-to-Use Automation: Continue to grow the infra-automation (k8s, ArgoCD, Helm Charts, Golang services, AWS, GCP, Terraform) with a focus on ease of configuration Environment Stability using Observability: Create and continue to evolve existing Observability (metrics & alerts) and participate in regular monitoring of infrastructure for stability. Collaborative Engagement: Collaborate closely with application owners and SRE team members as part of roadmap execution and continuous improvement of existing systems. Scale & Resilient systems: Design & deploy systems/infra which is highly available and resilient for the configured failure domains. Design systems using strong security principles with security by default. Knowledge, Skills and Abilities Hands-on programming experience in any one language python,golang + shell scripting. Hands-on terraform expertise. Strong networking fundamentals and experience dealing with different layers of the networking stack. SRE/Devops on Linux & Kubernetes: Demonstrate excellent, hands-on knowledge of deploying workloads and managing lifecyle on kubernetes, with practical experience on debugging issues. Experience in upgrading workloads for SaaS Services without downtime. Oncall Experience in managing everyday OPs for production environments. Experience in production alerts management and using dashboards to debug issues. GipOps: Experience with helm charts/kustomizations and gitops tools like ArgoCD/FluxCD. CI/CD: Experience working with/designing functional CI/CD systems. Cloud Infrastructure: Prior experience in deploying workloads and managing lifecycle on any cloud provider (AWS/GCP/Azure) Experience with Disaster Recovery and Migration is a plus Qualifications Typically, requires at least 8+ years of related experience with a bachelors degree, 6+ year and a masters degree, or a PhD with 4+ year of experience or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

5.0 - 10.0 years

27 - 42 Lacs

Mysuru

Work from Office

F5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F5XC Product. Primary Responsibilities F5xc SRE: Play the role of a hands-on SRE Engineer focused on automation and toil-reduction and participate in Ops cycles to support our product. Perform oncall support function on a rotation basis, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products Mentor junior team members to support their professional growth and development Easy-to-Use Automation: Continue to grow the infra-automation (k8s, ArgoCD, Helm Charts, Golang services, AWS, GCP, Terraform) with a focus on ease of configuration Environment Stability using Observability: Create and continue to evolve existing Observability (metrics & alerts) and participate in regular monitoring of infrastructure for stability. Collaborative Engagement: Collaborate closely with application owners and SRE team members as part of roadmap execution and continuous improvement of existing systems. Scale & Resilient systems: Design & deploy systems/infra which is highly available and resilient for the configured failure domains. Design systems using strong security principles with security by default. Knowledge, Skills and Abilities Hands-on programming experience in any one language python,golang + shell scripting. Hands-on terraform expertise. Strong networking fundamentals and experience dealing with different layers of the networking stack. SRE/Devops on Linux & Kubernetes: Demonstrate excellent, hands-on knowledge of deploying workloads and managing lifecyle on kubernetes, with practical experience on debugging issues. Experience in upgrading workloads for SaaS Services without downtime. Oncall Experience in managing everyday OPs for production environments. Experience in production alerts management and using dashboards to debug issues. GipOps: Experience with helm charts/kustomizations and gitops tools like ArgoCD/FluxCD. CI/CD: Experience working with/designing functional CI/CD systems. Cloud Infrastructure: Prior experience in deploying workloads and managing lifecycle on any cloud provider (AWS/GCP/Azure) Experience with Disaster Recovery and Migration is a plus Qualifications Typically, requires at least 8+ years of related experience with a bachelors degree, 6+ year and a masters degree, or a PhD with 4+ year of experience or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

5.0 - 10.0 years

27 - 42 Lacs

Pune

Work from Office

F5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F5XC Product. Primary Responsibilities F5xc SRE: Play the role of a hands-on SRE Engineer focused on automation and toil-reduction and participate in Ops cycles to support our product. Perform oncall support function on a rotation basis, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products Mentor junior team members to support their professional growth and development Easy-to-Use Automation: Continue to grow the infra-automation (k8s, ArgoCD, Helm Charts, Golang services, AWS, GCP, Terraform) with a focus on ease of configuration Environment Stability using Observability: Create and continue to evolve existing Observability (metrics & alerts) and participate in regular monitoring of infrastructure for stability. Collaborative Engagement: Collaborate closely with application owners and SRE team members as part of roadmap execution and continuous improvement of existing systems. Scale & Resilient systems: Design & deploy systems/infra which is highly available and resilient for the configured failure domains. Design systems using strong security principles with security by default. Knowledge, Skills and Abilities Hands-on programming experience in any one language python,golang + shell scripting. Hands-on terraform expertise. Strong networking fundamentals and experience dealing with different layers of the networking stack. SRE/Devops on Linux & Kubernetes: Demonstrate excellent, hands-on knowledge of deploying workloads and managing lifecyle on kubernetes, with practical experience on debugging issues. Experience in upgrading workloads for SaaS Services without downtime. Oncall Experience in managing everyday OPs for production environments. Experience in production alerts management and using dashboards to debug issues. GipOps: Experience with helm charts/kustomizations and gitops tools like ArgoCD/FluxCD. CI/CD: Experience working with/designing functional CI/CD systems. Cloud Infrastructure: Prior experience in deploying workloads and managing lifecycle on any cloud provider (AWS/GCP/Azure) Experience with Disaster Recovery and Migration is a plus Qualifications Typically, requires at least 8+ years of related experience with a bachelors degree, 6+ year and a masters degree, or a PhD with 4+ year of experience or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

5.0 - 10.0 years

27 - 42 Lacs

Nashik

Work from Office

F5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F5XC Product. Primary Responsibilities F5xc SRE: Play the role of a hands-on SRE Engineer focused on automation and toil-reduction and participate in Ops cycles to support our product. Perform oncall support function on a rotation basis, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products Mentor junior team members to support their professional growth and development Easy-to-Use Automation: Continue to grow the infra-automation (k8s, ArgoCD, Helm Charts, Golang services, AWS, GCP, Terraform) with a focus on ease of configuration Environment Stability using Observability: Create and continue to evolve existing Observability (metrics & alerts) and participate in regular monitoring of infrastructure for stability. Collaborative Engagement: Collaborate closely with application owners and SRE team members as part of roadmap execution and continuous improvement of existing systems. Scale & Resilient systems: Design & deploy systems/infra which is highly available and resilient for the configured failure domains. Design systems using strong security principles with security by default. Knowledge, Skills and Abilities Hands-on programming experience in any one language python,golang + shell scripting. Hands-on terraform expertise. Strong networking fundamentals and experience dealing with different layers of the networking stack. SRE/Devops on Linux & Kubernetes: Demonstrate excellent, hands-on knowledge of deploying workloads and managing lifecyle on kubernetes, with practical experience on debugging issues. Experience in upgrading workloads for SaaS Services without downtime. Oncall Experience in managing everyday OPs for production environments. Experience in production alerts management and using dashboards to debug issues. GipOps: Experience with helm charts/kustomizations and gitops tools like ArgoCD/FluxCD. CI/CD: Experience working with/designing functional CI/CD systems. Cloud Infrastructure: Prior experience in deploying workloads and managing lifecycle on any cloud provider (AWS/GCP/Azure) Experience with Disaster Recovery and Migration is a plus Qualifications Typically, requires at least 8+ years of related experience with a bachelors degree, 6+ year and a masters degree, or a PhD with 4+ year of experience or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

5.0 - 10.0 years

27 - 42 Lacs

Khammam

Work from Office

F5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F5XC Product. Primary Responsibilities F5xc SRE: Play the role of a hands-on SRE Engineer focused on automation and toil-reduction and participate in Ops cycles to support our product. Perform oncall support function on a rotation basis, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products Mentor junior team members to support their professional growth and development Easy-to-Use Automation: Continue to grow the infra-automation (k8s, ArgoCD, Helm Charts, Golang services, AWS, GCP, Terraform) with a focus on ease of configuration Environment Stability using Observability: Create and continue to evolve existing Observability (metrics & alerts) and participate in regular monitoring of infrastructure for stability. Collaborative Engagement: Collaborate closely with application owners and SRE team members as part of roadmap execution and continuous improvement of existing systems. Scale & Resilient systems: Design & deploy systems/infra which is highly available and resilient for the configured failure domains. Design systems using strong security principles with security by default. Knowledge, Skills and Abilities Hands-on programming experience in any one language python,golang + shell scripting. Hands-on terraform expertise. Strong networking fundamentals and experience dealing with different layers of the networking stack. SRE/Devops on Linux & Kubernetes: Demonstrate excellent, hands-on knowledge of deploying workloads and managing lifecyle on kubernetes, with practical experience on debugging issues. Experience in upgrading workloads for SaaS Services without downtime. Oncall Experience in managing everyday OPs for production environments. Experience in production alerts management and using dashboards to debug issues. GipOps: Experience with helm charts/kustomizations and gitops tools like ArgoCD/FluxCD. CI/CD: Experience working with/designing functional CI/CD systems. Cloud Infrastructure: Prior experience in deploying workloads and managing lifecycle on any cloud provider (AWS/GCP/Azure) Experience with Disaster Recovery and Migration is a plus Qualifications Typically, requires at least 8+ years of related experience with a bachelors degree, 6+ year and a masters degree, or a PhD with 4+ year of experience or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

5.0 - 10.0 years

27 - 42 Lacs

Bengaluru

Work from Office

F5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F5XC Product. Primary Responsibilities F5xc SRE: Play the role of a hands-on SRE Engineer focused on automation and toil-reduction and participate in Ops cycles to support our product. Perform oncall support function on a rotation basis, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products Mentor junior team members to support their professional growth and development Easy-to-Use Automation: Continue to grow the infra-automation (k8s, ArgoCD, Helm Charts, Golang services, AWS, GCP, Terraform) with a focus on ease of configuration Environment Stability using Observability: Create and continue to evolve existing Observability (metrics & alerts) and participate in regular monitoring of infrastructure for stability. Collaborative Engagement: Collaborate closely with application owners and SRE team members as part of roadmap execution and continuous improvement of existing systems. Scale & Resilient systems: Design & deploy systems/infra which is highly available and resilient for the configured failure domains. Design systems using strong security principles with security by default. Knowledge, Skills and Abilities Hands-on programming experience in any one language python,golang + shell scripting. Hands-on terraform expertise. Strong networking fundamentals and experience dealing with different layers of the networking stack. SRE/Devops on Linux & Kubernetes: Demonstrate excellent, hands-on knowledge of deploying workloads and managing lifecyle on kubernetes, with practical experience on debugging issues. Experience in upgrading workloads for SaaS Services without downtime. Oncall Experience in managing everyday OPs for production environments. Experience in production alerts management and using dashboards to debug issues. GipOps: Experience with helm charts/kustomizations and gitops tools like ArgoCD/FluxCD. CI/CD: Experience working with/designing functional CI/CD systems. Cloud Infrastructure: Prior experience in deploying workloads and managing lifecycle on any cloud provider (AWS/GCP/Azure) Experience with Disaster Recovery and Migration is a plus Qualifications Typically, requires at least 8+ years of related experience with a bachelors degree, 6+ year and a masters degree, or a PhD with 4+ year of experience or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

5.0 - 10.0 years

27 - 42 Lacs

Mumbai

Work from Office

F5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F5XC Product. Primary Responsibilities F5xc SRE: Play the role of a hands-on SRE Engineer focused on automation and toil-reduction and participate in Ops cycles to support our product. Perform oncall support function on a rotation basis, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products Mentor junior team members to support their professional growth and development Easy-to-Use Automation: Continue to grow the infra-automation (k8s, ArgoCD, Helm Charts, Golang services, AWS, GCP, Terraform) with a focus on ease of configuration Environment Stability using Observability: Create and continue to evolve existing Observability (metrics & alerts) and participate in regular monitoring of infrastructure for stability. Collaborative Engagement: Collaborate closely with application owners and SRE team members as part of roadmap execution and continuous improvement of existing systems. Scale & Resilient systems: Design & deploy systems/infra which is highly available and resilient for the configured failure domains. Design systems using strong security principles with security by default. Knowledge, Skills and Abilities Hands-on programming experience in any one language python,golang + shell scripting. Hands-on terraform expertise. Strong networking fundamentals and experience dealing with different layers of the networking stack. SRE/Devops on Linux & Kubernetes: Demonstrate excellent, hands-on knowledge of deploying workloads and managing lifecyle on kubernetes, with practical experience on debugging issues. Experience in upgrading workloads for SaaS Services without downtime. Oncall Experience in managing everyday OPs for production environments. Experience in production alerts management and using dashboards to debug issues. GipOps: Experience with helm charts/kustomizations and gitops tools like ArgoCD/FluxCD. CI/CD: Experience working with/designing functional CI/CD systems. Cloud Infrastructure: Prior experience in deploying workloads and managing lifecycle on any cloud provider (AWS/GCP/Azure) Experience with Disaster Recovery and Migration is a plus Qualifications Typically, requires at least 8+ years of related experience with a bachelors degree, 6+ year and a masters degree, or a PhD with 4+ year of experience or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

5.0 - 10.0 years

27 - 42 Lacs

Nizamabad

Work from Office

F5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F5XC Product. Primary Responsibilities F5xc SRE: Play the role of a hands-on SRE Engineer focused on automation and toil-reduction and participate in Ops cycles to support our product. Perform oncall support function on a rotation basis, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products Mentor junior team members to support their professional growth and development Easy-to-Use Automation: Continue to grow the infra-automation (k8s, ArgoCD, Helm Charts, Golang services, AWS, GCP, Terraform) with a focus on ease of configuration Environment Stability using Observability: Create and continue to evolve existing Observability (metrics & alerts) and participate in regular monitoring of infrastructure for stability. Collaborative Engagement: Collaborate closely with application owners and SRE team members as part of roadmap execution and continuous improvement of existing systems. Scale & Resilient systems: Design & deploy systems/infra which is highly available and resilient for the configured failure domains. Design systems using strong security principles with security by default. Knowledge, Skills and Abilities Hands-on programming experience in any one language python,golang + shell scripting. Hands-on terraform expertise. Strong networking fundamentals and experience dealing with different layers of the networking stack. SRE/Devops on Linux & Kubernetes: Demonstrate excellent, hands-on knowledge of deploying workloads and managing lifecyle on kubernetes, with practical experience on debugging issues. Experience in upgrading workloads for SaaS Services without downtime. Oncall Experience in managing everyday OPs for production environments. Experience in production alerts management and using dashboards to debug issues. GipOps: Experience with helm charts/kustomizations and gitops tools like ArgoCD/FluxCD. CI/CD: Experience working with/designing functional CI/CD systems. Cloud Infrastructure: Prior experience in deploying workloads and managing lifecycle on any cloud provider (AWS/GCP/Azure) Experience with Disaster Recovery and Migration is a plus Qualifications Typically, requires at least 8+ years of related experience with a bachelors degree, 6+ year and a masters degree, or a PhD with 4+ year of experience or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

5.0 - 10.0 years

27 - 42 Lacs

Thane

Work from Office

F5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F5XC Product. Primary Responsibilities F5xc SRE: Play the role of a hands-on SRE Engineer focused on automation and toil-reduction and participate in Ops cycles to support our product. Perform oncall support function on a rotation basis, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products Mentor junior team members to support their professional growth and development Easy-to-Use Automation: Continue to grow the infra-automation (k8s, ArgoCD, Helm Charts, Golang services, AWS, GCP, Terraform) with a focus on ease of configuration Environment Stability using Observability: Create and continue to evolve existing Observability (metrics & alerts) and participate in regular monitoring of infrastructure for stability. Collaborative Engagement: Collaborate closely with application owners and SRE team members as part of roadmap execution and continuous improvement of existing systems. Scale & Resilient systems: Design & deploy systems/infra which is highly available and resilient for the configured failure domains. Design systems using strong security principles with security by default. Knowledge, Skills and Abilities Hands-on programming experience in any one language python,golang + shell scripting. Hands-on terraform expertise. Strong networking fundamentals and experience dealing with different layers of the networking stack. SRE/Devops on Linux & Kubernetes: Demonstrate excellent, hands-on knowledge of deploying workloads and managing lifecyle on kubernetes, with practical experience on debugging issues. Experience in upgrading workloads for SaaS Services without downtime. Oncall Experience in managing everyday OPs for production environments. Experience in production alerts management and using dashboards to debug issues. GipOps: Experience with helm charts/kustomizations and gitops tools like ArgoCD/FluxCD. CI/CD: Experience working with/designing functional CI/CD systems. Cloud Infrastructure: Prior experience in deploying workloads and managing lifecycle on any cloud provider (AWS/GCP/Azure) Experience with Disaster Recovery and Migration is a plus Qualifications Typically, requires at least 8+ years of related experience with a bachelors degree, 6+ year and a masters degree, or a PhD with 4+ year of experience or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

5.0 - 10.0 years

27 - 42 Lacs

Vijayawada

Work from Office

F5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F5XC Product. Primary Responsibilities F5xc SRE: Play the role of a hands-on SRE Engineer focused on automation and toil-reduction and participate in Ops cycles to support our product. Perform oncall support function on a rotation basis, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products Mentor junior team members to support their professional growth and development Easy-to-Use Automation: Continue to grow the infra-automation (k8s, ArgoCD, Helm Charts, Golang services, AWS, GCP, Terraform) with a focus on ease of configuration Environment Stability using Observability: Create and continue to evolve existing Observability (metrics & alerts) and participate in regular monitoring of infrastructure for stability. Collaborative Engagement: Collaborate closely with application owners and SRE team members as part of roadmap execution and continuous improvement of existing systems. Scale & Resilient systems: Design & deploy systems/infra which is highly available and resilient for the configured failure domains. Design systems using strong security principles with security by default. Knowledge, Skills and Abilities Hands-on programming experience in any one language python,golang + shell scripting. Hands-on terraform expertise. Strong networking fundamentals and experience dealing with different layers of the networking stack. SRE/Devops on Linux & Kubernetes: Demonstrate excellent, hands-on knowledge of deploying workloads and managing lifecyle on kubernetes, with practical experience on debugging issues. Experience in upgrading workloads for SaaS Services without downtime. Oncall Experience in managing everyday OPs for production environments. Experience in production alerts management and using dashboards to debug issues. GipOps: Experience with helm charts/kustomizations and gitops tools like ArgoCD/FluxCD. CI/CD: Experience working with/designing functional CI/CD systems. Cloud Infrastructure: Prior experience in deploying workloads and managing lifecycle on any cloud provider (AWS/GCP/Azure) Experience with Disaster Recovery and Migration is a plus Qualifications Typically, requires at least 8+ years of related experience with a bachelors degree, 6+ year and a masters degree, or a PhD with 4+ year of experience or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

5.0 - 10.0 years

27 - 42 Lacs

Karimnagar

Work from Office

F5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F5XC Product. Primary Responsibilities F5xc SRE: Play the role of a hands-on SRE Engineer focused on automation and toil-reduction and participate in Ops cycles to support our product. Perform oncall support function on a rotation basis, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products Mentor junior team members to support their professional growth and development Easy-to-Use Automation: Continue to grow the infra-automation (k8s, ArgoCD, Helm Charts, Golang services, AWS, GCP, Terraform) with a focus on ease of configuration Environment Stability using Observability: Create and continue to evolve existing Observability (metrics & alerts) and participate in regular monitoring of infrastructure for stability. Collaborative Engagement: Collaborate closely with application owners and SRE team members as part of roadmap execution and continuous improvement of existing systems. Scale & Resilient systems: Design & deploy systems/infra which is highly available and resilient for the configured failure domains. Design systems using strong security principles with security by default. Knowledge, Skills and Abilities Hands-on programming experience in any one language python,golang + shell scripting. Hands-on terraform expertise. Strong networking fundamentals and experience dealing with different layers of the networking stack. SRE/Devops on Linux & Kubernetes: Demonstrate excellent, hands-on knowledge of deploying workloads and managing lifecyle on kubernetes, with practical experience on debugging issues. Experience in upgrading workloads for SaaS Services without downtime. Oncall Experience in managing everyday OPs for production environments. Experience in production alerts management and using dashboards to debug issues. GipOps: Experience with helm charts/kustomizations and gitops tools like ArgoCD/FluxCD. CI/CD: Experience working with/designing functional CI/CD systems. Cloud Infrastructure: Prior experience in deploying workloads and managing lifecycle on any cloud provider (AWS/GCP/Azure) Experience with Disaster Recovery and Migration is a plus Qualifications Typically, requires at least 8+ years of related experience with a bachelors degree, 6+ year and a masters degree, or a PhD with 4+ year of experience or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

5.0 - 10.0 years

27 - 42 Lacs

Warangal

Work from Office

F5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F5XC Product. Primary Responsibilities F5xc SRE: Play the role of a hands-on SRE Engineer focused on automation and toil-reduction and participate in Ops cycles to support our product. Perform oncall support function on a rotation basis, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products Mentor junior team members to support their professional growth and development Easy-to-Use Automation: Continue to grow the infra-automation (k8s, ArgoCD, Helm Charts, Golang services, AWS, GCP, Terraform) with a focus on ease of configuration Environment Stability using Observability: Create and continue to evolve existing Observability (metrics & alerts) and participate in regular monitoring of infrastructure for stability. Collaborative Engagement: Collaborate closely with application owners and SRE team members as part of roadmap execution and continuous improvement of existing systems. Scale & Resilient systems: Design & deploy systems/infra which is highly available and resilient for the configured failure domains. Design systems using strong security principles with security by default. Knowledge, Skills and Abilities Hands-on programming experience in any one language python,golang + shell scripting. Hands-on terraform expertise. Strong networking fundamentals and experience dealing with different layers of the networking stack. SRE/Devops on Linux & Kubernetes: Demonstrate excellent, hands-on knowledge of deploying workloads and managing lifecyle on kubernetes, with practical experience on debugging issues. Experience in upgrading workloads for SaaS Services without downtime. Oncall Experience in managing everyday OPs for production environments. Experience in production alerts management and using dashboards to debug issues. GipOps: Experience with helm charts/kustomizations and gitops tools like ArgoCD/FluxCD. CI/CD: Experience working with/designing functional CI/CD systems. Cloud Infrastructure: Prior experience in deploying workloads and managing lifecycle on any cloud provider (AWS/GCP/Azure) Experience with Disaster Recovery and Migration is a plus Qualifications Typically, requires at least 8+ years of related experience with a bachelors degree, 6+ year and a masters degree, or a PhD with 4+ year of experience or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

7.0 - 11.0 years

17 - 22 Lacs

Mandya

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply

7.0 - 11.0 years

17 - 22 Lacs

Mysuru

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply

7.0 - 11.0 years

17 - 22 Lacs

Hassan

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies