Jobs
Interviews

1633 Grafana Jobs - Page 17

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

7.0 - 11.0 years

17 - 22 Lacs

Mysuru

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply

7.0 - 11.0 years

17 - 22 Lacs

Hassan

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply

7.0 - 11.0 years

17 - 22 Lacs

Faridabad

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply

7.0 - 11.0 years

17 - 22 Lacs

Ghaziabad

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply

7.0 - 11.0 years

17 - 22 Lacs

Chittoor

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply

7.0 - 11.0 years

17 - 22 Lacs

Gurugram

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply

7.0 - 11.0 years

17 - 22 Lacs

Pune

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply

7.0 - 11.0 years

17 - 22 Lacs

Nashik

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply

7.0 - 11.0 years

17 - 22 Lacs

Navi Mumbai

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply

7.0 - 11.0 years

17 - 22 Lacs

Thane

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply

7.0 - 11.0 years

17 - 22 Lacs

Bengaluru

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply

7.0 - 11.0 years

17 - 22 Lacs

Hyderabad

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply

7.0 - 11.0 years

17 - 22 Lacs

Nizamabad

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply

7.0 - 11.0 years

17 - 22 Lacs

Mumbai

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply

7.0 - 11.0 years

17 - 22 Lacs

Karimnagar

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply

7.0 - 11.0 years

17 - 22 Lacs

Warangal

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply

7.0 - 11.0 years

17 - 22 Lacs

Khammam

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply

7.0 - 11.0 years

17 - 22 Lacs

Noida

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply

7.0 - 11.0 years

17 - 22 Lacs

Vijayawada

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What Youll Do: Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit? Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . Proven track record of eliminating toil and improving operational efficiency through automation. Passion for deep observability, networking-scale analytics, and automation at the edge .If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you!The About The Role is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change. Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where youll feel at home from day one, contributing to a positive and inspiring workplace. F5 Networks, Inc. is an equal opportunity employer and strongly supports diversity in the workplace.

Posted 2 weeks ago

Apply

2.0 - 5.0 years

3 - 6 Lacs

Hyderabad

Work from Office

Keyloop bridges the gap between dealers, manufacturers, technology suppliers and car buyers. We empower car dealers and manufacturers to fully embrace digital transformation. How? By creating innovative technology that makes selling cars better for our customers, and buying and owning cars better for theirs. We use cutting-edge technology to link our clients’ systems, departments and sites. We provide an open technology platform that’s shaping the industry for the future. We use data to help clients become more efficient, increase profitability and give more customers an amazing experience. Want to be part of it? Overview We are looking for a motivated DevOps & AWS Engineer to join our Keyloop Infrastructure and Cloud Operations team. You will work closely with senior engineers to support the management of AWS environments, Infrastructure as Code (IaC) with Terraform, and application provisioning with Ansible. This role is ideal for someone who wants to develop strong DevOps skills in a real-world production environment, learn modern automation practices, and grow under the guidance of experienced mentors. Roles and Responsibilities Assist in the implementation and maintenance of CI/CD pipelines using tools like Git and Jenkins under senior team guidance. Support the development of infrastructure as code (IaC) using Terraform to automate the provisioning of AWS resources such as EC2,S3,RDS, and VPCs. Help maintain configuration management using Ansible, ensuring environments are consistent and compliant. Write basic shell scripts for automation tasks under supervision. Assist in monitoring and maintaining Windows and Linux servers, helping to ensure performance, uptime, and security. Collaborate with development, QA, and operations teams to follow best practices in DevOps workflows. Support system monitoring and reporting using tools such as CloudWatch and Grafana. Follow security best practices and support the team in maintaining compliance and proper IAM configuration. Participate in knowledge-sharing sessions and seek guidance from senior engineers to grow technical skills. Support incident response efforts and on-call tasks as part of a team rotation, with mentorship. Be open to researching new tools and assisting in proof-of-concept tasks for automation and infrastructure improvement. Skills / Knowledge & Experience Essential Good interpersonal and communication skills. Strong willingness to learn and grow in a dynamic, fast-paced environment. 2-5 years of relevant experience in DevOps or Cloud Engineering. Good understanding of AWS services (EC2, S3, IAM, VPC). Familiarity with version control using Git. Knowledge on any Windows and Linux environments is must Exposure to scripting languages such as Bash or PowerShell. Eagerness to work with Terraform and Ansible—hands-on experience is a plus but not mandatory if you can demonstrate foundational knowledge. Good troubleshooting skills and a problem-solving mindset. Ability to follow instructions, seek clarification when needed, and document work clearly. Self-motivated and organised, with attention to detail Desirable Experience with Cloud Technologies (AWS, Azure). Familiarity with system monitoring tools that support proactive interventions to prevent service impact. Understanding of ITIL or other service management frameworks. Previous experience working in a 24/7 Infrastructure Operations environment Additional Information This position is work from office, requiring presence during all shifts, including nights and weekends as part of the 24/7 Operations Center and office cab transport is provided for all shifts. Why join us? We’re on a journey to become market leaders in our space – and with that comes some incredible opportunities. Collaborate and learn from industry experts from all over the globe. Work with game-changing products and services. Get the training and support you need to try new things, adapt to quick changes and explore different paths. Join Keyloop and progress your career, your way. An inclusive environment to thrive We’re committed to fostering an inclusive work environment. One that respects all dimensions of diversity. We promote an inclusive culture within our business, and we celebrate different employees and lifestyles – not just on key days, but every day. Be rewarded for your efforts We believe people should be paid based on their performance so our pay and benefits reflect this and are designed to attract the very best talent. We encourage everyone in our organisation to explore opportunities which enable them to grow their career through investment in their development but equally by working in a culture which fosters support and unbridled collaboration. Keyloop doesn’t require academic qualifications for this position. We select based on experience and potential, not credentials. We are also an equal opportunity employer committed to building a diverse and inclusive workforce. We value diversity and encourage candidates of all backgrounds to apply .

Posted 2 weeks ago

Apply

18.0 - 20.0 years

35 - 40 Lacs

Greater Noida

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where cloud meets global infrastructure ? We are looking for a top-tier SRE to drive Logs, Metrics, and Alerting , with a deep focus on Alerting automation at massive scale. Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What You'll Do: -- Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. -- Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. -- Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . -- Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit- - Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . - Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. - Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . - Proven track record of eliminating toil and improving operational efficiency through automation. - Passion for deep observability, networking-scale analytics, and automation at the edge . If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you! Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery : Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

18.0 - 20.0 years

35 - 40 Lacs

Ghaziabad

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where cloud meets global infrastructure ? We are looking for a top-tier SRE to drive Logs, Metrics, and Alerting , with a deep focus on Alerting automation at massive scale. Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What You'll Do: -- Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. -- Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. -- Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . -- Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit- - Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . - Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. - Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . - Proven track record of eliminating toil and improving operational efficiency through automation. - Passion for deep observability, networking-scale analytics, and automation at the edge . If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you! Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery : Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

18.0 - 20.0 years

35 - 40 Lacs

Faridabad

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where cloud meets global infrastructure ? We are looking for a top-tier SRE to drive Logs, Metrics, and Alerting , with a deep focus on Alerting automation at massive scale. Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What You'll Do: -- Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. -- Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. -- Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . -- Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit- - Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . - Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. - Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . - Proven track record of eliminating toil and improving operational efficiency through automation. - Passion for deep observability, networking-scale analytics, and automation at the edge . If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you! Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery : Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

18.0 - 20.0 years

35 - 40 Lacs

Warangal

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where cloud meets global infrastructure ? We are looking for a top-tier SRE to drive Logs, Metrics, and Alerting , with a deep focus on Alerting automation at massive scale. Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What You'll Do: -- Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. -- Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. -- Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . -- Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit- - Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . - Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. - Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . - Proven track record of eliminating toil and improving operational efficiency through automation. - Passion for deep observability, networking-scale analytics, and automation at the edge . If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you! Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery : Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply

18.0 - 20.0 years

35 - 40 Lacs

Gurugram

Work from Office

Position Summary F5 Inc. is actively seeking an exceptional Sr Principal Software Engineer (Individual Contributor) to play a pivotal role in our SRE Operations team for the groundbreaking F5XC Product. Are you an SRE Operations specialist with automation in your DNA? Do you thrive in fast-paced SaaS environments where cloud meets global infrastructure ? We are looking for a top-tier SRE to drive Logs, Metrics, and Alerting , with a deep focus on Alerting automation at massive scale. Why This Role is Unique: Our SaaS is hybrid running across public cloud and a global network of 50+ PoPs , delivering terabits of capacity . Our infrastructure spans cloud-native services and physical networking gear (routers, switches, firewalls), creating a uniquely challenging and exciting observability landscape. The Analytics & Observability platform will have deep reach across these layers , ensuring reliability, security, and performance at a massive scale. What You'll Do: -- Be the Force Behind Observability & Stability Drive end-to-end Observability (Logs, Metrics, and Alerts) across our hybrid SaaS stack , spanning cloud, edge, and physical network devices. Take ownership of Alerting strategy , cutting through noise while ensuring actionable, high-fidelity alerts. Implement intelligent automation to reduce operational toil and enhance real-time visibility. -- Own & Automate Operations Design, build, and manage automation for self-healing infrastructure across cloud + global PoPs. Develop automation for Kubernetes, ArgoCD, Helm Charts, Golang-based services, AWS, GCP, Terraform . Improve networking observability , ensuring our routers, switches, and firewalls are monitored at scale. Continuously eliminate manual ops work through automation and platform improvements. -- Lead Incident Response & Operational Excellence Participate in on-call rotations , ensuring rapid incident response across our cloud + edge stack. Drive incident response automation , reducing MTTR and increasing system resilience . Ensure security, compliance, and best practices in observability & automation . -- Collaborate & Mentor Work closely with application teams, network engineers, and SREs to improve reliability and performance. Mentor junior engineers, fostering a culture of automation-first thinking and deep observability . What Makes You a Great Fit- - Deep expertise in Logs, Metrics, and Alerting, with a strong focus on Alerting automation . - Experience in hybrid SaaS environments spanning cloud-native and global infrastructure. - Strong background in Kubernetes, Infrastructure-as-Code (Terraform), Golang, AWS/GCP, and networking observability . - Proven track record of eliminating toil and improving operational efficiency through automation. - Passion for deep observability, networking-scale analytics, and automation at the edge . If you love solving reliability challenges at global scale, automating everything, and working in a hybrid cloud + networking environment , we want to talk to you! Must-Have: Observability & Alerting Expertise Strong experience with Logs, Metrics, and Alerts , with a focus on high-fidelity alerting and automation . Automation & Infrastructure as Code Deep knowledge of Terraform, ArgoCD, Helm, Kubernetes, and Golang for automation . Cloud & Hybrid SaaS Experience Hands-on experience managing cloud-native (AWS/GCP) and edge infrastructure . Incident Response & Reliability Engineering Strong on-call experience , with a track record of reducing MTTR through automation Kubernetes Mastery Hands-on experience deploying, managing, and troubleshooting Kubernetes in production environments. Nice-to-Have: Networking & Edge Observability Familiarity with monitoring routers, switches, and firewalls in a global PoP environment . Data & Analytics in Observability Experience with time-series databases (Prometheus, Grafana, OpenTelemetry, etc.) . Security & Compliance Awareness Understanding of secure-by-design principles for monitoring & alerting . Mentorship & Collaboration Ability to mentor junior engineers and work cross-functionally with SREs, application teams, and network engineers . High Availability Disaster Recovery : Experience with HA/DR and Migration Qualifications Typically, it requires at least 18 years of related experience with a bachelors degree, 15 years and a masters degree, or a PhD with 12 years experience; or equivalent experience. Excellent organizational agility and communication skills throughout the organization. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 2 weeks ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies