Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
10.0 - 15.0 years
12 - 17 Lacs
Chennai, Bengaluru
Work from Office
Strong Knowledge in Linux internals (Preferable RHEL Ubuntu) Essential Knowledge in Windows internals Comprehensive understanding in DevOps SRE, IaC and 12 Factor Principles Excellent hands-on experience in configuration management, orchestration and IaC tools (Ansible, Jenkins, Terraform) Strong understanding of Virtualization Technologies (KVM Libvirt oVirt KubeVirt. OVM, Openstack) Strong understanding of Software Defined Storage Technologies (CEPH, GlusterFS) Strong understanding of Repository and Artifact management Tools (Red Hat Satellite, Spacewalk, Nexus) Strong understanding of Container Technologies (Docker, Kubernetes, Openshift) Strong understanding of ELK and its beats (Auditbeat, FileBeat) Strong understanding of OS Compliance Policies (CIS Benchmark) Agile methodologies and its ceremonies Architect, write and implement software that improves the stability, scalability, availability of products. Own multiple services and have the authonomy to do what suits the business and our customers in IT. Solve occurring problems and create solutions and automation to prevent them from happen again. Plan for reliability for systems to work across multi datacenter/environment and handle the outages. Conceptual understanding about infrastructure and how it works, DNS (Authoritive and Non-Authoritive DNS, Dynamic and bind DNS, Forwarder) SSL Communication (Handshake of SSL traffic, Cipher Suites, Enc Algorithyms,) Active Directory (Security OUs, policies) Certificates (SAN, client-authentication, keystores, mutual ssl) Loadbalancers Site Selectors Firewall Vault Tools (Cyberark Hashicorp) High Availability Knowledge about API communications (Rest/Soap), developing a new consumer/publisher for any API. Excellent Scripting in Groovy (writing Jenkins Files) Bash Powershell Python GITOPS driven configuration management and deployment. Familiar and openminded to Opensource technologies Team player quick adaptation to context change Security Awareness Strong understanding of troubleshooting. Deep dive to an issue, read logs, track the clues and identify the problems. Strategic Thinking with Research and Development minds
Posted 2 weeks ago
8.0 years
0 Lacs
Hyderabad, Telangana, India
On-site
Linux SME is responsible for designing, implementing, maintaining, and troubleshooting Linux-based infrastructure in enterprise environments. Serve as the go-to expert for Linux-related issues, ensuring high availability, security, and performance of systems. Key Responsibilities: Linux System Administration: Install, configure, and manage Linux servers (RHEL, Ubuntu, CentOS, SUSE, etc.). Performance Tuning & Optimization: Analyze system performance, optimize processes, and manage resource utilization. Automation & Scripting: Develop automation scripts using Bash, Python, Ansible, or Terraform for system management. Security & Compliance: Implement security best practices, manage firewall rules, and ensure compliance with industry standards. Troubleshooting & Incident Resolution: Diagnose and resolve system issues, kernel panics, and application failures. Patch Management & Updates: Apply patches, and security updates, and manage package repositories. Cloud & Virtualization Support: Work with AWS, Azure, VMware, or OpenStack for cloud-based and virtualized Linux environments. Networking & Storage: Configure and troubleshoot TCP/IP, DNS, DHCP, NFS, SAN, and NAS storage solutions. Disaster Recovery & Backup: Implement backup strategies, RAID configurations, and system recovery plans. Mentoring & Documentation: Guide junior engineers and maintain detailed documentation of systems and procedures. Required Experience & Qualifications: ✅ Education: Bachelor's degree in Computer Science, Information Technology, or equivalent experience. Certifications (Preferred): RHCE/RHCSA, LPIC-2/3, AWS Certified SysOps Administrator, or Kubernetes certifications. ✅ Experience: 8+ years of hands-on experience managing Linux environments. Strong knowledge of RHEL, CentOS, Ubuntu, SUSE , or other Linux distributions. Experience with automation tools (Ansible, Puppet, Chef, Terraform). Proficiency in shell scripting (Bash, Python, Perl) . Expertise in cloud platforms (AWS, Azure, Google Cloud) . Hands-on experience with containerization & orchestration (Docker, Kubernetes). Solid understanding of networking (TCP/IP, DNS, firewalls) and storage (LVM, RAID, NFS, SAN) . Strong troubleshooting and performance tuning skills.
Posted 2 weeks ago
10.0 years
0 Lacs
Chennai, Tamil Nadu, India
On-site
10 years of experience at a relevant technical position in large organizations Excellent knowledge of Network Security Principles (Very Important) Automation and Software experience – Ansible, Python, Chef Experience with cloud network security solutions Good knowledge in virtualization technologies: Docker, OpenShift, Kubernetes, VMWare ESXi, KVM, OpenStack Good knowledge of the products licensing Experience designing, implementing and operating large scale corporate network security solutions including NGFWs, IDS/IPS, Web Proxy, Load balancers, RAS, DNS, Certificates Excellent technical knowledge in FW Products: PaloAlto, Checkpoint Excellent technical knowledge in F5 Products: GTM, LTM and ASM Excellent knowledge in multiple Network security products: Bluecoat, Pulse Secure… Knowledge in opensource network security solutions: HAProxy, Service Mesh, PFSense, Squid Interested candidates can share their CVs to bhakti.godbole@sphereitglobal.com
Posted 2 weeks ago
6.0 years
0 Lacs
Gurugram, Haryana, India
On-site
Are you passionate about building and maintaining large-scale production systems that support advanced data science and machine learning applications? Do you want to join a team at the heart of NVIDIA's data-driven decision-making culture? If so, we have a great opportunity for you! NVIDIA is seeking a Senior Site Reliability Engineer (SRE) for the Data Science & ML Platform(s) team. The role involves designing, building, and maintaining services that enable real-time data analytics, streaming, data lakes, observability and ML/AI training and inferencing. The responsibilities include implementing software and systems engineering practices to ensure high efficiency and availability of the platform, as well as applying SRE principles to improve production systems and optimize service SLOs. Additionally, collaboration with our customers to plan implement changes to the existing system, while monitoring capacity, latency, and performance is part of the role. To succeed in this position, a strong background in SRE practices, systems, networking, coding, capacity management, cloud operations, continuous delivery and deployment, and open-source cloud enabling technologies like Kubernetes and OpenStack is required. Deep understanding of the challenges and standard methodologies of running large-scale distributed systems in production, solving complex issues, automating repetitive tasks, and proactively identifying potential outages is also necessary. Furthermore, excellent communication and collaboration skills, and a culture of diversity, intellectual curiosity, problem solving, and openness are essential. As a Senior SRE at NVIDIA, you will have the opportunity to work on innovative technologies that power the future of AI and data science, and be part of a dynamic and supportive team that values learning and growth. The role provides the autonomy to work on meaningful projects with the support and mentorship needed to succeed, and contributes to a culture of blameless postmortems, iterative improvement, and risk-taking. If you are seeking an exciting and rewarding career that makes a difference, we invite you to apply now! What You’ll Be Doing Develop software solutions to ensure reliability and operability of large-scale systems supporting machine-critical use cases. Gain a deep understanding of our system operations, scalability, interactions, and failures to identify improvement opportunities and risks. Create tools and automation to reduce operational overhead and eliminate manual tasks. Establish frameworks, processes, and standard methodologies to enhance operational maturity, team efficiency, and accelerate innovation. Define meaningful and actionable reliability metrics to track and improve system and service reliability. Oversee capacity and performance management to facilitate infrastructure scaling across public and private clouds globally. Build tools to improve our service observability for faster issue resolution. Practice sustainable incident response and blameless postmortems What We Need To See Minimum of 6+ years of experience in SRE, Cloud platforms, or DevOps with large-scale microservices in production environments. Master's or Bachelor's degree in Computer Science or Electrical Engineering or CE or equivalent experience. Strong understanding of SRE principles, including error budgets, SLOs, and SLAs. Proficiency in incident, change, and problem management processes. Skilled in problem-solving, root cause analysis, and optimization. Experience with streaming data infrastructure services, such as Kafka and Spark. Expertise in building and operating large-scale observability platforms for monitoring and logging (e.g., ELK, Prometheus). Proficiency in programming languages such as Python, Go, Perl, or Ruby. Hands-on experience with scaling distributed systems in public, private, or hybrid cloud environments. Experience in deploying, supporting, and supervising services, platforms, and application stacks. Ways To Stand Out From The Crowd Experience operating large-scale distributed systems with strong SLAs. Excellent coding skills in Python and Go and extensive experience in operating data platforms. Knowledge of CI/CD systems, such as Jenkins and GitHub Actions. Familiarity with Infrastructure as Code (IaC) methodologies and tools. Excellent interpersonal skills for identifying and communicating data-driven insights. NVIDIA leads the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing, and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions, from artificial intelligence to autonomous cars. NVIDIA is looking for exceptional people like you to help us accelerate the next wave of artificial intelligence. JR1999109
Posted 2 weeks ago
6.0 years
0 Lacs
Pune, Maharashtra, India
On-site
Are you passionate about building and maintaining large-scale production systems that support advanced data science and machine learning applications? Do you want to join a team at the heart of NVIDIA's data-driven decision-making culture? If so, we have a great opportunity for you! NVIDIA is seeking a Senior Site Reliability Engineer (SRE) for the Data Science & ML Platform(s) team. The role involves designing, building, and maintaining services that enable real-time data analytics, streaming, data lakes, observability and ML/AI training and inferencing. The responsibilities include implementing software and systems engineering practices to ensure high efficiency and availability of the platform, as well as applying SRE principles to improve production systems and optimize service SLOs. Additionally, collaboration with our customers to plan implement changes to the existing system, while monitoring capacity, latency, and performance is part of the role. To succeed in this position, a strong background in SRE practices, systems, networking, coding, capacity management, cloud operations, continuous delivery and deployment, and open-source cloud enabling technologies like Kubernetes and OpenStack is required. Deep understanding of the challenges and standard methodologies of running large-scale distributed systems in production, solving complex issues, automating repetitive tasks, and proactively identifying potential outages is also necessary. Furthermore, excellent communication and collaboration skills, and a culture of diversity, intellectual curiosity, problem solving, and openness are essential. As a Senior SRE at NVIDIA, you will have the opportunity to work on innovative technologies that power the future of AI and data science, and be part of a dynamic and supportive team that values learning and growth. The role provides the autonomy to work on meaningful projects with the support and mentorship needed to succeed, and contributes to a culture of blameless postmortems, iterative improvement, and risk-taking. If you are seeking an exciting and rewarding career that makes a difference, we invite you to apply now! What You’ll Be Doing Develop software solutions to ensure reliability and operability of large-scale systems supporting machine-critical use cases. Gain a deep understanding of our system operations, scalability, interactions, and failures to identify improvement opportunities and risks. Create tools and automation to reduce operational overhead and eliminate manual tasks. Establish frameworks, processes, and standard methodologies to enhance operational maturity, team efficiency, and accelerate innovation. Define meaningful and actionable reliability metrics to track and improve system and service reliability. Oversee capacity and performance management to facilitate infrastructure scaling across public and private clouds globally. Build tools to improve our service observability for faster issue resolution. Practice sustainable incident response and blameless postmortems What We Need To See Minimum of 6+ years of experience in SRE, Cloud platforms, or DevOps with large-scale microservices in production environments. Master's or Bachelor's degree in Computer Science or Electrical Engineering or CE or equivalent experience. Strong understanding of SRE principles, including error budgets, SLOs, and SLAs. Proficiency in incident, change, and problem management processes. Skilled in problem-solving, root cause analysis, and optimization. Experience with streaming data infrastructure services, such as Kafka and Spark. Expertise in building and operating large-scale observability platforms for monitoring and logging (e.g., ELK, Prometheus). Proficiency in programming languages such as Python, Go, Perl, or Ruby. Hands-on experience with scaling distributed systems in public, private, or hybrid cloud environments. Experience in deploying, supporting, and supervising services, platforms, and application stacks. Ways To Stand Out From The Crowd Experience operating large-scale distributed systems with strong SLAs. Excellent coding skills in Python and Go and extensive experience in operating data platforms. Knowledge of CI/CD systems, such as Jenkins and GitHub Actions. Familiarity with Infrastructure as Code (IaC) methodologies and tools. Excellent interpersonal skills for identifying and communicating data-driven insights. NVIDIA leads the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing, and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions, from artificial intelligence to autonomous cars. NVIDIA is looking for exceptional people like you to help us accelerate the next wave of artificial intelligence. JR1999109
Posted 2 weeks ago
6.0 - 10.0 years
25 - 35 Lacs
Warangal
Work from Office
Role & responsibilities Distributed-Application-Deployment: Deploy, contribute, and support a highly distributed networking and security product platform at scale. Hyper-Automation: Expand infrastructure provisioning and deployment automation treating everything is code with a focus on ease of configuration Environment Stability using Observability: Develop and enhance existing observability practices, including metrics and alerts, to maintain stability of the infrastructure, conducting regular monitoring and proactive troubleshooting. Collaborative Engagement: Engage closely with application owners and SRE (Site Reliability Engineering) team members to execute roadmap initiatives and continuously improve existing systems, fostering a collaborative and cohesive work environment. Scale & Resilient systems: Design and deploy systems and infrastructure that are scalable and resilient to failure, ensuring high availability and reliability across configured failure domains. Continuous monitoring and Incident management: Participate in an on-call support rotation, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products. Preferred candidate profile 5+ years product development experience in embedded operating systems BS/MS degree in Computer Science or equivalent with 5+ years of software engineering and development experience Hands on experience with multiple computer programming languages such as Golang (must), C++, Python, Java Hands-on experience with FIPS 140-2 and Common Criteria Ability to implement all phases of a development cycle for a software product from understanding requirements, going through design, development and deploy phases Working knowledge of Virtualization technologies like KVM, Docker etc. Working knowledge of Cloud orchestration systems such as Kubernetes, Openstack etc. Excellent written and verbal communication skills.
Posted 2 weeks ago
6.0 - 10.0 years
25 - 35 Lacs
Noida
Work from Office
Role & responsibilities Distributed-Application-Deployment: Deploy, contribute, and support a highly distributed networking and security product platform at scale. Hyper-Automation: Expand infrastructure provisioning and deployment automation treating everything is code with a focus on ease of configuration Environment Stability using Observability: Develop and enhance existing observability practices, including metrics and alerts, to maintain stability of the infrastructure, conducting regular monitoring and proactive troubleshooting. Collaborative Engagement: Engage closely with application owners and SRE (Site Reliability Engineering) team members to execute roadmap initiatives and continuously improve existing systems, fostering a collaborative and cohesive work environment. Scale & Resilient systems: Design and deploy systems and infrastructure that are scalable and resilient to failure, ensuring high availability and reliability across configured failure domains. Continuous monitoring and Incident management: Participate in an on-call support rotation, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products. Preferred candidate profile 5+ years product development experience in embedded operating systems BS/MS degree in Computer Science or equivalent with 5+ years of software engineering and development experience Hands on experience with multiple computer programming languages such as Golang (must), C++, Python, Java Hands-on experience with FIPS 140-2 and Common Criteria Ability to implement all phases of a development cycle for a software product from understanding requirements, going through design, development and deploy phases Working knowledge of Virtualization technologies like KVM, Docker etc. Working knowledge of Cloud orchestration systems such as Kubernetes, Openstack etc. Excellent written and verbal communication skills.
Posted 2 weeks ago
6.0 - 10.0 years
25 - 35 Lacs
Greater Noida
Work from Office
Role & responsibilities Distributed-Application-Deployment: Deploy, contribute, and support a highly distributed networking and security product platform at scale. Hyper-Automation: Expand infrastructure provisioning and deployment automation treating everything is code with a focus on ease of configuration Environment Stability using Observability: Develop and enhance existing observability practices, including metrics and alerts, to maintain stability of the infrastructure, conducting regular monitoring and proactive troubleshooting. Collaborative Engagement: Engage closely with application owners and SRE (Site Reliability Engineering) team members to execute roadmap initiatives and continuously improve existing systems, fostering a collaborative and cohesive work environment. Scale & Resilient systems: Design and deploy systems and infrastructure that are scalable and resilient to failure, ensuring high availability and reliability across configured failure domains. Continuous monitoring and Incident management: Participate in an on-call support rotation, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products. Preferred candidate profile 5+ years product development experience in embedded operating systems BS/MS degree in Computer Science or equivalent with 5+ years of software engineering and development experience Hands on experience with multiple computer programming languages such as Golang (must), C++, Python, Java Hands-on experience with FIPS 140-2 and Common Criteria Ability to implement all phases of a development cycle for a software product from understanding requirements, going through design, development and deploy phases Working knowledge of Virtualization technologies like KVM, Docker etc. Working knowledge of Cloud orchestration systems such as Kubernetes, Openstack etc. Excellent written and verbal communication skills.
Posted 2 weeks ago
6.0 - 10.0 years
25 - 35 Lacs
Mumbai
Work from Office
Role & responsibilities Distributed-Application-Deployment: Deploy, contribute, and support a highly distributed networking and security product platform at scale. Hyper-Automation: Expand infrastructure provisioning and deployment automation treating everything is code with a focus on ease of configuration Environment Stability using Observability: Develop and enhance existing observability practices, including metrics and alerts, to maintain stability of the infrastructure, conducting regular monitoring and proactive troubleshooting. Collaborative Engagement: Engage closely with application owners and SRE (Site Reliability Engineering) team members to execute roadmap initiatives and continuously improve existing systems, fostering a collaborative and cohesive work environment. Scale & Resilient systems: Design and deploy systems and infrastructure that are scalable and resilient to failure, ensuring high availability and reliability across configured failure domains. Continuous monitoring and Incident management: Participate in an on-call support rotation, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products. Preferred candidate profile 5+ years product development experience in embedded operating systems BS/MS degree in Computer Science or equivalent with 5+ years of software engineering and development experience Hands on experience with multiple computer programming languages such as Golang (must), C++, Python, Java Hands-on experience with FIPS 140-2 and Common Criteria Ability to implement all phases of a development cycle for a software product from understanding requirements, going through design, development and deploy phases Working knowledge of Virtualization technologies like KVM, Docker etc. Working knowledge of Cloud orchestration systems such as Kubernetes, Openstack etc. Excellent written and verbal communication skills.
Posted 2 weeks ago
6.0 - 10.0 years
25 - 35 Lacs
Pune
Work from Office
Role & responsibilities Distributed-Application-Deployment: Deploy, contribute, and support a highly distributed networking and security product platform at scale. Hyper-Automation: Expand infrastructure provisioning and deployment automation treating everything is code with a focus on ease of configuration Environment Stability using Observability: Develop and enhance existing observability practices, including metrics and alerts, to maintain stability of the infrastructure, conducting regular monitoring and proactive troubleshooting. Collaborative Engagement: Engage closely with application owners and SRE (Site Reliability Engineering) team members to execute roadmap initiatives and continuously improve existing systems, fostering a collaborative and cohesive work environment. Scale & Resilient systems: Design and deploy systems and infrastructure that are scalable and resilient to failure, ensuring high availability and reliability across configured failure domains. Continuous monitoring and Incident management: Participate in an on-call support rotation, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products. Preferred candidate profile 5+ years product development experience in embedded operating systems BS/MS degree in Computer Science or equivalent with 5+ years of software engineering and development experience Hands on experience with multiple computer programming languages such as Golang (must), C++, Python, Java Hands-on experience with FIPS 140-2 and Common Criteria Ability to implement all phases of a development cycle for a software product from understanding requirements, going through design, development and deploy phases Working knowledge of Virtualization technologies like KVM, Docker etc. Working knowledge of Cloud orchestration systems such as Kubernetes, Openstack etc. Excellent written and verbal communication skills.
Posted 2 weeks ago
6.0 - 10.0 years
25 - 35 Lacs
Ghaziabad
Work from Office
Role & responsibilities Distributed-Application-Deployment: Deploy, contribute, and support a highly distributed networking and security product platform at scale. Hyper-Automation: Expand infrastructure provisioning and deployment automation treating everything is code with a focus on ease of configuration Environment Stability using Observability: Develop and enhance existing observability practices, including metrics and alerts, to maintain stability of the infrastructure, conducting regular monitoring and proactive troubleshooting. Collaborative Engagement: Engage closely with application owners and SRE (Site Reliability Engineering) team members to execute roadmap initiatives and continuously improve existing systems, fostering a collaborative and cohesive work environment. Scale & Resilient systems: Design and deploy systems and infrastructure that are scalable and resilient to failure, ensuring high availability and reliability across configured failure domains. Continuous monitoring and Incident management: Participate in an on-call support rotation, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products. Preferred candidate profile 5+ years product development experience in embedded operating systems BS/MS degree in Computer Science or equivalent with 5+ years of software engineering and development experience Hands on experience with multiple computer programming languages such as Golang (must), C++, Python, Java Hands-on experience with FIPS 140-2 and Common Criteria Ability to implement all phases of a development cycle for a software product from understanding requirements, going through design, development and deploy phases Working knowledge of Virtualization technologies like KVM, Docker etc. Working knowledge of Cloud orchestration systems such as Kubernetes, Openstack etc. Excellent written and verbal communication skills.
Posted 2 weeks ago
6.0 - 10.0 years
25 - 35 Lacs
Bengaluru
Work from Office
Role & responsibilities Distributed-Application-Deployment: Deploy, contribute, and support a highly distributed networking and security product platform at scale. Hyper-Automation: Expand infrastructure provisioning and deployment automation treating everything is code with a focus on ease of configuration Environment Stability using Observability: Develop and enhance existing observability practices, including metrics and alerts, to maintain stability of the infrastructure, conducting regular monitoring and proactive troubleshooting. Collaborative Engagement: Engage closely with application owners and SRE (Site Reliability Engineering) team members to execute roadmap initiatives and continuously improve existing systems, fostering a collaborative and cohesive work environment. Scale & Resilient systems: Design and deploy systems and infrastructure that are scalable and resilient to failure, ensuring high availability and reliability across configured failure domains. Continuous monitoring and Incident management: Participate in an on-call support rotation, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products. Preferred candidate profile 5+ years product development experience in embedded operating systems BS/MS degree in Computer Science or equivalent with 5+ years of software engineering and development experience Hands on experience with multiple computer programming languages such as Golang (must), C++, Python, Java Hands-on experience with FIPS 140-2 and Common Criteria Ability to implement all phases of a development cycle for a software product from understanding requirements, going through design, development and deploy phases Working knowledge of Virtualization technologies like KVM, Docker etc. Working knowledge of Cloud orchestration systems such as Kubernetes, Openstack etc. Excellent written and verbal communication skills.
Posted 2 weeks ago
6.0 - 10.0 years
25 - 35 Lacs
Khammam
Work from Office
Role & responsibilities Distributed-Application-Deployment: Deploy, contribute, and support a highly distributed networking and security product platform at scale. Hyper-Automation: Expand infrastructure provisioning and deployment automation treating everything is code with a focus on ease of configuration Environment Stability using Observability: Develop and enhance existing observability practices, including metrics and alerts, to maintain stability of the infrastructure, conducting regular monitoring and proactive troubleshooting. Collaborative Engagement: Engage closely with application owners and SRE (Site Reliability Engineering) team members to execute roadmap initiatives and continuously improve existing systems, fostering a collaborative and cohesive work environment. Scale & Resilient systems: Design and deploy systems and infrastructure that are scalable and resilient to failure, ensuring high availability and reliability across configured failure domains. Continuous monitoring and Incident management: Participate in an on-call support rotation, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products. Preferred candidate profile 5+ years product development experience in embedded operating systems BS/MS degree in Computer Science or equivalent with 5+ years of software engineering and development experience Hands on experience with multiple computer programming languages such as Golang (must), C++, Python, Java Hands-on experience with FIPS 140-2 and Common Criteria Ability to implement all phases of a development cycle for a software product from understanding requirements, going through design, development and deploy phases Working knowledge of Virtualization technologies like KVM, Docker etc. Working knowledge of Cloud orchestration systems such as Kubernetes, Openstack etc. Excellent written and verbal communication skills.
Posted 2 weeks ago
6.0 - 10.0 years
25 - 35 Lacs
Faridabad
Work from Office
Role & responsibilities Distributed-Application-Deployment: Deploy, contribute, and support a highly distributed networking and security product platform at scale. Hyper-Automation: Expand infrastructure provisioning and deployment automation treating everything is code with a focus on ease of configuration Environment Stability using Observability: Develop and enhance existing observability practices, including metrics and alerts, to maintain stability of the infrastructure, conducting regular monitoring and proactive troubleshooting. Collaborative Engagement: Engage closely with application owners and SRE (Site Reliability Engineering) team members to execute roadmap initiatives and continuously improve existing systems, fostering a collaborative and cohesive work environment. Scale & Resilient systems: Design and deploy systems and infrastructure that are scalable and resilient to failure, ensuring high availability and reliability across configured failure domains. Continuous monitoring and Incident management: Participate in an on-call support rotation, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products. Preferred candidate profile 5+ years product development experience in embedded operating systems BS/MS degree in Computer Science or equivalent with 5+ years of software engineering and development experience Hands on experience with multiple computer programming languages such as Golang (must), C++, Python, Java Hands-on experience with FIPS 140-2 and Common Criteria Ability to implement all phases of a development cycle for a software product from understanding requirements, going through design, development and deploy phases Working knowledge of Virtualization technologies like KVM, Docker etc. Working knowledge of Cloud orchestration systems such as Kubernetes, Openstack etc. Excellent written and verbal communication skills.
Posted 2 weeks ago
6.0 - 10.0 years
25 - 35 Lacs
Hyderabad
Work from Office
Role & responsibilities Distributed-Application-Deployment: Deploy, contribute, and support a highly distributed networking and security product platform at scale. Hyper-Automation: Expand infrastructure provisioning and deployment automation treating everything is code with a focus on ease of configuration Environment Stability using Observability: Develop and enhance existing observability practices, including metrics and alerts, to maintain stability of the infrastructure, conducting regular monitoring and proactive troubleshooting. Collaborative Engagement: Engage closely with application owners and SRE (Site Reliability Engineering) team members to execute roadmap initiatives and continuously improve existing systems, fostering a collaborative and cohesive work environment. Scale & Resilient systems: Design and deploy systems and infrastructure that are scalable and resilient to failure, ensuring high availability and reliability across configured failure domains. Continuous monitoring and Incident management: Participate in an on-call support rotation, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products. Preferred candidate profile 5+ years product development experience in embedded operating systems BS/MS degree in Computer Science or equivalent with 5+ years of software engineering and development experience Hands on experience with multiple computer programming languages such as Golang (must), C++, Python, Java Hands-on experience with FIPS 140-2 and Common Criteria Ability to implement all phases of a development cycle for a software product from understanding requirements, going through design, development and deploy phases Working knowledge of Virtualization technologies like KVM, Docker etc. Working knowledge of Cloud orchestration systems such as Kubernetes, Openstack etc. Excellent written and verbal communication skills.
Posted 2 weeks ago
6.0 - 10.0 years
25 - 35 Lacs
Nizamabad
Work from Office
Role & responsibilities Distributed-Application-Deployment: Deploy, contribute, and support a highly distributed networking and security product platform at scale. Hyper-Automation: Expand infrastructure provisioning and deployment automation treating everything is code with a focus on ease of configuration Environment Stability using Observability: Develop and enhance existing observability practices, including metrics and alerts, to maintain stability of the infrastructure, conducting regular monitoring and proactive troubleshooting. Collaborative Engagement: Engage closely with application owners and SRE (Site Reliability Engineering) team members to execute roadmap initiatives and continuously improve existing systems, fostering a collaborative and cohesive work environment. Scale & Resilient systems: Design and deploy systems and infrastructure that are scalable and resilient to failure, ensuring high availability and reliability across configured failure domains. Continuous monitoring and Incident management: Participate in an on-call support rotation, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products. Preferred candidate profile 5+ years product development experience in embedded operating systems BS/MS degree in Computer Science or equivalent with 5+ years of software engineering and development experience Hands on experience with multiple computer programming languages such as Golang (must), C++, Python, Java Hands-on experience with FIPS 140-2 and Common Criteria Ability to implement all phases of a development cycle for a software product from understanding requirements, going through design, development and deploy phases Working knowledge of Virtualization technologies like KVM, Docker etc. Working knowledge of Cloud orchestration systems such as Kubernetes, Openstack etc. Excellent written and verbal communication skills.
Posted 2 weeks ago
6.0 - 10.0 years
25 - 35 Lacs
Gurugram
Work from Office
Role & responsibilities Distributed-Application-Deployment: Deploy, contribute, and support a highly distributed networking and security product platform at scale. Hyper-Automation: Expand infrastructure provisioning and deployment automation treating everything is code with a focus on ease of configuration Environment Stability using Observability: Develop and enhance existing observability practices, including metrics and alerts, to maintain stability of the infrastructure, conducting regular monitoring and proactive troubleshooting. Collaborative Engagement: Engage closely with application owners and SRE (Site Reliability Engineering) team members to execute roadmap initiatives and continuously improve existing systems, fostering a collaborative and cohesive work environment. Scale & Resilient systems: Design and deploy systems and infrastructure that are scalable and resilient to failure, ensuring high availability and reliability across configured failure domains. Continuous monitoring and Incident management: Participate in an on-call support rotation, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products. Preferred candidate profile 5+ years product development experience in embedded operating systems BS/MS degree in Computer Science or equivalent with 5+ years of software engineering and development experience Hands on experience with multiple computer programming languages such as Golang (must), C++, Python, Java Hands-on experience with FIPS 140-2 and Common Criteria Ability to implement all phases of a development cycle for a software product from understanding requirements, going through design, development and deploy phases Working knowledge of Virtualization technologies like KVM, Docker etc. Working knowledge of Cloud orchestration systems such as Kubernetes, Openstack etc. Excellent written and verbal communication skills.
Posted 2 weeks ago
6.0 - 10.0 years
25 - 35 Lacs
Karimnagar
Work from Office
Role & responsibilities Distributed-Application-Deployment: Deploy, contribute, and support a highly distributed networking and security product platform at scale. Hyper-Automation: Expand infrastructure provisioning and deployment automation treating everything is code with a focus on ease of configuration Environment Stability using Observability: Develop and enhance existing observability practices, including metrics and alerts, to maintain stability of the infrastructure, conducting regular monitoring and proactive troubleshooting. Collaborative Engagement: Engage closely with application owners and SRE (Site Reliability Engineering) team members to execute roadmap initiatives and continuously improve existing systems, fostering a collaborative and cohesive work environment. Scale & Resilient systems: Design and deploy systems and infrastructure that are scalable and resilient to failure, ensuring high availability and reliability across configured failure domains. Continuous monitoring and Incident management: Participate in an on-call support rotation, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products. Preferred candidate profile 5+ years product development experience in embedded operating systems BS/MS degree in Computer Science or equivalent with 5+ years of software engineering and development experience Hands on experience with multiple computer programming languages such as Golang (must), C++, Python, Java Hands-on experience with FIPS 140-2 and Common Criteria Ability to implement all phases of a development cycle for a software product from understanding requirements, going through design, development and deploy phases Working knowledge of Virtualization technologies like KVM, Docker etc. Working knowledge of Cloud orchestration systems such as Kubernetes, Openstack etc. Excellent written and verbal communication skills.
Posted 2 weeks ago
6.0 - 10.0 years
25 - 35 Lacs
Vijayawada
Work from Office
Role & responsibilities Distributed-Application-Deployment: Deploy, contribute, and support a highly distributed networking and security product platform at scale. Hyper-Automation: Expand infrastructure provisioning and deployment automation treating everything is code with a focus on ease of configuration Environment Stability using Observability: Develop and enhance existing observability practices, including metrics and alerts, to maintain stability of the infrastructure, conducting regular monitoring and proactive troubleshooting. Collaborative Engagement: Engage closely with application owners and SRE (Site Reliability Engineering) team members to execute roadmap initiatives and continuously improve existing systems, fostering a collaborative and cohesive work environment. Scale & Resilient systems: Design and deploy systems and infrastructure that are scalable and resilient to failure, ensuring high availability and reliability across configured failure domains. Continuous monitoring and Incident management: Participate in an on-call support rotation, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products. Preferred candidate profile 5+ years product development experience in embedded operating systems BS/MS degree in Computer Science or equivalent with 5+ years of software engineering and development experience Hands on experience with multiple computer programming languages such as Golang (must), C++, Python, Java Hands-on experience with FIPS 140-2 and Common Criteria Ability to implement all phases of a development cycle for a software product from understanding requirements, going through design, development and deploy phases Working knowledge of Virtualization technologies like KVM, Docker etc. Working knowledge of Cloud orchestration systems such as Kubernetes, Openstack etc. Excellent written and verbal communication skills.
Posted 2 weeks ago
6.0 - 10.0 years
25 - 35 Lacs
Chittoor
Work from Office
Role & responsibilities Distributed-Application-Deployment: Deploy, contribute, and support a highly distributed networking and security product platform at scale. Hyper-Automation: Expand infrastructure provisioning and deployment automation treating everything is code with a focus on ease of configuration Environment Stability using Observability: Develop and enhance existing observability practices, including metrics and alerts, to maintain stability of the infrastructure, conducting regular monitoring and proactive troubleshooting. Collaborative Engagement: Engage closely with application owners and SRE (Site Reliability Engineering) team members to execute roadmap initiatives and continuously improve existing systems, fostering a collaborative and cohesive work environment. Scale & Resilient systems: Design and deploy systems and infrastructure that are scalable and resilient to failure, ensuring high availability and reliability across configured failure domains. Continuous monitoring and Incident management: Participate in an on-call support rotation, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products. Preferred candidate profile 5+ years product development experience in embedded operating systems BS/MS degree in Computer Science or equivalent with 5+ years of software engineering and development experience Hands on experience with multiple computer programming languages such as Golang (must), C++, Python, Java Hands-on experience with FIPS 140-2 and Common Criteria Ability to implement all phases of a development cycle for a software product from understanding requirements, going through design, development and deploy phases Working knowledge of Virtualization technologies like KVM, Docker etc. Working knowledge of Cloud orchestration systems such as Kubernetes, Openstack etc. Excellent written and verbal communication skills.
Posted 2 weeks ago
6.0 - 10.0 years
25 - 35 Lacs
Mandya
Work from Office
Role & responsibilities Distributed-Application-Deployment: Deploy, contribute, and support a highly distributed networking and security product platform at scale. Hyper-Automation: Expand infrastructure provisioning and deployment automation treating everything is code with a focus on ease of configuration Environment Stability using Observability: Develop and enhance existing observability practices, including metrics and alerts, to maintain stability of the infrastructure, conducting regular monitoring and proactive troubleshooting. Collaborative Engagement: Engage closely with application owners and SRE (Site Reliability Engineering) team members to execute roadmap initiatives and continuously improve existing systems, fostering a collaborative and cohesive work environment. Scale & Resilient systems: Design and deploy systems and infrastructure that are scalable and resilient to failure, ensuring high availability and reliability across configured failure domains. Continuous monitoring and Incident management: Participate in an on-call support rotation, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products. Preferred candidate profile 5+ years product development experience in embedded operating systems BS/MS degree in Computer Science or equivalent with 5+ years of software engineering and development experience Hands on experience with multiple computer programming languages such as Golang (must), C++, Python, Java Hands-on experience with FIPS 140-2 and Common Criteria Ability to implement all phases of a development cycle for a software product from understanding requirements, going through design, development and deploy phases Working knowledge of Virtualization technologies like KVM, Docker etc. Working knowledge of Cloud orchestration systems such as Kubernetes, Openstack etc. Excellent written and verbal communication skills.
Posted 2 weeks ago
6.0 - 10.0 years
25 - 35 Lacs
Hassan
Work from Office
Role & responsibilities Distributed-Application-Deployment: Deploy, contribute, and support a highly distributed networking and security product platform at scale. Hyper-Automation: Expand infrastructure provisioning and deployment automation treating everything is code with a focus on ease of configuration Environment Stability using Observability: Develop and enhance existing observability practices, including metrics and alerts, to maintain stability of the infrastructure, conducting regular monitoring and proactive troubleshooting. Collaborative Engagement: Engage closely with application owners and SRE (Site Reliability Engineering) team members to execute roadmap initiatives and continuously improve existing systems, fostering a collaborative and cohesive work environment. Scale & Resilient systems: Design and deploy systems and infrastructure that are scalable and resilient to failure, ensuring high availability and reliability across configured failure domains. Continuous monitoring and Incident management: Participate in an on-call support rotation, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products. Preferred candidate profile 5+ years product development experience in embedded operating systems BS/MS degree in Computer Science or equivalent with 5+ years of software engineering and development experience Hands on experience with multiple computer programming languages such as Golang (must), C++, Python, Java Hands-on experience with FIPS 140-2 and Common Criteria Ability to implement all phases of a development cycle for a software product from understanding requirements, going through design, development and deploy phases Working knowledge of Virtualization technologies like KVM, Docker etc. Working knowledge of Cloud orchestration systems such as Kubernetes, Openstack etc. Excellent written and verbal communication skills.
Posted 2 weeks ago
6.0 - 10.0 years
25 - 35 Lacs
Mysuru
Work from Office
Role & responsibilities Distributed-Application-Deployment: Deploy, contribute, and support a highly distributed networking and security product platform at scale. Hyper-Automation: Expand infrastructure provisioning and deployment automation treating everything is code with a focus on ease of configuration Environment Stability using Observability: Develop and enhance existing observability practices, including metrics and alerts, to maintain stability of the infrastructure, conducting regular monitoring and proactive troubleshooting. Collaborative Engagement: Engage closely with application owners and SRE (Site Reliability Engineering) team members to execute roadmap initiatives and continuously improve existing systems, fostering a collaborative and cohesive work environment. Scale & Resilient systems: Design and deploy systems and infrastructure that are scalable and resilient to failure, ensuring high availability and reliability across configured failure domains. Continuous monitoring and Incident management: Participate in an on-call support rotation, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products. Preferred candidate profile 5+ years product development experience in embedded operating systems BS/MS degree in Computer Science or equivalent with 5+ years of software engineering and development experience Hands on experience with multiple computer programming languages such as Golang (must), C++, Python, Java Hands-on experience with FIPS 140-2 and Common Criteria Ability to implement all phases of a development cycle for a software product from understanding requirements, going through design, development and deploy phases Working knowledge of Virtualization technologies like KVM, Docker etc. Working knowledge of Cloud orchestration systems such as Kubernetes, Openstack etc. Excellent written and verbal communication skills.
Posted 2 weeks ago
6.0 - 10.0 years
25 - 35 Lacs
Navi Mumbai
Work from Office
Role & responsibilities Distributed-Application-Deployment: Deploy, contribute, and support a highly distributed networking and security product platform at scale. Hyper-Automation: Expand infrastructure provisioning and deployment automation treating everything is code with a focus on ease of configuration Environment Stability using Observability: Develop and enhance existing observability practices, including metrics and alerts, to maintain stability of the infrastructure, conducting regular monitoring and proactive troubleshooting. Collaborative Engagement: Engage closely with application owners and SRE (Site Reliability Engineering) team members to execute roadmap initiatives and continuously improve existing systems, fostering a collaborative and cohesive work environment. Scale & Resilient systems: Design and deploy systems and infrastructure that are scalable and resilient to failure, ensuring high availability and reliability across configured failure domains. Continuous monitoring and Incident management: Participate in an on-call support rotation, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products. Preferred candidate profile 5+ years product development experience in embedded operating systems BS/MS degree in Computer Science or equivalent with 5+ years of software engineering and development experience Hands on experience with multiple computer programming languages such as Golang (must), C++, Python, Java Hands-on experience with FIPS 140-2 and Common Criteria Ability to implement all phases of a development cycle for a software product from understanding requirements, going through design, development and deploy phases Working knowledge of Virtualization technologies like KVM, Docker etc. Working knowledge of Cloud orchestration systems such as Kubernetes, Openstack etc. Excellent written and verbal communication skills.
Posted 2 weeks ago
6.0 - 10.0 years
25 - 35 Lacs
Thane
Work from Office
Role & responsibilities Distributed-Application-Deployment: Deploy, contribute, and support a highly distributed networking and security product platform at scale. Hyper-Automation: Expand infrastructure provisioning and deployment automation treating everything is code with a focus on ease of configuration Environment Stability using Observability: Develop and enhance existing observability practices, including metrics and alerts, to maintain stability of the infrastructure, conducting regular monitoring and proactive troubleshooting. Collaborative Engagement: Engage closely with application owners and SRE (Site Reliability Engineering) team members to execute roadmap initiatives and continuously improve existing systems, fostering a collaborative and cohesive work environment. Scale & Resilient systems: Design and deploy systems and infrastructure that are scalable and resilient to failure, ensuring high availability and reliability across configured failure domains. Continuous monitoring and Incident management: Participate in an on-call support rotation, providing timely resolution of issues and ensuring operational excellence in managing and maintaining distributed networking and security products. Preferred candidate profile 5+ years product development experience in embedded operating systems BS/MS degree in Computer Science or equivalent with 5+ years of software engineering and development experience Hands on experience with multiple computer programming languages such as Golang (must), C++, Python, Java Hands-on experience with FIPS 140-2 and Common Criteria Ability to implement all phases of a development cycle for a software product from understanding requirements, going through design, development and deploy phases Working knowledge of Virtualization technologies like KVM, Docker etc. Working knowledge of Cloud orchestration systems such as Kubernetes, Openstack etc. Excellent written and verbal communication skills.
Posted 2 weeks ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
39581 Jobs | Dublin
Wipro
19070 Jobs | Bengaluru
Accenture in India
14409 Jobs | Dublin 2
EY
14248 Jobs | London
Uplers
10536 Jobs | Ahmedabad
Amazon
10262 Jobs | Seattle,WA
IBM
9120 Jobs | Armonk
Oracle
8925 Jobs | Redwood City
Capgemini
7500 Jobs | Paris,France
Virtusa
7132 Jobs | Southborough