Jobs
Interviews

9 Ebpf Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

5.0 - 9.0 years

0 Lacs

karnataka

On-site

As an AI Infrastructure Engineer at Cisco, you will join an innovative team with a mission to revolutionize how enterprises utilize AI. Operating with the agility of a startup and the focus of an incubator, we are building a close-knit group of AI and infrastructure experts. Our team is driven by bold ideas and a shared goal: to rethink systems from the ground up and deliver breakthrough solutions that redefine what is possiblefaster, leaner, and smarter. In this dynamic environment, where experimentation is abundant and new technologies are not just welcome but expected, you will collaborate with seasoned engineers, architects, and thinkers. Together, we craft iconic products that have the potential to reshape industries and unlock entirely new operational models for enterprises. If you are energized by solving challenging problems, enjoy pushing the boundaries of what is achievable, and aspire to shape the future of AI infrastructure, we are eager to meet you. Your role as an AI Infrastructure Engineer at Cisco will be instrumental in designing and implementing next-generation AI products. You will be focused on delivering high-performance, efficient, and reliable solutions that power AI workloads across Cisco's ecosystem. Your work will directly impact the performance, efficiency, reliability, and availability of AI systems for Cisco's customers, as well as drive advancements in AI and machine learning infrastructure. Key Responsibilities: - Design and develop node-level infrastructure components to support high-performance AI workloads. - Benchmark, analyze, and optimize the performance of AI infrastructure, including CUDA kernels and memory management for GPUs. - Minimize downtime through seamless configuration and upgrade architecture for software components. - Manage the installation and deployment of AI infrastructure on Kubernetes clusters, utilizing CRDs and operators. - Develop and deploy efficient telemetry collection systems for nodes and hardware components without impacting workload performance. - Work with distributed system fundamentals to ensure scalability, resilience, and reliability. - Collaborate across teams and time zones to shape the overall direction of AI infrastructure development and achieve shared goals. Minimum Qualifications: - Proficiency in programming languages such as Rust, C/C++, Golang, Python, or eBPF. - Strong understanding of Linux operating systems, including user space and kernel-level components. - Experience with Linux user space development, including packaging, logging, telemetry, and lifecycle management of processes. - Strong understanding of Kubernetes (K8s) and related technologies, such as custom resource definitions (CRDs). - Strong debugging and problem-solving skills for complex system-level issues. - Bachelor's degree or higher and a minimum of 5 years of relevant engineering work experience. Preferred Qualifications: - Linux kernel and device driver hands-on expertise is a plus. - Experience in GPU programming and optimization, including CUDA, UCX is a plus. - Experience with high-speed data transfer technologies such as RDMA. - Use of Nvidia GPU operators and Nvidia container toolkit and Nsight, CUPTI. - Nvidia MIG and MPS concepts for managing GPU consumption. At Cisco, we are dedicated to fostering an inclusive future where every individual brings their unique skills and perspectives together. Our employees celebrate diversity and focus on unlocking potential. We prioritize learning and development at every stage, offering opportunities for growth and multiple career paths. Our technology, tools, and culture support hybrid work trends, allowing everyone to excel and thrive. As a company, we recognize our responsibility to bring communities together, and our people are at the heart of that mission. One-third of our employees collaborate in our 30 employee resource organizations, known as Inclusive Communities, to connect, foster belonging, advocate for inclusivity, and make a positive impact. We provide dedicated paid time off for volunteering, allowing our employees to give back to causes they are passionate aboutnearly 86% of our employees actively participate. At Cisco, our purpose is driven by our people, making us a global leader in technology that powers the internet. We help our customers reimagine their applications, secure their enterprise, transform their infrastructure, and achieve their sustainability goals. Every step we take is a step towards a more inclusive future for all. Join us in taking your next step and being your authentic self with Cisco.,

Posted 1 week ago

Apply

8.0 - 12.0 years

0 Lacs

karnataka

On-site

Gruve is an innovative software services startup dedicated to transforming enterprises into AI powerhouses. Specializing in cybersecurity, customer experience, cloud infrastructure, and advanced technologies such as Large Language Models (LLMs), our mission is to assist customers in utilizing their data for making intelligent decisions. As a well-funded early-stage startup, Gruve offers a dynamic environment with strong customer and partner networks. We are currently looking for an experienced Staff Engineer with expertise in SONiC (Software for Open Networking in the Cloud), Networking, Security, and Linux. The ideal candidate should have a deep understanding of data plane and control plane networking, security mechanisms, and open-source networking stacks. In this role, you will be instrumental in designing, developing, and optimizing high-performance networking solutions based on SONiC, working on switch OS internals, and ensuring security at all levels. Key Responsibilities: - Design, develop, and optimize SONiC-based networking solutions for data center and cloud environments. - Contribute to SONiC's Control Plane, Data Plane, SAI (Switch Abstraction Interface), and integration with ASICs. - Develop and enhance network security mechanisms, including ACLs, firewall rules, and secure communication protocols. - Work with Linux kernel networking stack, DPDK, eBPF, and other high-performance packet processing frameworks. - Integrate and optimize FRR (Free Range Routing), BGP, OSPF, and other routing protocols within SONiC. - Collaborate with ASIC vendors to integrate new chipsets with SONiC through SAI API development. - Drive software development using C, C++, Python, and Go for various networking and security features. - Optimize Netfilter, iptables, nftables, and XDP/eBPF for security and performance enhancements. - Design and implement Zero Trust Security models for networking and cloud infrastructure. - Work on containerized networking (CNI), Kubernetes networking, and SDN solutions. - Debug and troubleshoot networking and security issues using tcpdump, Wireshark, gdb, strace, and perf tools. - Contribute to open-source networking projects and engage with the SONiC community. Basic Qualifications: - Bachelor's or Master's degree in computer science, electronics engineering, or a related field. - 8-12 years of experience in networking software development, security, and Linux systems programming. - Strong expertise in SONiC architecture, SAI, and open networking platforms. - Proficiency in L2/L3 networking protocols (BGP, OSPF, MPLS, VXLAN, EVPN, etc.). - Strong knowledge of network security concepts, including firewalling, VPNs, and DDoS mitigation. - Experience with Linux networking internals, Netfilter, iptables, nftables, XDP, and eBPF. - Proficiency in C, C++, Python, and Go for networking software development. - Strong debugging skills using tcpdump, Wireshark, gdb, strace, perf, and ASAN. - Experience working with network ASICs, Broadcom, Mellanox, Marvell, or Intel-based chipsets. - Good understanding of container networking, Kubernetes CNI, and SDN concepts. - Hands-on experience with CI/CD, Git, Jenkins, and automated testing frameworks. Preferred Qualifications: - Experience in DPDK, P4 programming, and FPGA-based networking solutions. - Contributions to open-source networking projects (SONiC, FRR, Linux kernel, etc.). - Knowledge of TLS, IPSec, MACsec, and secure boot mechanisms. - Experience working with public cloud networking (AWS, Azure, GCP). At Gruve, we value innovation, collaboration, and continuous learning. We strive to create a diverse and inclusive workplace where everyone can excel and contribute their best work. If you are passionate about technology and eager to make a difference, we encourage you to apply. Gruve is an equal opportunity employer, welcoming applicants from all backgrounds. We appreciate all applications, but only candidates selected for an interview will be contacted.,

Posted 1 week ago

Apply

5.0 - 9.0 years

0 Lacs

karnataka

On-site

We are an innovation team on a mission to transform how enterprises harness AI. Operating with the agility of a startup and the focus of an incubator, we're building a tight-knit group of AI and infrastructure experts driven by bold ideas and a shared goal: to rethink systems from the ground up and deliver breakthrough solutions that redefine what's possible faster, leaner, and smarter. We thrive in a fast-paced, experimentation-rich environment where new technologies aren't just welcome they're expected. Here, you'll work side-by-side with seasoned engineers, architects, and thinkers to craft the kind of iconic products that can reshape industries and unlock entirely new models of operation for the enterprise. If you're energized by the challenge of solving hard problems, love working at the edge of what's possible, and want to help shape the future of AI infrastructure we'd love to meet you. Cisco is seeking a forward-thinking AI Infrastructure Engineer to help design and implement the next-generation AI products. This role will focus on delivering high-performance, efficient, and reliable solutions that power AI workloads across Cisco's ecosystem. As an AI Infrastructure Engineer at Cisco, you will play a pivotal role in shaping the AI systems that enable cutting-edge innovations. Your work will directly impact: - The performance and efficiency of AI workloads on the node. - The reliability and availability of AI systems for Cisco's customers. - Advancements in AI and machine learning infrastructure, enabling better utilization and improving efficiency for applications across industries. - Collaboration across internal teams to bring system-level innovation across different Cisco products. Your contributions will help Cisco maintain its leadership in AI infrastructure development and influence the broader AI and machine learning community. Key Responsibilities: - Design and develop node-level infrastructure components to support high-performance AI workloads. - Benchmark, analyze, and optimize the performance of AI infrastructure, including CUDA kernels and memory management for GPUs. - Minimize downtime through seamless config and upgrade architecture for software components. - Manage the installation and deployment of AI infrastructure on Kubernetes clusters, including the use of CRDs and operators. - Develop and deploy efficient telemetry collection systems for nodes and hardware components without impacting workload performance. - Work with distributed system fundamentals to ensure scalability, resilience, and reliability. - Collaborate across teams and time zones to shape the overall direction of AI infrastructure development and achieve shared goals. Minimum Qualifications: - Proficiency in programming languages such as Rust, C/C++, Golang, Python, or eBPF. - Strong understanding of Linux operating systems, including user space and kernel-level components. - Experience with Linux user space development, including packaging, logging, telemetry, and lifecycle management of processes. - Strong understanding of Kubernetes (K8s) and related technologies, such as custom resource definitions (CRDs). - Strong debugging and problem-solving skills for complex system-level issues. - Bachelor's degree+ and relevant 5+ years of Engineering work experience. Preferred Qualifications: - Linux kernel and device driver hands-on expertise is a plus. - Experience in GPU programming and optimization, including CUDA, UCX is a plus. - Experience with high-speed data transfer technologies such as RDMA. - Use of Nvidia GPU operators and Nvidia container toolkit and Nsight, CUPTI. - Nvidia MIG and MPS concepts for managing GPU consumption. #WeAreCisco where every individual brings their unique skills and perspectives together to pursue our purpose of powering an inclusive future for all. Our passion is connection we celebrate our employees" diverse set of backgrounds and focus on unlocking potential. Cisconians often experience one company, many careers where learning and development are encouraged and supported at every stage. Our technology, tools, and culture pioneered hybrid work trends, allowing all to not only give their best, but be their best. We understand our outstanding opportunity to bring communities together and at the heart of that is our people. One-third of Cisconians collaborate in our 30 employee resource organizations, called Inclusive Communities, to connect, foster belonging, learn to be informed allies, and make a difference. Dedicated paid time off to volunteer 80 hours each year allows us to give back to causes we are passionate about, and nearly 86% do! Our purpose, driven by our people, is what makes us the worldwide leader in technology that powers the internet. Helping our customers reimagine their applications, secure their enterprise, transform their infrastructure, and meet their sustainability goals is what we do best. We ensure that every step we take is a step towards a more inclusive future for all. Take your next step and be you, with us!,

Posted 1 week ago

Apply

6.0 - 11.0 years

19 - 34 Lacs

Pune, Chennai, Bengaluru

Hybrid

We are looking for an experienced Senior Networking Developer with expertise in developing and optimizing networking systems and protocol stacks. This role involves designing, implementing, and troubleshooting high-performance network solutions at the system and protocol level. Required Skills and Qualifications Technical Expertise: Experience in networking protocol or system-level development. Strong proficiency in C or C++ for low-level systems programming. In-depth knowledge of networking protocols, including L2/L3 protocols (e.g., Ethernet, ARP, IP, ICMP) and L4 protocols (e.g., TCP, UDP). Familiarity with advanced networking concepts such as QoS, load balancing, and multicast. System-Level Knowledge: Experience working with Linux networking subsystems (e.g., netfilter, iptables, eBPF). Hands-on experience with kernel-level networking stack development and optimization. Debugging and Analysis: Proficiency in network diagnostics tools (e.g., Wireshark, tcpdump, strace, gdb). Ability to troubleshoot issues at all layers of the OSI model. Preferred Skills Experience with high-performance networking technologies Experience with programmable network devices Experience with SDN controllers and related protocols like OpenFlow.

Posted 2 weeks ago

Apply

6.0 - 11.0 years

19 - 34 Lacs

Pune, Chennai, Bengaluru

Hybrid

We are looking for an experienced Senior Networking Developer with expertise in developing and optimizing networking systems and protocol stacks. This role involves designing, implementing, and troubleshooting high-performance network solutions at the system and protocol level. Required Skills and Qualifications Technical Expertise: Experience in networking protocol or system-level development. Strong proficiency in C or C++ for low-level systems programming. In-depth knowledge of networking protocols, including L2/L3 protocols (e.g., Ethernet, ARP, IP, ICMP) and L4 protocols (e.g., TCP, UDP). Familiarity with advanced networking concepts such as QoS, load balancing, and multicast. System-Level Knowledge: Experience working with Linux networking subsystems (e.g., netfilter, iptables, eBPF). Hands-on experience with kernel-level networking stack development and optimization. Debugging and Analysis: Proficiency in network diagnostics tools (e.g., Wireshark, tcpdump, strace, gdb). Ability to troubleshoot issues at all layers of the OSI model. Preferred Skills Experience with high-performance networking technologies Experience with programmable network devices Experience with SDN controllers and related protocols like OpenFlow.

Posted 2 weeks ago

Apply

10.0 - 14.0 years

0 Lacs

karnataka

On-site

You are joining an innovation team with a mission to revolutionize how enterprises utilize AI. Operating with the agility of a startup and the focus of an incubator, we are assembling a close-knit group of AI and infrastructure experts fueled by bold ideas and a common objective: to reimagine systems from the ground up and deliver groundbreaking solutions that redefine what's achievable - faster, leaner, and smarter. In our fast-paced, experimentation-rich environment, where new technologies are not just embraced but expected, you will collaborate closely with seasoned engineers, architects, and visionaries to develop iconic products capable of reshaping industries and introducing entirely new operational models for enterprises. If you are invigorated by the prospect of tackling challenging problems, enjoy pushing the boundaries of what is possible, and are eager to contribute to shaping the future of AI infrastructure, we are excited to connect with you. As Cisco seeks a forward-thinking Architect for AI Infrastructure Software, you will play a pivotal role in spearheading the development of the next-generation AI infrastructure platform. This strategic leadership position at the intersection of software engineering and AI systems will require you to define the vision, architecture, and execution of high-performance software that directly influences how enterprises deploy, scale, and optimize AI workloads. Your responsibilities will include mentoring a high-caliber team, delivering robust control and data plane solutions, and operating them as a SaaS service with a relentless focus on uptime, quality, and customer success. Additionally, you will guide strategic decisions on resource usages in generative AI systems and collaborate across functions to align product direction with infrastructure capabilities. Key Responsibilities: - Architect and develop a SaaS control plane emphasizing ease of use, scalability, and reliability. - Design data models to drive APIs, ensuring best practices for usability and operations. - Utilize Kubernetes (K8s) to build scalable, resilient, and high-availability (HA) architectures. - Demonstrate a profound understanding of Nvidia and AMD metric collection and AI-driven analysis. - Plan and coordinate engineering work, map tasks to releases, conduct code reviews, and address technical challenges to facilitate releases. - Generate architecture specifications and develop proof-of-concept (POC) solutions for clarity as necessary. - Collaborate with product management to comprehend customer requirements and build architecturally sound solutions, working closely with engineers on implementation to ensure alignment with architectural requirements. - Manage technical debt with a strong emphasis on upholding product quality. - Integrate AI tools into everyday engineering practices, including code reviews, early bug detection, and test coverage automation. Required Skills: - Deep expertise in Golang, Python, C++, eBPF. - Proficiency in Kubernetes (K8s), Helm, Kubebuilder, K8S Operator pattern. - Hands-on experience with CI/CD pipelines and their impact on release quality. - Demonstrated experience in building and running SaaS services. - Strong design skills in distributed systems and large-scale data collection. - Familiarity with SLA/SLO principles and managing application scalability. - Practical experience with the NVIDIA stack and CUDA development. Minimum Qualifications: - Demonstrable experience in Golang development. - Leading CI/CD tools and API-first design practices. - Operations of Kubernetes for running SaaS services. - AI tools and generative AI applications for engineering. - Comprehensive understanding of software release processes, including the use of feature flags to ensure predictability. - Proficiency in utilizing agents during coding, review, CI, and CD processes. - Bachelor's degree or equivalent with 10+ years of engineering experience. Preferred Qualifications: - Proven leadership experience in building and guiding SaaS software teams in high-growth, dynamic environments. - Master's degree or equivalent. #WeAreCisco #WeAreCisco where every individual brings their unique skills and perspectives together to pursue our purpose of powering an inclusive future for all. Our passion is connection - we celebrate our employees" diverse set of backgrounds and focus on unlocking potential. Cisconians often experience one company, many careers where learning and development are encouraged and supported at every stage. Our technology, tools, and culture pioneered hybrid work trends, allowing all to not only give their best but be their best. We understand our outstanding opportunity to bring communities together, and at the heart of that is our people. One-third of Cisconians collaborate in our 30 employee resource organizations, called Inclusive Communities, to connect, foster belonging, learn to be informed allies, and make a difference. Dedicated paid time off to volunteer - 80 hours each year - allows us to give back to causes we are passionate about, and nearly 86% do! Our purpose, driven by our people, is what makes us the worldwide leader in technology that powers the internet. Helping our customers reimagine their applications, secure their enterprise, transform their infrastructure, and meet their sustainability goals is what we do best. We ensure that every step we take is a step towards a more inclusive future for all. Take your next step and be you, with us!,

Posted 2 weeks ago

Apply

3.0 - 5.0 years

2 - 5 Lacs

Bengaluru / Bangalore, Karnataka, India

On-site

The Exaleap Performance Engineering team drives performance optimization and efficiency improvements by working as a trusted expert to teams across the entire engineering organization. We are looking to add a talented, Linux system performance engineer to work on industry-leading performance observability and analysis tools. The team has pioneered and built extensive profiling and eBPF-based tracing tools/Node Exporter tools/GPU Profiling tools and utilities. You will work on building cutting-edge observability, visualization, and analytics tooling to help us stay at the forefront of our domain. What you will do: Build, enhance, and operate performance observability, data analysis and visualization, and benchmarking tools. Learn, evaluate, and integrate new tools and technologies Engage with performance engineers and end engineering users to understand their needs and improve their experience Maintain strong relationships with cross-functional teams through clear communication. Must-Have Skills: 2-5 years of professional Software Engineering experience Demonstrated proficiency in GO language Excellent communication skills and ability to work in a team environment. Good Knowledge Linux performance tools like eBPFBCC tools Nice-to-Have Skills: Good understanding of systems (server) and performance engineering concepts. Knowledge of statistical analysis and data visualization techniques. Exposure to observability tools for machine learning (ML) models(Nvidia -SMI HTApytorch profiler/ DCGM Exporter. Mandatory Key Skills data visualization, performance engineering, machine learning, performance optimization, GO language*,Linux*, eBPF*

Posted 1 month ago

Apply

3.0 - 5.0 years

16 - 18 Lacs

Bengaluru

Work from Office

The Role The Exaleap Performance Engineering team drives performance optimization and efficiency improvements by working as a trusted expert to teams across the entire engineering organization. We are looking to add a talented, Linux system performance engineer to work on industry-leading performance observability and analysis tools. The team has pioneered and built extensive profiling and eBPF-based tracing tools/Node Exporter tools/GPU Profiling tools and utilities. You will work on building cutting-edge observability, visualization, and analytics tooling to help us stay at the forefront of our domain. What you will do: Build, enhance, and operate performance observability, data analysis and visualization, and benchmarking tools. Learn, evaluate, and integrate new tools and technologies Engage with performance engineers and end engineering users to understand their needs and improve their experience Maintain strong relationships with cross-functional teams through clear communication. Must-Have Skills: 2-5 years of professional Software Engineering experience Demonstrated proficiency in GO language Excellent communication skills and ability to work in a team environment. Good Knowledge Linux performance tools like eBPFBCC tools Nice-to-Have Skills: Good understanding of systems (server) and performance engineering concepts. Knowledge of statistical analysis and data visualization techniques. Exposure to observability tools for machine learning (ML) models(Nvidia -SMI HTApytorch profiler/ DCGM Exporter. Mandatory Key Skills data visualization,performance engineering,machine learning,performance optimization,GO language*,Linux*,eBPF*

Posted 2 months ago

Apply

5 - 9 years

7 - 11 Lacs

Bengaluru

Work from Office

About the Job: The Red Hat Performance and Scale Engineering org is looking for a Senior Software Engineer to join us in the OpenShift Virtualization (OCPv) Performance and Scale team. Red Hat OpenShift Virtualization, an included feature of Red Hat OpenShift, provides a modern platform for organizations to run and deploy their new and existing virtual machine (VM) workloads. The solution allows for easy migration and management of traditional virtual machines onto a trusted, consistent, and comprehensive hybrid cloud application platform. As a senior member of the team, you will be responsible for providing comprehensive storage performance and scalability assessments of Red Hat OpenShift Virtualization (OCPv). Our goal is to make OCPv the platform of choice for Red Hat's enterprise customers for leveraging virtualization technologies. You will help us achieve such goals through targeted improvements in performance and scalability of the OCPv platform. This role needs an engineer that thinks creatively, adapts to rapid change, and has the willingness to learn and apply new technologies. You will be joining a vibrant open source culture, and helping promote performance and innovation in this Red Hat engineering team. The broader mission of the Performance and Scale team is to establish performance and scale leadership of the Red Hat product and cloud services portfolio. The scope includes component level, system and solution analysis and targeted enhancements. The team collaborates with engineering, product management, product marketing and customer support as well as Red Hat's hardware and software ecosystem partners. What will you do? Formulate test plans, and carry out performance and scalability benchmarks against various storage components/features of the OCPv platform to characterize performance, drive product performance improvements, and detect performance regressions through data analysis and visualization Develop tools and automation to aid the performance benchmarking work Collaborate with other engineering teams to resolve performance issues Triage, debug, and solve customer/partner cases related to virtualization storage performance and scale Publish results, conclusions, recommendations and best practices via internal test reports, presentations, external blogs and official documentation to support our partners and customers. Participate in internal and external conferences about your work and results What will you bring? Performance benchmarking, data capture, data analysis, and data Experience with storage systems and protocols (NAS, SAN, NFS, iSCSI, RBD, etc) Experience with testing windows technologies like MsSql, Win Desktop Citrix VDI, .Net etc. Experience with container technologies (podman, Kubernetes) Experience with systems performance engineering and metrics collection and analysis tools such as iostat, vmstat, sar, perf, pcp, prometheus, Grafana, Elasticsearch Programming experience in Python Experience working with the Linux operating system Excellent written and verbal language skills in English The following are considered as a plus: 5+ years of relevant experience Experience of working with virtualization technologies such as VMware Familiarity with storage APIs (snapshot, clone, provision, attach), Data Protection and Disaster Recovery Experience of working with Ansible automation platform Knowledge of performance observability/profiling tools like eBPF, Flame Graphs Bachelor degree in Computer Science or related fields Experience of Git or similar version control system

Posted 2 months ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies