Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
5.0 - 9.0 years
0 Lacs
karnataka
On-site
As an AI Infrastructure Engineer at Cisco, you will join an innovative team with a mission to revolutionize how enterprises utilize AI. Operating with the agility of a startup and the focus of an incubator, we are building a close-knit group of AI and infrastructure experts. Our team is driven by bold ideas and a shared goal: to rethink systems from the ground up and deliver breakthrough solutions that redefine what is possiblefaster, leaner, and smarter. In this dynamic environment, where experimentation is abundant and new technologies are not just welcome but expected, you will collaborate with seasoned engineers, architects, and thinkers. Together, we craft iconic products that have the potential to reshape industries and unlock entirely new operational models for enterprises. If you are energized by solving challenging problems, enjoy pushing the boundaries of what is achievable, and aspire to shape the future of AI infrastructure, we are eager to meet you. Your role as an AI Infrastructure Engineer at Cisco will be instrumental in designing and implementing next-generation AI products. You will be focused on delivering high-performance, efficient, and reliable solutions that power AI workloads across Cisco's ecosystem. Your work will directly impact the performance, efficiency, reliability, and availability of AI systems for Cisco's customers, as well as drive advancements in AI and machine learning infrastructure. Key Responsibilities: - Design and develop node-level infrastructure components to support high-performance AI workloads. - Benchmark, analyze, and optimize the performance of AI infrastructure, including CUDA kernels and memory management for GPUs. - Minimize downtime through seamless configuration and upgrade architecture for software components. - Manage the installation and deployment of AI infrastructure on Kubernetes clusters, utilizing CRDs and operators. - Develop and deploy efficient telemetry collection systems for nodes and hardware components without impacting workload performance. - Work with distributed system fundamentals to ensure scalability, resilience, and reliability. - Collaborate across teams and time zones to shape the overall direction of AI infrastructure development and achieve shared goals. Minimum Qualifications: - Proficiency in programming languages such as Rust, C/C++, Golang, Python, or eBPF. - Strong understanding of Linux operating systems, including user space and kernel-level components. - Experience with Linux user space development, including packaging, logging, telemetry, and lifecycle management of processes. - Strong understanding of Kubernetes (K8s) and related technologies, such as custom resource definitions (CRDs). - Strong debugging and problem-solving skills for complex system-level issues. - Bachelor's degree or higher and a minimum of 5 years of relevant engineering work experience. Preferred Qualifications: - Linux kernel and device driver hands-on expertise is a plus. - Experience in GPU programming and optimization, including CUDA, UCX is a plus. - Experience with high-speed data transfer technologies such as RDMA. - Use of Nvidia GPU operators and Nvidia container toolkit and Nsight, CUPTI. - Nvidia MIG and MPS concepts for managing GPU consumption. At Cisco, we are dedicated to fostering an inclusive future where every individual brings their unique skills and perspectives together. Our employees celebrate diversity and focus on unlocking potential. We prioritize learning and development at every stage, offering opportunities for growth and multiple career paths. Our technology, tools, and culture support hybrid work trends, allowing everyone to excel and thrive. As a company, we recognize our responsibility to bring communities together, and our people are at the heart of that mission. One-third of our employees collaborate in our 30 employee resource organizations, known as Inclusive Communities, to connect, foster belonging, advocate for inclusivity, and make a positive impact. We provide dedicated paid time off for volunteering, allowing our employees to give back to causes they are passionate aboutnearly 86% of our employees actively participate. At Cisco, our purpose is driven by our people, making us a global leader in technology that powers the internet. We help our customers reimagine their applications, secure their enterprise, transform their infrastructure, and achieve their sustainability goals. Every step we take is a step towards a more inclusive future for all. Join us in taking your next step and being your authentic self with Cisco.,
Posted 1 week ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
39581 Jobs | Dublin
Wipro
19070 Jobs | Bengaluru
Accenture in India
14409 Jobs | Dublin 2
EY
14248 Jobs | London
Uplers
10536 Jobs | Ahmedabad
Amazon
10262 Jobs | Seattle,WA
IBM
9120 Jobs | Armonk
Oracle
8925 Jobs | Redwood City
Capgemini
7500 Jobs | Paris,France
Virtusa
7132 Jobs | Southborough