Senior System Software Engineer - Infrastructure

8 - 12 years

0 Lacs

Posted:2 weeks ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

As a Senior System Software Engineer - Infrastructure at NVIDIA, you will be part of a groundbreaking team in Artificial Intelligence, High-Performance Computing, and Visualization. Your role will involve developing and implementing innovative architecture and software for large-scale storage and backup services to support AI/ML workloads and engineering workflows. **Role Overview:** You will be responsible for developing and managing enterprise-scale platforms that unify storage infrastructure and services, integrating enterprise appliances, networks, and open-source technologies. Your tasks will include developing and scaling REST APIs in Python/Go, automating storage operations, integrating intelligent observability and tracing into workflows, implementing agentic workflows, and building proof-of-concept integrations between infrastructure services and emerging agentic AI frameworks. **Key Responsibilities:** - Develop and manage enterprise-scale platforms that unify storage infrastructure and services - Develop and scale REST APIs in Python/Go - Automate storage operations for high reliability and performance - Integrate intelligent observability and tracing into workflows - Implement agentic workflows for self-healing and automation - Build proof-of-concept integrations with emerging agentic AI frameworks - Document practices and procedures, evaluate new technologies, and drive adoption of automation in enterprise storage services **Qualifications Required:** - BS in Computer Science (or equivalent experience) with 12+ years of relevant experience, MS with 10+ years, or Ph.D. with 8+ years - Extensive expertise in building large-scale, multi-threaded, distributed backend systems - Experience designing and building RESTful APIs using Python or Go - Familiarity with containerization & orchestration (Docker, Kubernetes) - Exposure to cloud platforms (AWS, Azure, GCP) - Experience with telemetry stacks (Prometheus, Grafana, Alert manager, ELK/Kibana) - Ability to collaborate across teams and communicate technical solutions effectively - Growth mindset to quickly adopt new frameworks in observability, AI automation, and infrastructure management If you want to stand out from the crowd, consider the following: - Contributions to open-source projects related to infrastructure, storage, or Python-based libraries - Strong background in Linux storage systems at an enterprise scale - Experience with Enterprise NAS (NetApp, Pure Storage), distributed filesystems (Lustre, GPFS, Ceph), or S3-compatible object storage - Experience with GenAI/agentic application frameworks or observability platforms - Proven track record in prototyping and productionizing intelligent automation workflows for large-scale infrastructure Join NVIDIA, where you will have the opportunity to work with some of the most forward-thinking people in the technology industry, competitive salaries, and a comprehensive benefits package. If you are creative and autonomous, NVIDIA wants to hear from you!,

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You