Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in pimpri chinchwad
>
Natobotics
>
On-Prem Infrastructure Engineer / SRE

On-Prem Infrastructure Engineer / SRE

Natobotics

10 years

0 Lacs

pimpri chinchwad maharashtra india

Posted:1 day ago| Platform:

Apply

Skills Required

reliability support engineering data management automation service monitoring analysis jenkins python optimization planning tuning efficiency resolve escalation collaboration documentation troubleshooting devops server scripting kubernetes mysql elasticsearch logstash debugging communication

Work Mode

On-site

Job Type

Full Time

Job Description

Location: Pan India

Experience:

5–10 Years

Role:

On-Prem Infrastructure Engineer / Site Reliability Engineer (SRE)

Job Summary

We are seeking a skilled On-Prem Infrastructure Engineer / SRE to manage and support NVIDIA’s on-prem engineering cloud infrastructure across multiple data centers. The ideal candidate will have strong experience in bare-metal infrastructure management, observability tools, automation, and production support. This role is critical in ensuring uptime, reliability, and operational excellence for engineering services.

Key Responsibilities

On-Prem Infrastructure Management

Manage and operate NVIDIA’s on-prem infrastructure across distributed data centers.
Maintain high availability, reliability, and readiness of on-prem engineering cloud environments.Perform lifecycle management of bare-metal servers and underlying hardware.

Service Level Management

Guard and maintain Service Level Agreements (SLAs) for mission-critical engineering services.
Implement and maintain monitoring, alerting, and incident response workflows.Drive root cause analysis (RCA), conduct post-mortems, and ensure corrective and preventive actions.

Observability & Monitoring

Deploy, configure, and manage observability tools such as

Prometheus, Grafana, ELK Stack

.Maintain KPI monitoring pipelines using

Jenkins, Python, and ELK

.Develop and enhance custom monitoring dashboards and business-specific alerting rules.

Automation & Optimization

Contribute to capacity planning, resource optimization, and performance tuning initiatives.Develop automation scripts/tools using

Python, Go, Bash

, or Jenkins pipelines.Improve operational efficiency through continuous automation.

Day-to-Day Operations & Support

Monitor system alerts, troubleshoot incidents, and resolve user-reported issues.Participate in

WAR rooms

during major or high-impact incidents.Ensure timely escalation and resolution of production issues.

Collaboration & Documentation

Create and maintain technical documentation for operational procedures, architectures, and troubleshooting steps.Work closely with engineering, DevOps, hardware, and data center teams to improve overall infrastructure reliability.

Required Skills & Experience

Strong hands-on experience in

bare-metal server management

using tools such as:

IPMI, Redfish, KVM

or similar technologies.

Experience With Automation And Scripting Using

Python, Go, Bash, Jenkins (CI/CD pipelines)

Practical Experience With Infrastructure Tools

Kubernetes, MySQL, Prometheus, Grafana, ELK (Elasticsearch, Logstash, Kibana)

.Solid understanding of system performance, capacity planning, and datacenter operations.Strong troubleshooting, incident-response, and operational debugging skills.Ability to work in fast-paced environments and handle production-critical scenarios.

Nice-to-Have Skills

Familiarity with

NVIDIA hardware

: GPUs, Tegra systems, DGX platforms, etc.Experience in large-scale distributed systems or high-performance computing environments.

Soft Skills

Strong communication and collaboration abilities.Analytical mindset with a focus on problem-solving.Ability to maintain composure under pressure in incident environments.Detail-oriented with strong documentation habits.ocumentation habits.

More Jobs at Natobotics

Principal Information Security Specialist

Mumbai, Maharashtra, India

Experience: Not specified

Salary: Not disclosed

React Developer

Pimpri Chinchwad, Maharashtra, India

Experience: Not specified

Salary: Not disclosed

VP - Principal Information Security

Mumbai, Maharashtra, India

8.0 - 8.0 yrs

Salary: Not disclosed

VP - FinOps

Mumbai, Maharashtra, India

5.0 - 5.0 yrs

Salary: Not disclosed

Finops - VP

Cuddalore, Tamil Nadu, India

10.0 - 10.0 yrs

Salary: Not disclosed

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Natobotics

RecommendedJobs for You

On-Prem Infrastructure Engineer / SRE

Natobotics

pimpri chinchwad, maharashtra, india

On-Prem Infrastructure Engineer / SRE

Natobotics

pimpri chinchwad, maharashtra, india

Login to

Please Verify Your Phone or Email

Confirm Action

On-Prem Infrastructure Engineer / SRE