Infrastructure Engineer (VMware , OpenShift , GPU /CPU)

4 - 8 years

3 - 10 Lacs

Posted:18 hours ago| Platform: Foundit logo

Apply

Skills Required

Work Mode

On-site

Job Type

Full Time

Job Description

Key Responsibilities:

Create and manage development (Dev), UAT, and production (Prod) environments on bare metal and Red Hat Linux-based servers.

Harden Linux servers for security compliance, ensuring systems pass VAPT (Vulnerability Assessment & Penetration Testing).

Develop CI/CD pipelines from GitHub to Linux-based VMs running OpenShift Kubernetes clusters.

Ensure high availability, observability, and proactive alerting for the HybridAI SaaS platform.

Automate deployment of the HybridAI InfraMetrics Collector in customer on- prem environments.

Work with VMware vCenter and Kubernetes Cluster APIs to manage infrastructure resources and automate deployments and provide guidance on VM Optimizations.

Enable build cycles with expertise on virtualization, container orchestration, and hybrid infrastructure.

Deploy LLMs on GPU infrastructure ensuring optimal resource allocation and scaling for AI-driven applications.

Monitor infrastructure performance and implement proactive scaling solutions.

Collaborate with Head of Software Engineering to enforce API security, access control, and compliance policies.

Implement secure and compliant infrastructure aligned with ISO 27001 standards.

Experience:

  • 4+ years of hands-on DevOps and infrastructure engineering experience managing enterprise-grade datacenter environments.
  • Strong experience with Red Hat Linux and bare metal infrastructure management.
  • Expertise in Linux security hardening (firewall configuration, SELinux, system patching).
  • Deep knowledge of OpenShift Kubernetes (OCP) and container orchestration.
  • Hands-on experience in CPU/GPU profiling, resource allocation, and performance tuning
  • Experience with infrastructure as code (Terraform, Ansible)
  • Proficiency in CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, ArgoCD) for OpenShift & Linux-based deployments.
  • Hands-on experience with VMware stack (ESXi, vCenter, vMotion)
  • Cloud and on-prem experience, with exposure to AWS, GCP, Azure, and private cloud platforms.
  • Scripting and automation expertise (Bash, Python, Powershell).
  • Strong security background, including API security, authentication (OAuth, JWT, mTLS), and compliance with CIS benchmarks.
  • Experience with any of observability and monitoring tools, including:NVIDIA DCGM, Prometheus & Grafana,ELK Stack,DataDog, Splunk, or AppDynamics
  • Solid experience in ISO 27001 compliance, security best practices, and policy implementation
  • Comfortable working in agile, very fast-paced startup environments with ownership of infra outcomes

Nice-to-Have Skills

  • Experience with service mesh architectures (Istio, Linkerd).
  • Familiarity with Zero Trust security models.
  • Exposure to air-gapped Kubernetes deployments for security-sensitive environments.
  • Experience with automated compliance enforcement tools (OpenSCAP, Falco, Aqua Security).
  • Knowledge of hybrid cloud networking (VPCs, VPNs, private links between on-prem and cloud).
  • Hands-on experience with HashiCorp Vault for secrets management.
  • Exposure to additional compliance frameworks such as SOC 2 or NIST
  • Experience with AI/ML or HPC workloads beyond LLM applications

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Uplers logo
Uplers

Digital Services

Ahmedabad

RecommendedJobs for You