Senior Specialist – Cloud Engineering

11 years

0 Lacs

Posted:1 week ago| Platform: Linkedin logo

Apply

Work Mode

Remote

Job Type

Full Time

Job Description

Experience:

7–11 Years

Location:

Bengaluru

Type:

Permanent

Role:

Senior Infrastructure Automation Engineer – Zero-Touch GPU Cloud Build & UpgradeWe are seeking a

Senior Infrastructure Automation Engineer

with

10 years of hands-on experience

in building and scaling infrastructure automation systems to lead the design and implementation of a

Zero-Touch Build, Upgrade, and Certification framework

for our on-prem GPU cloud environment.This role demands deep technical expertise across

bare-metal provisioning, configuration management, and full-stack automation

— from hardware to Kubernetes — built entirely on

GitOps principles

.

Key Responsibilities

  • Architect, lead, and implement a fully automated zero-touch deployment pipeline for GPU cloud infrastructure, spanning hardware, OS, and Kubernetes platform layers.
  • Build robust GitOps-based workflows to manage the full infrastructure lifecycle — from provisioning to continuous compliance.
  • Design and maintain automation for:
    • Bare-metal control: power cycling, provisioning, remote installs
    • Firmware & configuration flashing (BIOS, NIC, RAID, etc.)
    • Hardware inventory management
    • Configuration drift detection and remediation
  • Develop and extend internal automation frameworks using Ansible, Python, and related infrastructure tooling.
  • Serve as a technical authority and mentor, guiding junior engineers and collaborating with hardware, SRE, and platform engineering teams.
  • Lead architectural and design reviews for infrastructure automation systems.
  • Define and implement best practices for Infrastructure as Code (IaC), compliance, and operational resilience.
  • Champion automation-driven operational models, reducing manual intervention to near zero.

Bonus:

Familiarity with

Terraform

,

Chef

, and cloud automation platforms.

Required Skills & Experience

  • 10 years of hands-on experience in infrastructure engineering, automation, and systems design, with a strong track record of delivering scalable, maintainable solutions.
  • Primary key skills: Ansible, Python, ipmitool, firmware scripting, Linux shell scripting.
  • Deep expertise in:
    • Ansible for automation & configuration management
    • Python for scripting, integration, and automation logic
    • ipmitool & related tools for low-level hardware management (IPMI, Redfish)
  • Proven experience with bare-metal automation in data center environments, including:
    • Power control & PXE booting
    • BIOS/NIC/RAID firmware upgrades
    • Hardware & platform inventory systems
  • Strong foundation in Linux systems, networking, and Kubernetes infrastructure.
  • Fluency with GitOps workflows and tools.
  • Experience with CI/CD systems and Git-based pipelines for infrastructure.
  • Familiarity with infrastructure monitoring, logging, and drift detection.
  • Strong cross-team collaboration & communication skills, especially across hardware, platform, and SRE teams.

Bonus

  • Prior leadership or mentorship experience
  • Contribution to or maintenance of open-source infrastructure projects
  • Exposure to GPU-based compute stacks & high-performance workloads

Skills

Mandatory Skills:

  • Ansible
  • Terraform
  • Puppet
  • Chef
  • Scripting – Shell / PowerShell / Python

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You