Linux Engineer (KVM)

5 - 10 years

15 - 22 Lacs

Posted:None| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

We are looking for a skilled Linux Virtualization & HA Engineer to join our infrastructure team (design and maintain a fault-tolerant KVM cluster using Pacemaker and other open-source tools).

Key Responsibilities

  • Design, implement, and manage KVM-based virtualization on enterprise Linux platforms (RHEL/SUSE/Debian-based).
  • Deploy and configure Pacemaker and Corosync for HA cluster management.
  • Integrate shared storage, or for replicated VM storage.
  • Implement fencing/STONITH mechanisms to protect data integrity in failover scenarios.
  • Create and manage libvirt XML definitions and manage VMs via virsh and related tooling.
  • Configure pacemaker remote or equivalent to integrate guest VMs into HA clusters.
  • Develop and maintain cluster resource definitions, constraints, and failover policies.
  • Perform routine patching, upgrades, and cluster health checks.
  • Monitor and troubleshoot performance, network, and storage issues.
  • Implement backup and disaster recovery strategies for virtualized workloads.
  • Create detailed technical documentation and train internal teams on HA KVM environments.
  • Collaborate with DevOps, Networking, and Storage teams for infrastructure integration.

Required Skills & Qualifications

  • 5+ years experience with Linux system administration (RHEL, CentOS, SUSE, Debian, or similar).
  • Experience with SUSE Linux Enterprise High Availability Extension (SLE-HA) or RHEL HA Add-on.
  • Strong expertise in KVM/QEMU and libvirt.
  • Hands-on experience with Pacemaker/Corosync HA clusters.
  • Knowledge of DRBD, Ceph, or other shared storage technologies for VM replication.
  • Proficiency in networking concepts (bridging, VLANs, bonding, static routing, multicast for Corosync).
  • Experience with STONITH/fencing devices (IPMI, iLO, DRAC, etc.).
  • Scripting skills in Bash, and familiarity with Python or Ansible for automation.
  • Solid understanding of disaster recovery planning and HA design patterns.
  • Strong troubleshooting skills in large-scale Linux environments.
  • Familiarity with Proxmox VE, oVirt, or OpenStack.
  • Knowledge of Kubernetes/ KubeVirt for containerized virtualization.
  • Experience with monitoring tools (Zabbix, Prometheus, Grafana).
  • RHCSA/RHCE, SCA, or similar certifications.

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
CtrlS logo
CtrlS

Software / Automation

Silicon Valley

RecommendedJobs for You