Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in gurgaon
>
Rackspace Technology
>
Senior Systems Engineer HPC - R-21841

Senior Systems Engineer HPC - R-21841

Rackspace Technology

10 years

0 Lacs

gurgaon haryana india

Posted:2 months ago| Platform:

Apply

Skills Required

software patching resolve tuning scheduling networking linux tcp ip routing ethernet latency storage data lustre integrity support security controls compliance authentication ldap devops configuration management ansible terraform jenkins git automate packaging documentation training research planning design engineering ubuntu stack scripting python mpi aws azure gcp communication

Work Mode

On-site

Job Type

Full Time

Job Description

Responsibilities:

System Administration & Maintenance:

Install, configure, and maintain HPC clusters (hardware, software, operating systems), perform regular updates/patching, manage user accounts and permissions, and troubleshoot/resolve hardware or software issues.

Performance & Optimization:

Monitor and analyse system and application performance, identify bottlenecks, implement tuning solutions, and profile workloads to improve efficiency.

Cluster & Resource Management:

Manage and optimize job scheduling, resource allocation, and cluster operations using tools such as Slurm, LSF, Bright Cluster Manager /

Base Command Manager

, OpenHPC, and Warewulf.

Networking & Interconnects:

Configure, manage, and tune Linux networking (TCP/IP, DNS, routing) and high-speed HPC interconnects (InfiniBand, Ethernet) to ensure low-latency, high-bandwidth communication.

Storage & Data Management:

Implement and maintain large-scale storage and parallel file systems (Lustre, Ceph, GPFS), ensure data integrity, manage backups, and support disaster recovery.

Security & Authentication:

Implement security controls, ensure compliance with policies, and manage authentication and directory services such as LDAP and Active Directory.

DevOps & Automation:

Use configuration management and DevOps practices (Ansible, Terraform, Jenkins, Git) to automate deployments, application packaging (RPM/DEB), and system configurations.

User Support & Collaboration:

Provide technical support, documentation, and training to researchers; collaborate with scientists, HPC architects, and engineers to align infrastructure with research needs.

Planning & Innovation:

Contribute to the design and planning of HPC infrastructure upgrades, evaluate and recommend hardware/software solutions, and explore cloud-based HPC solutions where applicable.

Qualifications:

Bachelor’s degree in Computer Science, Engineering, or a related field (equivalent experience may substitute for degree)
Minimum of 10 years of systems experience, including at least 5 years working specifically with HPC
Strong knowledge of Linux operating systems (e.g., Rocky Linux, Ubuntu) with a fundamental understanding of Linux internals, system administration, and performance tuning
Experience building and managing RPM and DEB packages
Experience with cluster management tools such as Bright Cluster Manager, OpenHPC stack, or Warewulf
Proficiency with job schedulers and resource managers such as Slurm and LSF
Strong understanding of Linux networking (e.g., TCP/IP, DNS, routing) and HPC interconnects (e.g., InfiniBand, Ethernet) including performance tuning
Knowledge of parallel file systems such as Lustre, Ceph, or GPFS
Working knowledge of Linux authentication and directory services such as LDAP and Active Directory
Proficiency in scripting languages (e.g., Python, Bash, R) and familiarity with MPI libraries for parallel and distributed computing (nice to have)
Strong experience with DevOps and configuration management tools, including Ansible, Terraform, Jenkins, and Git
Knowledge of HPC in cloud environments (e.g., AWS, Azure, GCP HPC offerings) is a plus
Strong knowledge of Linux security, compliance standards, and data protection best practices
Excellent communication, interpersonal, and problem-solving skills

More Jobs at Rackspace Technology

Data Architect (Azure and Databricks)

Gurgaon, Haryana, India

Experience: Not specified

Salary: Not disclosed

Sr. Tableau Engineer - IN

Kolkata, Mumbai, New Delhi, Hyderabad, Pune, Chennai, Bengaluru

5 - 8 yrs

INR 7 - 10 Lacs

Customer Data Engineer II-IN (R-21168)-MongoDB

India

2.0 - 2.0 yrs

Salary: Not disclosed

PMO Technical Director IN - Cloud

Gurgaon, Haryana, India

10.0 - 10.0 yrs

Salary: Not disclosed

PMO Technical Director IN - Cloud

Bengaluru, Karnataka, India

10.0 - 10.0 yrs

Salary: Not disclosed

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Rackspace Technology

Cloud Computing

San Antonio

Login to

Please Verify Your Phone or Email

Confirm Action

Senior Systems Engineer HPC - R-21841