Cognizant Hiring For HPC Engineer- Hyderabad/ Bangalore location

3 - 8 years

7 - 17 Lacs

Posted:1 day ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Dear candidate,

lavanya.kumaresan@cognizant.com

Role Overview

operating and managing High-Performance Computing (HPC) platforms

Key Responsibilities

  • HPC Infrastructure Management

  • Operate and maintain HPC clusters based on

    CentOS, RHEL

    , and hardware platforms like

    HPE

    and

    NVIDIA DGX

    .
  • Ensure optimal performance, scalability, and reliability of compute resources.
  • Storage Administration

  • Manage large-scale storage systems including

    Dell Isilon

    ,

    VAST Storage

    ,

    Lustre

    , and

    GPFS

    .
  • Implement data lifecycle management and optimize storage performance for HPC workloads.
  • Networking

  • Configure and maintain

    InfiniBand-based networking

    for low-latency, high-bandwidth communication.
  • Troubleshoot network performance issues and ensure secure connectivity.
  • Cluster and Job Scheduling

  • Administer cluster management tools such as

    Bright Cluster Manager

    ,

    Altair Grid Manager

    , and

    IBM LSF

    .
  • Optimize job scheduling and resource allocation for diverse workloads.
  • Monitoring and Automation

  • Implement monitoring solutions using

    Zabbix

    ,

    Grafana

    , and

    ELK Stack

    .
  • Automate provisioning and configuration using

    Cobbler

    ,

    Chef

    ,

    Ansible

    , and

    AWS ParallelCluster

    .
  • Performance Tuning & Troubleshooting

  • Conduct performance benchmarking and tuning for HPC workloads.
  • Diagnose and resolve hardware/software issues across compute, storage, and network layers.
  • Security & Compliance

  • Ensure HPC environment adheres to security best practices and compliance standards.

Required Skills & Qualifications

  • Technical Expertise

  • Strong knowledge of

    Linux OS (CentOS, RHEL)

    and HPC hardware platforms (

    HPE

    ,

    NVIDIA DGX

    ).
  • Hands-on experience with

    parallel file systems

    (Lustre, GPFS) and enterprise storage solutions.
  • Proficiency in

    InfiniBand networking

    and high-speed interconnects.
  • Familiarity with

    job schedulers

    and cluster management tools (IBM LSF, Bright Cluster Manager, Altair Grid Manager).
  • Automation & Scripting

  • Expertise in

    Ansible

    ,

    Chef

    ,

    Cobbler

    , and scripting languages (Bash, Python).
  • Experience with

    AWS ParallelCluster

    or similar cloud-based HPC solutions.
  • Monitoring & Logging

  • Practical experience with

    Zabbix

    ,

    Grafana

    , and

    ELK Stack

    for system health and performance monitoring.
  • Soft Skills

  • Strong problem-solving and analytical skills.
  • Ability to work in a fast-paced environment and lead technical teams.
  • Excellent communication and documentation skills.

Preferred Qualifications

  • Exposure to

    AI/ML workloads

    on HPC clusters.
  • Experience with

    containerization

    (Docker, Singularity) in HPC environments.
  • Knowledge of

    security hardening

    for HPC systems.

Education

  • Bachelors or Masters degree in Computer Science, Engineering, or related field.

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Cognizant logo
Cognizant

IT Services and IT Consulting

Teaneck New Jersey

RecommendedJobs for You