Job
Description
High Performance Computing, AI and Labs is a critical element of HPE We are focused on delivering innovative solutions that accelerate our customers digital transformation, enabling them to tackle their complex, and data-intensive workloadsbining deep expertise and the development of the worlds most cutting-edge, high-performance supercomputers, is defining the next era of computing delivering valuable insight & innovation Join us and redefine whats next for you HPE makes Hybrid IT simple HPE helps customers to design the right mix of Hybrid IT to serve their unique needs We bring next-generation infrastructure that uses intelligent software to simplify and accelerate the delivery of new apps, services, and business insights Providing with new ways to deliver and manage IT on-premises and in the cloud What You'll Do You are a dynamic, driven professional with a passion for success yours, your companys, and your customers You bring knowledge and expertise in high-performance computing, cloud computing, or related technical fields, strong communication and collaboration skills, and you always conduct yourself with the highest professionalism and integrity Must be hands-on Be able to develop a solid understanding of the Linux system and be able to test the system Manage and maintain HPC clusters, including installation, configuration, and optimization of compute and management nodes Administer Linux/Unix-based systems, ensuring high availability, performance, and security Perform system imaging, software provisioning, and configuration management using tools such as Ansible Conduct hardware troubleshooting and coordinate with vendors or internal teams for hardware repairs and replacements Oversee lab systems used for development, testing, and release validation in HPC environments Manage storage systems (NFS, Lustre, GPFS, RAID) and ensure efficient data flow across the HPC environment Monitor system performance, perform regular health checks, and implement preventive maintenance measures Apply OS, firmware, and security up to maintain system stability and compliance Develop and maintain automation scripts (using Bash, Python, or Ansible) to improve operational efficiency Document system configurations, maintenance procedures, and troubleshooting guides Collaborate with cross-functional teams across geographies to resolve issues, plan upgrades, and support project activities Provides guidance and mentoring to less-experienced staff members What You Need To Bring Bachelor's or Master's engineering degree in Computer Science, Information Systems Typically 4-8 years experience Strong proficiency in Linux/Unix administration (installation, configuration, tuning, troubleshooting) Experience managing HPC clusters (eg HPE Cray, Slurm, PBS, LSF) Solid understanding of networking fundamentals (TCP/IP, DNS, DHCP, VLANs) Experience with storage management systems such as NFS, Lustre, or GPFS Hands-on experience in hardware diagnostics and maintenance Familiarity with system monitoring tools such as Prometheus, Grafana, or Nagios Working knowledge of containerization (Docker, Singularity) and virtualization technologies is a plus Proficiency in shell scripting (Bash) Familiarity with Python or Ansible for automation and orchestration Ability to automate routine tasks and enhance operational efficiency Strong troubleshooting and problem-solving skills with a focus on root cause analysis Experience in maintaining accurate system documentation and change logs Additional Skills Cloud Architectures, Cross Domain Knowledge, Design Thinking, Development Fundamentals, DevOps, Distributed Computing, Microservices Fluency, Full Stack Development, Security-First Mindset, Solutions Design, Testing & Automation, User Experience (UX) What We Can Offer You Health & Wellbeing We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing Personal & Professional Development We also invest in your career because the better you are, the better we all are We have specific programs catered to helping you reach any career goals you have whether you want to become a knowledge expert in your field or apply your skills to another division Unconditional Inclusion We are unconditionally inclusive in the way we work and celebrate individual uniqueness We know varied backgrounds are valued and succeed here We have the flexibility to manage our work and personal needs We make bold moves, together, and are a force for good Let's Stay Connected Follow @HPECareers on Instagram to see the latest on people, culture and tech at HPE #india #highperformancecompute Job Engineering Job Level TCP_03 HPE is an Equal Employment Opportunity/ Veterans/Disabled/LGBT employer We do not discriminate on the basis of racegender, or any other protected category, and all decisions we make are made on the basis of qualifications, merit, and business need Our goal is to be one global team that is representative of our customers, in an inclusive environment where we can continue to innovate and grow together Please click here: Equal Employment Opportunity Hewlett Packard Enterprise is EEO Protected Veteran/ Individual with Disabilities HPE will comply with all applicable laws related to employer use of arrest and conviction records, including laws requiring employers to consider for employment qualified applicants with criminal histories No Fees Notice & Recruitment Fraud Disclaimer It has come to HPEs attention that there has been an increase in recruitment fraud whereby scammer impersonate HPE or HPE-authorized recruiting agencies and offer fake employment opportunities to candi These scammers often seek to obtain personal information or money from candi Please note that Hewlett Packard Enterprise (HPE), its direct and indirect subsidiaries and affiliated companies, and its authorized recruitment agencies/vendors will never charge any candi a registration fee, hiring fee, or any other fee in connection with its recruitment and hiring process The credentials of any hiring agency that claims to be working with HPE for recruitment of talent should be verified by candi and candi shall be solely responsible to conduct such verification Any candi /individual who relies on the erroneous representations made by fraudulent employment agencies does so at their own risk, and HPE disclaims liability for any damages or claims that result from any such communication