Jobs
Interviews

11 Gpfs Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

6.0 - 10.0 years

0 Lacs

noida, uttar pradesh

On-site

We are looking for a highly skilled HPC/GPU Operations Engineer with 6-10 years of experience to oversee the management, optimization, and maintenance of high-performance computing (HPC) infrastructure, specifically focusing on GPU-accelerated workloads. Your primary responsibilities will include ensuring the reliability, efficiency, and scalability of HPC systems used for scientific computing, AI/ML, and data-intensive applications. As an IC3 level professional, you will be tasked with the following responsibilities: - Managing and administering HPC clusters, GPU nodes, and high-speed interconnects. - Configuring GPU-accelerated workloads for AI/ML, scientific computing, and simulations. - Monitoring system performance, troubleshooting issues, and optimizing resource utilization. - Installing, configuring, and maintaining HPC-related software, libraries, and tools such as CUDA, OpenMP, and MPI. - Supporting containerized workflows using technologies like Docker and Singularity. - Ensuring software compatibility with GPU architectures from NVIDIA, AMD, and Intel. - Tuning GPU and CPU performance for specific workloads, including benchmarking and profiling. - Utilizing monitoring tools like Prometheus, Grafana, and Slurm to track system health and efficiency. - Optimizing scheduling and resource allocation in workload managers such as Slurm, PBS, and LSF. - Upholding system security and access control for HPC resources. - Applying software patches, firmware updates, and security best practices. - Assisting in regulatory compliance for HPC environments. - Providing support to researchers, data scientists, and engineers using HPC resources. - Developing and maintaining documentation on best practices, troubleshooting, and system usage. - Conducting training sessions or workshops on HPC/GPU computing. To qualify for this role, you should possess the following technical skills: - Experience in managing HPC clusters and GPU-based computing environments. - Proficiency in Linux system administration, scripting (Bash, Python), and automation (Ansible, Terraform). - Knowledge of parallel computing, GPU programming (CUDA, OpenCL), and HPC frameworks. - Familiarity with networking technologies such as Infiniband and RDMA, storage solutions like Lustre, GPFS, NFS, and virtualization. At Oracle, we are dedicated to leveraging cutting-edge technology to address current challenges while fostering an inclusive work environment that encourages innovation and collaboration. We offer a range of competitive benefits, flexible medical, life insurance, and retirement options, and support employee engagement in volunteer programs. We are committed to providing accessibility assistance or accommodation for individuals with disabilities throughout the employment process. If you require such assistance, please contact us via email at accommodation-request_mb@oracle.com or by phone at +1 888 404 2494 in the United States.,

Posted 13 hours ago

Apply

2.0 - 7.0 years

0 - 0 Lacs

hyderabad

On-site

Job Description : We are seeking a talented Linux System Administrator with expertise in managing High-Performance Computing (HPC) servers to join our team. The ideal candidate will be responsible for the deployment, configuration, optimization, and maintenance of our HPC infrastructure, ensuring maximum performance and reliability for our clients. Responsibilities: Install, configure, and maintain Linux-based HPC clusters, including hardware and software components such as job schedulers, parallel file systems, and MPI libraries. Optimize system performance and resource utilization to meet the computational demands of our clients' workloads. Monitor system health and performance, troubleshoot issues, and implement solutions to minimize downtime and disruptions. Collaborate with researchers, engineers, and other stakeholders to understand their computational requirements and tailor HPC solutions to meet their needs. Develop and maintain documentation, best practices, and standard operating procedures for HPC system administration tasks. Implement security measures to protect HPC systems and data from unauthorized access and cyber threats. Stay up-to-date with the latest developments in HPC technologies and best practices, and recommend upgrades or improvements as needed. Provide technical support and training to end-users, including researchers and application developers, to help them utilize HPC resources effectively. Requirements: Bachelor's degree in Computer Science, Engineering, or related field. Proven experience as a Linux System Administrator with a focus on HPC environments. In-depth knowledge of Linux operating systems, cluster management tools (e.g., Slurm, PBS Pro), and parallel file systems (e.g., Lustre, GPFS). Experience with HPC hardware components such as compute nodes, interconnects (e.g., InfiniBand), and storage systems. Proficiency in scripting languages (e.g., Bash, Python) for automation and system administration tasks.

Posted 2 days ago

Apply

4.0 - 8.0 years

0 Lacs

karnataka

On-site

As an ideal candidate, you should possess extensive software development experience with expertise in file systems, concurrency, multithreading, server architectures, and distributed systems. Your hands-on experience in developing scale-out and high availability storage solutions should be exemplary. A strong command over system-level C/C++ development, especially in Linux/UNIX environments, is essential. You should have a thorough understanding of parallel file system solutions such as Lustre and GPFS, as well as familiarity with NVM storage technology and distributed key-value storage systems. While prior experience in these areas is beneficial, a willingness to learn and adapt is equally valuable. Your attention to detail and dedication to delivering high-quality results should be evident in your work. Being a collaborative team player with excellent communication skills is crucial, along with the ability to take initiative and work independently. Effective time management, prioritization, multitasking, and meeting deadlines are key skills required to thrive in this fast-paced environment. If you are driven by challenging projects, eager to contribute your expertise to innovative solutions, and possess the qualities mentioned above, we encourage you to apply and be a part of our dynamic team.,

Posted 3 days ago

Apply

3.0 - 7.0 years

0 Lacs

karnataka

On-site

You should have proven experience as a Linux Systems Administrator, focusing on HPC environments. Your understanding of Linux operating systems such as CentOS, Ubuntu, and Red Hat should be strong. You should also have intermediate knowledge in SLURM resource scheduler. Hands-on experience with AWS services related to HPC like EC2, S3, FSx for Lustre, AWS Batch, and AWS ParallelCluster is required. Familiarity with parallel file systems like Lustre, GPFS, and network storage solutions is essential. Knowledge of GPU computing and working with GPU-enabled HPC systems on AWS is a plus. Experience with configuration management tools such as Ansible, Puppet, and Chef is desired. Moreover, experience with cloud-based HPC solutions and hybrid HPC environments will be beneficial for this role.,

Posted 3 weeks ago

Apply

10.0 - 20.0 years

40 - 50 Lacs

Bengaluru

Work from Office

Lead HPC delivery teams, define strategy, mentor staff, align capacity with business goals, and manage performance. Required Candidate profile Strong leadership and technical experience in HPC, with ability to manage teams and drive innovation.

Posted 1 month ago

Apply

2.0 - 6.0 years

2 - 6 Lacs

Hyderabad, Pune

Hybrid

Roles and Responsibilities Conduct thorough investigations into payment discrepancies and resolve issues promptly. Utilize LexisNexis and Intelligent Tracks tools to process payments efficiently. Provide UAT support for new systems implementations. Collaborate with internal teams to ensure seamless execution of banking operations. Manage payment operations, including SWIFT payments, bank transfers, and other international transactions. Desired Candidate Profile 2-6 years of experience in banking operations or related field. Strong understanding of GBR (Good Business Rules) principles. Proficiency in using various software applications such as HotScan, Bankers Almanac, Swift GPI, etc.

Posted 1 month ago

Apply

8.0 - 13.0 years

40 - 80 Lacs

Bengaluru, Delhi / NCR, Mumbai (All Areas)

Work from Office

Graduate with 8+ years in product / solution management Product Manager / Offering Manager in Cloud, IaaS, HPC, or related high-tech domains. Hands-on understanding of technologies like NVIDIA GPU stacks, containerized HPC (Singularity, Docker), scheduling systems (SLURM, PBS), Lustre / GPFS Familiarity with as-a-Service constructs, subscription models, and TCO discussions. Product lifecycle management. Project and cross-functional stakeholder management Strong articulation, documentation, and influencing ability Able to interact across sales, delivery, product, and finance Suitable candidates may forward their updated profiles in strict confidence to hr33@hectorandstreak.com or call on 9699224920

Posted 1 month ago

Apply

10.0 - 20.0 years

40 - 80 Lacs

Bengaluru, Delhi / NCR, Mumbai (All Areas)

Work from Office

Bachelor's degree in Computer Science with 10+ years of experience with HPC environments Experience in HPC architecture and design, with a proven track record of delivering complex HPC solutions. Experience in designing and implementing HPC solutions on public cloud, private cloud, and on-premises infrastructure. Knowledge of HPC technologies, including M PI, OpenMP, Infiniband, GPFS, Lustre , and other file systems, cluster management tools such as Slurm , Torque, or LSF, and scheduling software such as PBSPro. Excellent communication skills, including the ability to communicate technical concepts to both technical and non-technical audiences. Experience with virtualization and containerization technologies such as Docker, Kubernetes, and Singularity. Strong understanding of networking technologies and protocols, including TCP/IP, Infiniband, and RDMA. Familiarity with one or more programming languages such as C, C++, Fortran, Python, or Java. Experience working in a multi-vendor, multi-cloud environment. Strong problem-solving skills and the ability to work under pressure in a fast-paced environment. Suitable candidates may forward their updated profiles in strict confidence to hr33@hectorandstreak.com or call on 9699224920

Posted 1 month ago

Apply

2.0 - 7.0 years

2 - 5 Lacs

Hyderabad

Work from Office

SME - HYD - SWIFT PAYMENTS

Posted 2 months ago

Apply

1.0 - 6.0 years

2 - 3 Lacs

Pune

Work from Office

PUNE - Swift payments

Posted 2 months ago

Apply

5 - 10 years

4 - 8 Lacs

Gurugram

Work from Office

AHEAD builds platforms for digital business. By weaving together advances in cloud infrastructure, automation and analytics, and software delivery, we help enterprises deliver on the promise of digital transformation. AtAHEAD, we prioritize creating a culture of belonging,where all perspectives and voices are represented, valued, respected, and heard. We create spaces to empower everyone to speak up, make change, and drive the culture at AHEAD. We are an equal opportunity employer,anddo not discriminatebased onan individual's race, national origin, color, gender, gender identity, gender expression, sexual orientation, religion, age, disability, maritalstatus,or any other protected characteristic under applicable law, whether actual or perceived. We embraceall candidatesthatwillcontribute to the diversification and enrichment of ideas andperspectives atAHEAD. The High-Performance Computing Storage Engineer is primarily responsible for the overall health and maintenance of storage technologies in our managed services customer's environments. Our Storage Engineers are a valued member of the Managed Services Infrastructure Practice responsible for Tier 3 incident management, service request management and change management infrastructure support for all Managed Services customers. Key Responsibilities Provide enterprise-level operational support to Managed Services customers for incident, problem, and change management activities Plan and perform maintenance activities Assess customer environments for performance and design issues and propose resolutions Work across technical teams to troubleshoot complex infrastructure issues Create and maintain detailed documentation Serve as a subject matter expert and escalation point for storage technologies Work with vendors to resolve storage issues Communicate with customers and internal team with transparency Participate in on-call rotation Completion of training and certification as assigned to further skills and knowledge Skills Required Bachelors degree or equivalent Information Systems or related field. Unique education, specialized experience, skills, knowledge, training, or certification may be substituted for education 5+ years of expert level experience managing storage infrastructure in high-performance computing environments including, file systems, storage appliances, and data workflows. Experience configuring, maintaining, and tuning Ceph clusters. Experience configuring, maintaining, and tuning distributed file systems (e.g., Lustre, GPFS, NFS, GlusterFS) Experience with InfiniBand networking preferred. 1+ years working with monitoring platforms; Elastic Observability is a bonus 1+ years working with an enterprise ITSM systemService Now is a bonus Familiarity with high-performance computing (HPC) schedulers (e.g., SLURM, PBS, Torque) and their interaction with data storage systems. Understanding of data protection mechanisms, including data replication, backup strategies, and disaster recovery in HPC environments. Experience with containerization (Docker, Singularity) in an HPC context for data processing and application deployment. Solid working knowledge or Linux and scripting a plus. Experience with machine learning or data science workflows in HPC environments a plus. Managed Services or consulting experience is required. Strong background with customer service High level problem-solving and communication skills Strong oral and written communications skills Related Storage certifications are a bonus. Why AHEAD: Through our daily work and internal groups like Moving Women AHEAD and RISE AHEAD, we value and benefit from diversity of people, ideas, experience, and everything in between. We fuel growth by stacking our office with top-notch technologies in a multi-million-dollar lab, by encouraging cross department training and development, sponsoring certifications and credentials for continued learning. USA Employment Benefits include - Medical, Dental, and Vision Insurance - 401(k) - Paid company holidays - Paid time off - Paid parental and caregiver leave - Plus more! See benefits https://www.aheadbenefits.com/ for additional details. The compensation range indicated in this posting reflects the On-Target Earnings (OTE) for this role, which includes a base salary and any applicable target bonus amount. This OTE range may vary based on the candidates relevant experience, qualifications, and geographic location.

Posted 2 months ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies