81 Slurm Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

3.0 - 6.0 years

6 - 15 Lacs

nashik, navi mumbai

Work from Office

Position Overview The Middle-level HPC/AI Engineer manages day-to-day operations of GPU cluster infrastructure, including SLURM job scheduling, system monitoring, and user support. This role works closely with the senior team to maintain high availability and performance of AI computing resources. Key Responsibilities Manage SLURM cluster operations including job queue monitoring, node management, and workload optimization Perform node state management and troubleshoot job failures and system issues Implement and maintain Ansible playbooks for configuration management and deployment automation Support Kubernetes cluster operations and containerized workload management Provide user support fo...

Posted 3 days ago

AI Match Score
Apply

8.0 - 12.0 years

0 Lacs

hyderabad, all india

On-site

Role Overview: As an HPC Engineer Sr, your primary focus will be developing and running Bioinformatic workflows/pipelines leveraging and managing WDL engines like miniWDL, Slurm, and R on AWS cloud utilizing technologies such as AWS Parallel Cluster, R Workbench, Batch, ECS, and Kubernetes. You will also be responsible for configuring Slurm partitions, converting standalone jobs to Slurm, installing and configuring R Workbench with scalable backends like Slurm and Kubernetes, as well as working with Dockers. Key Responsibilities: - Developing and running Bioinformatic workflows/pipelines using WDL engines like miniWDL, Slurm, and R on AWS cloud - Utilizing technologies like AWS Parallel Clus...

Posted 6 days ago

AI Match Score
Apply

4.0 - 9.0 years

35 - 40 Lacs

bengaluru

Hybrid

Oracle Cloud Infrastructure (OCI) Cluster Networking team is building an ultra-high-performance network to support AI/ML/HPC workloads. Join us to design systems that scale from tens to hundreds of thousands of GPUs without sacrificing performance. Our team develops and tunes the software and hardware stack for distributed workloads using libraries such as NCCL on high-speed networks. Strong knowledge and practical experience with NCCL is essential for this role. Youll apply collective communication libraries to tune system performance at a previously unheard-of scaleour approach to scaling is cutting edge. Preferred Qualifications: Bachelors / Masters in Computer Science or related engineer...

Posted 1 week ago

AI Match Score
Apply

8.0 - 10.0 years

0 Lacs

hyderabad, telangana, india

On-site

WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiencesfrom AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challengesstriving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Toge...

Posted 1 week ago

AI Match Score
Apply

4.0 - 8.0 years

0 Lacs

vapi, gujarat

On-site

As a Linux enthusiast with experience in High-Performance Computing (HPC), you have the opportunity to contribute to cutting-edge AI solutions for drug discovery at Meril Life Sciences Pvt. Ltd. **Key Responsibilities:** - Manage and maintain HPC clusters (SLURM/PBS) - Administer Linux servers (RHEL, Ubuntu, CentOS) - Support AI/ML and genomics workflows in HPC environments - Deploy and manage containerized environments (Docker/Singularity) - Automate system tasks using Bash/Python scripting - Collaborate closely with data scientists, bioinformaticians & AI engineers **Required Skills & Experience:** - 3-5 years of Linux system and HPC administration experience - Hands-on experience with sci...

Posted 2 weeks ago

AI Match Score
Apply

18.0 - 24.0 years

0 Lacs

hyderabad, telangana, india

On-site

You will lead the AIN-based data science organization that builds and operates scientific computational, data, and AI/ML pipelines and workflows for Amgen Research. You will operate across India/EU/US (Eastern, Central, Pacific) time zones and coordinate outcomes across the ARIA, Global Research, ATMOS Tech, and ATMOS AI&D. You will own portfolio outcomes across the Research data ecosystem, AI/ML for research, and high-performance computing enablement, ensuring robust, resilient pipelines and reliable, scalable services. Core Responsibilities Set multi-year strategy, organization design, and investment roadmap for research compute & AI platforms. Establish standards and promote methods for r...

Posted 2 weeks ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

karnataka

On-site

IndiaAI is currently focused on developing India's next-gen foundational LLMs and is seeking a Senior ML Engineer who is skilled in large-scale pre-training, distributed GPU systems, and data creation pipelines. As a Senior ML Engineer, your primary responsibility will be to work with cutting-edge technologies such as Megatron-LM, NVIDIA NeMo, DeepSpeed, PyTorch Distributed, and SLURM to train models exceeding 7B parameters on multi-node GPU clusters. Key Responsibilities: - Build and optimize LLM pre-training pipelines for models exceeding 7B parameters. - Implement distributed training using PyTorch Distributed, DeepSpeed (ZeRO/FSDP), Megatron-LM, and NVIDIA NeMo. - Manage multi-node GPU j...

Posted 2 weeks ago

AI Match Score
Apply

4.0 - 7.0 years

5 - 10 Lacs

hyderabad

Work from Office

L2 kill HPC Engineer with Application Expertise Role Overview: An L2 HPC (High-Performance Computing) Engineer with an application skillset is responsible for supporting, troubleshooting, and maintaining HPC infrastructure and assisting users with scientific and engineering applications. They operate between infrastructure and application layers, ensuring optimal performance and availability of both. Core Responsibilities: HPC Cluster Support: Manage day-to-day operations of HPC clusters (Slurm, PBS, LSF), monitor jobs, and node health, and manage user issues at L2. Application Support & Optimization: Support scientific/engineering applications (ANSYS, Gaussian, GROMACS, OpenFOAM, etc.) incl...

Posted 2 weeks ago

AI Match Score
Apply

3.0 - 5.0 years

0 Lacs

bengaluru, karnataka, india

On-site

About Us Atria University desires and enables research impact beyond publications. We operate without traditional departments (HoDs). Faculty are housed within Centers of Excellence (CoE), fostering deep, cross-disciplinary collaboration. This role will primarily be affiliated with the Bio-AI Hub/CoE. Why this role Help build India's next wave of Bio-AI: genomic and protein foundation models, multi-omics modelling, generative design for enzymes and pathways, and AI-assisted DBTL loops with wet-lab partners. You'll have real datasets, compute, and translational collaborations. What you'll do Lead research on Bio-AI foundation models (e.g., DNA FMs, protein LMs, generative design/diffusion for...

Posted 3 weeks ago

AI Match Score
Apply

4.0 - 7.0 years

0 Lacs

hyderabad, telangana, india

On-site

About The Role We are looking for a Senior Cloud Engineer who can deploy, automate, and optimize cloud infrastructure on AWS. This role requires hands-on technical depth, ownership of cloud environments & HPC, and the ability to work with engineers while collaborating with platform, DevOps, and engineering teams. Key Responsibilities Design, implement, and optimize scalable AWS architectures leveraging services such as EC2, RDS, EKS, FSx, VPC, Lambda, IAM, and CloudWatch. Automate using Terraform, CloudFormation, and configuration management tools. Develop and manage CI/CD pipelines and cloud-native development tooling. Implement network, performance, and security best practices across workl...

Posted 3 weeks ago

AI Match Score
Apply

8.0 - 10.0 years

0 Lacs

bengaluru, karnataka, india

On-site

The ideal candidate is a self-motivated, multi-tasker, and demonstrated team-player. You will be a lead developer responsible for the development of new software products and enhancements to existing products. You should excel in working with large-scale applications and frameworks and have outstanding communication and leadership skills. Senior Software Engineer Responsibilities Design core, backend software components using primarily Python, other languages are good to have Interface with other teams to incorporate their innovations and vice versa Conduct design and code reviews to maintain high standards Analyze and improve efficiency, scalability, and stability of various system resource...

Posted 3 weeks ago

AI Match Score
Apply

8.0 - 13.0 years

30 - 40 Lacs

bengaluru

Work from Office

Required Skills: HPC & AI Infrastructure Extensive knowledge of HPC technologies and workload scheduler such as Slurm and/or Altair PBS Pro, Proficient in HPC cluster management tools, including HPE Cluster Management (HPCM) and/or NVIDIA Base Command Manager. Experience with HPC cluster managers like HPE Cluster Management (HPCM) and/or NVIDIA Base Command Manager. Good understanding with high-speed networking stacks (InfiniBand, Mellanox) and performance tuning of HPC components. Solid grasp of high-speed networking technologies, such as InfiniBand and Ethernet. Containerization & Orchestration Extensive hands-on experience with containerization technologies such as Docker, Podman, and Sin...

Posted 4 weeks ago

AI Match Score
Apply

1.0 - 7.0 years

0 Lacs

pune, maharashtra, india

On-site

Title- Cloud Automation and Test Engineer (HCS) Location- Pune Onsite Client- ONIX /Data Matica Solutions Job Description Onix is seeking a motivated Cloud Automation and Test Engineer to enhance the quality and reliability of the HyperCompute Cluster Service (HCS) on Google Cloud. This role involves designing and implementing comprehensive automation tests for APIs, End-to-End workflows, and critical user journeys. Key Responsibilities ? API Test Development: ? Design and implement automated, comprehensive test suites for all HyperCompute Cluster Service (HCS) API actions, focusing on functional correctness, error handling, and performance. ? Utilize API testing tools and frameworks (e.g., ...

Posted 4 weeks ago

AI Match Score
Apply

8.0 - 10.0 years

0 Lacs

hyderabad, telangana, india

On-site

WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. To...

Posted 4 weeks ago

AI Match Score
Apply

1.0 - 7.0 years

0 Lacs

hyderabad, telangana, india

On-site

Title- Cloud Automation and Test Engineer (HCS) Location- Pune Onsite Client- ONIX /Data Matica Solutions Job Description Onix is seeking a motivated Cloud Automation and Test Engineer to enhance the quality and reliability of the HyperCompute Cluster Service (HCS) on Google Cloud. This role involves designing and implementing comprehensive automation tests for APIs, End-to-End workflows, and critical user journeys. Key Responsibilities ? API Test Development: ? Design and implement automated, comprehensive test suites for all HyperCompute Cluster Service (HCS) API actions, focusing on functional correctness, error handling, and performance. ? Utilize API testing tools and frameworks (e.g., ...

Posted 1 month ago

AI Match Score
Apply

10.0 - 12.0 years

0 Lacs

remote, india

Remote

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn't changed - we're here to stop breaches, and we've redefined modern security with the world's most advanced AI-native platform. We work on large scale distributed systems, processing almost 3 trillion events per day and this traffic is growing daily . Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We're also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own thei...

Posted 1 month ago

AI Match Score
Apply

10.0 - 15.0 years

0 Lacs

chennai, tamil nadu, india

On-site

Join our team as a Lead Linux Admin, where you will oversee the management and maintenance of Linux-based systems within a high-performance computing or research environment. You will be responsible for software stack installation, system upgrades, and ensuring system stability while collaborating with developers and researchers. Apply now to contribute your expertise to a dynamic technical team. Responsibilities Manage and maintain Linux servers and environments to ensure high availability and optimal performance Install, configure, and manage software packages using EasyBuild Perform installation and upgrades of R bundles to maintain compatibility and stability Collaborate with developers,...

Posted 1 month ago

AI Match Score
Apply

3.0 - 5.0 years

0 Lacs

ahmedabad, gujarat, india

On-site

Job Description Basic Qualifications: BS or MS degree in CS or related engineering or science field with 3+ years of relevant experience Experience with benchmarking and troubleshooting or optimizing performance of a system. Experience with coding, scripting, and automation. Background in Networking. General Linux skills. Demonstrated ability to lead complex projects, independently resolve ambiguity, collaborate with stakeholders across teams, and communicate effectively. Desired qualifications: Experience working on clusters, e.g., running HPC/AI workloads, or maintaining an HPC/AI system. Experience troubleshooting or tuning performance on distributed systems. Familiarity with elements of ...

Posted 1 month ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

coimbatore, tamil nadu, india

On-site

We are seeking a talented and experienced Senior Software Engineer with expertise in C++ and computer graphics to join our innovative team. In this role, you will work on the design, development, and optimization of advanced 3D visualization and rendering technologies, pushing the boundaries of performance, scalability, and usability. If you thrive in a challenging and collaborative environment, we would love to hear from you! Responsibilities Perform complex analysis, design, development, testing, and debugging of 3D visualization web applications Design, develop, and test full vertical visualization features - back-end computation and rendering, data management and storage, and front-end c...

Posted 1 month ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

hyderabad, telangana, india

On-site

Join Amgen's Mission of Serving Patients At Amgen, if you feel like you're part of something bigger, it's because you are. Our shared missionto serve patients living with serious illnessesdrives all that we do. Since 1980, we've helped pioneer the world of biotech in our fight against the world's toughest diseases. With our focus on four therapeutic areas Oncology, Inflammation, General Medicine, and Rare Disease we reach millions of patients each year. As a member of the Amgen team, you'll help make a lasting impact on the lives of patients as we research, manufacture, and deliver innovative medicines to help people live longer, fuller happier lives. Our award-winning culture is collaborati...

Posted 1 month ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

hyderabad, telangana

On-site

In this role, you will be responsible for designing architectures for meta-learning, self-reflective agents, and recursive optimization loops. You will also build simulation frameworks grounded in Bayesian dynamics, attractor theory, and teleo-dynamics. Moreover, your duties will include developing systems integrating graph rewriting, knowledge representation, and neurosymbolic reasoning. Additionally, you will conduct research on fractal intelligence structures, swarm-based agent coordination, and autopoietic systems. Your role will involve advancing Mobius's knowledge graph with ontologies supporting logic, agency, and emergent semantics. Furthermore, you will integrate logic into distribu...

Posted 1 month ago

AI Match Score
Apply

4.0 - 6.0 years

7 - 14 Lacs

bengaluru

Work from Office

Manage, configure, and troubleshoot HPC ( High Performance Computing ) clusters with SLURM, ROCKS/XCAT, and parallel file systems; support Linux/Windows servers and optimize scientific computing performance.

Posted 1 month ago

AI Match Score
Apply

3.0 - 7.0 years

0 Lacs

hyderabad, telangana

On-site

Join Aganitha and contribute to the discovery of medicines that will impact lives! Aganitha, located in Hyderabad, India, is dedicated to accelerating drug discovery and development for Biopharma and Biotech R&D through in silico solutions. Leveraging Computational Biology & Chemistry, High throughput Sciences, AI, ML, HPC, Cloud, Data, and DevOps, our cross-domain science and technology team of experts are transforming the biopharma and biotech industries. We are continuously expanding our world-class multi-disciplinary team in Genomics, AI, and Cloud computing, with the goal of accelerating drug discovery and development. At Aganitha, we find joy in working in an innovation-rich, research-...

Posted 1 month ago

AI Match Score
Apply

5.0 - 7.0 years

0 Lacs

india

On-site

Preferred Qualifications: Bachelors / Masters in Computer Science or related engineering fields Experience with RDMA programming, including but not limited to GPUDirect RDMA Experience with distributed workload managers like Slurm or K8s Experience with Linux Performance tools Experience in SDN, NFV, Cloud Networking Experience in Infrastructure-as-a-Service, viz. OpenStack, AWS, GCP, Azure Responsibilities 5+ years of experience with software (systems/application) development 2+ years of experience with collective communications libraries like NCCL, RCCL, MPI and GPU frameworks like CUDA and ROCm. 2+ years of experience with ML training frameworks like PyTorch, TensorFlow Proficient at prog...

Posted 1 month ago

AI Match Score
Apply

6.0 - 10.0 years

0 Lacs

thiruvananthapuram, kerala, india

On-site

Job Description Bachelor's or Master's degree in Computer Science & Engineering 6-10+ years of professional experience in full stack development, with a proven track record of deploying web applications in production environments. Strong fundamentals in data structures & algorithms demonstrated through complex system design and problem-solving in computer network domain. Experience in at least one backend language (e.g., Java, Python, Go), frontend framework including SQL/NoSQL databases and cloud platforms (e.g., AWS, Azure, GCP). Hands-on experience with observability tools (e.g., Prometheus, Grafana, ELK/EFK, OpenTelemetry). Experience with container orchestration (Kubernetes, Docker) and...

Posted 1 month ago

AI Match Score
Apply
Page 1 of 4
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies