Jobs
Interviews

92 Hpc Jobs - Page 4

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

15.0 - 19.0 years

40 - 60 Lacs

Bengaluru

Work from Office

Job Area: Information Technology Group, Information Technology Group > IT Management General Summary: We are enabling a world where everyone and everything can be intelligently connected. Our 5G and AI innovations are the power behind the connected intelligent edge. Youll find our technologies behind and inside the innovations that deliver significant value across multiple industries and to billions of people every day. Our engineering teams rely heavily on the latest High Performance Computing (HPC) technologies to design and develop new products using electronic design automation (EDA) tools. This role provides an opportunity to manage and deliver a portfolio of software solutions and services for core engineering teams. You will gain experience leading a portfolio of critical projects while building scalable and fault-tolerant software solutions that are deployed on some of the largest supercomputing infrastructures across the globe. What are we looking for? Engineering Software Solutions and Data Services team (ESSDS) is looking for an experienced software development manager preferably with exposure to HPC technologies. The ESSDS team (aligned with Engineering IT) is responsible for development of software solutions enabling High Performance Compute grid and large-scale, distributed, analytical applications. They work on components and services for HPC infrastructure optimization, hardware IP management systems, petabyte-scale cloud data platforms and development of machine learning solutions and pipelines. This role will lead a team of about 20 software developers working on a portfolio of software products and services being developed by the team. The ideal candidate would be a seasoned Software Developer Manager experienced in engaging with business and technical stakeholders, understanding complex problem statements, and proposing value-driven software solutions. What will you do? This roles responsibilities include: - Lead and manage a team of software developers and project manager, providing mentorship and guidance to foster professional growth. - Provide technical expertise across a portfolio of software development projects - Identify opportunities and deliver solutions for EDA workflow optimizations - Set and manage team priorities in line with organizational goals and objectives, working closely with diverse set of stakeholders in Engineering IT. - Oversee the entire software development lifecycle, from planning and design to implementation, testing, and deployment for a portfolio of products and services developed by the team - Collaborate with global teams to define project requirements, scope, and deliverables. - Ensure the delivery of high-quality software solutions that meet business objectives and customer needs. - Implement best practices for software development, including coding standards, code reviews, and automated testing. - Manage project timelines and resources to ensure successful project completion. - Stay updated with the latest industry trends and technologies to drive continuous improvement and innovation. - Build a culture of collaboration, accountability, and continuous learning within the team. - What do we want to see? The ideal candidate will be able to demonstrate some of the following skills: - 14+ years of hand-on experience in software engineering, with at least 6 years in a leadership role - Strong proficiency in programming languages such as Java, C++, Python, Rust or similar. - Expertise in software lifecycle management, version control, and CI/CD best practices for quality, agility and security - Proven ability to manage multiple projects and conflicting priorities. - Experience with public cloud environments such as AWS, Azure or Google Cloud - Experience with microservices architecture and containerization - Familiarity with EDA and semiconductor design process - Ability to explain technical concepts and analysis implications in a clear manner to a wide audience. - Exposure to HPC technologies is a plus - Bachelors or Masters in Computer Science or related field Minimum Qualifications: 7+ years of IT-related work experience with a Bachelor's degree. OR 9+ years of IT-related work experience without a Bachelors degree. 4+ years in a leadership role in projects/programs.

Posted 3 months ago

Apply

6 - 8 years

6 - 10 Lacs

Hyderabad

Work from Office

Senior High Performance Computing Engineer What you will do Let’s do this. Let’s change the world. In this vital role you will be responsible for deploying, maintaining and supporting HPC infrastructure in a multi-cloud environment. Hands-on engineering which requires deep technical expertise in HPC technology and standard methodologies. Implement, and manage cloud-based infrastructure that supports HPC environments that support data science (e.g. AI/ML workflows, Image Analysis). Collaborate with data scientists and ML engineers to deploy scalable machine learning models into production. Ensure the security, scalability, and reliability of HPC systems in the cloud. Optimize cloud resources for cost-effective and efficient use. Stay ahead of with the latest in cloud services and industry standard processes. Provide technical leadership and guidance in cloud and HPC systems management. Develop and maintain CI/CD pipelines for deploying resources to multi-cloud environments. Monitor and fix cluster operations/applications and cloud environments. Document system design and operational procedures. Must-Have Skills: Expert with Linux/Unix system administration (RHEL, CentOS, Ubuntu, etc.). Proficiency with job scheduling and resource management tools (SLURM, PBS, LSF, etc.). Good understanding of parallel computing, MPI, OpenMP, and GPU acceleration (CUDA, ROCm). Knowledge of storage architectures and distributed file systems (Lustre, GPFS, Ceph). Experience with containerization technologies (Singularity, Docker) and cloud-based HPC solutions. Expert in scripting languages (Python, Bash) and containerization technologies (Docker, Kubernetes). Familiarity with automation tools (Ansible, Puppet, Chef) for system provisioning and maintenance. Understanding of networking protocols, high-speed interconnects, and security best practices. Demonstrable experience in cloud computing (AWS, Azure, GCP) and cloud architecture. Experience with infrastructure as code (IaC) tools like Terraform or CloudFormation and Git. What we expect of you We are all different, yet we all use our unique contributions to serve patients. Expert knowledge in large Linux environments, networking, storage, and cloud related technologies . Also, the candidate will have expertise in root-cause analysis and fix while working with a team and stakeholders. Top-level communication and documentation skills are required. Expertise in coding in Python, Bash, YAML is expected. Good-to-Have Skills: Experience with Kubernetes (EKS) and service mesh architectures . Knowledge of AWS Lambda and event-driven architectures . Familiarity with AWS CDK, Ansible, or Packer for cloud automation. Exposure to multi-cloud environments (Azure, GCP) . Basic Qualifications: Bachelor’s degree in computer science, IT, or related field with 6-8 years of hands-on HPC administration or a related field Additional Skills : Experience supporting research in healthcare life sciences. Deep, extensive experience with High Performance Computing (HPC) and cluster management Familiarity with machine learning frameworks (TensorFlow, PyTorch) and data pipelines. Certifications in cloud architecture (AWS Certified Solutions Architect, Google Cloud Professional Cloud Architect, etc.). Experience in an Agile development environment. Prior work with distributed computing and big data technologies (Hadoop, Spark). Professional Certifications (preferred): Red Hat Certified Engineer (RHCE) or Linux Professional Institute Certification (LPIC) AWS Certified Solutions Architect – Associate or Professional Preferred Qualifications: Soft Skills: Strong analytical and problem-solving skills. Ability to work effectively with global, virtual teams Effective communication and collaboration with cross-functional teams. Ability to work in a fast-paced, cloud-first environment. Shift Information: This position is required to be onsite and participate in 24/5 and weekend on call in rotation fashion and may require you to work a later shift. Candidates must be willing and able to work off hours, as required based on business requirements. What you can expect of us As we work to develop treatments that take care of others, we also work to care for your professional and personal growth and well-being. From our competitive benefits to our collaborative culture, we’ll support your journey every step of the way. In addition to the base salary, Amgen offers competitive and comprehensive Total Rewards Plans that are aligned with local industry standards. Apply now for a career that defies imagination Objects in your future are closer than they appear. Join us. careers.amgen.com As an organization dedicated to improving the quality of life for people around the world, Amgen fosters an inclusive environment of diverse, ethical, committed and highly accomplished people who respect each other and live the Amgen values to continue advancing science to serve patients. Together, we compete in the fight against serious disease. Amgen is an Equal Opportunity employer and will consider all qualified applicants for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability status, or any other basis protected by applicable law. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

Posted 4 months ago

Apply

5 - 10 years

4 - 8 Lacs

Gurugram

Work from Office

AHEAD builds platforms for digital business. By weaving together advances in cloud infrastructure, automation and analytics, and software delivery, we help enterprises deliver on the promise of digital transformation. AtAHEAD, we prioritize creating a culture of belonging,where all perspectives and voices are represented, valued, respected, and heard. We create spaces to empower everyone to speak up, make change, and drive the culture at AHEAD. We are an equal opportunity employer,anddo not discriminatebased onan individual's race, national origin, color, gender, gender identity, gender expression, sexual orientation, religion, age, disability, maritalstatus,or any other protected characteristic under applicable law, whether actual or perceived. We embraceall candidatesthatwillcontribute to the diversification and enrichment of ideas andperspectives atAHEAD. The High-Performance Computing Storage Engineer is primarily responsible for the overall health and maintenance of storage technologies in our managed services customer's environments. Our Storage Engineers are a valued member of the Managed Services Infrastructure Practice responsible for Tier 3 incident management, service request management and change management infrastructure support for all Managed Services customers. Key Responsibilities Provide enterprise-level operational support to Managed Services customers for incident, problem, and change management activities Plan and perform maintenance activities Assess customer environments for performance and design issues and propose resolutions Work across technical teams to troubleshoot complex infrastructure issues Create and maintain detailed documentation Serve as a subject matter expert and escalation point for storage technologies Work with vendors to resolve storage issues Communicate with customers and internal team with transparency Participate in on-call rotation Completion of training and certification as assigned to further skills and knowledge Skills Required Bachelors degree or equivalent Information Systems or related field. Unique education, specialized experience, skills, knowledge, training, or certification may be substituted for education 5+ years of expert level experience managing storage infrastructure in high-performance computing environments including, file systems, storage appliances, and data workflows. Experience configuring, maintaining, and tuning Ceph clusters. Experience configuring, maintaining, and tuning distributed file systems (e.g., Lustre, GPFS, NFS, GlusterFS) Experience with InfiniBand networking preferred. 1+ years working with monitoring platforms; Elastic Observability is a bonus 1+ years working with an enterprise ITSM systemService Now is a bonus Familiarity with high-performance computing (HPC) schedulers (e.g., SLURM, PBS, Torque) and their interaction with data storage systems. Understanding of data protection mechanisms, including data replication, backup strategies, and disaster recovery in HPC environments. Experience with containerization (Docker, Singularity) in an HPC context for data processing and application deployment. Solid working knowledge or Linux and scripting a plus. Experience with machine learning or data science workflows in HPC environments a plus. Managed Services or consulting experience is required. Strong background with customer service High level problem-solving and communication skills Strong oral and written communications skills Related Storage certifications are a bonus. Why AHEAD: Through our daily work and internal groups like Moving Women AHEAD and RISE AHEAD, we value and benefit from diversity of people, ideas, experience, and everything in between. We fuel growth by stacking our office with top-notch technologies in a multi-million-dollar lab, by encouraging cross department training and development, sponsoring certifications and credentials for continued learning. USA Employment Benefits include - Medical, Dental, and Vision Insurance - 401(k) - Paid company holidays - Paid time off - Paid parental and caregiver leave - Plus more! See benefits https://www.aheadbenefits.com/ for additional details. The compensation range indicated in this posting reflects the On-Target Earnings (OTE) for this role, which includes a base salary and any applicable target bonus amount. This OTE range may vary based on the candidates relevant experience, qualifications, and geographic location.

Posted 4 months ago

Apply

5 - 10 years

4 - 8 Lacs

Gurugram

Work from Office

AHEAD builds platforms for digital business. By weaving together advances in cloud infrastructure, automation and analytics, and software delivery, we help enterprises deliver on the promise of digital transformation. AtAHEAD, we prioritize creating a culture of belonging,where all perspectives and voices are represented, valued, respected, and heard. We create spaces to empower everyone to speak up, make change, and drive the culture at AHEAD. We are an equal opportunity employer,anddo not discriminatebased onan individual's race, national origin, color, gender, gender identity, gender expression, sexual orientation, religion, age, disability, maritalstatus,or any other protected characteristic under applicable law, whether actual or perceived. We embraceall candidatesthatwillcontribute to the diversification and enrichment of ideas andperspectives atAHEAD. The High-Performance ComputingNetwork Engineer is primarily responsible for the overall health and maintenance of storage technologies in our managed services customer's environments. Our Network Engineers are a valued member of the Managed Services Infrastructure Practice responsible for Tier 3 incident management, service request management and change management infrastructure support for all Managed Services customers. Key Responsibilities Provide enterprise-level operational support to Managed Services customers for incident, problem, and change management activities Plan and perform maintenance activities Assess customer environments for performance and design issues and propose resolutions Work across technical teams to troubleshoot complex infrastructure issues Create and maintain detailed documentation Serve as a subject matter expert and escalation point for storage technologies Work with vendors to resolve storage issues Communicate with customers and internal team with transparency Participate in on-call rotation Completion of training and certification as assigned to further skills and knowledge Skills Required Bachelors degree or equivalent Information Systems or related field. Unique education, specialized experience, skills, knowledge, training, or certification may be substituted for education 5+ years of expert level experience managing Network infrastructure in high-performance computing environments. Experience configuring, maintaining and troubleshooting Nvidia/Mellanox (Cumulus OS) switches required. Strong knowledge of Kubernetes and its networking components (CNI, Service Mesh, etc.) Understanding of VPNs, Load Balancers, VPCs, and hybrid cloud networking Experience with both ethernet and InfiniBand networking. 1+ years working with monitoring platforms; Elastic Observability is a bonus 1+ years working with an enterprise ITSM systemService Now is a bonus Familiarity with high-performance computing (HPC) schedulers (e.g., SLURM, PBS, Torque) and their interaction with data storage systems. Experience with network containerization (Docker, Singularity) in an HPC context for data processing and application deployment. Solid working knowledge or Linux and Python scripting a plus. Previous experience with network automation tools such as Ansible, Puppet, or Chef a plus. Experience with machine learning or data science workflows in HPC environments a plus. Managed Services or consulting experience is required. Strong background with customer service High level problem-solving and communication skills Strong oral and written communications skills Related network certifications are a bonus. Why AHEAD: Through our daily work and internal groups like Moving Women AHEAD and RISE AHEAD, we value and benefit from diversity of people, ideas, experience, and everything in between. We fuel growth by stacking our office with top-notch technologies in a multi-million-dollar lab, by encouraging cross department training and development, sponsoring certifications and credentials for continued learning. USA Employment Benefits include - Medical, Dental, and Vision Insurance - 401(k) - Paid company holidays - Paid time off - Paid parental and caregiver leave - Plus more! See benefits https://www.aheadbenefits.com/ for additional details. The compensation range indicated in this posting reflects the On-Target Earnings (OTE) for this role, which includes a base salary and any applicable target bonus amount. This OTE range may vary based on the candidates relevant experience, qualifications, and geographic location.

Posted 4 months ago

Apply

10 - 20 years

45 - 50 Lacs

Hyderabad

Work from Office

SENIOR OPTIMIZATION ENGINEER Hyderabad Role Senior Optimization Engineer Contract type Permanent About the role As a Senior Optimization Engineer you provide technical expertise in the area of numerical optimization of thermo-fluid systems which are core to the Company's business this includes trade-off and optimize energy efficiency, production cost and operating economy. You engage with global product teams to solicit business needs and convert those into computational decision-making workflows, methods and tools to radically impact how company's products are designed, deployed and operated. Key targets include improving engineering effectiveness as well as developing disruptive, innovative methods for model-based design and operation of company's systems. You also take active part in product development to support design engineers in adopting and using new methods and tools. You also work closely with other teams in Systems, Controls, ML/AI COE, including teams responsible for model development (to drive the development of optimization-friendly thermo-fluid models) and controls engineering (to promote the use of computational optimization strategies (MPC, RTO) as needed). We are offering you We are committed to offering competitive benefits programs for all of our employees and enhancing our programs when necessary. A dynamic and international work environment in a company with high technology products. A strong growth strategy and people who are passionate about what they do. Requirements Education MEng or PhD in a relevant Engineering discipline (e.g., applied mathematics, mechanical or chemical engineering) with 5+ years of experience. Skills and qualifications Proven ability to capture engineering design and operation problems as mathematical programming problems (NLPs), including attention to reliable convergence of such problems. Proficient with the mathematical theory (applied mathematics, numerical analysis and functional analysis), algorithmic foundations (notably existence and convergence proofs), and methods/tools for numerical optimization (SQP, interior point method, etc.) of large-scale systems. Experience from using common algorithms/solvers for large-scale gradient-based non-linear programs, e.g., IPOPT, CONOPT, KNITRO, and WORHP, including their respective applicability to different types of problems. Experience from formulating and solving discrete optimization problems and using common algorithms, including CPLEX and Gurobi. Familiarity with physics-based modeling principles and best practices of thermo-fluid systems, such as vapor compression cycles or power plants. Familiarity and experience with development of computational platforms and tools in Python or equivalent. Familiarity with using HPC and cloud-based platforms for computation at scale. Demonstrated ability to work as part of a multidisciplinary team and an entrepreneurial attitude towards technological innovation in a global environment. Self-starter who is well-organized in an international team environment, with proven communication skills. Responsibilities Deployment. Support that methods and tools developed in the group impact the company's business through engagement in global product projects, including capture and formulation of computational problems arising in such projects. Methods, tools and algorithms. Ensure that appropriate computational methods, tools and algorithms for large-scale numerical optimization are based on sound mathematical foundations and are deployed to match the needs of the company's business, including contributing to the architecture, development, testing and documentation of the methods and tools developed in the group. Modeling for optimization. Support development of mathematical models for thermo-fluid systems are built based on principles and best practices that secure reliable application of numerical optimization algorithms. Talent. Support development and training of staff within company's product teams and within the Computational Engineering group; contribute to talent pipeline by supervising student internships and theses.

Posted 4 months ago

Apply

2 - 6 years

35 - 40 Lacs

Bengaluru

Work from Office

Skills required: 2-6y experience preferred Very strong data structure and algorithmic skills. Experience in identifying performance bottlenecks and designing/implementing optimizations to relieve analyzed bottlenecks. Experience in software development using C/C++ and debugging skills on multicore systems. Experience in performance analysis for data center, HPC (High Performance Computing), MPI (Message passing Interface) applications. Experience in x86 (or other architecture based) optimizations. Understanding of Cache sub-system, Instruction Set Architecture, pipeline (for any CPU). Nice To Have: Bonus skills: Experience on Intel MKL libraries, Linear Algebra, Core Math, x86 assembly programming. Knowledge of one or more CPU Profiling tools. Bachelor's or master's degree in computer engineering or related field. Responsibilities: Problem solving across multiple software layers, (user space, kernel, applications, libraries) and hardware. Optimization/development of the CPU performance stack (applications, libraries) for client server processors. Analyze and solve performance, scalability bottlenecks when code is running on multi-core, multi-node deployments. Innovate and publish papers, patents and participate in technical conferences to advance technologies. Continuously learn and grow along with evolving X86 server CPU architecture and application landscape. Lead collaborative approaches with multiple teams. Mentor others to achieve integrated projects

Posted 4 months ago

Apply

5 - 10 years

20 - 35 Lacs

Bengaluru

Work from Office

Develop and optimize HPC applications and algorithms using CUDA, MPI, OpenMP on Azure and cluster systems. Support scientific teams by modernizing codebases and enabling GPU acceleration. Required Candidate profile Software engineer with 5+ years in HPC programming, scientific code optimization, GPU computing, and collaboration with research teams.

Posted 4 months ago

Apply

5 - 10 years

15 - 30 Lacs

Bengaluru

Work from Office

Design and manage HPC infrastructure for geophysics, simulation, ML/AI using Azure and Linux. Optimize compute environments and support job schedulers, file systems, and parallel processing workflows. Required Candidate profile Experienced HPC engineer with 5–10 years in Linux, Azure, job schedulers, and supporting scientific workloads in a large-scale enterprise environment.

Posted 4 months ago

Apply

5.0 - 10.0 years

18 - 33 Lacs

pune

Work from Office

Job Description: We are seeking a highly motivated and technically proficient Technical Marketing Engineer to support our ambition in India. You will utilize your technical expertise to extend our R&D team's reach to support customers. Your proactive nature and understanding of the automotive business will enhance our product market penetration to create a business win. You will be a key contributor to spearhead our marketing campaign in India's Market to spread Telechips' brand image, product superiority, and solutions advantages. Roles & Responsibilities: Report to the Regional Manager. Drive business development and customer engagement in India's Automotive Market. Build and maintain strong relationships with customers. Participate in marketing campaigns to enhance brand and product awareness. Provide on-site technical support to customers Willingness to travel for business requirements Requirements: Strong and proactive communication skills Mature, independent, responsible, motivated & trustworthy Minimum 5 years of experience in relevant engineering roles (R&D, FAE, Technical Support) in the automotive electronics industry Background in Software Engineering or experience with a mass-production project e.g., Experience in developing Automotive Applications, Middleware, or BSP related to IVI or Cluster systems Proficient in Android & Linux-based development environments. Proficient in development using C or C++ Broad understanding of Application Processors (AP) and related work experience Possess analytical ability in problem-solving and technical troubleshooting skills Meticulous in making plans to ensure responsibilities are fully covered Preferred Qualifications: Existing relationships with Automotive Tier-1 and OEMs Knowledge of modern vehicle architecture, such as SDV, HPC, ADAS, Functional Safety, and Cybersecurity Experience in project management or relevant certifications Experience with Yocto development Experience in writing and managing technical documentation Employee Benefits: Comprehensive leave program: Marriage, Childbirth, Bereavement, Relocation, Long Service, and other special leaves in line with company policy Meal allowance to support daily living costs Annual health check-up for employee well-being Congratulatory & condolence support for life events Team building activity funding to encourage collaboration and connection

Posted Date not available

Apply

5.0 - 10.0 years

20 - 35 Lacs

pune

Work from Office

[ About the Hiring Company] > Fabless semiconductor company headquartered in South Korea, focused on automotive and multimedia solutions. > Customers include Tier-1 automotive parts makers and industrial/OA equipment manufacturers. > Global operations in 8 countries and 10 cities (https://www.telechips.com/) [Job Responsibilities] 1.Job Brief > We are seeking a highly motivated and technically proficient Technical Marketing Engineer to support our ambition in India. > You will utilize your technical expertise to extend our R&D teams reach to support customers. > Your pro-active nature and understanding in automotive business will enhance our products market penetration to create business win. You will be a key contributor to spearhead our marketing campaign in Indias Market to spread Telechips brand image, products superiority and solutions advantages. 2.Roles & Responsibilities > Report to the Regional Manager. > Drive business development and cutomer engagement in Indias Automotive Market. > Build and maintain strong relationships with customers. > Participate in marketing campaigns to enhance brand and product awareness. > Provide on-site technical support to customers > Willingness to travel for business requirements 3.Requirements > Strong and proactive communication skills > Mature, independent, responsible, motivated & trust-worthy > Minimum 5 years experience in relevant engineering roles (R&D, FAE, Technical Support) in the automotive electronics industry > Background in Software Engineering or experience with mass-production project o e.g., Experience in developing Automotive Applications, Middleware, or BSP related to IVI or Cluster systems > Proficient in Android & Linux based development environments. > Proficient in development using C or C++ > Broad understanding of Application Processors (AP) and related work experience > Possess analytical ability in problem solving and technical troubleshooting skill > Meticulous in making plans to ensure responsibility are fully covered 4.Preferred Qualifications > Existing relationships with Automotive Tier-1 and OEMs > Knowledge of modern vehicle architecture such as SDV, HPC, ADAS, Functional Safety, and Cybersecurity > Experience in project management or relevant certifications > Experience with Yocto development > Experience in writing and managing technical documentation [Mandatory Qualification & Skills] > Bachelors Degree Electronic Engineering / Computer Engineering > Minimum 5 years experience in relevant engineering roles (R&D, FAE, Technical Support) in the automotive electronics industry > Hindi(Native) English(Fluent) [Working Condition and Compensation] > Primary Location: Upcoming India representative office (Pune) Pre-launch: Co-working space in India / HQ in Korea / Singapore branch > Working Hours : 8 hours per day, Mon-Fri. 09:00 - 18:00 (adjustable) > Comprehensive leave program: Marriage, Childbirth, Bereavement, Relocation, Long Service, and other special leaves in line with company policy > Meal allowance to support daily living costs > Annual health check-up for employee well-being > Congratulatory & condolence support for life events > Team building activity funding to encourage collaboration and connection

Posted Date not available

Apply

8.0 - 13.0 years

0 - 1 Lacs

pune, bengaluru

Hybrid

Role & responsibilities Leadership and Strategy: Provide delivery assurance and serve as the lead design authority to ensure seamless execution of Enterprise grade container platform including Red Hat OpenShift and SUSE Rancher, Private Cloud AI and HPC/AI solutions, fully aligned with customer AI/ML strategies and business objectives. Align solution architecture with NVIDIA Enterprise AI Factory design principles, including modular scalability, GPU optimization, and hybrid cloud orchestration. Oversee planning, risk management, and stakeholder alignment throughout the project lifecycle to ensure successful outcomes. Solution Planning and Design: Architect and optimize end-to-end solutions across container orchestration and HPC workload management domains, leveraging platforms such as Red Hat OpenShift, SUSE Rancher, and/or workload schedulers like Slurm and Altair PBS Pro. Ensure seamless integration of container and AI platforms with the broader software ecosystem, including NVIDIA AI Enterprise, as well as open-source DevOps, AI/ML tools, and frameworks. Opportunity assessment: Lead technical responses to RFPs, RFIs, and customer inquiries, ensuring alignment with business and technical requirements. Conduct proof-of-concept (PoC) engagements to validate solution feasibility, performance, and integration within customer environments. Assess customer infrastructure and workloads to recommend optimal configurations using validated reference architectures from strategic partners such as Red Hat, NVIDIA, SUSE, along with components from the open-source ecosystem. Innovation and Research: Stay current with emerging technologies, industry trends, and best practices across HPC, Kubernetes, container platforms, hybrid cloud, and security to inform solution design and innovation. Customer-centric mindset: Act as a trusted advisor to enterprise customers, ensuring alignment of AI solutions with business goals. Translate complex technical concepts into value propositions for stakeholders Team Collaboration: Collaborate with cross-functional teams, including subject matter experts in infrastructure componentssuch as servers, storage, networkingand data science teams to ensure cohesive and integrated solution delivery. Mentor technical consultants and contribute to internal knowledge sharing through tech talks and innovation forums. Preferred candidate profile Required Skills: 1. HPC & AI Infrastructure Extensive knowledge of HPC technologies and workload scheduler such as Slurm and/or Altair PBS Pro, Proficient in HPC cluster management tools, including Cluster Management (HPCM) and/or NVIDIA Base Command Manager. Experience with HPC cluster managers like Cluster Management (HPCM) and/or NVIDIA Base Command Manager. Good understanding with high-speed networking stacks (InfiniBand, Mellanox) and performance tuning of HPC components. Solid grasp of high-speed networking technologies, such as InfiniBand and Ethernet. 2. Containerization & Orchestration Extensive hands-on experience with containerization technologies such as Docker, Podman, and Singularity Proficiency with at least two container orchestration platforms: CNCF Kubernetes, Red Hat OpenShift, SUSE Rancher (RKE/K3S), Canonical Charmed Kubernetes. Strong understanding of GPU technologies, including the NVIDIA GPU Operator for Kubernetes-based environments and DCGM (Data Center GPU Manager) for GPU health and performance monitoring. 3.Operating Systems & Virtualization Extensive experience in Linux system administration, including package management, boot process troubleshooting, performance tuning, and network configuration. Proficient with multiple Linux distributions, with hands-on expertise in at least two of the following: RHEL, SLES, and Ubuntu. Experience with virtualization technologies, including KVM and OpenShift Virtualization, for deploying and managing virtualized workloads in hybrid cloud environments. 4. Cloud, DevOps & MLOps Solid understanding of hybrid cloud architectures and experience working with major cloud platforms in conjunction with on-premises infrastructure. Familiarity with DevOps practices, including CI/CD pipelines, infrastructure as code (IaC), and microservices-based application delivery. Experience integrating and operationalizing open-source AI/ML tools and frameworks, supporting the full model lifecycle from development to deployment. Good understanding of cloud-native security, observability, and compliance frameworks, ensuring secure and reliable AI/ML operations at scale. 5. Networking & Protocols Strong understanding of core networking principles, including DNS, TCP/IP, routing, and load balancing, essential for designing resilient and scalable infrastructure. Working knowledge of key network protocols, such as S3, NFS, and SMB/CIFS, for data access, transfer, and integration across hybrid environments. 6. Programming & Automation Proficiency in scripting or programming languages such as Python and Bash. Experience automating infrastructure and AI workflows. 7. Soft Skills & Leadership Excellent problem-solving, analytical thinking, and communication skills for engaging both technical and non-technical stakeholders. Proven ability to lead complex technical projects from requirements gathering through architecture, design, and delivery. Strong business acumen with the ability to align technical solutions with client challenges and objectives. Qualifications: Bachelor’s/master’s degree in computer science, Information Technology, or a related field. Professional certifications in AI Infrastructure, Containers and Kubernetes are highly desirable —such as RHCSA, RHCE, CNCF certifications (CKA, CKAD, CKS), NVIDIA-Certified Associate - AI Infrastructure and Operations Typically, 8–10 years of hands-on experience in architecting and implementing HPC, AI/ML, and container platform solutions within hybrid or private cloud environments, with a strong focus on scalability, performance, and enterprise integration. Regards Bhaskar Dasegowda +91 9880540033 Bangalore, KA India bhaskar.dasegowda@encora.com Bhaskar.encora@gmail.com encora.com

Posted Date not available

Apply

12.0 - 15.0 years

20 - 25 Lacs

bengaluru

Work from Office

As a Lead AI Solutions Architect , you will spearhead the design, deployment, and optimization of AI infrastructure built on hybrid/private cloud environments using validated reference architectures. Youll architect scalable, high-performance solutions tailored to enterprise AI workloads, working closely with cross-functional teams and enterprise clients. This is a critical, hands-on leadership role for someone with deep technical expertise, business acumen, and a passion for delivering next-gen AI platforms. Key Responsibilities Serve as the lead technical authority for designing and implementing enterprise-grade AI and container platforms. Architect solutions aligned with Enterprise AI Factory design principles and validated infrastructure patterns. Lead planning, risk management, and stakeholder alignment for complex infrastructure deployments. Optimize end-to-end workflows across container orchestration (Kubernetes, OpenShift) and HPC workload management (Slurm, PBS Pro) . Integrate AI platforms with broader enterprise ecosystems and data pipelines. Conduct PoC engagements , validate performance, and ensure production readiness. Lead technical responses for RFPs, RFIs, and other customer engagements. Collaborate across Engineering, Product, Sales, and Customer Success to deliver unified solutions. Stay ahead of emerging AI infrastructure technologies, DevOps practices, and hybrid cloud trends . Required Skills & Experience 12-15 years of experience in architecting HPC, AI/ML, and hybrid cloud environments. Strong expertise in HPC technologies , including Slurm , PBS Pro , and cluster management tools . Proficiency in containerization (Docker, Podman, Singularity) and orchestration platforms ( Kubernetes, OpenShift, SUSE Rancher ). Solid Linux system administration and virtualization skills (KVM, OpenShift Virtualization). Deep understanding of hybrid/multi-cloud architectures and DevOps automation . Strong scripting/programming experience in Python , Bash , or equivalent. Excellent problem-solving, stakeholder management , and executive communication skills . Demonstrated ability to act as a trusted advisor to enterprise customers.

Posted Date not available

Apply

3.0 - 6.0 years

5 - 7 Lacs

bengaluru

Work from Office

Implement the HPC and Linux solutions for customers within our facilities. Conduct Proof of Concepts for customer’s requirements. Provide key role in Customer Service domain throughout the post sales cycle, demos and operational walkthroughs.

Posted Date not available

Apply

6.0 - 11.0 years

2 - 5 Lacs

hyderabad

Work from Office

This role specializes in performing and enabling remote technical support of IBM software, hardware and solutions. Provides technical support assistance to clients and/or IBM field support using problem determination/problem source identification skills. Uses technical and negotiation skills in collaboration with other support operations/organizations to prioritize and diagnose problems to resolution. Communicates action plans to the client or IBM representative as appropriate. Recommends and implements new or improvements to existing technical support tools, procedures, and processes. May provide training for and mentor others on the team. Contributes to department attainment of organizational objectives and high client satisfaction. Primary Duties: Provides technical support assistance to clients and/or IBM field support using problem determination/problem source identification skills. Uses technical and negotiation skills in collaboration with other support operations/organizations to prioritize and diagnose problems to resolution. Communicates action plans to the client or IBM representative as appropriate. Recommends and implements new or improvements to existing technical support tools, procedures, and processes. Provide world class customer services to large enterprise users. Investigate and resolve support issues independently and productively. Handle critical customer issues and hot-line support issues independently. Handle urgent customer situation and provide emergency solutions or fixes to customers. Coordinate within the team and across other teams (Developments, Product Management, Sales) for critical customer support issues or escalations independently. Develop Knowledge-Base, procedures and support tools to improve services efficiency Required education Bachelor's Degree Required technical and professional expertise Minimum 6+Years of Experience required. Bachelor's Degree System-level knowledge on Unix/Linux/Windows Communication and inter-personal skills Experience in problem troubleshooting, analysis, and resolution. Demonstrated productivity and quality results at customer issue handlin Support Positions - additional datapoints on skillset requiered1.Strong technical foundation in Linux and Windows. It's hard to train a candidate in both Linux, Windows and Spectrum products. 2.Demonstrated examples showing how a candidate has picked up and owned a skill.3.Great communication skills, since this is a customer facing role. 4.Also, our teams are so remote and spread out, so this is essential!Experience or exposure to HPC is a big plus, but we can relax on this for lower bands. Preferred technical and professional experience 1). Experience of using github to manage source code 2). Good communication (verbal and written) and interpersonal skills 3). UX and UI Design experience is an asset. 4). Knowledge and experience of IBM Websphere, Webpack, Spring framework are highly desired 5). experience of scikit-learn and pandas for data processing, prediction and inferencing 6). Knowledge and experience of IBM Watson Machine Learning is highly desired 7). Knowledge of IBM Spectrum LSF is desired

Posted Date not available

Apply

6.0 - 11.0 years

35 - 55 Lacs

mumbai

Work from Office

Designing and implementing groundbreaking GPU computers that run demanding deep learning, high-performance computing, and computationally intensive workloads Optimise Computer Vision Algorithms, Hardware Accelerators for performance & quality KPIs Required Candidate profile Optimize algorithms for optimal performance on the GPU tensor cores Strong Data structures and Algorithms know-how Hand-on expertise with GPU computing (CUDA, OpenCL, OpenACC) and HPC (MPI, OpenMP)

Posted Date not available

Apply

5.0 - 10.0 years

30 - 45 Lacs

bengaluru

Work from Office

Hiring AI/ML Architect | Lead GPU-based Deep Learning Infra Design | Expert in Performance Tuning, CV Algorithms, Cloud Strategy, Resource Optimization | Strong Leadership, Benchmarking, and Team Mentoring Skills | Apply now!

Posted Date not available

Apply

7.0 - 11.0 years

0 - 0 Lacs

hyderabad

Work from Office

Role & responsibilities Services: AWS Operations and DevOps Support 24/7 Monitoring Incident Management with SLAs Cost Optimization & Governance Adhere to Security & Compliance Best Practices Automation of provisioning and workflows Deliverables: Monthly reports (usage, cost, incidents) Cloud architecture documentation Security posture and compliance assessments DevOps pipeline maintenance Requirements: AWS Advanced or Premier Partner status preferred Experience with container platform (ECS & EKS) and ML workloads Relevant certifications and references Experience with HPC (parallel cluster, AWS batch)/ ML (SageMaker) for life sciences data workloads is a plus

Posted Date not available

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies