Jobs
Interviews

8 Singularity Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

5.0 - 9.0 years

0 Lacs

thane, maharashtra

On-site

You will play a pivotal role in the design and implementation of cutting-edge GPU computers optimized for demanding deep learning, high-performance computing, and computationally intensive workloads. Your expertise will be essential in identifying architectural enhancements and innovative approaches to accelerate our deep learning models. Addressing strategic challenges related to compute, networking, and storage design for large-scale, high-performance workloads will be a key responsibility. Additionally, you will contribute to effective resource utilization in a heterogeneous computing environment, evolve our cloud strategy, perform capacity modeling, and plan for growth across our products and services. As an architect, you are tasked with translating business requirements pertaining to AI-ML algorithms into a comprehensive set of product objectives encompassing workload scenarios, end user expectations, compute infrastructure, and execution timelines. This translation should culminate in a plan to operationalize the algorithms efficiently. Furthermore, you will be responsible for benchmarking and optimizing Computer Vision Algorithms and Hardware Accelerators based on performance and quality KPIs. Your role will involve fine-tuning algorithms for optimal performance on GPU tensor cores and collaborating with cross-functional teams to streamline workflows spanning data curation, training, optimization, and deployment. Providing technical leadership and expertise for project deliverables is a core aspect of this position, along with leading, mentoring, and managing the technical team to ensure successful outcomes. Your contributions will be instrumental in driving innovation and achieving project milestones effectively. Key Qualifications: - Possess an MS or PhD in Computer Science, Electrical Engineering, or a related field. - Demonstrated expertise in deploying complex deep learning architectures. - Minimum of 5 years of relevant experience in areas such as Machine Learning (with a focus on Deep Neural Networks), DNN adaptation and training, code development for DNN training frameworks (e.g., Caffe, TensorFlow, Torch), numerical analysis, performance analysis, model compression, optimization, and computer architecture. - Strong proficiency in data structures, algorithms, and C/C++ programming. - Hands-on experience with PyTorch, TensorRT, CuDNN, GPU computing (CUDA, OpenCL, OpenACC), and HPC (MPI, OpenMP). - Thorough understanding of container technologies like Docker, Singularity, Shifter, Charliecloud. - Proficient in Python programming, bash scripting, and operating systems including Windows, Ubuntu, and Centos. - Excellent communication, collaboration, and problem-solving skills. Good To Have: - Practical experience with HPC cluster job schedulers such as Kubernetes, SLURM, LSF. - Familiarity with cloud computing architectures. - Hands-on exposure to Software Defined Networking and HPC cluster networking. - Working knowledge of cluster configuration management tools like Ansible, Puppet, Salt. - Understanding of fast, distributed storage systems and Linux file systems for HPC workloads. This role offers an exciting opportunity to contribute to cutting-edge technology solutions and make a significant impact in the field of deep learning and high-performance computing. If you are a self-motivated individual with a passion for innovation and a track record of delivering results, we encourage you to apply.,

Posted 2 days ago

Apply

6.0 - 10.0 years

0 Lacs

noida, uttar pradesh

On-site

We are looking for a highly skilled HPC/GPU Operations Engineer with 6-10 years of experience to oversee the management, optimization, and maintenance of high-performance computing (HPC) infrastructure, specifically focusing on GPU-accelerated workloads. Your primary responsibilities will include ensuring the reliability, efficiency, and scalability of HPC systems used for scientific computing, AI/ML, and data-intensive applications. As an IC3 level professional, you will be tasked with the following responsibilities: - Managing and administering HPC clusters, GPU nodes, and high-speed interconnects. - Configuring GPU-accelerated workloads for AI/ML, scientific computing, and simulations. - Monitoring system performance, troubleshooting issues, and optimizing resource utilization. - Installing, configuring, and maintaining HPC-related software, libraries, and tools such as CUDA, OpenMP, and MPI. - Supporting containerized workflows using technologies like Docker and Singularity. - Ensuring software compatibility with GPU architectures from NVIDIA, AMD, and Intel. - Tuning GPU and CPU performance for specific workloads, including benchmarking and profiling. - Utilizing monitoring tools like Prometheus, Grafana, and Slurm to track system health and efficiency. - Optimizing scheduling and resource allocation in workload managers such as Slurm, PBS, and LSF. - Upholding system security and access control for HPC resources. - Applying software patches, firmware updates, and security best practices. - Assisting in regulatory compliance for HPC environments. - Providing support to researchers, data scientists, and engineers using HPC resources. - Developing and maintaining documentation on best practices, troubleshooting, and system usage. - Conducting training sessions or workshops on HPC/GPU computing. To qualify for this role, you should possess the following technical skills: - Experience in managing HPC clusters and GPU-based computing environments. - Proficiency in Linux system administration, scripting (Bash, Python), and automation (Ansible, Terraform). - Knowledge of parallel computing, GPU programming (CUDA, OpenCL), and HPC frameworks. - Familiarity with networking technologies such as Infiniband and RDMA, storage solutions like Lustre, GPFS, NFS, and virtualization. At Oracle, we are dedicated to leveraging cutting-edge technology to address current challenges while fostering an inclusive work environment that encourages innovation and collaboration. We offer a range of competitive benefits, flexible medical, life insurance, and retirement options, and support employee engagement in volunteer programs. We are committed to providing accessibility assistance or accommodation for individuals with disabilities throughout the employment process. If you require such assistance, please contact us via email at accommodation-request_mb@oracle.com or by phone at +1 888 404 2494 in the United States.,

Posted 1 week ago

Apply

2.0 - 6.0 years

0 Lacs

telangana

On-site

You will be joining our team as a System Development Engineer focusing on the Hybrid Scientific Computing Stack. A strong background in computer science and software development is required for this role, and knowledge of quantum computing would be an added advantage. Your responsibilities will include working on backend services such as FastAPI, Celery, OAuth, PostgreSQL, and Redis. You will also be involved in hybrid job orchestration using tools like Celery, RabbitMQ, Slurm, and Kubernetes. Containerized workflows using Docker, Singularity, and Helm will be part of your tasks. Monitoring and observability tasks will involve tools like Prometheus, Grafana, Loki, and Flower. Cloud-based deployment on platforms like GCP, AWS, and Azure, as well as secure on-prem server management, will also be within your purview. Additionally, you will work on scientific environments involving CUDA, Qiskit, Conda, GROMACS, and Lmod. To qualify for this position, you should hold a minimum Bachelor's Degree in Computer Science or related fields and have at least 2 years of professional work experience in full-stack systems engineering. Proficiency in Python (FastAPI/Celery), Linux (Ubuntu/Debian), and DevOps is required. Familiarity with cloud-native tools like Docker, Kubernetes, Helm, and GitHub Actions is essential. Experience with Slurm, GPU resource allocation, and secure job execution will be beneficial. Any familiarity with quantum SDKs such as Qiskit, PennyLane, and Cirq will be considered a bonus.,

Posted 3 weeks ago

Apply

2.0 - 6.0 years

0 Lacs

telangana

On-site

You will be joining our team as a Systems Development Engineer for the Hybrid Scientific Computing Stack. A strong background in computer science and software development is essential for this role, with knowledge of quantum computing considered a valuable asset. Your responsibilities will include managing backend services such as FastAPI, Celery, OAuth, PostgreSQL, and Redis. You will also be involved in hybrid job orchestration using tools like Celery, RabbitMQ, Slurm, and Kubernetes, as well as working on containerized workflows with Docker, Singularity, Helm, and Kubernetes. Monitoring and observability tasks will involve tools like Prometheus, Grafana, Loki, and Flower. Additionally, you will be responsible for cloud-based deployment on platforms like GCP, AWS, and Azure, as well as secure on-prem server management including GPU/CPU scheduling, RBAC, and SSH-only access. Familiarity with scientific environments such as CUDA, Qiskit, Conda, GROMACS, and Lmod will also be part of your role. To qualify for this position, you should hold a minimum Bachelor's Degree in Computer Science or related fields and have at least 2 years of professional work experience in full-stack systems engineering. Proficiency in Python (FastAPI/Celery), Linux (Ubuntu/Debian), and DevOps is required. You should also be familiar with cloud-native tools like Docker, Kubernetes, Helm, and GitHub Actions. Experience with Slurm, GPU resource allocation, and secure job execution will be beneficial. Any familiarity with quantum SDKs such as Qiskit, PennyLane, and Cirq will be considered a bonus.,

Posted 3 weeks ago

Apply

8.0 - 13.0 years

40 - 80 Lacs

Bengaluru, Delhi / NCR, Mumbai (All Areas)

Work from Office

Graduate with 8+ years in product / solution management Product Manager / Offering Manager in Cloud, IaaS, HPC, or related high-tech domains. Hands-on understanding of technologies like NVIDIA GPU stacks, containerized HPC (Singularity, Docker), scheduling systems (SLURM, PBS), Lustre / GPFS Familiarity with as-a-Service constructs, subscription models, and TCO discussions. Product lifecycle management. Project and cross-functional stakeholder management Strong articulation, documentation, and influencing ability Able to interact across sales, delivery, product, and finance Suitable candidates may forward their updated profiles in strict confidence to hr33@hectorandstreak.com or call on 9699224920

Posted 1 month ago

Apply

10.0 - 20.0 years

40 - 80 Lacs

Bengaluru, Delhi / NCR, Mumbai (All Areas)

Work from Office

Bachelor's degree in Computer Science with 10+ years of experience with HPC environments Experience in HPC architecture and design, with a proven track record of delivering complex HPC solutions. Experience in designing and implementing HPC solutions on public cloud, private cloud, and on-premises infrastructure. Knowledge of HPC technologies, including M PI, OpenMP, Infiniband, GPFS, Lustre , and other file systems, cluster management tools such as Slurm , Torque, or LSF, and scheduling software such as PBSPro. Excellent communication skills, including the ability to communicate technical concepts to both technical and non-technical audiences. Experience with virtualization and containerization technologies such as Docker, Kubernetes, and Singularity. Strong understanding of networking technologies and protocols, including TCP/IP, Infiniband, and RDMA. Familiarity with one or more programming languages such as C, C++, Fortran, Python, or Java. Experience working in a multi-vendor, multi-cloud environment. Strong problem-solving skills and the ability to work under pressure in a fast-paced environment. Suitable candidates may forward their updated profiles in strict confidence to hr33@hectorandstreak.com or call on 9699224920

Posted 1 month ago

Apply

10.0 - 12.0 years

0 Lacs

Hyderabad / Secunderabad, Telangana, Telangana, India

On-site

Introduction A career in IBM Software means youll be part of a team that transforms our customers challenges into solutions. Seeking new possibilities and always staying curious, we are a team dedicated to creating the worlds leading AI-powered, cloud-native software solutions for our customers. Our renowned legacy creates endless global opportunities for our IBMers, so the door is always open for those who want to grow their career. We are seeking a skilled back-end developer to join our IBM Software team. As part of our team, you will be responsible for developing and maintaining high-quality software products, working with a variety of technologies and programming languages. IBMs product and technology landscape includes Research, Software, and Infrastructure. Entering this domain positions you at the heart of IBM, where growth and innovation thrive. Your role and responsibilities As a IBM Spectrum LSF Backend Software Developer, you will be responsible for designing and developing components and features for IBM Spectrum LSF, and would be involved in designing , developing and discussing product delivery & strategy. You should also have leadership quality to manage and work as technical leads/software architect and be able to deliver end to end features. As part of worldwide development team, you will be collaborating with team members and clients from different timezone to support business success. You will be addressing product issues reported from clients and providing solutions of fixes in timely manner. Be an avid coder who can get his hands dirty and be involved in the coding to the deepest level. Work other developers in the dev team to maintain and improve code base. Work in an Agile environment of continuous deliverable. Youll learn directly from Sr members/leaders in this field Required education Bachelors Degree Required technical and professional expertise . Proven knowledge of software development principles and agile development experience . 10+ years of experience and strong knowledge in C, C++ . Working experience of Java and Python . 3+ years of experience in development of systems or enterprise software on Linux . Good knowledge of Linux kernel, system administration, networking, and performance . Good knowledge of distributed system and enterprise software . Self learner . Proactive approach . Excellent communication skills Preferred technical and professional experience . Experience with container (docker, singularity, podman) and container-based platform . Experience working with Git, AWS, Azure, Google Cloud . Good understanding and development experience on Windows . Development experience with GPU . Client interaction experience

Posted 2 months ago

Apply

10.0 - 12.0 years

0 Lacs

Hyderabad / Secunderabad, Telangana, Telangana, India

On-site

Introduction A career in IBM Software means youll be part of a team that transforms our customers challenges into solutions. Seeking new possibilities and always staying curious, we are a team dedicated to creating the worlds leading AI-powered, cloud-native software solutions for our customers. Our renowned legacy creates endless global opportunities for our IBMers, so the door is always open for those who want to grow their career. We are seeking a skilled back-end developer to join our IBM Software team. As part of our team, you will be responsible for developing and maintaining high-quality software products, working with a variety of technologies and programming languages. IBMs product and technology landscape includes Research, Software, and Infrastructure. Entering this domain positions you at the heart of IBM, where growth and innovation thrive. Your role and responsibilities As a IBM Spectrum LSF Backend Software Developer, you will be responsible for designing and developing components and features for IBM Spectrum LSF, and would be involved in designing , developing and discussing product delivery & strategy. You should also have leadership quality to manage and work as technical leads/software architect and be able to deliver end to end features. As part of worldwide development team, you will be collaborating with team members and clients from different timezone to support business success. You will be addressing product issues reported from clients and providing solutions of fixes in timely manner. Be an avid coder who can get his hands dirty and be involved in the coding to the deepest level. Work other developers in the dev team to maintain and improve code base. Work in an Agile environment of continuous deliverable. Youll learn directly from Sr members/leaders in this field Required education Bachelors Degree Required technical and professional expertise 1). Completion of Computer Science/Computer Engineering Degree 2). 10+ years of working experience on software development projects with Agile process 3). Experience of designing functional specification and implementing with code 4). Experience of using Javascripts, HTML, CSS etc web technology to debug the frontend page 5). Experience of using Linux OS and Mysql/MariaDB database 6). Understanding of the principles of Object Oriented software design and programming Requirements for PAC , Explorer and Process Manager: 1). Strong knowledge of Java with a focus on web base UI development 2). Work experiences of AngularJS, Nodejs, RESTful web services API 3). Knowledge and experience of C/C++ development Requirements for RTM 1). Strong knowledge of PHP + Mysql on web base UI development 2). Experiences of Cacti monitoring framework 3). Knowledge and experience of C development Requirements for Lsfsuite, Simulator and Predictor 1). Knowledge of ansible software deployment over network 2). Experiences of python and flask to build web application 3). Experience of container-docker, singularity and podman Preferred technical and professional experience 1). Experience of using github to manage source code 2). Good communication (verbal and written) and interpersonal skills 3). UX and UI Design experience is an asset. 4). Knowledge and experience of IBM Websphere, Webpack, Spring framework are highly desired 5). experience of scikit-learn and pandas for data processing, prediction and inferencing 6). Knowledge and experience of IBM Watson Machine Learning is highly desired 7). Knowledge of IBM Spectrum LSF is desired

Posted 2 months ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies