Jobs
Interviews

758 Cuda Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

3.0 years

1 - 2 Lacs

sān

On-site

We are building a distributed LLM inference network that combines idle GPU capacity from around the world into a single cohesive plane of compute that can be used for running large-language models like DeepSeek and Llama 4. At any given moment, we have over 5,000 GPUs and hundreds of terabytes of VRAM connected to the network. We are a small, well-funded team working on difficult, high-impact problems at the intersection of AI and distributed systems. We primarily work in-person from our office in downtown San Francisco. **Responsibilities ** - Design and implement optimization techniques to increase model throughput and reduce latency across our suite of models - Deploy and maintain large language models at scale in production environments - Deploy new models as they are released by frontier labs - Implement techniques like quantization, speculative decoding, and KV cache reuse - Contribute regularly to open source projects such as SGLang and vLLM - Deep dive into underlying codebases of TensorRT, PyTorch, TensorRT-LLM, vLLM, SGLang, CUDA, and other libraries to debug ML performance issues - Collaborate with the engineering team to bring new features and capabilities to our inference platform - Develop robust and scalable infrastructure for AI model serving - Create and maintain technical documentation for inference systems **Requirements ** - 3+ years of experience writing high-performance, production-quality code - Strong proficiency with Python and deep learning frameworks, particularly PyTorch - Demonstrated experience with LLM inference optimization techniques - Hands-on experience with SGLang and vLLM, with contributions to these projects strongly preferred - Familiarity with Docker and Kubernetes for containerized deployments - Experience with CUDA programming and GPU optimization - Strong understanding of distributed systems and scalability challenges - Proven track record of optimizing AI models for production environments Nice to Have - Familiarity with TensorRT and TensorRT-LLM - Knowledge of vision models and multimodal AI systems - Experience implementing techniques like quantization and speculative decoding - Contributions to open source machine learning projects - Experience with large-scale distributed computing **Compensation **We offer competitive compensation, equity in a high-growth startup, and comprehensive benefits. The base salary range for this role is

Posted 2 hours ago

Apply

7.0 years

0 Lacs

thane, maharashtra, india

On-site

Job Description: You will provide leadership in designing and implementing ground-breaking GPU computers that run demanding deep learning, high-performance computing, and computationally intensive workloads. We seek an expert to identify architectural changes and/or completely new approaches for accelerating our deep learning models. As an expert, you will help us with the strategic challenges we encounter, including compute, networking, and storage design for large scale, high-performance workloads, effective resource utilization in a heterogeneous computing environment, evolving our private/public cloud strategy, capacity modelling, and growth planning across our products and services. As an architect you are responsible for converting business needs associated with AI-ML algorithms in to a set of product goals covering workload scenarios, end user expectations, compute infrastructure and time of execution; this should lead to a plan for making the algorithms production ready Benchmark and optimise the Computer Vision Algorithms and the Hardware Accelerators for performance and quality KPIs. Optimize algorithms for optimal performance on the GPU tensor cores. Collaborate with various teams to drive an end to end workflow from data curation and training to performance optimization and deployment. Assign tasks to the team and monitor as well Skills Required MS or PhD in Computer Science, Electrical Engineering, or related field. A strong background in deployment of complex deep learning architectures . 7+ years of relevant experience in at least a few of the following relevant areas is required in your work history: Machine learning (with focus on Deep Neural Networks), including understanding of DL fundamentals; Experience adapting and training DNNs for various tasks; Experience developing code for one or more of the DNN training frameworks (such as Caffe, TensorFlow or Torch): Numerical analysis, Performance analysis, Model compression and Optimization & Computer architecture. Strong Data structures and Algorithms know-how with Excellent C/C++ programming skills. Hands-on expertise with PyTorch, TensorRT, CuDNN Hand-on expertise with GPU computing (CUDA, OpenCL, OpenACC) and HPC (MPI, OpenMP) In-depth understanding of container technologies like Docker, Singularity, Shifter, Charliecloud. Proficient in Python programming and bash scripting. Proficient in Windows, Ubuntu and Centos operating systems. Excellent communication and collaboration skills. Self-motivated and able to find creative practical solutions to problems. Good to have Hands-on experience with HPC cluster job schedulers such as Kubernetes, SLURM, LSF. Familiarity with cloud computing architectures Hands-on experience with Software Defined Networking and HPC cluster networking. Working knowledge of cluster configuration management tools such as Ansible, Puppet, Salt. Understanding of fast, distributed storage systems and Linux file systems for HPC workloads. About Company: AIRA Matrix provides artificial intelligence based solutions for Life Sciences applications. Our solutions improve efficiency, diagnostic accuracy, and turnaround times in pathology, microbiology and ophthalmology workflows across pharmaceutical and healthcare laboratories. We leverage machine and deep learning techniques to develop diagnostic, prognostic, and predictive solutions. Our solutions provide cost benefits in the pharmaceutical domain, by speeding up pre-clinical drug development timelines, and by enhancing the efficiency of environmental monitoring required in manufacturing. In healthcare applications, our solutions improve treatment outcomes by aiding disease stratification and enabling management protocols tailored to individual patients. Our clients and partners include leading hospitals, pharmaceutical companies, CROs, and research labs around the world. Our deep learning platforms with existing network models and pre-built AI applications provide the foundation for fast customizations and help tackle any unique challenges in your image analysis and study management workflows. Our flexible service model enables the swift deployment of these custom solutions with minimal resource and time commitment from your side. Our Application Development Team plays an important role in developing competent customer facing applications to access our AI solutions and enterprise-level image management systems in life sciences. -- Regards, Surya Prajapati Talent Acquisition Specialist Email : surya.prajapati@airamatrix.com Website : https://www.airamatrix.com Dosti Pinnacle, 801, Rd No. 22, Wagle Industrial Estate , Thane (W) Maharashtra, India - 400604.

Posted 2 hours ago

Apply

0 years

0 Lacs

coimbatore, tamil nadu, india

On-site

Company Description GANSHY Solutions excels in connecting the right talents with the right opportunities. Whether you are a professional seeking a new position or a client looking to hire a top performer, we are uniquely positioned to meet your needs throughout Europe. Role Description This is a full-time on-site role for an Embedded Software Developer, located in Coimbatore . The Embedded Software Developer will be responsible for developing embedded software, programming, debugging, and designing software. The developer will work closely with the engineering team to create efficient and reliable software solutions for embedded systems. Qualifications Skills in Embedded Software Programming and Embedded Software Experience working with Mono and Stereo camera Experience in image detection algorithm is advantages Experience in training ML-Mode Python, Pytorch yolov11, mm2action Conda CUDA understanding is advantages

Posted 3 hours ago

Apply

0 years

0 Lacs

bengaluru, karnataka, india

On-site

We are seeking PhD holders—freshers or experienced—with expertise in High-Performance Computing (HPC) or related fields such as Computer Science, Engineering, Physics, or Computational Science. If you are passionate about performance optimization, parallel computing, and tackling complex computational challenges, this role is for you. Key Qualifications PhD in HPC or a related discipline. Strong programming skills in C, C++, or Python. Familiarity with MPI, OpenMP, CUDA, or other parallel computing frameworks (preferred). Passion for performance, scalability, and impactful problem-solving. Excellent analytical, research, and problem-solving skills. Who Can Apply Fresh PhD graduates eager to pursue cutting-edge research in HPC. Experienced researchers/professionals with academic or industry background in HPC. Skills: research,mpi,cuda,hpc,openmp

Posted 4 hours ago

Apply

11.0 - 15.0 years

0 Lacs

bangalore, karnataka

On-site

As an AI Infrastructure Engineer at Cisco, you will be part of an innovation team focused on transforming how enterprises utilize AI. Operating with the agility of a startup and the focus of an incubator, you'll work alongside seasoned engineers, architects, and thinkers to craft iconic products that reshape industries. If you're energized by solving hard problems and shaping the future of AI infrastructure, this role is for you. **Key Responsibilities:** - Design and develop node-level infrastructure components for high-performance AI workloads. - Benchmark, analyze, and optimize AI infrastructure performance, including CUDA kernels and GPU memory management. - Ensure minimal downtime through seamless configuration and upgrade architecture for software components. - Manage installation and deployment of AI infrastructure on Kubernetes clusters, including CRDs and operators. - Develop efficient telemetry collection systems for nodes and hardware components without impacting workload performance. - Ensure scalability, resilience, and reliability through distributed system fundamentals. - Collaborate across teams and time zones to influence the direction of AI infrastructure development. **Minimum Qualifications:** - Proficiency in programming languages such as C/C++, Golang, Python, or eBPF. - Strong understanding of Linux operating systems, including user space and kernel-level components. - Experience with Linux user space development, packaging, logging, telemetry, and process lifecycle management. - Knowledge of Kubernetes (K8s) and related technologies like CRDs. - Excellent debugging and complex problem-solving skills. - Bachelor's degree or higher with 11+ years of relevant engineering experience. **Preferred Qualifications:** - Hands-on expertise in Linux kernel and device drivers. - Experience in GPU programming and optimization, including CUDA and UCX. - Familiarity with high-speed data transfer technologies like RDMA. - Proficiency in Nvidia GPU operators, Nvidia container toolkit, Nsight, CUPTI, Nvidia MIG, and MPS concepts. Cisco is dedicated to fostering an inclusive environment where every individual's unique skills and perspectives contribute to our purpose of powering an inclusive future for all. With a focus on connection, learning, and development, Cisconians have the opportunity to grow within the company. By leveraging technology, tools, and culture to pioneer hybrid work trends, Cisco empowers its employees to give their best and be their best. Cisco's commitment to bringing communities together is demonstrated through initiatives like Inclusive Communities, where employees collaborate to foster belonging, learn, and make a difference. Additionally, dedicated paid time off for volunteering enables employees to give back to causes they are passionate about. Cisco's purpose, driven by its people, positions the company as a worldwide leader in technology that powers the internet, helping customers reimagine their applications, secure their enterprise, transform their infrastructure, and meet sustainability goals. Join Cisco and take the next step towards a more inclusive future for all.,

Posted 1 day ago

Apply

2.0 - 6.0 years

10 - 15 Lacs

pune

Work from Office

The opportunity We are currently looking for a Computer Vision Engineer, to join our office in Pune. As a Computer Vision Engineer, you will support our clients in defining and implementing a data journey aligned with their strategic objectives. Some of your responsibilities will include: Work with the research team to research, develop, evaluate, and optimize various computer vision and deep learning models for different problems. Take ownership to drive computer vision solutions and meet customer requirements. Deploying developed computer vision models on edge devices after optimization to meet customer requirements and maintain them to later improve to address additional customer requirements in the future. Developing data handling and machine learning pipelines for training In-depth understanding of computer vision models including object detection, semantic segmentation, and key-point detection Implementing algorithms in robust, efficient, and well-tested code. Skills and attributes for success To qualify for the role, you must have Experience of minimum 2 years Ability to develop Deep Learning frameworks to solve problems. Design and create platforms for image processing and visualization. Knowledge of computer vision libraries. Understanding of dataflow programming. B.E. (E&TC /Computer / IT / Mechanical / Electronics) C++, video analytics, CUDA, Deepstream

Posted 1 day ago

Apply

10.0 years

0 Lacs

bangalore rural, karnataka, india

On-site

Cohesity is the leader in AI-powered data security. Over 13,600 enterprise customers, including over 85 of the Fortune 100 and nearly 70% of the Global 500, rely on Cohesity to strengthen their resilience while providing Gen AI insights into their vast amounts of data. Formed from the combination of Cohesity with Veritas’ enterprise data protection business, the company’s solutions secure and protect data on-premises, in the cloud, and at the edge. Backed by NVIDIA, IBM, HPE, Cisco, AWS, Google Cloud, and others, Cohesity is headquartered in Santa Clara, CA, with offices around the globe. We’ve been named a Leader by multiple analyst firms and have been globally recognized for Innovation, Product Strength, and Simplicity in Design , and our culture. Want to join the leader in AI-powered data security? At Cohesity, we are shaping the future of enterprise data protection and management with AI-driven innovation. Our AI Solutions team partners with marquee customers to design and validate cutting-edge solutions powered by Cohesity Gaia. By combining enterprise data management with Generative AI and advanced ML, we help customers solve real-world problems while influencing the future of our products. As an AI Solution Architect, you’ll be the key product and technology expert, bridging sales, product, engineering, and customer success. You will design, deploy, and validate Cohesity Gaia solutions—including DataProtect, SmartFiles, and AI-powered RAG workflows—helping customers accelerate AI adoption with confidence. This is a customer-facing, hybrid role requiring both technical depth and business acumen. How You'll Spend Your Time Here Customer Solution Design & Adoption Architect AI/ML solutions leveraging Cohesity Gaia, DataProtect, Kubernetes, and NVIDIA GPUs. Lead POCs, accelerate adoption, and act as a trusted advisor for AI integrations across cloud, storage, and enterprise platforms. Field & Product Collaboration Partner with Sales and field teams to win complex AI-driven opportunities. Provide feedback on AI use cases, SaaS plugins, and deployment patterns. Validate new features as “customer zero” before general release. Design, Testing & Validation Create reference architectures combining Gaia with DataProtect and SmartFiles. Benchmark LLMs, Agentic AI workflows, and SaaS adapters for enterprise readiness. Ensure solutions meet compliance, resilience, and responsible AI standards. Enablement & Evangelism Train field teams, partners, and customers on AI, GPU acceleration, and LLMOps. Lead workshops, roadmap presentations, and technical deep dives. Publish whitepapers and best practices to establish Cohesity as an AI thought leader. Development & Tools Build lightweight scripts, plugins, and adapters to simplify integrations and POCs. Document validation checklists, GPU/LLM configs, and performance benchmarks. WE'D LOVE TO TALK TO YOU IF YOU HAVE MANY OF THE FOLLOWING: Technical Expertise Deep knowledge of Cohesity DataProtect, SmartFiles, and SaaS-based data management. Experience architecting AI/ML/GenAI solutions across hybrid and on-prem environments. Hands-on expertise with NVIDIA GPUs (H100, H200, L40S, RTX 6000 Blackwell, A100), CUDA, TensorRT, and Triton. Familiarity with vector databases (Milvus, Pinecone, etc) and LLMs (OpenAI, Gemini, Claude, Cohere, Llama 2, Mistral). Proficiency in Kubernetes, OpenShift, MLOps pipelines, and scripting (Python, Bash, PowerShell). Enterprise Experience 10+ years in customer-facing solution roles with enterprise-grade AI or data solutions. Proven success in leading POCs, integrations, and enterprise rollouts. Soft Skills Strong communication and presentation skills with both technical and executive stakeholders. Structured, proactive, and self-driven in fast-paced environments. Education & Travel Bachelor’s or Master’s in Computer Science, Engineering, or related field. Willingness to travel 10–20% for customer and field engagements. Key Technologies Data & Platforms: Cohesity DataProtect, SmartFiles, Gaia, NAS/external file services (NetApp, Isilon) Infrastructure Virtualization: Kubernetes (OpenShift, VMware Tanzu, Rancher, Nutanix NKE, Amazon EKS, Azure Stack HCI) AI/ML Frameworks: RAG, Agentic AI, LLMOps, NVIDIA Triton, TensorRT LLMs: OpenAI GPT, Google Gemini, Anthropic Claude, Cohere, Mistral, Llama 2, Falcon, fine-tuned/custom models Vector Databases: Milvus, Pinecone Hardware: Deployment/administration of NVIDIA GPU nodes (L40S, A100, H100, H200, RTX 6000 Blackwell) Development: Python, Bash, PowerShell – scripting for quick plugins, adapters, scrapers during POCs Data Privacy Notice For Job Candidates For information on personal data processing, please see our Privacy Policy . Equal Employment Opportunity Employer (EEOE) Cohesity is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, national origin or nationality, ancestry, age, disability, gender identity or expression, marital status, veteran status or any other category protected by law. If you are an individual with a disability and require a reasonable accommodation to complete any part of the application process, or are limited in the ability or unable to access or use this online application process and need an alternative method for applying, you may contact us at 1-855-9COHESITY or talent@cohesity.com for assistance. In-Office Expectations Cohesity employees who are within a reasonable commute (e.g. within a forty-five (45) minute average travel time) work out of our core offices 2-3 days a week of their choosing. Interested candidates based outside of the designated areas are welcome to apply, provided they have the right to work in the job location.

Posted 2 days ago

Apply

6.0 years

0 Lacs

bengaluru, karnataka, india

On-site

Netradyne harnesses the power of Computer Vision and Edge Computing to revolutionize the modern-day transportation ecosystem. We are a leader in fleet safety solutions. With growth exceeding 4x year over year, our solution is quickly being recognized as a significant disruptive technology. Our team is growing, and we need forward-thinking, uncompromising, competitive team members to continue to facilitate our growth. Role:- Embedded Software Engineer- Multimedia Developer Department/Group:Device Team Experience: - 6 to 10 years About Netradyne Founded in 2015, Netradyne is a technology company that leverages expertise in Artificial Intelligence, Deep Learning, and Edge Computing to bring transformational solutions to the transportation industry. Netradyne’ s technology is already deployed in thousands of vehicles; and our customers drive everything from passenger cars to semi-trailers on interstates, suburban roads, rural highways—even off-road. Responsibilities: Individual in this position should have a zeal to learn and ability to come up with innovative solutions for our latest cutting-edge embedded vision platforms Candidate should have experience on design and implementing solutions for embedded systems. Individual will be responsible to take Audio/Video data captured by camera and process it in real time and store it efficiently using right encoding techniques Should have ability to handle probes at different source/sinc points in pipeline and carry out the required functionality Essential Skills: Linux/ Android, Middleware, Multimedia – Camera, GStreamer ( Video/Audio pipeline) or equivalent streaming framework exposure Ability to develop software in C, C++ Hands on experience on design and implementation of software modules on embedded systems Good oral and written skills Rapidly adapt to a challenging work environment, Team spirit and good communication skills Optional Skills: Linux device driver and RTOS is a plus. GPU, Cuda, OpenCL, SNPE, Algorithm Optimization is a plus. Qualifications and Education Requirements BE/ME Or BTECH/MTECH in a related field We are committed to an inclusive and diverse team. Netradyne is an equal-opportunity employer. We do not discriminate based on race, color, ethnicity, ancestry, national origin, religion, sex, gender, gender identity, gender expression, sexual orientation, age, disability, veteran status, genetic information, marital status, or any legally protected status. If there is a match between your experiences/skills and the Company's needs, we will contact you directly. Netradyne is an equal-opportunity employer. Applicants only - Recruiting agencies do not contact. Recruitment Fraud Alert! There has been an increase in fraud that targets job seekers. Scammers may present themselves to job seekers as Netradyne employees or recruiters. Please be aware that Netradyne does not request sensitive personal data from applicants via text/instant message or any unsecured method; does not promise any advance payment for work equipment set-up and does not use recruitment or job-sourcing agencies that charge candidates an advance fee of any kind. Official communication about your application will only come from emails ending in ‘@netradyne.com’ or ‘@us-greenhouse-mail.io’. Please review and apply to our available job openings at Netradyne.com/company/careers. For more information on avoiding and reporting scams, please visit the Federal Trade Commission's job scams website.

Posted 2 days ago

Apply

2.0 years

0 Lacs

pune, maharashtra, india

On-site

The opportunity We are currently looking for a Computer Vision Engineer, to join our office in Pune. As a Computer Vision Engineer, you will support our clients in defining and implementing a data journey aligned with their strategic objectives. Some of your responsibilities will include: Work with the research team to research, develop, evaluate, and optimize various computer vision and deep learning models for different problems. Take ownership to drive computer vision solutions and meet customer requirements. Deploying developed computer vision models on edge devices after optimization to meet customer requirements and maintain them to later improve to address additional customer requirements in the future. Developing data handling and machine learning pipelines for training In-depth understanding of computer vision models, including object detection, semantic segmentation, and key-point detection Implementing algorithms in robust, efficient, and well-tested code. Skills and attributes for success To qualify for the role, you must have Minimum 2 years of professional experience Ability to develop Deep Learning frameworks to solve problems. Design and create platforms for image processing and visualization. Knowledge of computer vision libraries. Understanding of dataflow programming. B.E. (E&TC/Computer/IT / Mechanical / Electronics) C++, video analytics, CUDA, Deepstream

Posted 2 days ago

Apply

2.0 - 6.0 years

0 Lacs

karnataka

On-site

Role Overview: Motherson Health & Medical is dedicated to promoting health and well-being by facilitating access to top-quality and cost-effective healthcare solutions. Leveraging the extensive expertise of the Motherson Group, which boasts a global presence of over 180,000 employees and 350+ manufacturing facilities across 41 countries, we strive to revolutionize the healthcare sector by enhancing access to affordable and high-quality healthcare services worldwide. By collaborating with various entities within the healthcare ecosystem, including universities, hospitals, research groups, startups, and healthcare organizations, we continuously enhance our capabilities and deliver cutting-edge products to our clientele. Join us in our mission to transform the healthcare industry and make quality healthcare accessible to all. Key Responsibilities: - Write efficient and clean runtime code for a range of applications, from quick prototypes to complex desktop applications - Lead strategies using tools for static/dynamic analysis, memory management, code coverage, and software analysis techniques - Oversee integration and deployment processes - Demonstrate aptitude for technical writing and meticulous process documentation Qualifications Required: - Bachelor's degree in Software Engineering/Computer Science or a related field, along with 2+ years of work experience - Proficiency in coding with C++, OpenGL, and CUDA - Influence the Software Development Life Cycle processes and best practices collaboratively - Knowledge of efficient Version Control systems - Lead in creating estimates for code implementation time/resource for assigned tasks and projects Additional Details: The team at Motherson Health & Medical is expanding in alignment with the growth of our Group, offering numerous growth opportunities for you to advance your career alongside us. Join us in our quest to drive positive change in global healthcare.,

Posted 2 days ago

Apply

5.0 - 9.0 years

0 Lacs

maharashtra

On-site

Role Overview: As an HPC Application Specialist at Corning, you will be a crucial member of the global Scientific Computing team, contributing to the development and utilization of scientific software on the HPC Clusters. Your role will involve collaborating with various teams and communities to identify, develop, and implement solutions that support Modeling and Scientific Computing objectives, particularly focusing on the solid mechanics community. Key Responsibilities: - Engage with the diverse global HPC user community, addressing issues ranging from individual support tickets to participation in modeling projects. - Develop, validate, and apply numerical models for solving nonlinear FEA, fracture mechanics, and other applied mechanics problems using open-source tools like MOOSE and FEniCS. - Expand the capabilities of FEM-based open-source software as required, collaborating with external code owners and internal stakeholders. - Create models tailored for HPC environments, ensuring efficient resource utilization, scalability, and parallel execution. - Research, install, configure, maintain, and optimize a wide range of commercial and open-source scientific software for HPC clusters. - Collaborate with scientists and engineers to identify, model, and automate complex scientific processes. - Research and resolve software build, execution, and performance issues promptly and efficiently. - Conduct training sessions on new software or HPC capabilities and maintain technical documentation for the HPC user community. - Foster relationships to drive collaboration and partnerships for enhanced technology services within the community. Qualifications Required: - Experience developing complex numerical models using open-source tools like MOOSE and FEniCS, with a deep understanding of the source code. - Proficiency in solid mechanics and FEA, along with a strong grasp of solid mechanics concepts and theory. - Skilled in programming languages for scientific high-performance computing such as C/C++, Python, and FORTRAN. - Familiarity with HPC and parallel programming concepts and techniques like MPI, OpenMP, OpenACC, and CUDA. - Proven experience in developing, configuring, and troubleshooting applications in Linux-based environments. - Sound knowledge of High-Performance Computing (HPC) environment and related technologies. - Ability to write, port, debug, analyze, and optimize parallel programs effectively. - Understanding of the software development process and strong communication, troubleshooting, and problem-solving skills. - Adaptable to changing requirements and capable of working well both independently and within project teams. Additional Company Details: Corning is a pioneering company known for its breakthrough innovations in glass, ceramic, and materials science. The company's technologies span diverse fields, from ocean depths to outer space, pushing the boundaries of what's achievable. Corning's people play a vital role in driving the company and the world forward, breaking through limitations and expectations daily to make a lasting impact. At Corning, you'll find endless opportunities to contribute to transformative projects in various sectors, including connecting the unconnected, advancing automotive technology, revolutionizing home entertainment, and facilitating the delivery of life-saving medicines. Join Corning and be a part of the innovation journey.,

Posted 2 days ago

Apply

5.0 - 10.0 years

8 - 10 Lacs

belgaum

Work from Office

We are seeking a highly skilled software engineer, very proficient in C++ or Python coding. What You'll Do Design, develop, and maintain high-performance software applications using Python & C++. Optimize and enhance existing software for efficiency, scalability, and reliability. Collaborate with cross-functional teams, including mechanical, electrical, computer vision, and software engineers. Implement best practices in software engineering, including code reviews, unit testing, and documentation. Debug, troubleshoot, and resolve software defects and performance issues. Work with modern development tools, version control systems (Git), and CI/CD pipelines. Develop algorithms and data structures to solve complex computational problems. Ensure security and compliance standards are met in software development. What You'll Need Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field. Strong proficiency in C++ (C++11/14/17/20) and Python. Experience with multi-threading, concurrency, and performance optimization. Familiarity with software development methodologies and design patterns. Knowledge of system programming, memory management, and debugging tools. Experience with version control systems (Git) and agile development practices. Strong problem-solving skills and ability to work in a fast-paced environment. Professional experience in software development preferred. You'll Stand Out Experience with GPU programming (CUDA, OpenCL) or parallel computing. Knowledge of networking protocols and distributed systems. Exposure to machine learning frameworks (TensorFlow, PyTorch) or scientific computing. Experience with DevOps tools (Docker, Kubernetes, CI/CD pipelines)

Posted 3 days ago

Apply

3.0 years

0 Lacs

hyderabad, telangana, india

Remote

🚀 Hiring: AI/ML Engineer @ Apple – Offshore Team (Remote, Full-Time, Hyderabad, Telangana) ✅ Location: Hyderabad, Telangana (Remote within India) ✅ Client: Apple (Offshore team) ✅ Compensation: ₹25 LPA – ₹48 LPA (depends on experience) ✅ Experience: 3+ years (Multiple Positions) Minimum Qualifications Bachelors or M.S. in Computer Science (or related fields) or equivalent experience Expert in Python programming and tensor operations, including https://github.com/rougier/numpy-100 Expert in at least one ML framework (PyTorch preferred but TensorFlow or JAX are also fine), including flash-attention, efficient kv-caching, DDP and FSDP Hands-on experience in establishing ML benchmarks (data, models, metrics) Involvement with open-source projects and experience with collaborative software development Excellent communication skills Readiness to encounter unforeseen challenges and to solve them Preferred Qualifications Bachelors or M.S. in Computer Science (or related fields) or equivalent experience Expert in Python programming and tensor operations, including https://github.com/rougier/numpy-100 Expert in at least one ML framework (PyTorch preferred but TensorFlow or JAX are also fine), including flash-attention, efficient kv-caching, DDP and FSDP Hands-on experience in establishing ML benchmarks (data, models, metrics) Active development in open-source projects and experience with collaborative software development Experience with CUDA programming, and/or High-Performance Computing and/or distributed computing Excellent communication skills Readiness to encounter unforeseen challenges and to solve them Past research experience C++ programming is a plus MLX experience is a plus Description Work with researchers on the team to build high-performance and scalable software addressing novel ML research algorithms. Apply solid software engineering skills, leverage experience to deal with the unexpected, explore research software solutions and pave the way to future Machine Learning toolboxes. Be part of a small team dedicated to advancing ML algorithms and techniques — Is this you? If so, we’d love to hear from you! (Compensation range is indicative and will vary based on years of experience and skill fit.)

Posted 3 days ago

Apply

5.0 - 10.0 years

8 - 12 Lacs

bengaluru

Work from Office

We are looking for a Software Toolchain Engineer to join our engineering team. In this role, you will be developing, maintaining, and optimizing software development toolchains, including compilers, linkers, debuggers, build systems, and related infrastructure. In particular, the focus is on Software Development Kit (SDK) porting and retargeting pretrained neural network models on an AI inference engine. You have: Bachelors or Masters degree in Computer Science, Computer Engineering, or related field. 5+ years of experience in software toolchain. Strong proficiency in C/C++, Python, PyTorch. Deep working knowledge of compiler internals and linking/loading processes. Experience in CUDA. Understanding of AI neural-network architectures, and formats such as ONNX. Neural-network optimizations such as model quantization, pruning. Understanding of systems-level topics like memory management, scheduling, and multi-core compute. Experience with version control systems like Git. At least one major compiler infrastructure (e.g., GCC). It would be nice if you also had: Contributions to open-source compiler or toolchain projects. Experience in programming kernel functions. Familiarity with deployment to edge devices or cloud inference platforms. Knowledge of low-level programming, embedded systems, or hardware architectures (CPU/DSP/GPU). Design, develop, and maintain software development toolchain, including compilers, linkers, debuggers, static analysis tools, code generators, and build systems. Adopt and customize third-party and open-source tools and technologies to meet specific needs. Identify bottlenecks and areas for improvement within the existing toolchain. Optimize performance and output for specific hardware targets based on a hardware-accelerated multi-core RISC-V system. Collaborate with AI/ML experts, hardware and embedded software engineers. Develop SDK: native API, libraries, plugin to PyTorch and/or TensorFlow backend. Develop user environment from PyTorch to instruction simulator.

Posted 3 days ago

Apply

3.0 years

0 Lacs

gurugram, haryana, india

On-site

Job Description We aim to bring about a new paradigm in medical image diagnostics; providing intelligent, holistic, ethical, explainable and patient centric care. We are looking for innovative problem solvers. We want people who can empathize with the consumer, understand business problems, and design and deliver intelligent products. We are looking for a System Administrator to manage and optimize our on-premise and cloud infrastructure, ensuring reliability, security, and scalability for high-throughput AI workloads. As a System Administrator , you will be responsible for managing servers, storage, network, and compute infrastructure powering our AI development and deployment pipelines. You will ensure seamless handling of large medical imaging datasets (DICOM/NIfTI), maintain high availability for research and production systems. Key Responsibilities Infrastructure & Systems Management Manage Linux-based servers, GPU clusters, and network storage for AI training and inference workloads. Configure and maintain message queue systems (RabbitMQ, ActiveMQ, Kafka) for large-scale, asynchronous AI pipeline execution. Set up and maintain service beacons and health checks to proactively monitor the state of critical services (XNAT pipelines, FastAPI endpoints, AI model inference servers). Maintain PACS integration, DICOM routing, and high-throughput data transfer for medical imaging workflows. Manage hybrid infrastructure (on-prem + cloud) including auto-scaling compute for large training tasks. Implement monitoring and alerting systems for infrastructure uptime, resource utilization, and failures. Service Monitoring & Reliability Implement automated service checking for all production and development services using Prometheus, Grafana, or similar tools. Configure beacon agents to trigger alerts and self-healing scripts for service restarts when anomalies are detected. Set up log aggregation and anomaly detection to catch failures in AI processing pipelines early. Ensure 99.9% uptime for mission-critical systems and clinical services. Security & Compliance Enforce secure access control (IAM, VPN, RBAC, MFA) and maintain audit trails for all system activities. Ensure compliance with HIPAA, GDPR, ISO 27001 for medical data storage and transfer. Encrypt medical imaging data (DICOM/NIfTI) at rest and in transit. Automation & DevOps Develop automation scripts for service restarts, scaling GPU resources, and pipeline deployments. Work with DevOps teams to integrate infrastructure monitoring with CI/CD pipelines. Optimize AI pipeline orchestration with MQ-based task handling for scalable performance. Backup, Disaster Recovery & High Availability Manage data backup policies for medical datasets, AI model artifacts, and PostgreSQL/MongoDB databases. Implement failover systems for MQ brokers and imaging data services to ensure uninterrupted AI processing. Collaboration & Support Work closely with AI engineers and data scientists to optimize compute resource utilization. Support teams in troubleshooting infrastructure and service issues. Maintain license servers and specialized imaging software environments. Skills and Qualifications Required: 3+ years of Linux systems administration experience with a focus on service monitoring and high-availability environments . Experience with message queues (RabbitMQ, ActiveMQ, Kafka) for distributed AI workloads. Familiarity with beacons, service health monitoring, self-healing automation . Experience managing GPU clusters (NVIDIA CUDA, drivers, dockerized AI workflows). Hands-on with cloud platforms (AWS, GCP, Azure). Networking fundamentals (firewalls, VPNs, load balancers). Hands-on experience with GPU-enabled servers (NVIDIA CUDA, drivers, dockerized AI workflows). Experience managing large datasets (100GB–TB scale), preferably in healthcare or scientific research. Familiarity with cloud platforms (AWS EC2, S3, EKS or equivalents). Knowledge of cybersecurity best practices and compliance frameworks (HIPAA, ISO 27001). Preferred: Experience with PACS, XNAT, or medical imaging servers. Familiarity with Prometheus, Grafana, ELK stack, SaltStack beacons , or similar monitoring tools. Knowledge of Kubernetes or Docker Swarm for container orchestration. Basic scripting knowledge (Bash, Python) for task automation. Exposure to database administration (PostgreSQL, MongoDB). Scripting skills (Bash, Python, PowerShell) for automation and troubleshooting. Understanding of databases (PostgreSQL, MongoDB) used in AI pipelines. Education: BE/B Tech Experience: 3-5 Years

Posted 4 days ago

Apply

0 years

0 Lacs

new delhi, delhi, india

On-site

We are seeking a Physical AI Engineer to design, develop, and implement AI-driven control and decision-making systems for humanoid robots and embodied agents. This role involves integrating vision-language-action models, reinforcement learning, imitation learning, and real-time robotics systems to create robots capable of performing complex tasks in dynamic environments. Key Responsibilities Integrate AI models (transformers, diffusion policy networks, LLMs, vision-language models) with physical humanoid robots. Design real-time control frameworks that enable AI decision-making to translate into smooth, safe, and efficient motor actions. Develop pipelines to align simulation-to-reality (sim2real) and optimise robot learning for real-world deployment. Apply multi-modal AI learning (vision, audio, haptics, proprioception) to enhance robot perception and adaptability. Collaborate with hardware teams to calibrate robot sensors, optimise energy efficiency, and ensure reliable AI control execution. Develop safety protocols for autonomous decision-making in environments with humans. Conduct experiments in areas such as human-robot interaction, autonomous navigation, dexterous manipulation, and multi-agent collaboration . Research and implement emerging paradigms in Physical AI , including embodied GPTs, action-conditioned transformers, and world-model-based learning. Required Qualifications Bachelor’s or Master’s degree in Robotics, Mechatronics, Computer Science, AI, or a related field (Ph.D. preferred for senior positions). Strong background in machine learning , particularly reinforcement learning, imitation learning, or embodied AI systems. Hands-on experience with robotics simulation environments (Isaac Sim, MuJoCo, Unity, Gazebo). Proficiency in Python and good knowledge of C++/ROS2 for robotics integration. Deep understanding of robotics control theory, kinematics, and sensor fusion . Demonstrated work with humanoid robots, quadrupeds, or robotic arms , preferably on vision-language-action models. Preferred Skills Familiarity with NVIDIA Jetson, CUDA optimisation, and real-time robotics inference . Experience with large foundation models adapted for robotics (OpenAI VLA, Google RT-2, Gr00T). Knowledge of safety-critical autonomous systems . Exposure to distributed training of large models on RTX/HPC clusters. Working knowledge of teleoperation frameworks (for collecting demonstration data). What We Offer The chance to work on cutting-edge humanoid robotics with embodied AI . Access to advanced computing infrastructure ( NVIDIA RTX 6000 GPUs, Jetson Thor platforms, VR mocap systems, and Unitree robots ). A multi-disciplinary team environment spanning AI research and robotics engineering. Competitive remuneration and growth opportunities for leadership in next-gen Physical AI projects .

Posted 4 days ago

Apply

7.0 - 12.0 years

18 - 24 Lacs

bengaluru

Work from Office

Responsibilities: * Develop GPU models using C++ with Yocto framework, optimize performance through CUDA programming. * Collaborate on CI/CD pipelines for efficient build processes.

Posted 4 days ago

Apply

6.0 years

0 Lacs

chennai, tamil nadu, india

On-site

Domain: Artificial Intelligence / Machine Learning Industry: Information Technology / Product Engineering / Semiconductor / Hardware Manufacturing Job Overview ➡️ Product company,052 experience is mandate. We are seeking an experienced AI Product Manager to lead a team of algorithm engineers in developing and deploying advanced machine learning, deep learning, computer vision, and image processing solutions. The role involves managing both people and technology while driving innovation and scalability in AI systems for high-performance environments. Key Responsibilities Lead and mentor a team of algorithm engineers, ensuring their growth and success. Develop and maintain infrastructure required for large-scale deployment and execution of algorithms. Collaborate with cross-functional teams including data scientists, software engineers, and product managers. Optimize algorithm performance and resource utilization to achieve business goals. Stay updated on the latest advancements in AI, ML, DL, computer vision, and infrastructure technologies. Drive continuous improvements in development processes, methodologies, and tools. Qualifications Educational Requirements: PhD with 6+ years of industry experience, OR M.Tech/Master’s degree with 8+ years of experience, OR B.Tech/Bachelor’s degree with 10+ years of experience Minimum Experience At least 1 year in a Manager/Lead role. 8+ years in programming (Python, C++, CUDA). 8+ years in ML, AI, DL algorithm development. 2–3 years in image processing & computer vision (mandatory). Prior experience in product/semiconductor/hardware manufacturing companies (must-have). Preferred Qualifications Background from Tier-1 institutes (IIT, NIT, IIIT, VIT, etc.). Experience with ML frameworks (TensorFlow, PyTorch, Scikit-learn). Familiarity with GPU architecture and algo development toolkits (e.g., Docker, Apptainer). Experience in high-performance computing, parallel programming, and distributed systems. Strong leadership and people management track record. Stability in career (minimum 2 years tenure in each organization). High academic performance (8+ CGPA desirable). Additional Details Relocation to Chennai is mandatory (relocation expenses covered). Hybrid model: 3 days working from office. Open to candidates globally who are willing to relocate. Interview Process: 3 Technical Rounds + Fitment Round + HR Round. 👉 Important Note This role is not focused on data science/analytics. We are looking for candidates with ML framework development or ML algorithm development experience, not analytics backgrounds. Skills: image processing,computer vision,algorithm development,machine learning,ml

Posted 4 days ago

Apply

2.0 - 6.0 years

0 Lacs

karnataka

On-site

You are seeking a Software Engineer specializing in Video Pipelines to join the Video AI Platform team. Your primary responsibility will involve developing robust and high-performance video pipelines for Video-on-Demand (VoD) and Live streaming systems. Your hands-on approach will be crucial in implementing modules for video decoding, encoding, transcoding, and modification to ensure the platform delivers low-latency, high-quality video experiences at scale. Key Responsibilities: - Build and maintain video ingestion, decoding, encoding, and transcoding pipelines for VoD and Live systems. - Integrate adaptive bitrate streaming (HLS, DASH) into delivery pipelines. - Collaborate with frameworks like FFmpeg, GStreamer, NVIDIA Video Codec SDK, and VAAPI to implement efficient video processing components. - Ensure pipeline compatibility with multiple codecs and containers (H.264/H.265, AV1, VP9, MP4, MKV, TS). In terms of Video Modification Modules, your tasks will include: - Implementing frame-accurate transformations such as redaction (face/voice blurring), reframing, auto-zoom, and overlays. - Building timeline-aware components that align scene metadata with video streams for precise modifications. - Optimizing GPU-accelerated filters for real-time and batch processing. Regarding Performance & Scalability, you will: - Profile and tune pipelines for low-latency live streaming and high-throughput VoD workflows. - Contribute to scaling strategies for large video libraries and live event workloads. - Optimize for cloud cost efficiency while maintaining reliability. Your Collaboration & Execution responsibilities entail: - Translating designs into production components in collaboration with senior engineers. - Integrating model outputs from AI teams into video pipelines (e.g., scene tagging, redaction cues). - Participating in code reviews, testing, and deployment automation. Qualifications: Must-Have: - 2-5 years of experience in video pipeline or multimedia systems engineering. - Strong coding skills in C++ and/or Python. - Hands-on experience with FFmpeg, GStreamer, libx264/x265, NVENC/DEC. - Understanding of video codecs and streaming protocols (H.264/H.265, VP9, AV1, HLS, DASH, RTMP). - Familiarity with GPU acceleration (CUDA, NVENC/DEC, VAAPI, or equivalent). Nice-to-Have: - Exposure to cloud-native deployments (AWS/GCP/Azure, Docker, Kubernetes). - Experience in real-time video editing or transformation pipelines. - Familiarity with timeline-based metadata, content retrieval, or AI-driven video modifications. - Knowledge of adaptive streaming and edge delivery optimizations.,

Posted 4 days ago

Apply

2.0 - 5.0 years

7 - 12 Lacs

pune

Work from Office

Work with the research team to research, develop, evaluate, and optimize various computer vision and deep learning models for different problems. Take ownership to drive computer vision solutions and meet customer requirements.

Posted 4 days ago

Apply

4.0 - 9.0 years

30 - 45 Lacs

hyderabad

Work from Office

Role Overview: The Staff Engineer will be responsible for architecting and implementing advanced quantization algorithms for edge AI applications. You will lead technical initiatives, mentor junior team members, and drive continuous improvement in model compression and optimization techniques for LLMs and other deep learning models. Key Responsibilities: Architectural Leadership: o Design and develop robust quantization strategies and algorithms for AI inference on edge devices. o Lead system-level design discussions and collaborate closely with hardware and research teams. Mentorship & Code Review: o Mentor mid-level and junior engineers, providing technical guidance and best practices. o Conduct thorough code reviews and ensure high standards of quality and performance. Innovation & Optimization: o Stay abreast of the latest research in model quantization and compression, and drive the adoption of innovative techniques. o Develop and maintain performance benchmarks, and continuously optimize algorithms for low latency and high energy efficiency. Cross-Functional Collaboration: o Work with the Quantizer Group Manager and Tech Lead to align technical roadmaps with product objectives. o Participate in regular strategy sessions to set technical direction and priorities. Qualifications: Bachelors or Masters degree in Computer Science, Electrical Engineering, or a related field (Ph.D. is a plus). 5-8+ years of industry experience in deep learning, model optimization, or related areas. Demonstrated experience with quantization techniques, LLM optimization, and software development using Python/C++. Strong problem-solving skills and a passion for innovation in edge AI technologies. What We Offer: An opportunity to work on pioneering edge AI technologies that redefine the future of real-time inference. A collaborative environment where innovation is at the core of our culture. Competitive compensation, comprehensive benefits, and significant opportunities for professional growth.

Posted 4 days ago

Apply

5.0 years

0 Lacs

kolkata, west bengal, india

On-site

Key Responsibilities Responsible for building and maintaining robust machine learning pipelines ensuring efficient model deployment monitoring and lifecycle management within a cloud-based environment Extensive expertise in MLOps specifically with Google Cloud Platform GCP and Vertex AI and a deep understanding of model performance drift detection and GPU accelerators Build and maintain scalable MLOps pipelines in GCP Vertex AI for endtoend machine learning workflows Manage the full MLOps lifecycle from data preprocessing model training and deployment to model monitoring and drift detection Implement realtime model monitoring and drift detection to ensure optimal model performance over time Optimize model training and inference processes using GPU accelerators and CUDA Collaborate with cross functional teams to automate and streamline machine learning model deployment and monitoring Utilize Python 310 with libraries such as pandas NumPy and TensorFlow to handle data processing and model development Set up infrastructure for continuous training testing and deployment of machine learning models Ensure scalability security and high availability in all machine learning operations by implementing best practices in MLOps Requirements 5 years of experience in MLOps and building ML pipelines 3 years of experience in GCP Vertex AI Deep understanding of the MLOps lifecycle and automation of ML workflows Proficient in Python 310 and related libraries such as pandas NumPy and TensorFlow Strong experience in GPU accelerators and CUDA for model training and optimization Proven experience in model monitoring drift detection and maintaining model accuracy over time Strong problemsolving skills with the ability to work in a fast paced environment Knowledge of data versioning and model version control techniques Familiarity with TensorFlow Extended TFX or other ML workflow orchestration frameworks Skills Mandatory Skills : MLOps, Google Cloud Platform GCP and Vertex AI. Good to Have Skills : MLOPS - Cloud (AWS/ GCP/ Azure)

Posted 5 days ago

Apply

2.0 - 6.0 years

0 Lacs

delhi

On-site

You are a Software Engineer specializing in Video Pipelines, and you will be a valuable addition to our Video AI Platform team. Your primary responsibility will be to develop robust and high-performance video pipelines for Video-on-Demand (VoD) and Live streaming systems. Your expertise will ensure that our platform delivers top-notch video experiences at scale with low latency. Your role involves building and maintaining video pipelines for ingestion, decoding, encoding, and transcoding, integrating adaptive bitrate streaming into delivery pipelines. You will work with frameworks like FFmpeg, GStreamer, NVIDIA Video Codec SDK, and VAAPI to implement efficient video processing components compatible with various codecs and containers. In addition, you will be tasked with implementing video modification modules for frame-accurate transformations such as redaction, reframing, auto-zoom, and overlays. Your responsibilities will include building timeline-aware components and optimizing GPU-accelerated filters for real-time and batch processing. You will play a crucial role in profiling and tuning pipelines for low-latency live streaming and high-throughput VoD workflows, contributing to scaling strategies for large video libraries and live event workloads. Your optimization efforts will focus on cloud cost efficiency while ensuring system reliability. Collaboration is key in this role, as you will work closely with senior engineers to bring designs into production components, collaborate with AI teams for integrating model outputs into video pipelines, and actively participate in code reviews, testing, and deployment automation. For this role, you must have a minimum of 2-5 years of experience in video pipeline or multimedia systems engineering. Strong coding skills in C++ and/or Python are essential, along with hands-on experience in FFmpeg, GStreamer, libx264/x265, NVENC/DEC, and a solid understanding of video codecs and streaming protocols. Familiarity with GPU acceleration is a must. Additional qualifications that would be beneficial include exposure to cloud-native deployments, real-time video editing, familiarity with timeline-based metadata, and knowledge of adaptive streaming and edge delivery optimizations.,

Posted 5 days ago

Apply

1.0 years

0 Lacs

noida, uttar pradesh, india

On-site

Job Description Designation : Software Engineer - Description : Individual contributor Understand Software/System components and associated specifications. Ability to modify the architecture to enhance accuracy using different techniques like hyper parameter tuning, changing validation and test strategy, choosing right loss and activation functions. Good understanding of Visual Foundation models like YOLO, MAE, GrondingDIno etc. Apply statistical techniques for data analysis. Optimization of neural architecture for production deployment. Good at developing Heuristics post inference to reduce False Required : Skills : Good knowledge of Maths and Statistics particularly in the areas of calculus, linear algebra, and Bayesian statistics. Working knowledge of CNN based architecture on imaging or video applications. Good understanding of programming languages like Python, C/C++. Good understanding of Tensorflow/Keras APIs. Good understanding of OpenCV library. Good understanding of various optimization techniques for production deployment. Demonstratable track record of delivering on production environment. Image processing/audio signal processing exposure is a plus. Hands-on knowledge of using GPUs (CUDA) for tensorflow/pyTorch Skills : Strong analytical and problem-solving abilities, with quick adaptation to new technologies, methodologies, and systems. Self-starter, having positive attitude, ability to proactively identify issues and/or opportunities for improvement. Mature, ability to understand business perspective of requirements and interpersonal relationships when interacting with non team members. Team : Graduate/Post-Graduate in Computer Science/Maths/Stats/Physics from reputed institution Consistently good academic record having minimum 75% throughout (10th std : At least 1 year of work experience in the field of ML/AI : Sector - 68, Noida Job Types : Full-time, : Health insurance Leave encashment Provident Fund (ref:hirist.tech)

Posted 5 days ago

Apply

10.0 - 31.0 years

10 - 17 Lacs

sector 126, noida

On-site

We at HCLTech are seeking a highly skilled and experienced Technical Architect to join our team and play a pivotal role in transforming business processes and enhancing operational efficiency. The ideal candidate will possess strong expertise in Artificial Intelligence, a proven track record of success in developing and implementing Machine learning , Python, NLP and a passion for leveraging technology to drive business transformation. Job Description Role: Technical Architect (AI/ML Architect) Experience: 10+ years Location: Noida Working Model: Hybrid Notice Period Required: Any Responsibilities & Qualifications: Must have skills: Research, develop, optimize and productize for artificial intelligence and machine learning applications · In-depth knowledge of machine learning, text extraction and image processing techniques & Natural Language Processing · Experience in developing machine learning, text extraction and image processing algorithms, and optimizing the existing algorithms. · Coding experience in Python. · Experience in products like Tensorflow, CUDA toolkit, CuDNN, Theano, Caffe, Torch, OpenCV or similar tools. · Experience working with Cloud Services – AWS, Azure etc. · Takes ownership of self-professional development · The desired candidate should be a quick thinker and should have a track record of high performance product development. Additionally, the candidate should be innovative, dedicated and result oriented. · Research experience is preferred. · Strong background in Math specially in statistics , modelling and matrices. · Only candidates with B.Tech & M.Tech background are considered. Benefits: · Competitive salary and comprehensive benefits package · Opportunity to work on cutting-edge projects. · Collaborative and supportive work environment · Professional development and training opportunities Kindly share across your resume at Prabha.kumari@hcltech.com along with the below requested details: · Total experience: · Relevant experience in NLP , Gen AI , ML Models , Python : · Notice Period: · Education Qualification

Posted 5 days ago

Apply

Exploring CUDA Jobs in India

India has emerged as a hub for tech talent, with a growing demand for professionals skilled in CUDA programming. CUDA, which stands for Compute Unified Device Architecture, is a parallel computing platform and programming model developed by NVIDIA. As more companies in India look to leverage GPU acceleration for their computing needs, the demand for CUDA developers is on the rise.

Top Hiring Locations in India

  1. Bangalore
  2. Pune
  3. Hyderabad
  4. Chennai
  5. Mumbai

Average Salary Range

The average salary range for CUDA professionals in India varies based on experience: - Entry-level: INR 4-6 lakhs per annum - Mid-level: INR 8-12 lakhs per annum - Experienced: INR 15-20 lakhs per annum

Career Path

In the field of CUDA programming, a typical career path may include: - Junior CUDA Developer - CUDA Developer - Senior CUDA Developer - CUDA Tech Lead

Related Skills

Apart from proficiency in CUDA programming, professionals in this field are often expected to have knowledge or experience in: - C/C++ programming - Parallel computing - GPU architecture - Machine learning algorithms

Interview Questions

  • What is CUDA and how does it differ from traditional programming models? (basic)
  • Explain the difference between threads and blocks in CUDA. (basic)
  • What is shared memory in CUDA and why is it important? (medium)
  • How do you optimize memory access in CUDA programming? (medium)
  • Can you explain the concept of warp divergence in CUDA? (medium)
  • What is kernel launch overhead in CUDA and how can it be minimized? (advanced)
  • How do you handle error checking in CUDA programming? (basic)
  • Explain the concept of coalesced memory access in CUDA. (medium)
  • What are the different types of memory available in CUDA? (basic)
  • How do you debug CUDA code? (medium)
  • Explain the purpose of the cudaMemcpy function in CUDA. (basic)
  • How do you handle synchronization in CUDA programming? (medium)
  • What is the significance of grid and block dimensions in CUDA? (basic)
  • Explain the concept of warp size in CUDA. (basic)
  • How do you optimize performance in CUDA kernels? (medium)
  • What is the difference between global, shared, and constant memory in CUDA? (medium)
  • Can you explain the concept of texture memory in CUDA? (medium)
  • How do you handle race conditions in CUDA programming? (medium)
  • What are the advantages of using CUDA for parallel computing? (basic)
  • Explain the concept of warp shuffle in CUDA. (advanced)
  • How do you handle dynamic memory allocation in CUDA? (basic)
  • What is the purpose of the nvcc compiler in CUDA programming? (basic)
  • How do you profile and optimize CUDA applications? (medium)
  • Can you explain the concept of occupancy in CUDA? (advanced)

Closing Remark

As the demand for CUDA professionals continues to grow in India, now is the perfect time to upskill and pursue career opportunities in this field. By mastering CUDA programming and related skills, you can position yourself as a valuable asset in the tech industry. Prepare diligently, showcase your expertise confidently, and embark on a rewarding career journey in CUDA development.

cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies