Home
Jobs

229 Cuda Jobs - Page 6

Filter
Filter Interviews
Min: 0 years
Max: 25 years
Min: ₹0
Max: ₹10000000
Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

Linkedin logo

Description - External Role – AIML Data Scientist Location : Kochi Mode of Interview - In Person Date : 14th June 2025 (Saturday) Job Description Be a hands on problem solver with consultative approach, who can apply Machine Learning & Deep Learning algorithms to solve business challenges Use the knowledge of wide variety of AI/ML techniques and algorithms to find what combinations of these techniques can best solve the problem Improve Model accuracy to deliver greater business impact Estimate business impact due to deployment of model Work with the domain/customer teams to understand business context , data dictionaries and apply relevant Deep Learning solution for the given business challenge Working with tools and scripts for sufficiently pre-processing the data & feature engineering for model development – Python / R / SQL / Cloud data pipelines 4. Design , develop & deploy Deep learning models using Tensorflow / Pytorch Experience in using Deep learning models with text, speech, image and video data Design & Develop NLP models for Text Classification, Custom Entity Recognition, Relationship extraction, Text Summarization, Topic Modeling, Reasoning over Knowledge Graphs, Semantic Search using NLP tools like Spacy and opensource Tensorflow, Pytorch, etc Design and develop Image recognition & video analysis models using Deep learning algorithms and open source tools like OpenCV Knowledge of State of the art Deep learning algorithms Optimize and tune Deep Learnings model for best possible accuracy Use visualization tools/modules to be able to explore and analyze outcomes & for Model validation eg: using Power BI / Tableau Work with application teams, in deploying models on cloud as a service or on-prem Deployment of models in Test / Control framework for tracking Build CI/CD pipelines for ML model deployment Integrating AI&ML models with other applications using REST APIs and other connector technologies Constantly upskill and update with the latest techniques and best practices. Write white papers and create demonstrable assets to summarize the AIML work and its impact. Technology/Subject Matter Expertise Sufficient expertise in machine learning, mathematical and statistical sciences Use of versioning & Collaborative tools like Git / Github Good understanding of landscape of AI solutions – cloud, GPU based compute, data security and privacy, API gateways, microservices based architecture, big data ingestion, storage and processing, CUDA Programming Develop prototype level ideas into a solution that can scale to industrial grade strength Ability to quantify & estimate the impact of ML models Softskills Profile Curiosity to think in fresh and unique ways with the intent of breaking new ground. Must have the ability to share, explain and “sell” their thoughts, processes, ideas and opinions, even outside their own span of control Ability to think ahead, and anticipate the needs for solving the problem will be important Ability to communicate key messages effectively, and articulate strong opinions in large forums Desirable Experience: Keen contributor to open source communities, and communities like Kaggle Ability to process Huge amount of Data using Pyspark/Hadoop Development & Application of Reinforcement Learning Knowledge of Optimization/Genetic Algorithms Operationalizing Deep learning model for a customer and understanding nuances of scaling such models in real scenarios Optimize and tune deep learning model for best possible accuracy Understanding of stream data processing, RPA, edge computing, AR/VR etc Appreciation of digital ethics, data privacy will be important Experience of working with AI & Cognitive services platforms like Azure ML, IBM Watson, AWS Sagemaker, Google Cloud will all be a big plus Experience in platforms like Data robot, Cognitive scale, H2O.AI etc will all be a big plus Show more Show less

Posted 3 weeks ago

Apply

0 years

0 Lacs

Rajkot, Gujarat, India

On-site

Linkedin logo

Stride Dynamics We are an early-stage Robotics startup developing autonomous aerial robots. We are IIT Kanpur Alumni with extensive experience building autonomous systems for government, defence and enterprises in India and abroad. With Stride Dynamics, we envision leading the next generation of autonomous aerial robots in India and making global standard products for defence, government and enterprises. The Role We are looking for someone with a passion for working on hardware and autonomous systems. As a robotics engineer, you will work on our core technology for autonomous flight and contribute from conceptualisation to deployment. You will have the opportunity to work on concepts like localisation, controls, perception, navigation, and planning. We are working on developing aerial vehicles with very robust localisation, enabling them to navigate in any conditions (indoors, dark, dusty, high altitude GNSS jamming scenarios, etc.). The Work Design, develop and debug the autonomy software stack for our systems. Work on computer vision, learning based perception, and localisation for aerial systems. A lot of testing in real-world environments. Document and maintain efficient, modular, and reliable C++ code. Develop and improve algorithms for various autonomy modules. Research, understand and implement state-of-the-art methods.. We’re looking for someone with Experience with hardware and implementing algorithms. Experience in C++, Python and ROS. Experience with computer vision, localisation (filtering, PGO, visual odometry). Has Experience with Linux Development Environment and tools like CMake, Git, etc. Bonus if you: Have hands-on experience with robots in the form of projects or competitions. Experience/knowledge of Deep Learning based approaches in Robotics. Experience with GPU/VPU-accelerated programming (eg, CUDA, OpenCL). Published research in the Robotics domain. If you match the above, why us Work in a culture that celebrates innovation, creativity, and the freedom to challenge the status quo. Work with a team of people who are passionate about hardware and robotics. Join us and help us design the future of drones! Apart from the above job description, if you think you can contribute in any other domains (eg, embedded software, hardware, machine learning), feel free to reach out to us. Show more Show less

Posted 3 weeks ago

Apply

6.0 - 9.0 years

27 - 42 Lacs

Chennai

Work from Office

Naukri logo

Description - External Role – AIML Data Scientist Location : Kochi Mode of Interview - In Person Date : 14th June 2025 (Saturday) Job Description: 1. Be a hands on problem solver with consultative approach, who can apply Machine Learning & Deep Learning algorithms to solve business challenges a. Use the knowledge of wide variety of AI/ML techniques and algorithms to find what combinations of these techniques can best solve the problem b. Improve Model accuracy to deliver greater business impact c. Estimate business impact due to deployment of model 2. Work with the domain/customer teams to understand business context , data dictionaries and apply relevant Deep Learning solution for the given business challenge 3. Working with tools and scripts for sufficiently pre-processing the data & feature engineering for model development – Python / R / SQL / Cloud data pipelines 4. Design , develop & deploy Deep learning models using Tensorflow / Pytorch 5. Experience in using Deep learning models with text, speech, image and video data a. Design & Develop NLP models for Text Classification, Custom Entity Recognition, Relationship extraction, Text Summarization, Topic Modeling, Reasoning over Knowledge Graphs, Semantic Search using NLP tools like Spacy and opensource Tensorflow, Pytorch, etc b. Design and develop Image recognition & video analysis models using Deep learning algorithms and open source tools like OpenCV c. Knowledge of State of the art Deep learning algorithms 6. Optimize and tune Deep Learnings model for best possible accuracy 7. Use visualization tools/modules to be able to explore and analyze outcomes & for Model validation eg: using Power BI / Tableau 8. Work with application teams, in deploying models on cloud as a service or on-prem a. Deployment of models in Test / Control framework for tracking b. Build CI/CD pipelines for ML model deployment 9. Integrating AI&ML models with other applications using REST APIs and other connector technologies 10. Constantly upskill and update with the latest techniques and best practices. Write white papers and create demonstrable assets to summarize the AIML work and its impact. Technology/Subject Matter Expertise Sufficient expertise in machine learning, mathematical and statistical sciences Use of versioning & Collaborative tools like Git / Github Good understanding of landscape of AI solutions – cloud, GPU based compute, data security and privacy, API gateways, microservices based architecture, big data ingestion, storage and processing, CUDA Programming Develop prototype level ideas into a solution that can scale to industrial grade strength Ability to quantify & estimate the impact of ML models Softskills Profile Curiosity to think in fresh and unique ways with the intent of breaking new ground. Must have the ability to share, explain and “sell” their thoughts, processes, ideas and opinions, even outside their own span of control Ability to think ahead, and anticipate the needs for solving the problem will be important Ability to communicate key messages effectively, and articulate strong opinions in large forums Desirable Experience: Keen contributor to open source communities, and communities like Kaggle Ability to process Huge amount of Data using Pyspark/Hadoop Development & Application of Reinforcement Learning Knowledge of Optimization/Genetic Algorithms Operationalizing Deep learning model for a customer and understanding nuances of scaling such models in real scenarios Optimize and tune deep learning model for best possible accuracy Understanding of stream data processing, RPA, edge computing, AR/VR etc Appreciation of digital ethics, data privacy will be important Experience of working with AI & Cognitive services platforms like Azure ML, IBM Watson, AWS Sagemaker, Google Cloud will all be a big plus Experience in platforms like Data robot, Cognitive scale, H2O.AI etc will all be a big plus

Posted 3 weeks ago

Apply

12.0 - 18.0 years

35 - 40 Lacs

Bengaluru, Mumbai (All Areas)

Work from Office

Naukri logo

designing robust software architectures integrating advanced algorithms optimising solutions for real-world UAV applications like Vision based UAV navigation developing multi-modal architectures for applications -Advanced Driver Assistance Systems Required Candidate profile Expert application and integration of cutting-edge DL/CV. Designing scalable and modular systems. Strategic hardware utilization for performance. Optimizing DL inference

Posted 3 weeks ago

Apply

8.0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

Linkedin logo

Job Description/Preferred Qualifications Key Responsibilities : • Lead and mentor a team of algorithm engineers, providing guidance and support to ensure their professional growth and success. • Develop and maintain the infrastructure required for the deployment and execution of algorithms at scale. • Collaborate with data scientists, software engineers, and product managers to design and implement robust and scalable algorithmic solutions. • Optimize algorithm performance and resource utilization to meet business objectives. • Stay up to date with the latest advancements in algorithm engineering and infrastructure technologies and apply them to improve our systems. • Drive continuous improvement in development processes, tools, and methodologies. Qualifications: • Bachelor's or master's degree in computer science, Engineering, or a related field. • Proven experience in developing computer vision and image processing algorithm and ML/DL algorithm. • Familiar with high performance computing, parallel programming and distributed systems. • Strong leadership and team management skills, with a track record of successfully leading engineering teams. • Proficiency in programming languages such as Python, C++ and CUDA. • Excellent problem-solving and analytical skills. • Strong communication and collaboration abilities. Preferred Qualifications : • Experience with machine learning frameworks and libraries (e.g., TensorFlow, PyTorch, Scikit-learn). • Experience with GPU architecture and algo development toolkit like Docker, Apptainer. Minimum Qualifications • Bachelor's degree plus 8 + years of experience • Master's degree plus 8 + years of experience • Familiar with high performance computing, parallel programming and distributed systems. Show more Show less

Posted 3 weeks ago

Apply

5.0 years

0 Lacs

Chandigarh, India

On-site

Linkedin logo

Company Profile Oceaneering is a global provider of engineered services and products, primarily to the offshore energy industry. We develop products and services for use throughout the lifecycle of an offshore oilfield, from drilling to decommissioning. We operate the world's premier fleet of work class ROVs. Additionally, we are a leader in offshore oilfield maintenance services, umbilicals, subsea hardware, and tooling. We also use applied technology expertise to serve the defense, entertainment, material handling, aerospace, science, and renewable energy industries. Position Summary Purpose As a Senior Application Software Engineer , you will be responsible for designing, developing, and maintaining the server-side logic of our software systems. The ideal candidate will have expertise in C++ and Python, and experience with Rust is a plus. Duties And Responsibilities Required Experience Design, develop, and maintain server-side software systems and APIs Write efficient, scalable, and maintainable code using C++, Python, and Rust (Optional) Extensive design and development skills in C++ 11. Having knowledge of C++ 14/C++17 will be added advantage Thorough knowledge of the standard library, STL containers, and algorithms Solid understanding of complexity theory (big-O) of algorithms in general, and how the C++ containers fit in Understanding of performance tuning (w.r.t time/space) and how to do performance analysis and optimization Experience in Multi-threaded software development Excellent knowledge of Synchronization objects (Mutex, Semaphore, condition variables, etc) including their appropriate use cases and distinctions Ability to pick up new software languages, interfaces, and software tools Collaborate with cross-functional teams to understand business requirements and develop technical solutions Optimize performance and scalability of backend systems Troubleshoot and debug production issues Keep up to date with industry trends and emerging technologies Ability to extrapolate, develop and refine detailed tasks given high-level goals Experience with developing code in Linux Knowledge of SQL and NoSQL databases Object-oriented software design experience Experience writing unit/integration tests and test procedures Experience with issue tracking software, Jira experience a plus Experience with revision control software, GitHub experience a plus Experience with Rust a plus Experience with QT a plus OpenCV, CUDA, PCL, and experience with Image processing / Computer Vision is a plus Experience with one or more of docker, podman, and Kubernetes is a plus Experience with middleware such as MQTT, DDS, ROS, ROS2 is a plus Knowledge of containerization and container orchestration using tools like Docker and Kubernetes Supervisory Responsibilities This position has no supervisory responsibilities. Reporting Relationships Reports to Sr. Software Manager Engineering. Qualifications Qualifications Bachelor's or master’s degree in Computer Science, Software Engineering, or a related field Software development experience in C++ with 5+ years of experience proven experience designing and developing scalable and high-performance backend systems Experience with cloud computing platforms such as AWS, Google Cloud, or Azure a plus Experience with writing requirements, design documentation, and test cases Ability to read, understand, debug, and modify existing product code Language Skills The employee must have: Excellent English communication skills, both written and verbal. The ability to read and interpret technical documentation, process standards, and safety standards written in English. The ability to write reports, business correspondence, and procedure manuals written in English. The ability to effectively present information in English and respond to questions in English. Closing Statement Working Conditions This position is considered OFFICE WORK which is characterized as follows. Almost exclusively indoors during the day and occasionally at night Occasional exposure to airborne dust in the workplace Work surface is stable (flat) Occasional exposure to project worksites to include fabrication, assembly, test, and customer attraction sites Physical Activity/Requirements The physical demands described here are representative of those that must be met by an employee to successfully perform the essential functions of this job. Reasonable accommodations may be made to enable individuals with disabilities to perform essential functions. This position is considered LIGHT work. OCCASIONAL FREQUENT CONSTANT Lift up to 20 pounds Climbing, stooping, kneeling, squatting, and reaching Lift up to 10 pounds Standing Repetitive movements of arms and hands Sit with back supported Show more Show less

Posted 3 weeks ago

Apply

0 years

0 Lacs

India

On-site

Linkedin logo

About the Role: We are seeking an experienced MLOps Engineer with a strong background in NVIDIA GPU-based containerization and scalable ML infrastructure ( Contractual - Assignment Basis) . You will work closely with data scientists, ML engineers, and DevOps teams to build, deploy, and maintain robust, high-performance machine learning pipelines using NVIDIA NGC containers, Docker, Kubernetes , and modern MLOps practices. Key Responsibilities: Design, develop, and maintain end-to-end MLOps pipelines for training, validation, deployment, and monitoring of ML models. Implement GPU-accelerated workflows using NVIDIA NGC containers, CUDA, and RAPIDS . Containerize ML workloads using Docker and deploy on Kubernetes (preferably with GPU support like NVIDIA device plugin for K8s) . Integrate model versioning, reproducibility, CI/CD, and automated model retraining using tools like MLflow, DVC, Kubeflow, or similar . Optimize model deployment for inference on NVIDIA hardware using TensorRT, Triton Inference Server , or ONNX Runtime-GPU . Manage cloud/on-prem GPU infrastructure and monitor resource utilization and model performance in production. Collaborate with data scientists to transition models from research to production-ready pipelines. Required Skills: Proficiency in Python and ML libraries (e.g., TensorFlow, PyTorch, Scikit-learn). Strong experience with Docker , Kubernetes , and NVIDIA GPU containerization (NGC, nvidia-docker) . Familiarity with NVIDIA Triton Inference Server , TensorRT , and CUDA . Experience with CI/CD for ML (GitHub Actions, GitLab CI, Jenkins, etc.). Deep understanding of ML lifecycle management , monitoring, and retraining. Experience working with cloud platforms (AWS/GCP/Azure) or on-prem GPU clusters. Preferred Qualifications: Experience with Kubeflow , Seldon Core , or similar orchestration tools. Exposure to Airflow , MLflow , Weights & Biases , or DVC . Knowledge of NVIDIA RAPIDS and distributed GPU workloads. MLOps certifications or NVIDIA Deep Learning Institute training (preferred but not mandatory). Show more Show less

Posted 3 weeks ago

Apply

7.0 years

0 Lacs

Delhi, India

On-site

Linkedin logo

TCS HIRING!! ROLE: Gen AI - AI/ML Engineer YEAR OF EXP: 7 + YEARS LOCATION: DELHI Responsibilities: - Fine-tune large language models (LLMs) like Llama for domain-specific applications. - Build LangChain workflows to orchestrate intelligent task handling and document retrieval. - Optimize model performance with hyperparameter tuning and efficient training on Nvidia GPUs. - Design and implement machine learning models for tasks like protein sequencing and structure prediction. Required Skills - Deep understanding of LLMs, fine-tuning, and retrieval-augmented generation workflows. - Proficiency in LangChain, Llama, and tools for hyperparameter optimization (e.g., Optuna, Ray Tune). - Knowledge of bioinformatics, particularly in protein sequencing and modeling. - Strong coding skills in Python and experience with deep learning frameworks (TensorFlow, PyTorch). - Familiarity with GPU-based training environments (CUDA). Show more Show less

Posted 3 weeks ago

Apply

7.0 years

0 Lacs

Thane, Maharashtra, India

On-site

Linkedin logo

Job Description: You will provide leadership in designing and implementing ground-breaking GPU computers that run demanding deep learning, high-performance computing, and computationally intensive workloads. We seek an expert to identify architectural changes and/or completely new approaches for accelerating our deep learning models. As an expert, you will help us with the strategic challenges we encounter, including compute, networking, and storage design for large scale, high-performance workloads, effective resource utilization in a heterogeneous computing environment, evolving our private/public cloud strategy, capacity modelling, and growth planning across our products and services. As an architect you are responsible for converting business needs associated with AI-ML algorithms in to a set of product goals covering workload scenarios, end user expectations, compute infrastructure and time of execution; this should lead to a plan for making the algorithms production ready Benchmark and optimise the Computer Vision Algorithms and the Hardware Accelerators for performance and quality KPIs. Optimize algorithms for optimal performance on the GPU tensor cores. Collaborate with various teams to drive an end to end workflow from data curation and training to performance optimization and deployment. Assign tasks to the team and monitor as well Skills Required MS or PhD in Computer Science, Electrical Engineering, or related field. A strong background in deployment of complex deep learning architectures . 7+ years of relevant experience in at least a few of the following relevant areas is required in your work history: Machine learning (with focus on Deep Neural Networks), including understanding of DL fundamentals; Experience adapting and training DNNs for various tasks; Experience developing code for one or more of the DNN training frameworks (such as Caffe, TensorFlow or Torch): Numerical analysis, Performance analysis, Model compression and Optimization & Computer architecture. Strong Data structures and Algorithms know-how with Excellent C/C++ programming skills. Hands-on expertise with PyTorch, TensorRT, CuDNN Hand-on expertise with GPU computing (CUDA, OpenCL, OpenACC) and HPC (MPI, OpenMP) In-depth understanding of container technologies like Docker, Singularity, Shifter, Charliecloud. Proficient in Python programming and bash scripting. Proficient in Windows, Ubuntu and Centos operating systems. Excellent communication and collaboration skills. Self-motivated and able to find creative practical solutions to problems. Good to have Hands-on experience with HPC cluster job schedulers such as Kubernetes, SLURM, LSF. Familiarity with cloud computing architectures Hands-on experience with Software Defined Networking and HPC cluster networking. Working knowledge of cluster configuration management tools such as Ansible, Puppet, Salt. Understanding of fast, distributed storage systems and Linux file systems for HPC workloads. About Company: AIRA Matrix provides artificial intelligence based solutions for Life Sciences applications. Our solutions improve efficiency, diagnostic accuracy, and turnaround times in pathology, microbiology and ophthalmology workflows across pharmaceutical and healthcare laboratories. We leverage machine and deep learning techniques to develop diagnostic, prognostic, and predictive solutions. Our solutions provide cost benefits in the pharmaceutical domain, by speeding up pre-clinical drug development timelines, and by enhancing the efficiency of environmental monitoring required in manufacturing. In healthcare applications, our solutions improve treatment outcomes by aiding disease stratification and enabling management protocols tailored to individual patients. Our clients and partners include leading hospitals, pharmaceutical companies, CROs, and research labs around the world. Our deep learning platforms with existing network models and pre-built AI applications provide the foundation for fast customizations and help tackle any unique challenges in your image analysis and study management workflows. Our flexible service model enables the swift deployment of these custom solutions with minimal resource and time commitment from your side. Our Application Development Team plays an important role in developing competent customer facing applications to access our AI solutions and enterprise-level image management systems in life sciences. -- Regards, Surya Prajapati Talent Acquisition Specialist Email : surya.prajapati@airamatrix.com Website : https://www.airamatrix.com Dosti Pinnacle, 801, Rd No. 22, Wagle Industrial Estate , Thane (W) Maharashtra, India - 400604. Show more Show less

Posted 3 weeks ago

Apply

40.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Linkedin logo

About CynLr Just like a baby’s brain, CynLr Visual Intelligence stack makes Robots to instinctively see & pick any object under any ambience, without any training . ( a demo video link ). Today, we don’t have a robot that can fit a screw into a nut without slipping a thread. Imagine what it would take for a robot to assemble a Smartphone or a car by putting together 1000s of parts with varied shapes and weights, all in random orientations. Thus factories become complex, needing heavy customization of their environment. CynLr-enabled visual robots intuitively handle any object, even from a clutter – a universal alternative to custom machines, simplifying factory lines into modular LEGO blocks of micro-factories. Simplifying factories with robots that can pick & place any object has been a 40-year-old pipe dream - touted as The Holy Grail of Robotics . As a SW developer, you will be responsible for building the entire Vision & Learning SW Stack, Task & Robot Control OS, extensively optimising the performance of these algorithms, translating them into an API library for Internal Engineers and Customers, along with creating a UI layer for the end users. Requirements in Practice : Setting up the Project Framework and breaking down the composition of the Software - scalability, modularity & maintainability. Set up a development framework for the distributed development of Software to ensure – Scalability, Modularity and Maintainability. Design and set the Coding standards and guidelines along with the review process to assess the Scalability, Modularity and Maintainability of the code across every stage of Software Development Lifecycle Identify the platforms, Setup Tool Chains and support APIs needed for every stage of the Development process – Visual Studio IDE setup, Continuous Integration, Debugging for Multi-Threads, Licensing, Cross Language Licensing, Dependency Bundling, etc. Set up the C++ project templates for Services Architecture, API structure and state machine. Design the development process for Parallel Threaded Coding and debugging. Train the team with Debugging skills and the nuances of performance improvements with C++. Design the Dependency architecture and version management system Design the release management system & API/Library Licensing. Design Test Frameworks for Integration, distribution, Performance profiling & Safety. Must have an understanding of : Compiler working and construction. CPU architectures – x86, x64 & ARM Hardware-associated driver development. OS and layers (Board Support Packages, BIOS, UEFI, BootLoader) Memory architectures and optimizations. MIMD, SIMD Good to have experience and practice with GPU-based application development. Knowledge of CUDA (Excellency is not necessary) State machine architecture Realtime computing UI-based deployable application development Team Structure: The engineering team will comprise of – Algo Team, GPU Team, Software Dev Team & HW Team. Members of other teams will be passive members of each team, apart from the team they lead. The Algo Team will provide the Neural Models & Vision algorithms, while the GPU Team will provide the GPU optimizations for the algos, Hardware team will provide the HW integration and the Software team with translate GPU optimized algos into SW blocks. Each team will split the implementation among other teams and guide them through the implementation. Every team member will be a passive member of all other teams. What will you do? Your contribution is predominantly (but not confined only to) architecting, defining test cases, reviewing & implementing all the SW & firmware parts and development effort - the image processing, Neural Models, APIs, vision sequences, manipulation sequences, drivers, etc. Your thorough knowledge of system architecture, C++ skill set, design patterns & your penchant for application-oriented SW design thought processes, will drive and implement the whole software design & development part. You will also provide architectures and approaches for the drivers to be developed to keep them modular and scalable, and consistently review them all. How will you Do? You have complete freedom here, but you will be subjected to reviews. Since this is a startup and the product is not yet well-defined, you would be the one with the responsibility of defining it. Expect things to be not orderly and requirements to not be solid. Part of your design effort largely involves requirements building, too and developing architectures that are agnostic to such requirement changes. The SW part of the product significantly evolves as per your thought process and will henceforth carry your signature in it. You will also be building a team as the product evolves to maintain and develop further. Though confined to a focused area, the work is pretty much expected to be entrepreneurial with the exact advantages and difficulties of a startup. Show more Show less

Posted 3 weeks ago

Apply

5.0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

Linkedin logo

Who We Are Applied Materials is the global leader in materials engineering solutions used to produce virtually every new chip and advanced display in the world. We design, build and service cutting-edge equipment that helps our customers manufacture display and semiconductor chips – the brains of devices we use every day. As the foundation of the global electronics industry, Applied enables the exciting technologies that literally connect our world – like AI and IoT. If you want to work beyond the cutting-edge, continuously pushing the boundaries of science and engineering to make possible the next generations of technology, join us to Make Possible® a Better Future. What We Offer Location: Bangalore,IND, Chennai,IND At Applied, we prioritize the well-being of you and your family and encourage you to bring your best self to work. Your happiness, health, and resiliency are at the core of our benefits and wellness programs. Our robust total rewards package makes it easier to take care of your whole self and your whole family. We’re committed to providing programs and support that encourage personal and professional growth and care for you at work, at home, or wherever you may go. Learn more about our benefits. You’ll also benefit from a supportive work culture that encourages you to learn, develop and grow your career as you take on challenges and drive innovative solutions for our customers. We empower our team to push the boundaries of what is possible—while learning every day in a supportive leading global company. Visit our Careers website to learn more about careers at Applied. Technical Lead - Software About Applied Applied Materials is the leader in materials engineering solutions used to produce virtually every new chip and advanced display in the world. Our expertise in modifying materials at atomic levels and on an industrial scale enables customers to transform possibilities into reality. At Applied Materials, our innovations make possible the technology shaping the future. Our Team Our team is developing a high-performance computing solution for low-latency and high throughput image processing and deep-learning workloads that will enable our Chip Manufacturing process control equipment to offer differentiated value to our customers. Your Opportunity As a technical lead, you will get the opportunity to grow in the field of high-performance computing, complex system design and low-level optimizations for better cost of ownership. Roles and Responsibility As a technical lead, you will be responsible for designing and implementing High performance computing software solutions for our organization. You will work closely with cross-functional teams, including software engineers, product managers, and business stakeholders, to understand requirements and translate them into architectural/software designs that meet business needs. You will be a subject Matter expert to unblock software engineers in the HPC domain. You will be expected to profile systems to understand bottlenecks, optimize workflows and code and processes to improve cost of ownership. Identify and mitigate technical risks and issues throughout the software development lifecycle. Lead the design and implementation of complex software components and systems. Ensure that software systems are scalable, reliable, and maintainable. Mentor and coach junior software engineers. Your primary focus will be on implementing features of high quality with maintainable and extendable code following software development best practices Our Ideal Candidate Someone who has the drive and passion to learn quickly, has the ability to multi-task and switch contexts based on business needs. Qualifications 5 to 10 years of experience in Design and coding in C/C++ preferably in Linux Environment. Very good knowledge of Data structures, Algorithms and Complexity analysis. In depth experience in Multi-threading, Thread Synchronization, Inter process communication, and Distributed computing fundamentals. Very Good knowledge of Operating systems internals (Linux Preferred), Networking and Storage systems. Experience in performance profiling at application and system level (e.g. vtune, Oprofiler, perf, Nividia Nsight etc.) Experience in low level code optimization techniques using Vectorization and Intrinsics, cache-aware programming, lock free data structures etc. Excellent problem-solving and analytical skills. Strong communication and collaboration abilities. Ability to mentor and coach junior team members. Experience in Agile development methodologies. Additional Qualifications: Experience in GPU programming using CUDA, OpenMP, OpenACC, OpenCL etc. Good Knowledge of Work-flow orchestration Software like Apache Airflow, Apache Spark, Apache storm or Intel TBB flowgraph etc. Experience in developing Distributed High Performance Computing software using Parallel programming frameworks like MPI, UCX etc. Experience in HPC Job-Scheduling and Cluster Management Software (SLURM, Torque, LSF etc.) Good knowledge of Low-latency and high-throughput data transfer technologies (RDMA, RoCE, InfiniBand) Familiarity with microservices architecture and containerization technologies (docker/singularity) and low latency Message queues. Education : Bachelor's Degree or higher in Computer science or related Disciplines. Applied Materials is committed to diversity in its workforce including Equal Employment Opportunity for Minorities, Females, Protected Veterans and Individuals with Disabilities. Additional Information Time Type: Full time Employee Type: Assignee / Regular Travel: Yes, 10% of the Time Relocation Eligible: Yes Applied Materials is an Equal Opportunity Employer. Qualified applicants will receive consideration for employment without regard to race, color, national origin, citizenship, ancestry, religion, creed, sex, sexual orientation, gender identity, age, disability, veteran or military status, or any other basis prohibited by law. Show more Show less

Posted 3 weeks ago

Apply

5.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Linkedin logo

As a member of the GPU AI/HPC Infrastructure team, you will provide leadership in the design and implementation of groundbreaking GPU compute clusters that powers all AI research across NVIDIA. We seek an expert to build and operate these clusters at high reliability, efficiency, and performance and drive foundational improvements and automation to improve researchers productivity. As a Site Reliability Engineer, you are responsible for the big picture of how our systems relate to each other, we use a breadth of tools and approaches to tackle a broad spectrum of problems. Practices such as limiting time spent on reactive operational work, blameless postmortems and proactive identification of potential outages factor into iterative improvement that is key to both product quality and interesting dynamic day-to-day work. SRE's culture of diversity, intellectual curiosity, problem solving and openness is important to our success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to build an environment that provides the support and mentorship needed to learn and grow. What You'll Be Doing In this role you will be building and improving our ecosystem around GPU-accelerated computing including developing large scale automation solutions. You will also be maintaining and building deep learning AI-HPC GPU clusters at scale and supporting our researchers to run their flows on our clusters including performance analysis and optimizations of deep learning workflows. You will design, implement and support operational and reliability aspects of large scale distributed systems with focus on performance at scale, real time monitoring, logging, and alerting. Design and implement state-of-the-art GPU compute clusters. Optimize cluster operations for maximum reliability, efficiency, and performance. Drive foundational improvements and automation to enhance researcher productivity. Troubleshoot, diagnose, and root cause of system failures and isolate the components/failure scenarios while working with internal & external partners. Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity. Practice sustainable incident response and blameless postmortems and Be part of an on-call rotation to support production systems Write and review code, develop documentation and capacity plans, debug the hardest problems, live, on some of the largest and most complex systems in the world. Implement remediations across software and hardware stack according to plan, while keeping a thorough procedural record and data log and Manage upgrades and automated rollbacks across all clusters. What We Need To See Bachelor’s degree in computer science, Electrical Engineering or related field or equivalent experience with a minimum 5+ years of experience designing and operating large scale compute infrastructure. Proven experience in site reliability engineering for high-performance computing environments with operational experience of at least 2K GPUs cluster. Deep understanding of GPU computing and AI infrastructure. Passion for solving complex technical challenges and optimizing system performance. Experience with AI/HPC advanced job schedulers, and ideally familiarity with schedulers such as Slurm. Working knowledge of cluster configuration management tools such as BCM or Ansible and infrastructure level applications, such as Kubernetes, Terraform, MySQL, etc. In depth understating of container technologies like Docker, Enroot, etc. Experience programming in Python and Bash scripting. Ways To Stand Out From The Crowd Interest in crafting, analyzing, and fixing large-scale distributed systems. Familiarity with NVIDIA GPUs, Cuda Programming, NCCL, MLPerf benchmarking, InfiniBand with IBoIP and RDMA. Experience with Cloud Deployment, BCM, Terraform. Understanding of fast, distributed storage systems like Lustre and GPFS for AI/HPC workloads. Multi-cloud experience. JR1993756 Show more Show less

Posted 3 weeks ago

Apply

5.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Linkedin logo

NVIDIA has continuously reinvented itself. Our invention of the GPU sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. Today, research in artificial intelligence is booming worldwide, which calls for highly scalable and massively parallel computation horsepower that NVIDIA GPUs excel. NVIDIA is a “learning machine” that constantly evolves by adapting to new opportunities that are hard to solve, that only we can address, and that matter to the world. This is our life’s work , to amplify human creativity and intelligence. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join our diverse team and see how you can make a lasting impact on the world! As a member of the GPU AI/HPC Infrastructure team, you will provide leadership in the design and implementation of groundbreaking GPU compute clusters that powers all AI research across NVIDIA. We seek an expert to build and operate these clusters at high reliability, efficiency, and performance and drive foundational improvements and automation to improve researchers productivity. As a Site Reliability Engineer, you are responsible for the big picture of how our systems relate to each other, we use a breadth of tools and approaches to tackle a broad spectrum of problems. Practices such as limiting time spent on reactive operational work, blameless postmortems and proactive identification of potential outages factor into iterative improvement that is key to both product quality and interesting dynamic day-to-day work. SRE's culture of diversity, intellectual curiosity, problem solving and openness is important to our success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to build an environment that provides the support and mentorship needed to learn and grow. What You'll Be Doing In this role you will be building and improving our ecosystem around GPU-accelerated computing including developing large scale automation solutions. You will also be maintaining and building deep learning AI-HPC GPU clusters at scale and supporting our researchers to run their flows on our clusters including performance analysis and optimizations of deep learning workflows. You will design, implement and support operational and reliability aspects of large scale distributed systems with focus on performance at scale, real time monitoring, logging, and alerting. Design and implement state-of-the-art GPU compute clusters. Optimize cluster operations for maximum reliability, efficiency, and performance. Drive foundational improvements and automation to enhance researcher productivity. Troubleshoot, diagnose, and root cause of system failures and isolate the components/failure scenarios while working with internal & external partners. Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity. Practice sustainable incident response and blameless postmortems and Be part of an on-call rotation to support production systems Write and review code, develop documentation and capacity plans, debug the hardest problems, live, on some of the largest and most complex systems in the world. Implement remediations across software and hardware stack according to plan, while keeping a thorough procedural record and data log and Manage upgrades and automated rollbacks across all clusters. What We Need To See Bachelor’s degree in computer science, Electrical Engineering or related field or equivalent experience with a minimum 5+ years of experience designing and operating large scale compute infrastructure. Proven experience in site reliability engineering for high-performance computing environments with operational experience of at least 2K GPUs cluster. Deep understanding of GPU computing and AI infrastructure. Passion for solving complex technical challenges and optimizing system performance. Experience with AI/HPC advanced job schedulers, and ideally familiarity with schedulers such as Slurm. Working knowledge of cluster configuration management tools such as BCM or Ansible and infrastructure level applications, such as Kubernetes, Terraform, MySQL, etc. In depth understating of container technologies like Docker, Enroot, etc. Experience programming in Python and Bash scripting. Ways To Stand Out From The Crowd Interest in crafting, analyzing, and fixing large-scale distributed systems. Familiarity with NVIDIA GPUs, Cuda Programming, NCCL, MLPerf benchmarking, InfiniBand with IBoIP and RDMA. Experience with Cloud Deployment, BCM, Terraform. Understanding of fast, distributed storage systems like Lustre and GPFS for AI/HPC workloads. Multi-cloud experience. JR1993564 Show more Show less

Posted 3 weeks ago

Apply

10.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Linkedin logo

Our Company Changing the world through digital experiences is what Adobe’s all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences! We’re passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen. We’re on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity. We realize that new ideas can come from everywhere in the organization, and we know the next big idea could be yours! The Firefly GenAI engineering team is looking for a passionate ML Engineer to enable and accelerate the Adobe GenAI creative flows powered by pioneering machine learning models. Our goal is to seamlessly enable users to transform Creative Intent into Creative Success, through GenAI features, in our revolutionary content creation applications: Firefly, PPro, Express, Photoshop, Podcast and others. This is a unique opportunity to work on powerful new and innovative generative technologies. In this role, you will work closely with ML researchers, bringing groundbreaking AI experiences to our users by enabling large-scale productisation through cloud based services. Responsibilities Design and develop the GenAI backend services for Firefly, creating GPU optimized, efficient model pipelines that power the generative AI features on Firefly website, PPro, Photoshop, Illustrator, Express, Stock and other applications/surfaces Work on large-scale stateful and stateless distributed systems for AI workflows with cost efficient strategies Collaborate closely with data scientists and engineers to ensure the GenAI services are effectively integrated into Adobe products and systems Ensure scalable and reliable cloud services with observability, logging and tracing to enable quick detection, understanding and resolution of run-time issues Explore and research new and emerging ML and MLOps technologies to continuously improve Adobe`s GenAI engineering effectiveness and efficiency Requirements 10+ years of experience Excellent programming skills (Python) Proven programming skills with standard machine learning tools such as Python, PyTorch, and TensorFlow Experience in pioneering Generative AI technologies, such as diffusion and/or GAN models Track record of delivering cloud-scale, data-driven products, and services that are widely adopted with large customer bases Advanced understanding of AI/ML, including ML frameworks Passion for staying up to date with the latest trends and technologies in AI/ML - in the cloud and on device Bachelor’s / Masters degree in engineering Bonus Experience with GPU optimization ( CUDA, Triton, TRT, AOT) Experience converting models from various frameworks like PyTorch and TensorFlow to other target formats, to ensure compatibility and optimized performance across different platforms Adobe is proud to be an Equal Employment Opportunity employer. We do not discriminate based on gender, race or color, ethnicity or national origin, age, disability, religion, sexual orientation, gender identity or expression, veteran status, or any other applicable characteristics protected by law. Learn more. Adobe aims to make Adobe.com accessible to any and all users. If you have a disability or special need that requires accommodation to navigate our website or complete the application process, email accommodations@adobe.com or call (408) 536-3015. Show more Show less

Posted 3 weeks ago

Apply

5.0 years

0 Lacs

Hyderabad, Telangana, India

On-site

Linkedin logo

As a member of the GPU AI/HPC Infrastructure team, you will provide leadership in the design and implementation of groundbreaking GPU compute clusters that powers all AI research across NVIDIA. We seek an expert to build and operate these clusters at high reliability, efficiency, and performance and drive foundational improvements and automation to improve researchers productivity. As a Site Reliability Engineer, you are responsible for the big picture of how our systems relate to each other, we use a breadth of tools and approaches to tackle a broad spectrum of problems. Practices such as limiting time spent on reactive operational work, blameless postmortems and proactive identification of potential outages factor into iterative improvement that is key to both product quality and interesting dynamic day-to-day work. SRE's culture of diversity, intellectual curiosity, problem solving and openness is important to our success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to build an environment that provides the support and mentorship needed to learn and grow. What You'll Be Doing In this role you will be building and improving our ecosystem around GPU-accelerated computing including developing large scale automation solutions. You will also be maintaining and building deep learning AI-HPC GPU clusters at scale and supporting our researchers to run their flows on our clusters including performance analysis and optimizations of deep learning workflows. You will design, implement and support operational and reliability aspects of large scale distributed systems with focus on performance at scale, real time monitoring, logging, and alerting. Design and implement state-of-the-art GPU compute clusters. Optimize cluster operations for maximum reliability, efficiency, and performance. Drive foundational improvements and automation to enhance researcher productivity. Troubleshoot, diagnose, and root cause of system failures and isolate the components/failure scenarios while working with internal & external partners. Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity. Practice sustainable incident response and blameless postmortems and Be part of an on-call rotation to support production systems Write and review code, develop documentation and capacity plans, debug the hardest problems, live, on some of the largest and most complex systems in the world. Implement remediations across software and hardware stack according to plan, while keeping a thorough procedural record and data log and Manage upgrades and automated rollbacks across all clusters. What We Need To See Bachelor’s degree in computer science, Electrical Engineering or related field or equivalent experience with a minimum 5+ years of experience designing and operating large scale compute infrastructure. Proven experience in site reliability engineering for high-performance computing environments with operational experience of at least 2K GPUs cluster. Deep understanding of GPU computing and AI infrastructure. Passion for solving complex technical challenges and optimizing system performance. Experience with AI/HPC advanced job schedulers, and ideally familiarity with schedulers such as Slurm. Working knowledge of cluster configuration management tools such as BCM or Ansible and infrastructure level applications, such as Kubernetes, Terraform, MySQL, etc. In depth understating of container technologies like Docker, Enroot, etc. Experience programming in Python and Bash scripting. Ways To Stand Out From The Crowd Interest in crafting, analyzing, and fixing large-scale distributed systems. Familiarity with NVIDIA GPUs, Cuda Programming, NCCL, MLPerf benchmarking, InfiniBand with IBoIP and RDMA. Experience with Cloud Deployment, BCM, Terraform. Understanding of fast, distributed storage systems like Lustre and GPFS for AI/HPC workloads. Multi-cloud experience. JR1993756 Show more Show less

Posted 3 weeks ago

Apply

8.0 years

0 Lacs

Pune, Maharashtra, India

On-site

Linkedin logo

Job Requisition ID # 25WD85491 Position Overview We are looking for an experienced Principal Software Engineer to join our platform team focusing on AI/ML Platform (AMP). This team builds and maintains central components to fast track the development of new ML/AI models such as model development studio, feature store, model serving and model observability. The ideal candidate would have a background in ML Ops, Data engineering and DevOps with the experience of building high scale deployment architectures and observability. As an important contributor to our engineering team, you will help shape the future of our AI/ML capabilities, delivering solutions that inspire value for our organization. You will report directly to an Engineering Manager, and you will be based in Pune. Responsibilities System design: You will design, implement and manage software systems for the AI/ML Platform and orchestrate the full ML development lifecycle for the partner teams Mentoring: Spreading your knowledge, sharing best practices and doing design reviews to step up the expertise at the team level Multi-cloud architecture: Define components which leverages strengths from multiple cloud platforms (e.g., AWS, Azure) to optimize performance, cost, and scalability AI/ML observability: You will build systems for monitoring performance of AI/ML models and find insights on the underlying data such as drift detection, data fairness/bias and anomalies ML Solution Deployment: You will develop tools for building and deploying ML artefacts in production environments and facilitating a smooth transition from development to deployment Big Data Management: Automate and orchestrate tasks related to managing big data transformation and processing and build large-scale data stores for ML artifacts Scalable Services: Design and implement low-latency, scalable prediction, and inference services to support the diverse needs of our users Cross-Functional Collaboration: Collaborate across diverse teams, including machine learning researchers, developers, product managers, software architects, and operations, fostering a collaborative and cohesive work environment End-to-end ownership: You will take the end-to-end ownership of the components and work with other engineers in the team including design, architecture, implementation, rollout and onboarding support to partner teams, production on-call support, testing/verification, investigations etc Minimum Qualifications Educational Background: Bachelor’s degree in Computer Science or equivalent practical experience Experience: Over 8 years of experience in software development and engineering, delivering production systems and services Prior experience of working with MLOps team at the intersection of the expertise across ML model deployments, DevOps and data engineering Hands-on skills: Ability to fluently translate the design into high quality code in golang, python, Java Knowledge of DevOps practices, containerization, orchestration tools such as CI/CD, Terraform, Docker, Kubernetes, Gitops Demonstrate knowledge of distributed data processing frameworks, orchestrators, and data lake architectures using technologies such as Spark, Airflow, iceberg/ parquet formats Prior collaborations with Data science teams to deploy their models, setting up ML observability for inference level monitoring Exposure for building RAG based applications by collaborating with other product teams, Data scientists/AI engineers Demonstrate creative problem-solving skills with the ability to break down problems into manageable components Knowledge of Amazon AWS and/or Azure cloud for solutioning large scale application deployments Excellent communication and collaboration skills, fostering teamwork and effective information exchange Preferred Qualifications Experience in integrating with third party vendors Experience in latency optimization with the ability to diagnose, tune, and enhance the efficiency of serving systems Familiarity with tools and frameworks for monitoring and managing the performance of AI/ML models in production (e.g., MLflow, Kubeflow, TensorBoard) Familiarity with distributed model training/inference pipelines using (KubeRay or equivalent) Exposure to leveraging GPU computing for AI/ML workloads, including experience with CUDA, OpenCL, or other GPU programming tools, to significantly enhance model training and inference performance Exposure to ML libraries such as PyTorch, TensorFlow, XGBoost, Pandas, and ScikitLearn Learn More About Autodesk Welcome to Autodesk! Amazing things are created every day with our software – from the greenest buildings and cleanest cars to the smartest factories and biggest hit movies. We help innovators turn their ideas into reality, transforming not only how things are made, but what can be made. We take great pride in our culture here at Autodesk – our Culture Code is at the core of everything we do. Our values and ways of working help our people thrive and realize their potential, which leads to even better outcomes for our customers. When you’re an Autodesker, you can be your whole, authentic self and do meaningful work that helps build a better future for all. Ready to shape the world and your future? Join us! Salary transparency Salary is one part of Autodesk’s competitive compensation package. Offers are based on the candidate’s experience and geographic location. In addition to base salaries, we also have a significant emphasis on discretionary annual cash bonuses, commissions for sales roles, stock or long-term incentive cash grants, and a comprehensive benefits package. Diversity & Belonging We take pride in cultivating a culture of belonging and an equitable workplace where everyone can thrive. Learn more here: https://www.autodesk.com/company/diversity-and-belonging Are you an existing contractor or consultant with Autodesk? Please search for open jobs and apply internally (not on this external site). Show more Show less

Posted 3 weeks ago

Apply

0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Linkedin logo

Mandatory Skills: 1. Python 2. Rest end point API (FAST or FLASK) 3. Kubernetes /Docker containers 4. Redis 5. Neo 4 J Graph Databases or Vector J (any of the two ) Responsibilities: Design, develop, and maintain APIs using Python. Implement containerization and orchestration using Docker and Kubernetes. Integrate APIs for data exchange and ensure seamless data flow between systems Collaborate with cross-functional teams to define, design, and ship new features. Ensure the performance, quality, and responsiveness of applications. Identify and correct bottlenecks and fix bugs. Help maintain code quality, organization, and automation. Provide technical leadership and guidance to the development team Skills: Proven experience as a Solution Architect or similar role. Strong proficiency in Python and experience with API development. Hands-on experience with Docker and Kubernetes. Experience with API integration for data exchange. Solid understanding of software development principles and methodologies. Excellent problem-solving skills and attention to detail. Strong communication and teamwork skills. Nice to have: Architect and develop user interfaces using Node.js, React, and HTML ML programming skills, CUDA GPU Show more Show less

Posted 3 weeks ago

Apply

4.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Linkedin logo

Job Category: AIML Job Type: Full Time Job Location: Bengaluru Mangalore Experience: 4-8 Years Skills: AI AWS/AZURE/GCP Azure ML C computer vision data analytics Data Modeling Data Visualization deep learning Descriptive Analytics GenAI Image processing Java LLM models ML ONNX Predictive Analytics Python R Regression/Classification Models SageMaker SQL TensorFlow Position Overview We are looking for an experienced AI/ML Engineer to join our team in Bengaluru. The ideal candidate will bring a deep understanding of machine learning, artificial intelligence, and big data technologies, with proven expertise in developing scalable AI/ML solutions. You will lead technical efforts, mentor team members, and collaborate with cross-functional teams to design, develop, and deploy cutting edge AI/ML applications. Job Details Job Category: AI/ML Engineer. Job Type: Full-Time Job Location: Bengaluru Experience Required: 4-8 Years About Us We are a multi-award-winning creative engineering company. Since 2011, we have worked with our customers as a design and technology enablement partner, guiding them on their digital transformation journeys. Roles And Responsibilities Design, develop, and deploy deep learning models for object classification, detection, and segmentation using CNNs and Transfer Learning. Implement image preprocessing and advanced computer vision pipelines. Optimize deep learning models using pruning, quantization, and ONNX for deployment on edge devices. Work with PyTorch, TensorFlow, and ONNX frameworks to develop and convert models. Accelerate model inference using GPU programming with CUDA and cuDNN. Port and test models on embedded and edge hardware platforms. ( Orin, Jetson, Hailo ) Conduct research and experiments to evaluate and integrate GenAI technologies in computer vision tasks. Explore and implement cloud-based AI workflows, particularly using AWS/Azure AI/ML services. Collaborate with cross-functional teams for data analytics, data processing, and large-scale model training. Required Skills Strong programming experience in Python. Solid background in deep learning, CNNs, and transfer learning and Machine learning basics. Expertise in object detection, classification, segmentation. Proficiency with PyTorch, TensorFlow, and ONNX. Experience with GPU acceleration (CUDA, cuDNN). Hands-on knowledge of model optimization (pruning, quantization). Experience deploying models to edge devices (e.g., Jetson, mobile, Orin, Hailo ) Understanding of image processing techniques. Familiarity with data pipelines, data preprocessing, and data analytics. Willingness to explore and contribute to Generative AI and cloud-based AI solutions. Good problem-solving and communication skills. Preferred (Nice-to-Have) Experience with C/C++. Familiarity with AWS Cloud AI/ML tools (e.g., SageMaker, Rekognition). Exposure to GenAI frameworks like OpenAI, Stable Diffusion, etc. Knowledge of real-time deployment systems and streaming analytics. Qualifications Graduation/Post-graduation in Computers, Engineering, or Statistics from a reputed institute. What We Offer Competitive salary and benefits package. Opportunity to work in a dynamic and innovative environment. Professional development and learning opportunities. Visit us on: CodeCraft Technologies LinkedIn : CodeCraft Technologies LinkedIn Instagram : CodeCraft Technologies Instagram Show more Show less

Posted 3 weeks ago

Apply

40.0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

Linkedin logo

Company Overview KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents systems and solutions for the manufacturing of wafers and reticles, integrated circuits, packaging, printed circuit boards and flat panel displays. The innovative ideas and devices that are advancing humanity all begin with inspiration, research and development. KLA focuses more than average on innovation and we invest 15% of sales back into R&D. Our expert teams of physicists, engineers, data scientists and problem-solvers work together with the world’s leading technology providers to accelerate the delivery of tomorrow’s electronic devices. Life here is exciting and our teams thrive on tackling really hard problems. There is never a dull moment with us. Group/Division With over 40 years of semiconductor process control experience, chipmakers around the globe rely on KLA to ensure that their fabs ramp next-generation devices to volume production quickly and cost-effectively. Enabling the movement towards advanced chip design, KLA's Global Products Group (GPG), which is responsible for creating all of KLA’s metrology and inspection products, is looking for the best and the brightest research scientist, software engineers, application development engineers, and senior product technology process engineers. The LS-SWIFT Division of KLA’s Global Products Group provides patterned wafer inspection systems for high-volume semiconductor manufacturing. Its mission is to deliver market-leading cost of ownership in defect detection for a broad range of applications in the production of semiconductors. Customers from the foundry, logic, memory, automotive, MEMS, advanced packaging and other markets rely upon high-sample wafer inspection information generated by LS-SWIFT products. LS (Laser Scanning) systems enable cost-effective patterned wafer defect detection for the industry’s most sophisticated process technologies deployed in leading-edge foundry, logic, DRAM, and NAND fabs. SWIFT (Simultaneous Wafer Inspection at Fast Throughput) systems deliver all-wafer-surface (frontside, backside, and edge) macro inspection that is critical for automotive IC, MEMS, and advanced packaging processes as well as foundry/logic and memory fabs. LS-SWIFT operates from a global footprint that includes the US, Singapore, India and Germany, and serves a worldwide customer base across Asia, Europe and North America. Job Description/Preferred Qualifications Key Responsibilities: Lead and mentor a team of algorithm engineers, providing guidance and support to ensure their professional growth and success. Develop and maintain the infrastructure required for the deployment and execution of algorithms at scale. Collaborate with data scientists, software engineers, and product managers to design and implement robust and scalable algorithmic solutions. Optimize algorithm performance and resource utilization to meet business objectives. Stay up-to-date with the latest advancements in algorithm engineering and infrastructure technologies, and apply them to improve our systems. Drive continuous improvement in development processes, tools, and methodologies. Qualifications: Bachelor's or Master's degree in Computer Science, Engineering, or a related field. Proven experience in developing computer vision and image processing algorithm and ML/DL algorithm. Familiar with high performance computing, parallel programming and distributed systems. Strong leadership and team management skills, with a track record of successfully leading engineering teams. Proficiency in programming languages such as Python, C++ and CUDA. Excellent problem-solving and analytical skills. Strong communication and collaboration abilities. Preferred Qualifications: Experience with machine learning frameworks and libraries (e.g., TensorFlow, PyTorch, Scikit-learn). Experience with GPU architecture and algo development toolkit like Docker, Apptainer. Minimum Qualifications Bachelor's degree plus 8 + years of experience Master's degree plus 8 + years of experience Familiar with high performance computing, parallel programming and distributed systems. We offer a competitive, family friendly total rewards package. We design our programs to reflect our commitment to an inclusive environment, while ensuring we provide benefits that meet the diverse needs of our employees. KLA is proud to be an equal opportunity employer Be aware of potentially fraudulent job postings or suspicious recruiting activity by persons that are currently posing as KLA employees. KLA never asks for any financial compensation to be considered for an interview, to become an employee, or for equipment. Further, KLA does not work with any recruiters or third parties who charge such fees either directly or on behalf of KLA. Please ensure that you have searched KLA’s Careers website for legitimate job postings. KLA follows a recruiting process that involves multiple interviews in person or on video conferencing with our hiring managers. If you are concerned that a communication, an interview, an offer of employment, or that an employee is not legitimate, please send an email to talent.acquisition@kla.com to confirm the person you are communicating with is an employee. We take your privacy very seriously and confidentially handle your information. Show more Show less

Posted 3 weeks ago

Apply

8.0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

Linkedin logo

Job Title: AI Manager Location: Chennai ( Hybrid role – 3 days per week onsite. Candidates must be willing to relocate to Chennai) Key Responsibilities: • Lead and mentor a team of algorithm engineers, providing guidance and support to ensure their professional growth and success. • Develop and maintain the infrastructure required for the deployment and execution of algorithms at scale. • Collaborate with data scientists, software engineers, and product managers to design and implement robust and scalable algorithmic solutions. • Optimize algorithm performance and resource utilization to meet business objectives. • Stay up-to-date with the latest advancements in algorithm engineering and infrastructure technologies, and apply them to improve our systems. • Drive continuous improvement in development processes, tools, and methodologies. Qualifications: • Bachelor's or Master's degree in Computer Science, Engineering, or a related field. • Proven experience in developing computer vision and image processing algorithm and ML/DL algorithm. • Familiar with high performance computing, parallel programming and distributed systems. • Strong leadership and team management skills, with a track record of successfully leading engineering teams. • Proficiency in programming languages such as Python, C++ and CUDA. • Excellent problem-solving and analytical skills. • Strong communication and collaboration abilities. Preferred Qualifications: • Experience with machine learning frameworks and libraries (e.g., TensorFlow, PyTorch, Scikit-learn). • Experience with GPU architecture and algo development toolkit like Docker, Apptainer. Minimum Must Have's: • Bachelor's degree plus 8 + years of experience • Master's degree plus 8 + years of experience • Familiar with high performance computing, parallel programming and distributed systems. You can also share your resume at hr@alphasqmax.com Show more Show less

Posted 3 weeks ago

Apply

4.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Linkedin logo

Company: Qualcomm India Private Limited Job Area: Engineering Group, Engineering Group > Systems Engineering General Summary: General Summary: Qualcomm’s Graphics PSE team is a part of the Graphics System team and is responsible for the overall quality of the Graphics IP in silicon. As a member of our Graphics PSE team, you will be working closely with architects, designers, verification, and software engineers to take the GPU from pre-Sil stage to tape out to silicon bring-up and to CS(Customer Samples). Job Functions/General Responsibilities: In this position, you will be responsible for developing graphics applications using graphics API like DirectX, OpenGL ES , Vulkan, improving coverage, creating GPU bring-up test-plans and test methodologies. Analyzing and enabling new games and benchmark in pre-Si environment. Provide debug support in pre-Silicon environment (functional model) and driving end to end solutions for silicon bring-up issues including failure debug. We are looking for highly motivated engineers that enjoy working in a fast-paced environment with minimal guidance. Candidates must have strong programming, communication and teamwork skills and approach difficult challenges as learning opportunities. Critical “Must Have” Skills/experience For Role Strong programming in C/C++. GPU APIs knowledge (Vulkan/Direct3D/OpenGL/Direct X / OpenCL etc.) GPU architecture. Strong analytical skill. 4+ years of relevant experience. Minimum Qualifications Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 10+ years of Systems Engineering or related work experience. OR Master's degree in Engineering, Information Systems, Computer Science, or related field and 8+ years of Systems Engineering or related work experience. OR PhD in Engineering, Information Systems, Computer Science, or related field and 5+ year of Systems Engineering or related work experience. Minimum Qualifications: Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 3+ years of Systems Engineering or related work experience. OR Master's degree in Engineering, Information Systems, Computer Science, or related field and 2+ years of Systems Engineering or related work experience. OR PhD in Engineering, Information Systems, Computer Science, or related field and 1+ year of Systems Engineering or related work experience. Preferred Skills/experience For Role: Experience with at least one of: Perl, Python, TCL Games/graphics application development. OpenCL/CUDA knowledge. Graphics driver development or modelling experience. Post-silicon enablement and bring-up. Prior experience in working in emulation environments for development and debug. Debug tools including JTAG and kernel debuggers Applicants : Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail disability-accomodations@qualcomm.com or call Qualcomm's toll-free number found here. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries). Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law. To all Staffing and Recruiting Agencies : Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications. If you would like more information about this role, please contact Qualcomm Careers. 3074282 Show more Show less

Posted 3 weeks ago

Apply

10.0 - 20.0 years

100 - 200 Lacs

Hyderabad

Hybrid

Naukri logo

Compiler Lead Hyderabad Founded by highly respected Silicon Valley veterans - with its design centers established in Santa Clara, California. / Hyderabad/Bangalore A US based well-funded product-based startup looking for Highly talented Verification Engineers for the following roles. We are looking for a highly experienced systems engineer with deep expertise in compilers, machine learning infrastructure, and system-level performance optimization. This role is hands-on and research-driven, ideal for someone who thrives on solving low-level performance challenges and building core infrastructure that powers next-generation AI workloads. Key Responsibilities: Compiler Design & Optimization Develop and enhance compiler toolchains based on LLVM, MLIR, Open64, or Glow. Build and optimize intermediate representations, custom dialects, and code generation flows for AI accelerators. Implement transformations and optimizations for latency, memory usage, and compute efficiency. AI System Integration Work closely with hardware teams to co-design compilers targeting custom silicon. Integrate compiler backends with ML frameworks like PyTorch, TensorFlow, or ONNX. Build graph-level and kernel-level transformations for AI training and inference pipelines. Performance Tuning & System Analysis Conduct low-level profiling and performance tuning across compiler and runtime layers. Identify and eliminate bottlenecks across CPU/GPU/NPU workloads. Develop parallel programming solutions leveraging SIMD, multi-threading, and heterogeneous computing. Tooling & Infrastructure Develop tooling for performance analysis, debug, and test automation. Contribute to internal SDKs and devkits used by AI researchers and system engineers. Required Skills & Experience: Strong compiler development experience using LLVM, MLIR, Glow, or similar toolchains. Proficiency in C/C++, with solid command of Python for tooling and automation. In-depth understanding of compiler internals, including IR design, lowering, codegen, and scheduling. Deep knowledge of hardware-software co-design, particularly for AI/ML workloads. Experience with runtime systems, memory models, and performance modeling. Solid grasp of parallel and heterogeneous computing paradigms. Nice to Have: Experience working with custom AI hardware or edge inference platforms. Familiarity with quantization, scheduling for dataflow architectures, or compiler autotuning. Contributions to open-source compiler projects (e.g., LLVM, MLIR, TVM). Qualifications: Bachelors or Masters degree in Computer Science, Electrical Engineering, or a related field. 10–15 years of relevant hands-on experience in compilers, systems programming, or AI infrastructure. Contact: Uday Mulya Technologies muday_bhaskar@yahoo.com "Mining The Knowledge Community"

Posted 3 weeks ago

Apply

5.0 years

0 Lacs

Coimbatore, Tamil Nadu, India

On-site

Linkedin logo

Job Overview We are looking for an experienced Computer Vision Engineer with expertise in image processing, machine learning, and deep learning. The ideal candidate should have hands-on experience developing and deploying computer vision algorithms and deep learning models using Python, OpenCV, and YOLO, along with proficiency in CUDA and NumPy. This role involves research, model optimization, and real-world deployment of cutting-edge computer vision Responsibilities : Develop and deploy computer vision algorithms and deep learning models for diverse applications. Design and implement computer vision models using state-of-the-art techniques and frameworks. Explore and analyze unstructured data like images through image processing techniques. Analyze, evaluate, and optimize existing computer vision systems for improved performance and accuracy. Test and validate computer vision code and models, ensuring robustness and reliability. Research and implement new computer vision technologies to stay at the forefront of the field. Collaborate with cross-functional teams to develop innovative solutions meeting project requirements. Monitor the performance and accuracy of deployed computer vision models, making necessary adjustments. Maintain and update computer vision systems to ensure their continued functionality and relevance. Provide technical support and guidance to team members and customers using computer vision : 5 years of experience as a Computer Vision Engineer. Bachelor's degree in Computer Science or a related field. Proven experience in developing and deploying computer vision systems. Strong knowledge of computer vision algorithms, libraries, and tools such as: OpenCV, TensorFlow, PyTorch, Keras, NumPy, scikit-image, Matplotlib, Seaborn, YOLO, etc. Familiarity with GPU acceleration and optimization tools like CUDA, OpenCL, OpenGL. Expertise in computer vision applications, including : Object detection, image classification, text detection & OCR, face detection, generative models, video analytics, object tracking, and model Experience with runtime AI frameworks such as ONNX, TensorRT, OpenVINO. Hands-on experience with cloud platforms (AWS, Azure), Docker, Kubernetes, and GitHub. Experience in training models using GPU computing or cloud-based environments. Familiarity with machine learning and deep learning concepts and frameworks. Strong problem-solving and analytical skills. Ability to work independently in a fast-paced environment and collaborate effectively in a Qualifications : Experience in real-time video processing and streaming analytics. Knowledge of Edge AI and deployment on embedded systems. Exposure to 3D vision, SLAM, and depth estimation. Contributions to open-source computer vision projects. (ref:hirist.tech) Show more Show less

Posted 3 weeks ago

Apply

5.0 years

0 Lacs

Coimbatore, Tamil Nadu, India

On-site

Linkedin logo

Job Description MulticoreWare Inc develops heterogeneous software for Media and Automotive markets. It develops and licenses world-class video codec libraries, such as the widely used x265 (open-source HEVC encoding library) and supports tier-1 global streaming video services and video encoding solution providers. As a Senior SW Engineer/Technical Lead, you will join MulticoreWare’s growing team of world-class engineers, and architects in its offices at Chennai, India, to implement, and deliver solutions in video pipelines and products. You will work in a result-oriented team with a culture of transparency and minimal bureaucracy. MulticoreWare provides a collaborative work environment with opportunities to learn, grow and progress in your career path. Key Responsibilities Implement, test, and deliver production-quality software, architected for problems in video pipelines of internal and customer IPs Lead the execution and implementation of product features with a small team of engineers and mentor the junior engineers within the team Conduct meticulous code-reviews to ensure bug-free and high-quality deliverables. Contribute towards product release cycles by developing tests deployed in continuous integration frameworks, fixing bugs identified during release, etc. Interface with customers, support customer evaluations of products and debugging of customer issues Qualifications Bachelor’s degree in Computer Science, Computer Engineering, or Electrical Engineering with 5 + years’ experience, or a Master’s degree in similar areas with 3+ years’ experience Excellent C/C++ development and debugging experience in creating production-grade software on Windows and Unix-based platforms Hands-on experience in video coding with an in-depth knowledge of HEVC, AVC, VP9, AV1 and VVC Strong interest in video/imaging technologies, including video/image compression, analytics, and streaming/delivery Demonstrated analytical, problem solving and troubleshooting skills, with a meticulous attention to detail Working knowledge of code management systems such as Git and Bitbucket Good verbal and written communication skills Preferred Qualifications Experience with continuous integration frameworks like Jenkins Experience with heterogeneous programming libraries such as OpenCL, CUDA Experience with platform-specific software optimizations like SIMD to enable higher performance software Understanding of Agile / Scrum methodologies Experience in interacting with geographically distributed teams for reviews and meetings Skills:- Video codecs, Multimedia, C++, GStreamer, FFmpeg and HEVC Show more Show less

Posted 3 weeks ago

Apply

40.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Linkedin logo

About CynLr Just like a baby’s brain, CynLr Visual Intelligence stack makes Robots to instinctively see & pick any object under any ambience, without any training . ( a demo video link ). Today, we don’t have a robot that can fit a screw into a nut without slipping a thread. Imagine what it would take for a robot to assemble a Smartphone or a car by putting together 1000s of parts with varied shapes and weights, all in random orientations. Thus factories become complex, needing heavy customization of their environment. CynLr enabled visual robots intuitively handles any object, even from a clutter – a universal alternative to custom machines, simplifying factory lines into modular LEGO blocks of micro-factories. Simplifying factories with robots that can pick & place any object has been a 40 year old pipe dream - touted as The Holy Grail of Robotics . As a SW developer, you will be responsible for building the entire Vision & Learning SW Stack, Task & Robot Control OS, extensively optimising the performance of these algorithms, translating them into API library for Internal Engineers and Customers, along with creating a UI layer for the end users. Requirements in Practice : Excellent C++ skillset across Windows and Linux Platforms. Optimizing processing cycles for large image pipelines and memory handling is a plus. (Camera generates 1 GB of Data every second) Experience with State-Machine Architecture is a plus Experienced with entire SW development Lifecycle & best practices. OOP driven thought process – scalability, modularity, polymorphic coding Deep architecture understanding with thorough knowledge of design patterns and adept in designing architectures for parallel threading Multi-threaded programming libraries like pThreads, OPENMP, OPENCL Experience with API architecture design and development Experienced with DLL & Static Library building and distribution. Adept with translating and optimizing algorithms, especially DSP (which includes image processing) and/or Neural Net Architectures ( at least one of the two is needed ) Implementing and Optimizing Algorithms Adept with building test frameworks for codes. Must have a strong inclination for documentation and code readability. Must have an understanding of : Compiler working and construction. CPU architectures – x86, x64 & ARM Hardware-associated driver development. OS and layers (Board Support Packages, BIOS, UEFI, BootLoader) Memory architectures and optimizations. MIMD, SIMD Good to have experience and practice with GPU-based application development. Knowledge of CUDA (Excellency is not necessary) State machine architecture Realtime computing UI-based deployable application development Team Structure: The engineering team will comprise of – Algo Team, GPU Team, SW Dev Team & HW Team. Members of other teams will be passive members of each team apart from the team they lead. The Algo Team will provide the Neural Models & Vision algorithms, while the GPU Team will provide the GPU optimizations for the algos, HW team will provide the HW integration and SW team with translate GPU optimized algos into SW blocks. Each team will split the implementation among other teams and guide them through the implementation. Every team member will be a passive member of all other teams. What will you do? Your contribution is predominantly (but not confined only to) architecting, defining test cases, reviewing & implementing all the SW & firmware parts and development effort - the image processing, Neural Models, APIs, vision sequences, manipulation sequences, drivers, etc. Your thorough knowledge of system architecture, C++ skill set, design patterns & your penchant for application-oriented SW design thought processes, will drive and implement the whole software design & development part. You will also provide architectures and approaches for the drivers to be developed to keep them modular and scalable, and consistently review them all. How will you Do? You have complete freedom here, but you will be subjected to reviews. Since this is a startup and the product is not yet well-defined, you would be the one with the responsibility of defining it. Expect things to be not orderly and requirements to not be solid. Part of your design effort largely involves requirements building too and developing architectures that are agnostic to such requirement changes. The SW part of the product significantly evolves as per your thought process and will henceforth carry your signature in it. You will also be building a team as the product evolves to maintain and develop further. Though confined to a focused area, the work is pretty much expected to be entrepreneurial with the exact advantages and difficulties of a startup. Show more Show less

Posted 3 weeks ago

Apply

Exploring CUDA Jobs in India

India has emerged as a hub for tech talent, with a growing demand for professionals skilled in CUDA programming. CUDA, which stands for Compute Unified Device Architecture, is a parallel computing platform and programming model developed by NVIDIA. As more companies in India look to leverage GPU acceleration for their computing needs, the demand for CUDA developers is on the rise.

Top Hiring Locations in India

  1. Bangalore
  2. Pune
  3. Hyderabad
  4. Chennai
  5. Mumbai

Average Salary Range

The average salary range for CUDA professionals in India varies based on experience: - Entry-level: INR 4-6 lakhs per annum - Mid-level: INR 8-12 lakhs per annum - Experienced: INR 15-20 lakhs per annum

Career Path

In the field of CUDA programming, a typical career path may include: - Junior CUDA Developer - CUDA Developer - Senior CUDA Developer - CUDA Tech Lead

Related Skills

Apart from proficiency in CUDA programming, professionals in this field are often expected to have knowledge or experience in: - C/C++ programming - Parallel computing - GPU architecture - Machine learning algorithms

Interview Questions

  • What is CUDA and how does it differ from traditional programming models? (basic)
  • Explain the difference between threads and blocks in CUDA. (basic)
  • What is shared memory in CUDA and why is it important? (medium)
  • How do you optimize memory access in CUDA programming? (medium)
  • Can you explain the concept of warp divergence in CUDA? (medium)
  • What is kernel launch overhead in CUDA and how can it be minimized? (advanced)
  • How do you handle error checking in CUDA programming? (basic)
  • Explain the concept of coalesced memory access in CUDA. (medium)
  • What are the different types of memory available in CUDA? (basic)
  • How do you debug CUDA code? (medium)
  • Explain the purpose of the cudaMemcpy function in CUDA. (basic)
  • How do you handle synchronization in CUDA programming? (medium)
  • What is the significance of grid and block dimensions in CUDA? (basic)
  • Explain the concept of warp size in CUDA. (basic)
  • How do you optimize performance in CUDA kernels? (medium)
  • What is the difference between global, shared, and constant memory in CUDA? (medium)
  • Can you explain the concept of texture memory in CUDA? (medium)
  • How do you handle race conditions in CUDA programming? (medium)
  • What are the advantages of using CUDA for parallel computing? (basic)
  • Explain the concept of warp shuffle in CUDA. (advanced)
  • How do you handle dynamic memory allocation in CUDA? (basic)
  • What is the purpose of the nvcc compiler in CUDA programming? (basic)
  • How do you profile and optimize CUDA applications? (medium)
  • Can you explain the concept of occupancy in CUDA? (advanced)

Closing Remark

As the demand for CUDA professionals continues to grow in India, now is the perfect time to upskill and pursue career opportunities in this field. By mastering CUDA programming and related skills, you can position yourself as a valuable asset in the tech industry. Prepare diligently, showcase your expertise confidently, and embark on a rewarding career journey in CUDA development.

cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies