Jobs
Interviews

8 Gpu Optimization Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

5.0 - 7.0 years

0 Lacs

mumbai, maharashtra, india

On-site

Job Title: Developer (C++,CUDA) Work Mode: Work from Office Location: Mumbai (ONLY LOCAL CANDIDATES) Experience: 5 +Years Notice Period: Immediate Joiner (Within 20 days) Job Objective We are seeking an experienced Developer with a strong background in C++, CUDA programming, and Linux to guide our development team in building cutting-edge solutions for device integration and high-performance computing tasks. This is a hands-on leadership position that combines technical expertise with team management skills to deliver high-quality software products. Primary Responsibilities Software Development Develop and maintain high-performance applications using C++ and CUDA. Design and implement parallel algorithms for GPUs to accelerate computational workloads. Performance Optimization Optimize CUDA kernels for performance, scalability, and memory efficiency. Analyze performance bottlenecks and propose innovative solutions. Code Review and Testing Conduct code reviews to ensure adherence to coding standards and best practices. Develop and execute test cases to validate functionality and performance. Collaboration Work closely with the software engineering and research teams to understand requirements and deliver robust solutions. Provide technical guidance and mentoring to junior team members when necessary. Documentation Write and maintain technical documentation, including design specifications and user manuals. Required Skills C++: Strong proficiency in modern C++ (C++11/14/17/20). CUDA Programming: Extensive experience in developing, debugging, and optimizing CUDA applications. GPU Optimization: Familiarity with memory hierarchy, shared memory, streams, and warp-level operations in CUDA. Parallel Computing: Solid understanding of parallel algorithms and multi-threaded programming. Mathematical and Analytical Skills: Strong foundation in linear algebra, calculus, and numerical methods. Tools: Experience with debugging/profiling tools like Nsight, CUDA Memcheck, or similar. Note: Interested Candidates Please share your updated Resume to- [HIDDEN TEXT] Show more Show less

Posted 1 day ago

Apply

5.0 - 9.0 years

0 Lacs

chennai, tamil nadu

On-site

Are you ready to experiment, learn, and implement in the field of ML, Python, Computer Vision, hardware platforms like Jetson Nano and Raspberry Pi, and cloud services Join us on a new adventure where your expertise can revolutionize the dynamics of our organization. We believe in selection, not rejection, and we are excited to welcome you to our team. OptiSol is your destination for a stress-free and balanced lifestyle. We provide a nurturing environment where your career can flourish. As a certified GREAT PLACE TO WORK for 4 consecutive years, we value open communication, accessible leadership, diversity, and work-life balance with flexible policies. At OptiSol, you can thrive both personally and professionally. We are at the forefront of AI and innovation, shaping the future together. Join us on this journey of learning and growth. What we like to see in you: - Core Competencies: Programming, AI Expertise, IoT Hardware, Protocols (RTSP, TCP, MQTT, Modbus, UART), Leadership - Bachelor's/masters in computer science, ECE, or related fields - Expertise in Python and relevant ML frameworks (TensorFlow, PyTorch, OpenCV) - Strong understanding of neural networks, transfer learning, and optimization techniques - Proficiency with Jetson Nano, Raspberry Pi, Arduino, and related platforms - Familiarity with RTSP, TCP, MQTT, Modbus, UART - Proven experience in leading technical teams and managing projects What do we expect: - Hands-on experience with model quantization, multitask learning, and zero-shot fine-tuning - Familiarity with OCR and multimodal LLMs for different data types - Experience in API development using FastAPI and Django, plus Android AI apps - Knowledge of industrial cameras, motor drivers, and Ethernet switches - Proficiency in CUDA programming and GPU optimization What You'll Bring to the Table: - Lead a team of AI engineers and promote a creative and collaborative environment - Work with stakeholders to define project goals and milestones - Deliver high-quality AI-driven solutions on time - Tweak machine learning and computer vision algorithms for tasks like object detection - Deploy AI on edge devices like Jetson Nano or Raspberry Pi - Integrate AI models with electronics and solve hardware-software challenges - Manage AI services on cloud platforms (AWS, Azure, Google Cloud) - Test, validate, and document solutions following industry best practices Core benefits you'll gain: - Lead and inspire a team of engineers in a collaborative setting - Directly shape project goals and ensure project success - Gain hands-on experience in cutting-edge AI and computer vision applications - Dive into edge devices like Jetson Nano and Raspberry Pi - Manage AI services on cloud platforms securely and efficiently - Develop a well-rounded skill set in AI and hardware Join us as an OptiSolite and discover a fulfilling career with us. Explore life at OptiSol and learn more about our culture on our Insta Page.,

Posted 2 weeks ago

Apply

4.0 - 6.0 years

0 Lacs

Mumbai, Maharashtra, India

On-site

Position : C++ Developer NOTE :- Mandatory Skills- Candidate must have "end-to-end experience in C++" NOTE:- Only local candidates of Mumbai is acceptable, Please, don&apost apply candidates from other state a) Contract-to-hire position No of position 1 Duration 1 year contractual position Budget 18 LPA Experience Range 4 to 5 years Notice Period Immediate, candidate should join within 10 days Location Kandivali, Mumbai (Only local candidates of Mumbai is acceptable) Work Mode- Work from Office Interview Process: 1st Round- Technical round 2nd Round - Technical round HR Round Candidates who are all interested in contract position only apply JD: We are seeking an experienced Developer with a strong background in C++, CUDA programming, and Linux to guide our development team in building cutting-edge solutions for device integration and high-performance computing tasks. This is a hands-on leadership position that combines technical expertise with team management skills to deliver high-quality software products. Primary responsibilities: Software Development: Develop and maintain high-performance applications using C++ and CUDA. Design and implement parallel algorithms for GPUs to accelerate computational workloads. Performance Optimization: Optimize CUDA kernels for performance, scalability, and memory efficiency. Analyze performance bottlenecks and propose innovative solutions. Code Review and Testing: Conduct code reviews to ensure adherence to coding standards and best practices. Develop and execute test cases to validate functionality and performance. Collaboration: Work closely with the software engineering and research teams to understand requirements and deliver robust solutions. Provide technical guidance and mentoring to junior team members when necessary. Documentation: Write and maintain technical documentation, including design specifications and user manuals. Required Skills: C++: Strong proficiency in modern C++ (C++11/14/17/20). CUDA Programming: Extensive experience in developing, debugging, and optimizing CUDA applications. GPU Optimization: Familiarity with memory hierarchy, shared memory, streams, and warp-level operations in CUDA. Parallel Computing: Solid understanding of parallel algorithms and multi-threaded programming. Mathematical and Analytical Skills: Strong foundation in linear algebra, calculus, and numerical methods. Tools: Experience with debugging/profiling tools like Nsight, CUDA Memcheck, or similar. Show more Show less

Posted 2 weeks ago

Apply

4.0 - 6.0 years

0 Lacs

Mumbai, Maharashtra, India

On-site

???? Hiring: C++ Developer | Contract | Mumbai (Kandivali) ???? Location: Kandivali, Mumbai (Work from Office) ???? CTC: ?1415 LPA ????? Experience: 45 Years ? Notice Period: Immediate joiners (Max 10 Days) ???? Client: ACG World ???? Responsibilities Develop and maintain high-performance applications using modern C++ and CUDA Design and optimize parallel algorithms for GPU acceleration Identify and resolve performance bottlenecks Collaborate with teams to deliver scalable, robust solutions Conduct code reviews and create technical documentation ???? Required Skills Strong proficiency in modern C++ (C++11/14/17/20) Expertise in CUDA programming & GPU optimization Solid understanding of parallel computing & multi-threaded programming Strong problem-solving & analytical skills Experience with debugging/profiling tools (e.g., Nsight , CUDA Memcheck ) ? Only Mumbai-based candidates will be considered ? Urgent hiring Interviews ongoing daily! ???? Apply now or share your CV at [HIDDEN TEXT] with subject C++ Show more Show less

Posted 2 weeks ago

Apply

10.0 - 14.0 years

0 Lacs

karnataka

On-site

As an Applied AI/GenAI ML Director within the Asset and Wealth Management Technology Team at JPMorgan Chase, you will provide deep engineering expertise and work across agile teams to enhance, build, and deliver trusted market-leading technology products in a secure, stable, and scalable way. You will leverage your deep expertise to consistently challenge the status quo, innovate for business impact, lead the strategic development behind new and existing products and technology portfolios, and remain at the forefront of industry trends, best practices, and technological advances. This role will focus on establishing and nurturing common capabilities, best practices, and reusable frameworks, creating a foundation for AI excellence that accelerates innovation and consistency across business functions. Your responsibilities will include establishing and promoting a library of common ML assets, including reusable ML models, features stores, data pipelines, and standardized templates. You will lead efforts to create shared tools and platforms that streamline the end-to-end ML lifecycle across the organization. Additionally, you will create curative solutions using GenAI workflows through advanced proficiency in large language models (LLMs) and related techniques, and gain experience with creating a Generative AI evaluation and feedback loop for GenAI/ML pipelines. You will advise on the strategy and development of multiple products, applications, and technologies, serving as a lead advisor on the technical feasibility and business need for AIML use cases. Furthermore, you will liaise with firm-wide AI ML stakeholders, translating highly complex technical issues, trends, and approaches to leadership to drive the firm's innovation and enable leaders to make strategic, well-informed decisions about technology advancements. You will also influence across business, product, and technology teams and successfully manage senior stakeholder relationships, championing the firm's culture of diversity, opportunity, inclusion, and respect. To be successful in this role, you must have formal training or certification on Machine Learning concepts and at least 10 years of applied experience, along with 5+ years of experience leading technologists to manage, anticipate, and solve complex technical items within your domain of expertise. An MS and/or PhD in Computer Science, Machine Learning, or a related field is required, as well as at least 10 years of experience in one of the programming languages like Python, Java, C/C++, etc., with intermediate Python skills being a must. You should have a solid understanding of using ML techniques, especially in Natural Language Processing (NLP) and Large Language Models (LLMs), hands-on experience with machine learning and deep learning methods, and the ability to work on system design from ideation through completion with limited supervision. Practical cloud-native experience such as AWS is necessary, along with good communication skills, a passion for detail and follow-through, and the ability to work effectively with engineers, product managers, and other ML practitioners. Preferred qualifications for this role include experience with Ray, MLFlow, and/or other distributed training frameworks, in-depth understanding of Embedding based Search/Ranking, Recommender systems, Graph techniques, and other advanced methodologies, advanced knowledge in Reinforcement Learning or Meta Learning, and a deep understanding of Large Language Model (LLM) techniques, including Agents, Planning, Reasoning, and other related methods. Experience with building and deploying ML models on cloud platforms such as AWS and AWS tools like Sagemaker, EKS, etc., is also desirable.,

Posted 3 weeks ago

Apply

1.0 - 5.0 years

0 Lacs

karnataka

On-site

Qualcomm India Private Limited is a leading technology innovator in the Engineering Group, specializing in Software Engineering. As a Qualcomm Software Engineer, you will play a crucial role in designing, developing, modifying, and validating cutting-edge embedded and cloud edge software applications. Your work will contribute to the creation of world-class products that exceed customer expectations. Collaboration with systems, hardware, architecture, and test engineers is essential to design system-level software solutions that meet performance requirements and interfaces. The ideal candidate holds a Bachelor's degree in Engineering, Information Systems, Computer Science, or a related field and possesses 1-3 years of work experience in embedded software and/or drivers. You should be detail-oriented with strong analytical and problem-solving skills, highly organized, and proficient in C/C++ programming and ARM assembly language. A solid understanding of embedded system architecture, 2D and 3D graphics technology, multimedia on embedded systems, and GPU optimization is required. Experience with virtualization technologies, GPU as a compute engine, and modern 3D graphics applications using OpenGLES API is advantageous. Knowledge of operating systems such as Android, QNX, embedded Linux, Genivi, and Integrity is preferred. Proficiency in graphics frameworks like Kanzi and QT, industry-standard software tools, and excellent communication skills are necessary for this role. Qualcomm values diversity and is an equal opportunity employer. If you require accommodations during the application/hiring process due to a disability, Qualcomm is committed to providing accessible support. Qualcomm expects its employees to adhere to all applicable policies and procedures, including safeguarding confidential information. To all Staffing and Recruiting Agencies: Qualcomm's Careers Site is exclusively for individuals seeking job opportunities at Qualcomm. Staffing agencies or individuals represented by agencies are not authorized to use this site for submissions. Unsolicited resumes or applications will not be accepted. For further information about this role, please contact Qualcomm Careers directly.,

Posted 1 month ago

Apply

5.0 - 8.0 years

4 - 7 Lacs

Mumbai, Navi Mumbai

Work from Office

Duration : 1-year contractual position Location-Remote Notice Period : Within 20 days Education : B.Tech, B.E Interview Process : 1st- Technical, 2nd - Technical round & 3rd - HR Round Mandatory : End-to-end C++ skills Skills Required : - C, C++ - Qt/QML - OOPs - STL, Data Structures - JavaScript - Automotive Product Development - Android Application Development - Java - API - GitLab CI/CD - GitHub, Gerrit - Jira, Zoho - PostgreSQL, SQLite, JSON - MVVM Architecture - Testing - Debugging - Linux, Unix Job Description : We are seeking an experienced Developer with a strong background in C++, CUDA programming, and Linux to guide our development team in building cutting-edge solutions for device integration and high-performance computing tasks. This is a hands-on leadership position that combines technical expertise with team management skills to deliver high-quality software products. Primary responsibilities : Software Development : - Develop and maintain high-performance applications using C++ and CUDA. - Design and implement parallel algorithms for GPUs to accelerate computational workloads. Performance Optimization : - Optimize CUDA kernels for performance, scalability, and memory efficiency. - Analyze performance bottlenecks and propose innovative solutions. Code Review and Testing : - Conduct code reviews to ensure adherence to coding standards and best practices. - Develop and execute test cases to validate functionality and performance. Collaboration : - Work closely with the software engineering and research teams to understand requirements and deliver robust solutions. - Provide technical guidance and mentoring to junior team members when necessary. Documentation : - Write and maintain technical documentation, including design specifications and user manuals. Required Skills : - C++ : Strong proficiency in modern C++ (C++11/14/17/20). - CUDA Programming : Extensive experience in developing, debugging, and optimizing CUDA applications. - GPU Optimization : Familiarity with memory hierarchy, shared memory, streams, and warp-level operations in CUDA. - Parallel Computing : Solid understanding of parallel algorithms and multi-threaded programming. - Mathematical and Analytical Skills : Strong foundation in linear algebra, calculus, and numerical methods. - Tools : Experience with debugging/profiling tools like Nsight, CUDA Memcheck, or similar.

Posted 1 month ago

Apply

6.0 - 12.0 years

6 - 12 Lacs

Bengaluru / Bangalore, Karnataka, India

On-site

Optimize existing GPU implementations for ILT software. Design new GPU-accelerated algorithms for large-scale geometric data handling for ILT. Collaborate with cross-functional teams to ensure seamless integration of GPU features. Lead benchmarking and performance testing initiatives. Stay current on GPU technology trends and design the latest advancements into the system. Work closely with customers and hardware vendors to deliver optimal solutions rapidly. The Impact You Will Have: Enhance the performance and efficiency of ILT software through optimized GPU implementations. Develop innovative GPU-accelerated algorithms that handle large-scale geometric data efficiently. Ensure seamless integration of GPU features into existing Mask Synthesis tools. Lead performance testing to ensure the highest standards of software quality. Drive technological advancements by integrating the latest GPU trends into our systems. Contribute to the rapid manufacturing of new chips by delivering optimal solutions swiftly. What You'll Need: M.S. or Ph.D. in Computer Science or a related field. 6+ years of experience working with GPU-accelerated systems. Proficiency in CUDA, OpenCL, ROCm, or related technologies. Expertise in C++ and Python. Experience in distributed computing environments. Strong troubleshooting and collaboration skills.

Posted 2 months ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies