Jobs
Interviews

439 Cuda Jobs - Page 8

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

0.0 - 8.0 years

0 Lacs

Chennai, Tamil Nadu

On-site

Job Title: AI Infrastructure Engineer Experience: 8+ Years Location: Onsite ( Note: The selected candidate is required to relocate to Kovilpatti, Tamil Nadu for the initial three-month project training session . Post training, the candidate will be relocated to one of our onsite locations: Chennai, Hyderabad, or Pune , based on project allocation.) Job Summary: We are looking for an experienced AI Infrastructure Engineer to architect and manage scalable, secure, and high-performance infrastructure tailored for enterprise AI and ML applications. The ideal candidate will collaborate with data scientists, DevOps, and cybersecurity teams to build reliable platforms for efficient model development, training, and deployment. Key Responsibilities: Design and implement end-to-end AI infrastructure using cloud-native tools (Azure, AWS, GCP). Build secure and scalable compute environments with GPU/TPU acceleration for model training and inference. Develop and maintain CI/CD and MLOps pipelines for the AI/ML lifecycle. Optimize large-scale AI workloads using distributed computing and hardware-aware strategies. Manage containerized deployments using orchestration platforms like Kubernetes (AKS, EKS, GKE) and Docker. Ensure system reliability, monitoring, observability, and performance tuning for real-time inference services. Implement automated rollback, logging, and infrastructure monitoring tools. Collaborate with cybersecurity teams to enforce security, data privacy, and regulatory compliance. Technical Skills: Cloud Platforms: Azure Machine Learning, AWS SageMaker, GCP Vertex AI Infrastructure-as-Code: Terraform, ARM Templates, Bicep Containerization & Orchestration: Docker, Kubernetes (AKS, EKS, GKE) MLOps Tools: MLflow, Kubeflow, Azure DevOps, GitHub Actions GPU/TPU Acceleration: CUDA, NVIDIA Triton Inference Server Security & Compliance: TLS, IAM, RBAC, Azure Key Vault Performance: Endpoint scaling, latency optimization, model caching, and resource allocation Qualifications: Bachelor's or Master's in Computer Engineering, Cloud Architecture, or a related field Microsoft Certified: Azure Solutions Architect or DevOps Engineer Expert (preferred) Proven experience deploying and managing large-scale ML pipelines and AI workloads Strong understanding of infrastructure security, networking, and cloud-based AI environments Job Type: Full-time Pay: Up to ₹80,000.00 per month Ability to commute/relocate: Tamulinadu, Tamil Nadu: Reliably commute or willing to relocate with an employer-provided relocation package (Required) Application Question(s): Expected Salary in Annual (INR) Experience: AI Infrastructure Engineer : 8 years (Required) Work Location: In person

Posted 1 month ago

Apply

0.0 - 8.0 years

0 Lacs

Chennai, Tamil Nadu

Remote

Job Title: AI Infrastructure Engineer Experience: 8+ Years *Location: The selected candidate is required to work onsite at our Chennai location for the initial six-month project training and execution period. After the six months , the candidate will be offered remote opportunities.* Job Summary: We are looking for an experienced AI Infrastructure Engineer to architect and manage scalable, secure, and high-performance infrastructure tailored for enterprise AI and ML applications. The ideal candidate will collaborate with data scientists, DevOps, and cybersecurity teams to build reliable platforms for efficient model development, training, and deployment. Key Responsibilities: Design and implement end-to-end AI infrastructure using cloud-native tools (Azure, AWS, GCP). Build secure and scalable compute environments with GPU/TPU acceleration for model training and inference. Develop and maintain CI/CD and MLOps pipelines for the AI/ML lifecycle. Optimize large-scale AI workloads using distributed computing and hardware-aware strategies. Manage containerized deployments using orchestration platforms like Kubernetes (AKS, EKS, GKE) and Docker. Ensure system reliability, monitoring, observability, and performance tuning for real-time inference services. Implement automated rollback, logging, and infrastructure monitoring tools. Collaborate with cybersecurity teams to enforce security, data privacy, and regulatory compliance. Technical Skills: Cloud Platforms: Azure Machine Learning, AWS SageMaker, GCP Vertex AI Infrastructure-as-Code: Terraform, ARM Templates, Bicep Containerization & Orchestration: Docker, Kubernetes (AKS, EKS, GKE) MLOps Tools: MLflow, Kubeflow, Azure DevOps, GitHub Actions GPU/TPU Acceleration: CUDA, NVIDIA Triton Inference Server Security & Compliance: TLS, IAM, RBAC, Azure Key Vault Performance: Endpoint scaling, latency optimization, model caching, and resource allocation Qualifications: Bachelor's or Master's in Computer Engineering, Cloud Architecture, or a related field Microsoft Certified: Azure Solutions Architect or DevOps Engineer Expert (preferred) Proven experience deploying and managing large-scale ML pipelines and AI workloads Strong understanding of infrastructure security, networking, and cloud-based AI environments Job Type: Full-time Pay: Up to ₹80,000.00 per month Ability to commute/relocate: Chennai, Tamil Nadu: Reliably commute or planning to relocate before starting work (Required) Application Question(s): Expected Salary in Annual (INR) Experience: AI Infrastructure Engineer : 8 years (Required) Work Location: In person

Posted 1 month ago

Apply

0 years

0 Lacs

Hyderabad, Telangana, India

On-site

Company Description Echoleads.ai leverages AI-powered sales agents to engage, qualify, and convert leads through real-time voice conversations. Our voice bots act as scalable sales representatives, making thousands of smart, human-like calls daily to follow up instantly, ask the right questions, and book appointments effortlessly. Echoleads integrates seamlessly with lead sources like Meta Ads, Google Ads, and CRMs, ensuring leads are never missed. Serving modern sales and marketing teams across various industries, our AI agents proficiently handle outreach, lead qualification, and appointment setting. About the Role: We are seeking a highly experienced Voice AI /ML Engineer to lead the design and deployment of real-time voice intelligence systems. This role focuses on ASR, TTS, speaker diarization, wake word detection, and building production-grade modular audio processing pipelines to power next-generation contact center solutions, intelligent voice agents, and telecom-grade audio systems. You will work at the intersection of deep learning, streaming infrastructure, and speech/NLP technology, creating scalable, low-latency systems across diverse audio formats and real-world applications. Key Responsibilities: Voice & Audio Intelligence: Build, fine-tune, and deploy ASR models (e.g., Whisper, wav2vec2.0, Conformer) for real-time transcription. Develop and finetune high-quality TTS systems using VITS, Tacotron, FastSpeech for lifelike voice generation and cloning. Implement speaker diarization for segmenting and identifying speakers in multi-party conversations using embeddings (x-vectors/d-vectors) and clustering (AHC, VBx, spectral clustering). Design robust wake word detection models with ultra-low latency and high accuracy in noisy conditions. Real-Time Audio Streaming & Voice Agent Infrastructure: Architect bi-directional real-time audio streaming pipelines using WebSocket, gRPC, Twilio Media Streams, or WebRTC. Integrate voice AI models into live voice agent solutions, IVR automation, and AI contact center platforms. Optimize for latency, concurrency, and continuous audio streaming with context buffering and voice activity detection (VAD). Build scalable microservices to process, decode, encode, and stream audio across common codecs (e.g., PCM, Opus, μ-law, AAC, MP3) and containers (e.g., WAV, MP4). Deep Learning & NLP Architecture: Utilize transformers, encoder-decoder models, GANs, VAEs, and diffusion models, for speech and language tasks. Implement end-to-end pipelines including text normalization, G2P mapping, NLP intent extraction, and emotion/prosody control. Fine-tune pre-trained language models for integration with voice-based user interfaces. Modular System Development: Build reusable, plug-and-play modules for ASR, TTS, diarization, codecs, streaming inference, and data augmentation. Design APIs and interfaces for orchestrating voice tasks across multi-stage pipelines with format conversions and buffering. Develop performance benchmarks and optimize for CPU/GPU, memory footprint, and real-time constraints. Engineering & Deployment: Writing robust, modular, and efficient Python code Experience with Docker, Kubernetes, cloud deployment (AWS, Azure, GCP) Optimize models for real-time inference using ONNX, TorchScript, and CUDA, including quantization, context-aware inference, model caching. On device voice model deployment.

Posted 1 month ago

Apply

6.0 years

0 Lacs

Pune, Maharashtra, India

On-site

Hi, We have an immediate requirement for HPC/AI Application Engineer position with our organization SHI Locuz Enterprise Solutions Pvt Ltd. PFB Job details: Work Location - Pune/Mumbai/Bangalore Work Experience - 6+years(relevant) Subject Matter Expert Skills Required- HPC Application Installation & Deployment, AI Application Deployment. PFB JD for your reference: Senior HPC/AI Applications Engineer Experienced HPC/AI Applications Engineer with 5+ years in High-performance computing and AI application deployment. Expert at architecting, optimizing, and benchmarking CPU/GPU-intensive environments, ensuring maximum efficiency in scientific and ML workloads. Mastery over Open-source and Commercial HPC/AI Applications. Deep experience installing, benchmarking, and fine-tuning open-source applications, libraries, and compilers across CPU and GPU platforms. Proficient deploying and optimizing and benchmarking scientific codes (WRF, OpenFOAM, LAMMPS, GROMACS, Quantum Espresso, VASP, NAMD, BLAST, GATK, Ansys, Abaqus, MATLAB, LS‑DYNA, Nastran, CAE/CFX) etc. Compiler & Library Optimization - Advanced user of Intel OneAPI, AOCC, NVIDIA HPC SDK, GNU, LLVM, PGI compilers, and MPI libraries (OpenMPI, MPICH, Intel MPI). Deep profiling insights via Nsight, VTune, PAPI. Expert in AI frameworks: TensorFlow (CPU/GPU), PyTorch, Keras, Theano, Caffe, cuDNN. Strong knowledge of NVIDIANGC, NIM & NeMo. Proficient with workload & resource managers (PBS, LSF, SLURM, Kubernetes). Knowledge of application installation tools source code, cmake, spack, easy build, mamba etc. Benchmarking experience in accelerated HPC: HPL, HPCG, STREAM and MLPerf and scientific applications. Skilled in NVIDIA GPU tuning, CUDA and NIM workflows, kernel optimization, memory throughput tuning, and multi-GPU scaling strategies. Knowledge of frameworks such as Hugging Face, OpenAI, or other GenAI platforms. Knowledge in data preprocessing and model evaluation tool. Fluent in Bash, Python, and other scripting languages to automate installation, deployment, performance testing, and administrative tasks. Strong interpersonal skills; versed in customer interaction, technical documentation, and collaboration with cross-functional teams.

Posted 1 month ago

Apply

5.0 years

0 Lacs

Hubli, Karnataka, India

On-site

Read Before Applying Candidates available to join within 15–30 days will be preferred. What we’re building demands hard work, tenacity, ownership, creativity, and significantly more man-hours than an average job. If you prefer routine over the dynamic unpredictability of innovation, Astr isn’t for you. Our fast-paced environment requires curiosity, resilience, a passion for continuous learning, and comfort with rapid change. Why You Should NOT Join Us If high stakes, tight deadlines, and significant challenges don’t excite you, then our mission to revolutionize defence technology might not align with your career goals. We seek individuals ready to push boundaries and motivated by urgency and impact. Why You SHOULD Join Us We offer the opportunity to work on groundbreaking projects that strengthen national defence , supported by a team of A-players . About This Role We are seeking a Computer Vision Engineer to develop and implement cutting-edge vision-based algorithms for long-range surveillance and defence applications . The ideal candidate will have prior experience working on thermal cameras, long-range object detection, or visual surveillance systems . You will play a key role in designing and developing real-time object detection and tracking solutions for our next-generation defence products. This role requires expertise in Deep Learning, Computer Vision, and Embedded AI . Key Responsibilities ✅ Computer Vision Algorithm Development Develop, optimize, and implement object detection and tracking algorithms . Work on small object detection and long-range target tracking using visible spectrum and thermal imaging . ✅ Deep Learning & AI Implementation Train and fine-tune CNN-based models (YOLO, Faster R-CNN, SSD, etc.) for real-world defence applications. Deploy deep learning models on edge devices (Nvidia Jetson, ARM-based processors, etc.) . ✅ Hardware Integration & Optimization Collaborate with the embedded team to ensure real-time inference on low-power devices . Optimize algorithms for low-latency and power-efficient operation . ✅ Testing & Deployment Conduct field testing and performance evaluation of computer vision models in real-world environments . Work with defence teams to integrate vision-based solutions into weapon systems and surveillance platforms . Requirements 🔹 Must-Have Skills 1–5 years of experience in Computer Vision / AI / Deep Learning . Hands-on experience with thermal imaging, long-range surveillance, or security systems . Proficiency in Python, C++, OpenCV, TensorFlow, PyTorch, or YOLO . Strong understanding of CNNs, object detection, and real-time tracking . Experience deploying models on Jetson, Raspberry Pi, or ARM-based edge devices . Understanding of multi-threading, real-time processing, and hardware acceleration (CUDA, OpenCL) . 🔹 Good-to-Have Skills Experience with sensor fusion (LiDAR, Radar, IR cameras, etc.) . Knowledge of SLAM, motion prediction. Prior work in defence, aerospace, or surveillance applications . What We Expect from You ✅ Genuine curiosity and a problem-solving mindset . ✅ Execution, not just ideas —we value doers over talkers . ✅ Ability to work in a high-pressure, fast-paced environment . ✅ A strong sense of ownership and accountability . What You Can Expect from Us 🚀 The chance to work on hard, meaningful problems that directly impact national security . 💡 An A-team of high-energy, high-performance individuals. ⚡ High levels of responsibility and the ability to take ownership. 🎯 A mission-driven environment where your work truly matters . About Astr Defence Astr Defence is an award-winning Indian Defence Manufacturer pioneering next-generation armaments and counter-drone solutions to address modern security challenges . We collaborate with India’s elite forces to develop mission-critical technologies.

Posted 1 month ago

Apply

0.0 years

0 Lacs

Chandigarh, India

On-site

Company Profile Since year 2003, Oceaneering’s India Center has been an integral part of operations for Oceaneering’s robust product and service offerings across the globe. This center caters to diverse business needs, from oil and gas field infrastructure, subsea robotics to automated material handling & logistics. Our multidisciplinary team offers a wide spectrum of solutions, encompassing Subsea Engineering, Robotics, Automation, Control Systems, Software Development, Asset Integrity Management, Inspection, ROV operations, Field Network Management, Graphics Design & Animation, and more. In addition to these technical functions, Oceaneering India Center plays host to several crucial business functions, including Finance, Supply Chain Management (SCM), Information Technology (IT), Human Resources (HR), and Health, Safety & Environment (HSE). Our world class infrastructure in India includes modern offices, industry-leading tools and software, equipped labs, and beautiful campuses aligned with the future way of work. Oceaneering in India as well as globally has a great work culture that is flexible, transparent, and collaborative with great team synergy. At Oceaneering India Center, we take pride in “Solving the Unsolvable” by leveraging the diverse expertise within our team. Join us in shaping the future of technology and engineering solutions on a global scale. Position Summary We are seeking a proactive and enthusiastic Application Software Engineer to join our dynamic software team. This role is ideal for a fresher with a strong foundation in programming languages such as C++, Java, and Python, as well as a basic understanding of front-end development and testing. As a Software Developer, you will have the opportunity to work on a variety of projects and enhance your technical skills. Working in a multidisciplinary team you will be responsible for making sure that the software systems meet the customer specifications and work within their site constraints. ESSENTIAL Duties And Responsibilities Design, develop, and maintain server-side software systems and APIs. Write efficient, scalable, and maintainable code using C++, Python, and Rust (Optional) Extensive design and development skills in C++ 11. Having knowledge of C++ 14/C++17 will be added advantage. Thorough knowledge of the standard library, STL containers, and algorithms Solid understanding of complexity theory (big-O) of algorithms in general, and how the C++ containers fit in Understanding of performance tuning (w.r.t time/space) and how to do performance analysis and optimization. Experience in multi-threaded software development Excellent knowledge of Synchronization objects (Mutex, Semaphore, condition variables, etc) including their appropriate use cases and distinctions OpenCV, CUDA, PCL, and experience with Image processing / Computer Vision is a plus. Experience with one or more of docker, podman, and Kubernetes is a plus. Experience with middleware such as MQTT, DDS, ROS, ROS2 is a plus. NON-ESSENTIAL Carry out additional duties as assigned. Qualifications REQUIRED Bachelor / Master degree, preferably in Computer Science, Automation Technology or Information Technology; 0-1 years’ of experience in writing application software for technical applications; Understanding of networking hardware and software including UDP and TCP; Ability to read, understand, debug and modify existing product code; Experience with writing requirements, design documentation, and test cases Ability to read, understand, debug, and modify existing product code DESIRED Experience with programming in Linux; (tool chains, IDE’s, etc.); Experience with versions control systems, preferably githib; Knowledge of object-oriented analysis & design methodologies and design patterns; Experience with programming in Java, Python; Experience with XML and web services; Proficiency in C++, Java, and Python programming languages. Basic understanding of front-end technologies such as HTML, CSS, and JavaScript. Familiarity with software testing principles and practices. Knowledge, Skills, Abilities, And Other Characteristics Ensures that important information from management is shared with employees and others as appropriate Gives and receives constructive feedback Ensures that regular consistent communication takes place within area of responsibility Self-motivated, confident and passionate Provides vision and inspiration to peers and subordinates. Able to make decisions in conflicting situation Should be comfortable with ambiguity. Able to set priorities in a fast-paced, rapidly changing environment. How To Apply Oceaneering’s policy is to provide equal employment opportunities to all applicants. How To Apply Regular full-time employees who apply will be considered along with external candidates. Employees with less than six months with their current position are not eligible to apply for job postings. Please discuss your interest in the position with your current manager/supervisor prior to submitting your completed application. It is highly recommended to apply through the PeopleSoft or Oceanet portals. How To Apply In addition, we make a priority of providing learning and development opportunities to enable employees to achieve their potential and take charge of their future. As well as developing employees in a specific role, we are committed to lifelong learning and ongoing education, including developing people skills and identifying future supervisors and managers. Every month, hundreds of employees are provided training, including HSE awareness, apprenticeships, entry and advanced level technical courses, management development seminars, and leadership and supervisory training. We have a strong ethos of internal promotion. We can offer long-term employment and career advancement across countries and continents. Working at Oceaneering means that if you have the ability, drive, and ambition to take charge of your future-you will be supported to do so and the possibilities are endless.

Posted 1 month ago

Apply

2.0 - 7.0 years

11 - 15 Lacs

Bengaluru

Work from Office

WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded Underpinning our mission is the AMD culture We push the limits of innovation to solve the worlds most important challenges We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives AMD together we advance_ GPU Kernel Developer AI Models The Role AMD is looking for a GPU kernel development engineer who is talented in developing high performance kernels for state-of-the-art and upcoming GPU hardware You will be a member of a core team of incredibly talented industry specialists and will work with the very latest hardware and software technology The Person Experienced in GPU kernel development and optimization for AI/HPC applications Strong technical and analytical skills in GPU computing, hardware architecture, and deep understanding of HIP/CUDA/OpenCL/Triton development Ability to work as part of a team, deliver to project scope, and communicate to a technical/non-technical audience Key Responsibilities Develop high performance GPU kernels for key AI operators on AMD GPUs Optimize GPU code using structured and disciplined methodology profiling to identify gaps, roofline-analysis on hardware, identify key set of optimizations, establish uplift and line-of-sight, prototype and develop optimizations Support mission-critical workloads in NLP/LLM, Recommendation, Vision and Audio Collaborate and interact with system level performance architects, GPU hardware specialists, power/clock tuning teams, performance validation teams, and performance marketing teams to analyze and optimize training and inference for AI Work with open-source framework maintainers to understand their requirements and have your code changes integrated upstream Debug, maintain and optimize GPU kernels, understand and drive AI operator performance (GEMM, Attention, Distributed scale-up/out communication, etc ) Apply your knowledge of software engineering best practices Preferred Experience Knowledge of GPU computing (HIP, CUDA, OpenCL, Triton) Knowledge and experience in optimizing GPU kernels Expertise in using profiling, debugging tools Core understanding of GPU hardware Excellent C/C++/Python programming and software design skills, including debugging, performance analysis, and test design Academic Credentials Masters or PhD or equivalent experience in Computer Science, Computer Engineering, or related field Benefits offered are described: AMD benefits at a glance AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law We encourage applications from all qualified candidates and will accommodate applicantsneeds under the respective laws throughout all stages of the recruitment and selection process

Posted 1 month ago

Apply

12.0 years

6 - 9 Lacs

Hyderābād

On-site

Our vision is to transform how the world uses information to enrich life for all . Micron Technology is a world leader in innovating memory and storage solutions that accelerate the transformation of information into intelligence, inspiring the world to learn, communicate and advance faster than ever. Principal / Senior Systems Performance Engineer Micron Data Center and Client Workload Engineering in Hyderabad, India, is seeking a senior/principal engineer to join our dynamic team. The successful candidate will primarily contribute to the ML development, ML DevOps, HBM program in the data center by analyzing how AI/ML workloads perform on the latest MU-HBM, Micron main memory, expansion memory and near memory (HBM/LP) solutions, conduct competitive analysis, showcase the benefits that workloads see with MU-HBM’s capacity / bandwidth / thermals, contribute to marketing collateral, and extract AI/ML workload traces to help optimize future HBM designs. Job Responsibilities: The Job Responsibilities include but are not limited to the following: Design, implement, and maintain scalable & reliable ML infrastructure and pipelines. Collaborate with data scientists and ML engineers to deploy machine learning models into production environments. Automate and optimize ML workflows, including data preprocessing, model training, evaluation, and deployment. Monitor and manage the performance, reliability, and scalability of ML systems. Troubleshoot and resolve issues related to ML infrastructure and deployments. Implement and manage distributed training and inference solutions to enhance model performance and scalability. Utilize DeepSpeed, TensorRT, vLLM for optimizing and accelerating AI inference and training processes. Understand key care abouts when it comes to ML models such as: transformer architectures, precision, quantization, distillation, attention span & KV cache, MoE, etc. Build workload memory access traces from AI models. Study system balance ratios for DRAM to HBM in terms of capacity and bandwidth to understand and model TCO. Study data movement between CPU, GPU and the associated memory subsystems (DDR, HBM) in heterogeneous system architectures via connectivity such as PCIe/NVLINK/Infinity Fabric to understand the bottlenecks in data movement for different workloads. Develop an automated testing framework through scripting. Customer engagements and conference presentations to showcase findings and develop whitepapers. Requirements: Strong programming skills in Python and familiarity with ML frameworks such as TensorFlow, PyTorch, or scikit-learn. Experience in data preparation: cleaning, splitting, and transforming data for training, validation, and testing. Proficiency in model training and development: creating and training machine learning models. Expertise in model evaluation: testing models to assess their performance. Skills in model deployment: launching server, live inference, batched inference Experience with AI inference and distributed training techniques. Strong foundation in GPU and CPU processor architecture Familiarity with and knowledge of server system memory (DRAM) Strong experience with benchmarking and performance analysis Strong software development skills using leading scripting, programming languages and technologies (Python, CUDA, C, C++) Familiarity with PCIe and NVLINK connectivity Preferred Qualifications: Experience in quickly building AI workflows: building pipelines and model workflows to design, deploy, and manage consistent model delivery. Ability to easily deploy models anywhere: using managed endpoints to deploy models and workflows across accessible CPU and GPU machines. Understanding of MLOps: the overarching concept covering the core tools, processes, and best practices for end-to-end machine learning system development and operations in production. Knowledge of GenAIOps: extending MLOps to develop and operationalize generative AI solutions, including the management of and interaction with a foundation model. Familiarity with LLMOps: focused specifically on developing and productionizing LLM-based solutions. Experience with RAGOps: focusing on the delivery and operation of RAGs, considered the ultimate reference architecture for generative AI and LLMs. Data management: collect, ingest, store, process, and label data for training and evaluation. Configure role-based access control; dataset search, browsing, and exploration; data provenance tracking, data logging, dataset versioning, metadata indexing, data quality validation, dataset cards, and dashboards for data visualization. Workflow and pipeline management: work with cloud resources or a local workstation; connect data preparation, model training, model evaluation, model optimization, and model deployment steps into an end-to-end automated and scalable workflow combining data and compute. Model management: train, evaluate, and optimize models for production; store and version models along with their model cards in a centralized model registry; assess model risks, and ensure compliance with standards. Experiment management and observability: track and compare different machine learning model experiments, including changes in training data, models, and hyperparameters. Automatically search the space of possible model architectures and hyperparameters for a given model architecture; analyze model performance during inference, monitor model inputs and outputs for concept drift. Synthetic data management: extend data management with a new native generative AI capability. Generate synthetic training data through domain randomization to increase transfer learning capabilities. Declaratively define and generate edge cases to evaluate, validate, and certify model accuracy and robustness. Embedding management: represent data samples of any modality as dense multi-dimensional embedding vectors; generate, store, and version embeddings in a vector database. Visualize embeddings for improvised exploration. Find relevant contextual information through vector similarity search for RAGs. Education: Bachelor’s or higher (with 12+ years of experience) in Computer Science or related field. About Micron Technology, Inc. We are an industry leader in innovative memory and storage solutions transforming how the world uses information to enrich life for all . With a relentless focus on our customers, technology leadership, and manufacturing and operational excellence, Micron delivers a rich portfolio of high-performance DRAM, NAND, and NOR memory and storage products through our Micron® and Crucial® brands. Every day, the innovations that our people create fuel the data economy, enabling advances in artificial intelligence and 5G applications that unleash opportunities — from the data center to the intelligent edge and across the client and mobile user experience. To learn more, please visit micron.com/careers All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status. To request assistance with the application process and/or for reasonable accommodations, please contact hrsupport_india@micron.com Micron Prohibits the use of child labor and complies with all applicable laws, rules, regulations, and other international and industry labor standards. Micron does not charge candidates any recruitment fees or unlawfully collect any other payment from candidates as consideration for their employment with Micron. AI alert : Candidates are encouraged to use AI tools to enhance their resume and/or application materials. However, all information provided must be accurate and reflect the candidate's true skills and experiences. Misuse of AI to fabricate or misrepresent qualifications will result in immediate disqualification. Fraud alert: Micron advises job seekers to be cautious of unsolicited job offers and to verify the authenticity of any communication claiming to be from Micron by checking the official Micron careers website in the About Micron Technology, Inc.

Posted 1 month ago

Apply

1.0 years

15 Lacs

Mumbai

On-site

Job Title : C++ Developer Duration : 1-year contractual position Experience Range : 5 to 8 years Notice Period : Within 20 days Location : Kandivali, Mumbai (Only local candidates of Mumbai are acceptable) Education : B.Tech, B.E Interview Process : 1st- Technical, 2nd - Technical round & 3rd - HR Round Mandatory : End-to-end C++ skills Skills Required : - C, C++ - Qt/QML - OOPs - STL, Data Structures - JavaScript - Automotive Product Development - Android Application Development - Java - API - GitLab CI/CD - GitHub, Gerrit - Jira, Zoho - PostgreSQL, SQLite, JSON - MVVM Architecture - Testing - Debugging - Linux, Unix Job Description : We are seeking an experienced Developer with a strong background in C++, CUDA programming, and Linux to guide our development team in building cutting-edge solutions for device integration and high-performance computing tasks. This is a hands-on leadership position that combines technical expertise with team management skills to deliver high-quality software products. Primary responsibilities : Software Development : - Develop and maintain high-performance applications using C++ and CUDA. - Design and implement parallel algorithms for GPUs to accelerate computational workloads. Performance Optimization : - Optimize CUDA kernels for performance, scalability, and memory efficiency. - Analyze performance bottlenecks and propose innovative solutions. Code Review and Testing : - Conduct code reviews to ensure adherence to coding standards and best practices. - Develop and execute test cases to validate functionality and performance. Collaboration : - Work closely with the software engineering and research teams to understand requirements and deliver robust solutions. - Provide technical guidance and mentoring to junior team members when necessary. Documentation : - Write and maintain technical documentation, including design specifications and user manuals. Required Skills : - C++ : Strong proficiency in modern C++ (C++11/14/17/20). - CUDA Programming : Extensive experience in developing, debugging, and optimizing CUDA applications. - GPU Optimization : Familiarity with memory hierarchy, shared memory, streams, and warp-level operations in CUDA. - Parallel Computing : Solid understanding of parallel algorithms and multi-threaded programming. - Mathematical and Analytical Skills : Strong foundation in linear algebra, calculus, and numerical methods. - Tools : Experience with debugging/profiling tools like Nsight, CUDA Memcheck, or similar. Send your CVs to hr@basebiz.in for faster screening Job Types: Full-time, Contractual / Temporary Contract length: 12 months Pay: Up to ₹1,500,000.00 per year Benefits: Health insurance Provident Fund Schedule: Day shift Fixed shift Monday to Friday Morning shift Night shift Application Question(s): This is a one-year contractual job. Are you okay with it? Are you in Mumbai? How many years of end-to-end experience in C++ experience do you have? C++: Strong proficiency in modern C++ (C++11/14/17/20) CUDA Programming: Extensive experience in developing, debugging, and optimizing CUDA applications GPU Optimization: Familiarity with memory hierarchy, shared memory, streams, and warp-level operations in CUDA Parallel Computing: Solid understanding of parallel algorithms and multi-threaded programming. Tools: Experience with debugging/profiling tools like Nsight, CUDA Memcheck, or similar. What is your Notice period? What is your CCTC? What is your ECTC (Max salary offered is 15 LPA)? Write in your response, which of the following skills do you not have? ● C, C++ ● Qt/QML ● OOPs ● STL, Data Structures ● JavaScript ● Automotive Product Development ● Android Application Development ● Java ● API ● GitLab CI/CD ● GitHub, Gerrit ● Jira, Zoho ● PostgreSQL, SQLite, JSON ● MVVM Architecture ● Testing ● Debugging ● Linux, Unix Work Location: In person Application Deadline: 23/07/2025 Expected Start Date: 21/07/2025

Posted 1 month ago

Apply

4.0 - 5.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Responsibilities From Research to Reality: You'll be the bridge between cutting-edge academic research (activity recognition, multi-camera tracking, Video Language Models) and deployable, production-grade features on https://deeplabel.app . Building Solutions: Design, develop, and optimize Deep Learning models for human activity analysis in videos, from initial concept to final deployment. System Ownership: Take charge of your modules, ensuring they're robust, efficient, and seamlessly integrated with our AI platform, collaborating closely with our full-stack team. Requirements 4-5 years of industry experience, with 2-3 years specifically in practical, project-driven Deep Learning in the Video AI domain. Rock-solid Python coding skills. Deep practical knowledge of PyTorch and ONNX. Proven track record of deploying data pipelines for Computer Vision projects. The ability to independently set up, troubleshoot, and optimize Linux workstations (CUDA, OpenCV). A strong grasp of Deep Learning concepts (optimizers, attention, masking, model tuning). Demonstrated experience with activity detection or object detection implementations. A keen ability to read and implement new approaches from research papers. Your Personality Is Just As Important You're incredibly curious and love to experiment. You have an unwavering commitment to learning and overcoming challenges. You're a great communicator and thrive in a collaborative environment. You take full ownership of your work, seeing it through to successful deployment. This job was posted by Vinay Ts from Streamingo.

Posted 1 month ago

Apply

8.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

About the team: The AI Core team is a small and nimble team focused on AI innovation, experimental modelling, and scientific validation. Its mission is to build highly efficient, adaptable and efficacious large models for multiple general-purpose applications in chemical metrology. To stay aligned with the mission, the team follows advancements in AI, including next-generation deep learning architectures, autonomous agents, large-scale optimization algorithms, reinforcement learning methodologies, and innovative data simulation techniques. Machine Learning Scientist The Machine Learning Scientist is critical to the AI Core team by creating AI models from the ground up and steering model development through every stage, leading to the fulfilment of the research objective. Responsibilities: Experiment with advanced architectures like CNNs, RNNs, transformers, autoencoders etc. appropriate for the research objective Develop training strategies (e.g., self-supervised learning, few-shot learning) - Optimize loss functions and metrics for performance, manage hyperparameter tuning - Optimize training pipelines and debug training failures - Develop reproducible training/evaluation pipelines. Skills: Expertise in PyTorch/TensorFlow or other frameworks Strong Python skills (NumPy, SciPy, scikit-learn) and GPU acceleration (CUDA, cuDNN) Experience with ML experiment tracking (W&B, MLflow etc.) Experience with RL frameworks (Stable Baselines3, Ray RLlib) Passion: Building AI agents with superior capabilities Qualifications: Bachelor's or Master's degree in Data Science, Computer Science, or a related technical field. 8+ years of experience in machine learning and deep learning Prior roles within AI research teams Background in chemometrics, spectroscopy, or analytical chemistry, desirable Advantage: Networks involving very large matrix operations About Picarro: We are the world's leader in timely, trusted, and actionable data using enhanced optical spectroscopy. Our solutions are used in a wide variety of applications, including natural gas leak detection, ethylene oxide emissions monitoring, semiconductor fabrication, pharmaceutical, petrochemical, atmospheric science, air quality, greenhouse gas measurements, food safety, hydrology, ecology, and more. Our software and hardware are designed and manufactured in Santa Clara, California and are used in over 90 countries worldwide based on over 65 patents related to cavity ring-down spectroscopy (CRDS) technology and are unparalleled in their precision, ease of use, and reliability. All qualified applicants will receive consideration for employment without regard to race, sex, color, religion, national origin, protected veteran status, gender identity, social orientation, nor on the basis of disability. Posted positions are not open to third party recruiters/agencies and unsolicited resume submissions will be considered free referrals. If you are an individual with a disability and require reasonable accommodation to complete any part of the application process or are limited in the ability or unable to access or use this online application process and need an alternative method for applying, you may contact Picarro, Inc. at disabilityassistance@picarro.com for assistance.

Posted 1 month ago

Apply

5.0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

Who We Are Applied Materials is the global leader in materials engineering solutions used to produce virtually every new chip and advanced display in the world. We design, build and service cutting-edge equipment that helps our customers manufacture display and semiconductor chips – the brains of devices we use every day. As the foundation of the global electronics industry, Applied enables the exciting technologies that literally connect our world – like AI and IoT. If you want to work beyond the cutting-edge, continuously pushing the boundaries of science and engineering to make possible the next generations of technology, join us to Make Possible® a Better Future. What We Offer Location: Bangalore,IND, Chennai,IND At Applied, we prioritize the well-being of you and your family and encourage you to bring your best self to work. Your happiness, health, and resiliency are at the core of our benefits and wellness programs. Our robust total rewards package makes it easier to take care of your whole self and your whole family. We’re committed to providing programs and support that encourage personal and professional growth and care for you at work, at home, or wherever you may go. Learn more about our benefits. You’ll also benefit from a supportive work culture that encourages you to learn, develop and grow your career as you take on challenges and drive innovative solutions for our customers. We empower our team to push the boundaries of what is possible—while learning every day in a supportive leading global company. Visit our Careers website to learn more about careers at Applied. Technical Lead - Software About Applied Applied Materials is the leader in materials engineering solutions used to produce virtually every new chip and advanced display in the world. Our expertise in modifying materials at atomic levels and on an industrial scale enables customers to transform possibilities into reality. At Applied Materials, our innovations make possible the technology shaping the future. Our Team Our team is developing a high-performance computing solution for low-latency and high throughput image processing and deep-learning workloads that will enable our Chip Manufacturing process control equipment to offer differentiated value to our customers. Your Opportunity As a technical lead, you will get the opportunity to grow in the field of high-performance computing, complex system design and low-level optimizations for better cost of ownership. Roles and Responsibility As a technical lead, you will be responsible for designing and implementing High performance computing software solutions for our organization. You will work closely with cross-functional teams, including software engineers, product managers, and business stakeholders, to understand requirements and translate them into architectural/software designs that meet business needs. You will be a subject Matter expert to unblock software engineers in the HPC domain. You will be expected to profile systems to understand bottlenecks, optimize workflows and code and processes to improve cost of ownership. Identify and mitigate technical risks and issues throughout the software development lifecycle. Lead the design and implementation of complex software components and systems. Ensure that software systems are scalable, reliable, and maintainable. Mentor and coach junior software engineers. Your primary focus will be on implementing features of high quality with maintainable and extendable code following software development best practices Our Ideal Candidate Someone who has the drive and passion to learn quickly, has the ability to multi-task and switch contexts based on business needs. Qualifications 5 to 10 years of experience in Design and coding in C/C++ preferably in Linux Environment. Very good knowledge of Data structures, Algorithms and Complexity analysis. In depth experience in Multi-threading, Thread Synchronization, Inter process communication, and Distributed computing fundamentals. Very Good knowledge of Operating systems internals (Linux Preferred), Networking and Storage systems. Experience in performance profiling at application and system level (e.g. vtune, Oprofiler, perf, Nividia Nsight etc.) Experience in low level code optimization techniques using Vectorization and Intrinsics, cache-aware programming, lock free data structures etc. Excellent problem-solving and analytical skills. Strong communication and collaboration abilities. Ability to mentor and coach junior team members. Experience in Agile development methodologies. Additional Qualifications Experience in GPU programming using CUDA, OpenMP, OpenACC, OpenCL etc. Good Knowledge of Work-flow orchestration Software like Apache Airflow, Apache Spark, Apache storm or Intel TBB flowgraph etc. Experience in developing Distributed High Performance Computing software using Parallel programming frameworks like MPI, UCX etc. Experience in HPC Job-Scheduling and Cluster Management Software (SLURM, Torque, LSF etc.) Good knowledge of Low-latency and high-throughput data transfer technologies (RDMA, RoCE, InfiniBand) Familiarity with microservices architecture and containerization technologies (docker/singularity) and low latency Message queues. Experience in Java and Python programming Education Bachelor's Degree or higher in Computer science or related Disciplines. Applied Materials is committed to diversity in its workforce including Equal Employment Opportunity for Minorities, Females, Protected Veterans and Individuals with Disabilities. Additional Information Time Type: Full time Employee Type Assignee / Regular Travel Yes, 10% of the Time Relocation Eligible: Yes Applied Materials is an Equal Opportunity Employer. Qualified applicants will receive consideration for employment without regard to race, color, national origin, citizenship, ancestry, religion, creed, sex, sexual orientation, gender identity, age, disability, veteran or military status, or any other basis prohibited by law.

Posted 1 month ago

Apply

5.0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

Who We Are Applied Materials is the global leader in materials engineering solutions used to produce virtually every new chip and advanced display in the world. We design, build and service cutting-edge equipment that helps our customers manufacture display and semiconductor chips – the brains of devices we use every day. As the foundation of the global electronics industry, Applied enables the exciting technologies that literally connect our world – like AI and IoT. If you want to work beyond the cutting-edge, continuously pushing the boundaries of science and engineering to make possible the next generations of technology, join us to Make Possible® a Better Future. What We Offer Location: Bangalore,IND, Chennai,IND At Applied, we prioritize the well-being of you and your family and encourage you to bring your best self to work. Your happiness, health, and resiliency are at the core of our benefits and wellness programs. Our robust total rewards package makes it easier to take care of your whole self and your whole family. We’re committed to providing programs and support that encourage personal and professional growth and care for you at work, at home, or wherever you may go. Learn more about our benefits. You’ll also benefit from a supportive work culture that encourages you to learn, develop and grow your career as you take on challenges and drive innovative solutions for our customers. We empower our team to push the boundaries of what is possible—while learning every day in a supportive leading global company. Visit our Careers website to learn more about careers at Applied. About Applied Applied Materials is the leader in materials engineering solutions used to produce virtually every new chip and advanced display in the world. Our expertise in modifying materials at atomic levels and on an industrial scale enables customers to transform possibilities into reality. At Applied Materials, our innovations make possible the technology shaping the future. What You’ll Do You will be responsible leading research, design, development and implementation of algorithmic modules. You will perform algorithmic concept and feasibility for image processing algorithms, including problem analysis, data gathering, literature review, concept selection and evaluation and implementation constrains. Your Key Responsibilities Will Be Demonstrates understanding of computer vision and machine learning / deep learning algorithms for image analysis, pattern recognition, metrology, and object detection. Prototyping algorithms approaches to solve complex problems and drive integration of new algo solutions to product. Algorithm optimization to reduce computational cost and influence hardware requirements. Experience with machine learning frameworks such as TensorFlow, PyTorch, or Keras, especially in the context of image analysis. Knowledge of GPU programming using CUDA or OpenCL. You will be a great fit if you have Strong background in computer vision, image processing, and DL technology Expertise in C/C++, Python, MATLAB and working experience with Tensor Flow, Pytorch or similar framework Academic background in Computer Vision; Deep Learning, Machine Learning, or Artificial Intelligence. Experience working in Semiconductor industry. Excellent analytical, problem-solving, and organizational skills, along with strong interpersonal communication skills. Experience in managing teams, sets organizational priorities and allocates resources. Ph.D. or Master’s in computer science/engineering or similar fields with 5+ years of experience Education Master’s / Research background- Qualification preferred – Image/ Signal Processing, Computer Vison, Deep Learning Years Of Experience 5- 10 Years Applied Materials is committed to diversity in its workforce including Equal Employment Opportunity for Minorities, Females, Protected Veterans and Individuals with Disabilities. Additional Information Time Type: Full time Employee Type Assignee / Regular Travel Yes, 10% of the Time Relocation Eligible: Yes Applied Materials is an Equal Opportunity Employer. Qualified applicants will receive consideration for employment without regard to race, color, national origin, citizenship, ancestry, religion, creed, sex, sexual orientation, gender identity, age, disability, veteran or military status, or any other basis prohibited by law.

Posted 1 month ago

Apply

7.0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

Who We Are Applied Materials is the global leader in materials engineering solutions used to produce virtually every new chip and advanced display in the world. We design, build and service cutting-edge equipment that helps our customers manufacture display and semiconductor chips – the brains of devices we use every day. As the foundation of the global electronics industry, Applied enables the exciting technologies that literally connect our world – like AI and IoT. If you want to work beyond the cutting-edge, continuously pushing the boundaries of science and engineering to make possible the next generations of technology, join us to Make Possible® a Better Future. What We Offer Location: Bangalore,IND, Chennai,IND At Applied, we prioritize the well-being of you and your family and encourage you to bring your best self to work. Your happiness, health, and resiliency are at the core of our benefits and wellness programs. Our robust total rewards package makes it easier to take care of your whole self and your whole family. We’re committed to providing programs and support that encourage personal and professional growth and care for you at work, at home, or wherever you may go. Learn more about our benefits. You’ll also benefit from a supportive work culture that encourages you to learn, develop and grow your career as you take on challenges and drive innovative solutions for our customers. We empower our team to push the boundaries of what is possible—while learning every day in a supportive leading global company. Visit our Careers website to learn more about careers at Applied. Software Architect About Applied Applied Materials is the leader in materials engineering solutions used to produce virtually every new chip and advanced display in the world. Our expertise in modifying materials at atomic levels and on an industrial scale enables customers to transform possibilities into reality. At Applied Materials, our innovations make possible the technology shaping the future. Our Team Our team is developing a high-performance computing solution for low-latency and high throughput image processing and deep-learning workload that enables our Chip Manufacturing process control equipment to offer differentiated value to our customers. Your Opportunity As an architect, you will get the opportunity to grow in the field of high-performance computing, complex system design and low-level optimizations for better cost of ownership. Roles and Responsibility As a Software Architect, you will be responsible for designing and implementing High performance computing software solutions for our organization. You will work closely with cross-functional teams, including software engineers, product managers, and business stakeholders, to understand requirements and translate them into architectural/software designs that meet business needs. You will be coding and developing quick prototypes to establish your design with real code and data. You will be a subject Matter expert to unblock software engineers in the HPC domain. You will be expected to profile systems to understand bottlenecks, optimize workflows and code and processes to improve cost of ownership. Conduct technical reviews and provide guidance to software engineers during the development process. Identify and mitigate technical risks and issues throughout the software development lifecycle. Evaluate and recommend appropriate technologies and frameworks to meet project requirements. Lead the design and implementation of complex software components and systems. Ensure that software systems are scalable, reliable, and maintainable. Mentor and coach junior software architects and engineers. Your primary focus will be on ensuring that the software systems are scalable, reliable, maintainable and cost effective. Our Ideal Candidate Someone who has the drive and passion to learn quickly, has the ability to multi-task and switch contexts based on business needs. Qualifications 7 to 15 years of experience in Design and coding in C/C++ preferably in Linux Environment. Very good knowledge Data structure and Algorithms and complexity analysis. Experience in developing Distributed High Performance Computing software using Parallel programming frameworks like MPI, UCX etc. In depth experience in Multi-threading, Thread Synchronization, Inter process communication, and distributed computing fundamentals. Very Good knowledge of Computer science fundamentals like, Operating systems internals (Linux Preferred), Networking and Storage systems. Experience in performance profiling at application and system level (e.g. vtune, Oprofiler, perf, Nividia Nsight etc.) Experience in low level code optimization techniques using Vectorization and Intrinsics, cache-aware programming, lock free data structures etc. Experience in GPU programming using CUDA, OpenMP, OpenACC, OpenCL etc. Familiarity with microservices architecture and containerization technologies (docker/singularity) and low latency Message queues. Excellent problem-solving and analytical skills. Strong communication and collaboration abilities. Ability to mentor and coach junior team members. Experience in Agile development methodologies. Additional Qualifications Experience in HPC Job-Scheduling and Cluster Management Software (SLURM, Torque, LSF etc.) Good knowledge of Low-latency and high-throughput data transfer technologies (RDMA, RoCE, InfiniBand) Good Knowledge of Work-flow orchestration Software like Apache Airflow, Apache Spark, Apache storm or Intel TBB flowgraph etc. Education Bachelor's Degree or higher in Computer science or related Disciplines. Years Of Experience 7 - 15 Years Additional Information Time Type: Full time Employee Type Assignee / Regular Travel Yes, 10% of the Time Relocation Eligible: Yes Applied Materials is an Equal Opportunity Employer. Qualified applicants will receive consideration for employment without regard to race, color, national origin, citizenship, ancestry, religion, creed, sex, sexual orientation, gender identity, age, disability, veteran or military status, or any other basis prohibited by law.

Posted 1 month ago

Apply

5.0 - 10.0 years

11 - 16 Lacs

Gurugram

Work from Office

Looking for challenging roleIf you really want to make a difference - make it with us Can we energize society and fight climate change at the same time At Siemens Energy, we can. Our technology is key, but our people make the difference. Brilliant minds innovate. They connect, create, and keep us on track towards changing the worlds energy systems. Their spirit fuels our mission. We are seeking a highly skilled and driven Senior AI Engineer to join our team as a founding member, developing the critical data and AI infrastructure for training foundation models for power grid applications. You will be instrumental in building and optimizing the end-to-end systems, data pipelines, and training processes that will power our AI research. Working closely with research scientists, you will translate cutting-edge research into robust, scalable, and efficient implementations, enabling the rapid development and deployment of transformational AI solutions. This role requires deep hands-on expertise in distributed training, data engineering, MLOps, a proven track record of building scalable AI infrastructure. Your new role- challenging and future- oriented Design, build, and rigorously optimize everything necessary for large-scale training, fine-tuning and/or inference with different model architectures. Includes the complete stack from dataloading to distributed training to inference; to maximize the MFU (Model Flop Utilization) on the compute cluster. Collaborate closely and proactively with research scientists, translating research models and algorithms into high-performance, production-ready code and infrastructure. Ability to implement, integrate & test latest advancements from research publications or open-source code. Relentlessly profile and resolve training performance bottlenecks, optimizing every layer of the training stack from data loading to model inference for speed and efficiency. Contribute to technology evaluations and selection of hardware, software, and cloud services that will define our AI infrastructure platform. Experience with MLOps frameworks (MLFlow, WnB, etc) to implement best practices across the model lifecycle- development, training, validation, and monitoring- ensuring reproducibility, reliability, and continuous improvement. Create thorough documentation for infrastructure, data pipelines, and training procedures, ensuring maintainability and knowledge transfer within the growing AI lab. Stay at the forefront of advancements in large-scale training strategies and data engineering and proactively driving improvements and innovation in our workflows and infrastructure. High-agency individual demonstrating initiative, problem-solving, and a commitment to delivering robust and scalable solutions for rapid prototyping and turnaround. We dont need superheroes, just super minds Bachelor's or masters degree in computer science, Engineering, or a related technical field. 5+ years of hands-on experience in a role specifically building and optimizing infrastructure for large-scale machine learning systems Deep practical expertise with AI frameworks (PyTorch, Jax, Pytorch Lightning, etc). Hands-on experience with large-scale multi-node GPU training, and other optimization strategies for developing large foundation models, across various model architectures. Ability to scale solutions involving large datasets and complex models on distributed compute infrastructure. Excellent problem-solving, debugging, and performance optimization skills, with a data-driven approach to identifying and resolving technical challenges. Strong communication and teamwork skills, with a collaborative approach to working with research scientists and other engineers. Experience with MLOps best practices for model tracking, evaluation and deployment. Desired skills Public GitHub profile demonstrating a track record of open-source contributions to relevant projects in data engineering or deep learning infrastructure is a BIG PLUS. Experience with performance monitoring and profiling tools for distributed training and data pipelines. Experience writing CUDA/Triton/CUTLASS kernels.

Posted 1 month ago

Apply

0.0 - 5.0 years

4 - 8 Lacs

Bengaluru

Work from Office

Research Engineer position at IBM India Research Lab is a challenging, dynamic and highly innovative role. Some of our current areas of work where we are actively looking for top talent are: Optimized runtime stacks for foundation model workloads including fine-tuning, inference serving and large-scale data engineering, with a focus on multi-stage tuning including reinforcement learning, inference-time compute, and data preparation needs for complex AI systems. Optimizing models to run on multiple accelerators including IBM’s AIU accelerator leveraging compiler optimizations, specialized kernels, libraries and tools. Developing use cases that effectively leverage the infrastructure and models to deliver value Pre-training language and multi-modal foundation models working with large scale distributed training procedures, model alignment, creating specialized pipelines for various tasks including effective LLM-generated data pipelines, creating frameworks for collecting human data and deploying models in user-centric platforms. Required education Bachelor's Degree Preferred education Master's Degree Required technical and professional expertise You should have one or more of the following: A master’s degree in computer science, AI or related fields from a top institution 0-8 years of experience working with modern ML techniques including but not limited to model architectures, data processing, fine-tuning techniques, reinforcement learning, distributed training, inference optimizations Experience with big data platforms like Ray and Spark Experience working with Pytorch FSDP and HuggingFace libraries Programming experience in one of the followingPython, web development technologies Growth mindset and a pragmatic attitude Preferred technical and professional experience Peer-reviewed research at top machine learning or systems conferences Experience working with pytorch.compile, CUDA, triton kernels, GPU scheduling, memory management Experience working with open-source communities

Posted 1 month ago

Apply

0.0 - 5.0 years

7 - 11 Lacs

Bengaluru

Work from Office

Research Scientist position at IBM India Research Lab is a challenging, dynamic and highly innovative role, where you will be responsible for coming up with new innovative ideas, developing solutions working as a team, building prototypes, publishing research papers and demonstrating the value of your ideas in an enterprise setting. Some of our current areas of work where we are actively looking for top researchers are: Optimized runtime stacks for foundation model workloads including fine-tuning, inference serving and large-scale data engineering, with a focus on multi-stage tuning including reinforcement learning, inference-time compute, and data preparation needs for complex AI systems. Optimizing models to run on multiple accelerators including IBM’s AIU accelerator leveraging compiler optimizations, specialized kernels, libraries and tools. Innovative use cases that effectively leverage the infrastructure and models to deliver value Pre-training language and multi-modal foundation models working with large scale distributed training procedures, model alignment, and creating specialized pipelines for various tasks including effective LLM-generated data pipelines. Required education Bachelor's Degree Required technical and professional expertise You should have one or more of the following: A master’s degree in computer science, AI or related fields from a top institution 0-8 years of experience working with modern ML techniques including but not limited to model architectures, data processing, fine-tuning techniques, reinforcement learning, distributed training, inference optimizations Experience with big data platforms like Ray and Spark Experience working with Pytorch FSDP and HuggingFace libraries Programming experience in one of the followingPython, web development technologies Growth mindset and a pragmatic attitude Preferred technical and professional experience Peer-reviewed research at top machine learning or systems conferences Experience working with pytorch.compile, CUDA, triton kernels, GPU scheduling, memory management Experience working with open-source communities

Posted 1 month ago

Apply

7.0 - 9.0 years

37 - 40 Lacs

Ahmedabad, Bengaluru, Mumbai (All Areas)

Work from Office

Dear Candidate, We are hiring a Computer Vision Engineer to develop AI-driven solutions for image recognition, object detection, and video analysis. The role requires expertise in deep learning, computer vision algorithms, and real-time processing techniques. Key Responsibilities: Develop and optimize computer vision models using OpenCV, TensorFlow, and PyTorch. Implement object detection, segmentation, and facial recognition algorithms. Process and analyze large-scale image and video datasets. Optimize deep learning models for real-time inference on edge devices. Collaborate with AI and software teams to integrate vision solutions into applications. Required Skills & Qualifications: Computer Vision Frameworks: OpenCV, DLIB, MediaPipe Deep Learning: TensorFlow, PyTorch, Keras Algorithms: CNNs, YOLO, Faster R-CNN, Mask R-CNN Programming: Python, C++, CUDA Edge AI: TensorRT, OpenVINO, NVIDIA Jetson Experience with autonomous systems, OCR, and SLAM is a plus. Soft Skills: Strong troubleshooting and problem-solving skills. Ability to work independently and in a team. Excellent communication and documentation skills. Note: If interested, please share your updated resume and preferred time for a discussion. If shortlisted, our HR team will contact you. Kandi Srinivasa Reddy Delivery Manager Integra Technologies

Posted 1 month ago

Apply

5.0 - 8.0 years

12 - 16 Lacs

Chennai, Bengaluru

Work from Office

Who We Are Applied Materials is the global leader in materials engineering solutions used to produce virtually every new chip and advanced display in the world. We design, build and service cutting-edge equipment that helps our customers manufacture display and semiconductor chips the brains of devices we use every day. As the foundation of the global electronics industry, Applied enables the exciting technologies that literally connect our world like AI and IoT. If you want to work beyond the cutting-edge, continuously pushing the boundaries ofscience and engineering to make possiblethe next generations of technology, join us to Make Possible a Better Future. What We Offer Location: Bangalore,IND, Chennai,IND At Applied, we prioritize the well-being of you and your family and encourage you to bring your best self to work. Your happiness, health, and resiliency are at the core of our benefits and wellness programs. Our robust total rewards package makes it easier to take care of your whole self and your whole family. Were committed to providing programs and support that encourage personal and professional growth and care for you at work, at home, or wherever you may go. Learn more about our benefits . Youll also benefit from a supportive work culture that encourages you to learn, develop and grow your career as you take on challenges and drive innovative solutions for our customers.We empower our team to push the boundaries of what is possiblewhile learning every day in a supportive leading global company. Visit our Careers website to learn more about careers at Applied. Technical Lead - Software About Applied Applied Materials is the leader in materials engineering solutions used to produce virtually every new chip and advanced display in the world. Our expertise in modifying materials at atomic levels and on an industrial scale enables customers to transform possibilities into reality. At Applied Materials, our innovations make possible the technology shaping the future. Our Team Our team is developing a high-performance computing solution for low-latency and high throughput image processing and deep-learning workloads that will enable our Chip Manufacturing process control equipment to offer differentiated value to our customers. Your Opportunity As a technical lead, you will get the opportunity to grow in the field of high-performance computing, complex system design and low-level optimizations for better cost of ownership. Roles and Responsibility As a technical lead, you will be responsible for designing and implementing High performance computing software solutions for our organization. You will work closely with cross-functional teams, including software engineers, product managers, and business stakeholders, to understand requirements and translate them into architectural/software designs that meet business needs. You will be a subject Matter expert to unblock software engineers in the HPC domain. You will be expected to profile systems to understand bottlenecks, optimize workflows and code and processes to improve cost of ownership. Identify and mitigate technical risks and issues throughout the software development lifecycle. Lead the design and implementation of complex software components and systems. Ensure that software systems are scalable, reliable, and maintainable. Mentor and coach junior software engineers. Your primary focus will be on implementing features of high quality with maintainable and extendable code following software development best practices Our Ideal Candidate Someone who has the drive and passion to learn quickly, has the ability to multi-task and switch contexts based on business needs. Qualifications 5 to 10 years of experience in Design and coding in C/C++ preferably in Linux Environment. Very good knowledge of Data structures, Algorithms and Complexity analysis. In depth experience in Multi-threading, Thread Synchronization, Inter process communication, and Distributed computing fundamentals. Very Good knowledge of Operating systems internals (Linux Preferred), Networking and Storage systems. Experience in performance profiling at application and system level (e.g. vtune, Oprofiler, perf, Nividia Nsight etc.) Experience in low level code optimization techniques using Vectorization and Intrinsics, cache-aware programming, lock free data structures etc. Excellent problem-solving and analytical skills. Strong communication and collaboration abilities. Ability to mentor and coach junior team members. Experience in Agile development methodologies. Additional Qualifications: Experience in GPU programming using CUDA, OpenMP, OpenACC, OpenCL etc. Good Knowledge of Work-flow orchestration Software like Apache Airflow, Apache Spark, Apache storm or Intel TBB flowgraph etc. Experience in developing Distributed High Performance Computing software using Parallel programming frameworks like MPI, UCX etc. Experience in HPC Job-Scheduling and Cluster Management Software (SLURM, Torque, LSF etc.) Good knowledge of Low-latency and high-throughput data transfer technologies (RDMA, RoCE, InfiniBand) Familiarity with microservices architecture and containerization technologies (docker/singularity) and low latency Message queues. Experience in Java and Python programming Education : Bachelor's Degree or higher in Computer science or related Disciplines. Applied Materials is committed to diversity in its workforce including Equal Employment Opportunity for Minorities, Females, Protected Veterans and Individuals with Disabilities. Additional Information Time Type: Full time Employee Type: Assignee / Regular Travel: Yes, 10% of the Time Relocation Eligible: Yes Applied Materials is an Equal Opportunity Employer. Qualified applicants will receive consideration for employment without regard to race, color, national origin, citizenship, ancestry, religion, creed, sex, sexual orientation, gender identity, age, disability, veteran or military status, or any other basis prohibited by law.

Posted 1 month ago

Apply

8.0 - 13.0 years

6 - 10 Lacs

Chandigarh, Dadra & Nagar Haveli, Daman

Work from Office

We have immediate job openings forBoomi SME with SQL 8+ years of overall experience4+ years of experience with Boomi with EDI(X12) Integration experience.Good knowledge of Trading Partner SetupGood knowledge of EDI Validations and compliance checksWorking Knowledge on SQL, Databases (Oracle/SQL Server etc).Experience in setup of Atom/Molecule.Strong understanding on writing complex Business Rules in Business Rule Component.Development experience in Core Java, XML technologiesStrong understanding of using various protocols in Boomi/methods of communication (FTP, SFTP, FTPS, AS2, Database (MySQL, Oracle), JMS, MQSeries, Mail, HTTP/HTTPS)Strong hands-on understanding of scalability, security, high availability, and operational requirementsStrong understanding of Batch Processing, Parallel Processing, Performance analysis and tuning of Interfaces or integrationsExperience with SOAP, Webservices, and REST protocols. Knowledge of using SOAP UI tool. Location - Chandigarh,Dadra & Nagar Haveli,Daman,Diu,Goa,Haveli,Hyderabad,Jammu,Lakshadweep,Nagar,New Delhi,Puducherry,Sikkim

Posted 1 month ago

Apply

6.0 years

0 Lacs

India

Remote

Location: India (Hybrid/Remote Options) About RediMinds We don’t just develop AI—we redefine its boundaries . At RediMinds, we turn theoretical breakthroughs into solutions that revolutionize healthcare, climate resilience, robotics, and human potential. If you’ve ever dreamed of leading research that leaves labs to transform lives, this is your moment. The Role - AI Research Pioneer You’ll lead a nimble, brilliant team of 5–8 researchers to tackle audacious questions: How do we make LLMs diagnose diseases before symptoms appear? Can AI simulate protein folding to cure the incurable? What does ethical superintelligence look like for 8 billion people? This isn’t management—it’s orchestrating a rebellion against the possible . What You’ll Ignite Moonshot Leadership : Direct high-risk/high-reward research in generative AI, neuro-symbolic systems, and causal inference—with resources to match your ambition. Industry-Defining Publications : Publish in Nature , NeurIPS, or ICML while shipping code that runs in hospitals, factories, and farms. Talent Alchemy : Recruit and mentor geniuses (PhDs, rebels, dreamers) into a team that outthinks Google DeepMind. Impact Translation : Partner with RediMinds’ product teams to turn embeddings into empathy, and algorithms into action. Global Thought Leadership : Represent RediMinds at Davos, TED, or the UN—your research will shape policy and humanity’s trajectory. Ideal Profile A PhD in CS, Physics, Neurobiology, Quantum Computing, or any field where you’ve hacked the universe’s rules. 5–6 years leading research at a top-tier lab (OpenAI, DeepMind, MIT, Max Planck) or bleeding-edge tech giant. Published or perished : A track record in venues that make peers gasp (e.g., Science , NeurIPS, CVPR). Engineering grit : Fluency in PyTorch, JAX, HF, CUDA—and battle scars from scaling ideas to petabyte scale. Philosophical depth : You debate AI alignment over chai, and see "ethics" as a technical constraint. Deep experience in one or more of the following domains: LLM training and fine-tuning Knowledge retrieval, vector databases, and RAG pipelines Scientific machine learning (e.g., computational chemistry, physics-informed ML) Vision-Language and Multimodal AI Strong ability to bridge academic theory with real-world AI product development . Excellent communication skills and the ability to inspire, guide, and grow a high-performing research team. Why Choose RediMinds? Build Your Legacy : Lead a team of most ambitious AI researchers from Day 1. Unshackled Resources : No grant applications—just a blank check for compute, talent, and wild creativity. Elite Team: Collaborate with a global network of researchers, product thinkers, and AI engineers across RediMinds’ initiatives. Global Advantage : Shape AI revolution from its epicenter, with proximity to IITs/ISRO and global collaborators. Compensation : Competitive salary in a company valued at tomorrow’s numbers. Relocation support, healthcare for family, and a moonshot budget for conferences, courses, or curiosity. Apply Now If you’re ready to trade incremental papers for planet-scale impact , apply now only if you’re a PhD holder and send us: Your CV (highlight research that bent reality). A 1-page manifesto: “The One Problem I’d Solve with Unlimited Compute.” Links to 2 publications that reveal your intellectual signature. Email to talent@rediminds.com Subject line: “Pioneer Application: [Your Name] - [Your Boldest Idea]” “The universe is not made of atoms—it’s made of courage.” Join us. Rewrite physics.

Posted 1 month ago

Apply

0 years

0 Lacs

Mumbai, Maharashtra, India

On-site

We’re Hiring: AI Artist – Open-Source Generative AI for Advertising 🎨🤖 Studio Skwer is expanding its AI division to push the boundaries of AI-driven content creation for advertising and filmmaking. We are looking for an AI Artist who is passionate about open-source AI models, generative workflows, and cutting-edge creative automation. 📍 Location : Mumbai 📆 Full-time / Contract Who We Are: Studio Skwer is a leading post-production and color grading studio, known for working with top filmmakers, brands, and creative agencies. As we venture deeper into AI-powered storytelling, our AI team will focus on developing advanced tools for creative workflows, transforming advertising and post-production. What You’ll Do: - Research and develop AI-driven tools for ad visuals, animation, and creative workflows - Work with generative AI models like Stable Diffusion, SDXL, and ControlNet - Comfortable working in ComfyUI and building efficient node-based workflows - Build AI-assisted motion graphics, video synthesis, and structured image generation - Fine-tune models for advertising and filmmaking applications - Experiment with multi-modal AI: text-to-image, image-to-video, AI-based 3D modeling - Collaborate with developers, artists, and post teams to bring AI into real-world projects - Stay current with the latest in generative AI, diffusion models, and visual innovation Required Expertise & AI Tools: - Stable Diffusion, Flux, SDXL – For high-quality AI visuals and branding - Deforum, AnimateDiff – For AI-powered video and animation - ControlNet, T2I-Adapter – For structure-aware image generation - RIFE, LCM, Stable Video Diffusion – For AI-driven motion and smooth transitions - DeepFloyd IF – For photorealistic ad campaigns - Meshy AI – For product visualization using AI-generated 3D modeling Technical Skills & Research Capabilities: - Strong foundation in deep learning, diffusion models, and latent consistency techniques - Hands-on experience with PyTorch, TensorFlow, and open-source AI libraries - Familiarity with custom model training (LoRA, DreamBooth, ControlNet) - Understanding of GPU optimization (CUDA/OpenCL) for AI workflows - Ability to interpret and apply AI research to practical creative tools Why Join Studio Skwer’s AI Team? - Be part of a forward-thinking team at the intersection of technology and storytelling - Collaborate with renowned directors, cinematographers, and creative studios - Work on real-world campaigns that reimagine the role of AI in advertising - Access to the latest open-source tools and freedom to experiment 📩 Interested? Apply now by sending your CV, GitHub/portfolio, and research work to amplify@studioskwer.com with the subject: " AI Artist – Studio Skwer ". Let’s innovate the future of AI-powered advertising and filmmaking! #AIResearch #GenerativeAI #AIforAdvertising #StableDiffusion #ComfyUI #CreativeAI #StudioSkwer #WeAreHiring

Posted 1 month ago

Apply

8.0 years

0 Lacs

Noida

On-site

Our Company Changing the world through digital experiences is what Adobe’s all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences! We’re passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen. We’re on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity. We realize that new ideas can come from everywhere in the organization, and we know the next big idea could be yours! Position Summary: Adobe Firefly Gen AI Models and services group seeks passionate machine learning engineers to deliver groundbreaking generative AI experiences. In this role, you'll: Optimize and scale state-of-the-art generative AI models. Deploy AI-driven creative tools that empower millions globally. Solve real-world challenges in AI scalability and production readiness. Job Responsibilities Help architect and optimize large-scale foundation model pipelines in Generative AI Design and develop the GenAI backend services for Firefly, creating GPU optimized, efficient model pipelines that power the generative AI features on Firefly website, PPro, Photoshop, Illustrator, Express, Stock and other applications/surfaces Collaborate with outstanding Applied researchers and engineers to bring ideas to production Provide technical leadership and mentorship for junior team members Explore and research new and emerging ML and MLOps technologies to continuously improve Adobe’s GenAI engineering effectiveness and efficiency Review and provide feedback on features, technology, architecture, designs and test strategies. What you'll need to succeed Masters or Ph.D. in Computer Science, AI/ML, or related fields or B.Tech and strong experience in AI/ML 8+ years of experience Excellent communication and technical leadership skills Experience in tech working with a large number of contributors on time-sensitive and business-critical GenAI projects Experience in the latest Generative AI technologies, such as GAN, diffusion, transformer models Strong hands-on experience with large-scale GenAI model pipelines and/or shipping ML features Strong collaboration skills. Experience with working in a Matrix organization, driving alignment, drawing conclusions, and getting things done Preferred experience: Experience training or optimizing models (CUDA, Triton, TRT, AOT) Experience converting models from various frameworks like PyTorch and TensorFlow to other target formats to ensure compatibility and optimized performance across different platforms Good publication record in Computer Science, AI/ML, or related fields #FireflyGenAI Adobe is proud to be an Equal Employment Opportunity employer. We do not discriminate based on gender, race or color, ethnicity or national origin, age, disability, religion, sexual orientation, gender identity or expression, veteran status, or any other applicable characteristics protected by law. Learn more. Adobe aims to make Adobe.com accessible to any and all users. If you have a disability or special need that requires accommodation to navigate our website or complete the application process, email accommodations@adobe.com or call (408) 536-3015.

Posted 1 month ago

Apply

10.0 years

0 Lacs

Chennai, Tamil Nadu, India

On-site

Exp : 15yrs to 23yrs Primary skills :- Vision AI Solution, Nvidia, Computer Vision, Media, Open Stack. Key Responsibilities Define and lead the end-to-end technical architecture for vision-based AI systems across edge and cloud. Design and optimize large-scale video analytics pipelines using NVIDIA DeepStream, TensorRT, and Triton Inference Server. Architect distributed AI systems, including model training, deployment, inferencing, monitoring, and continuous learning. Collaborate with product, research, and engineering teams to translate business requirements into scalable AI solutions. Lead efforts in model optimization (quantization, pruning, distillation) for real-time performance on devices like Jetson Orin/Xavier. Drive the integration of multi-modal AI (vision + language, 3D, audio) where applicable. Guide platform choices (e.g., edge AI vs cloud AI trade-offs), ensuring cost-performance balance. Mentor senior engineers and promote best practices in MLOps, system reliability, and AI observability. Stay current with emerging technologies (e.g., NeRF, Diffusion Models, Vision Transformers, synthetic data). Contribute to internal innovation strategy, including IP generation, publications, and external presentations. ________________________________________ 🛠️ Required Technical Skills Deep expertise in computer vision, deep learning, and multi-modal AI. Proven hands-on experience with: NVIDIA Jetson, DeepStream SDK, TensorRT, Triton Inference Server TAO Toolkit, Isaac SDK, CUDA, cuDNN Strong in PyTorch, TensorFlow, OpenCV, GStreamer, and GPU-accelerated pipelines. Experience deploying vision AI models at large scale (e.g., 1000+ cameras/devices or multi-GPU clusters). Skilled in cloud-native ML infrastructure: Docker, Kubernetes, CI/CD, MLflow, Seldon, Airflow Proficiency in Python, C++, CUDA (or PyCUDA), and scripting. Familiar with 3D vision, synthetic data pipelines, and generative models (e.g., SAM, NeRF, Diffusion). Experience in multi modal (LVM/VLM), SLMs, small LVM/ VLM, Time series Gen AI models, Agentic AI, LLMOps/Edge LLMOps, Guardrails, Security in Gen AI, YOLO/Vision Transformers ________________________________________ 🤝 Soft Skills & Leadership 10+ years in AI/ML/Computer Vision, with 8+ years in technical leadership or architect roles Strong leadership skills with experience mentoring technical teams and driving innovation. Excellent communicator with the ability to engage stakeholders across engineering, product, and business. Strategic thinker with a practical mindset—able to balance innovation with production-readiness. Experience interfacing with enterprise customers, researchers, and hardware partners. ________________________________________ 🧩 Preferred Qualifications MS or PhD in Computer Vision, Machine Learning, Robotics, or a related technical field ( Added Advantage ) Experience with NVIDIA Omniverse, Clara, or MONAI for healthcare or simulation environments. Experience in domains like smart cities, robotics, retail analytics, or medical imaging. Contributions to open-source projects or technical publications. Certifications: NVIDIA Jetson Developer, AWS/GCP AI/ML Certifications.

Posted 1 month ago

Apply

5.0 years

0 Lacs

Noida, Uttar Pradesh, India

On-site

Primary skill :- NVIDIA Solution Architect, GEN / AI Architect, Azure or AWS cloud. Relevant Exp :- NVIDIA ( 2 to 3 yrs ) Location :- Chennai / Noida. As an NVIDIA Generative AI Solution Architect at , you will lead the design, development, and deployment of AI solutions leveraging NVIDIA’s Edge AI, Computer Vision, Generative AI, and Metropolis technologies . You will collaborate with cross-functional teams and customers to architect scalable, high-performance AI systems integrating real-time computer vision, generative AI workflows, and industrial digital twins on edge, cloud, and metaverse platforms. Key Responsibilities Architect and deliver end-to-end AI solutions using NVIDIA’s AI Enterprise software, NeMo framework, Triton Inference Server, and GPU-accelerated platforms. Design and implement AI pipelines optimized for edge devices (NVIDIA Jetson, Clara), cloud infrastructure (AWS, Azure, GCP), and data centers (NVIDIA DGX). Develop and showcase proof-of-concept solutions using large language models (LLMs), retrieval-augmented generation (RAG), and advanced computer vision models for object detection, segmentation, and video analytics. Utilize NVIDIA Metropolis platform capabilities to architect AI-powered video analytics and smart city solutions, leveraging edge-to-cloud pipelines for real-time insights and automation. Optimize AI inference workloads using CUDA, TensorRT, mixed precision, and model quantization to meet stringent latency and throughput SLAs. Collaborate with company engineering, product, and client teams to embed NVIDIA AI technologies into enterprise workflows and industrial applications. Provide technical leadership, training, and mentorship on NVIDIA SDKs, AI best practices, and solution deployment strategies. Stay abreast of NVIDIA’s product roadmap, AI research trends, and industrial AI innovations to drive continuous solution improvement. Support customer engagements including technical workshops, solution demonstrations, and architectural reviews. Ensure adherence to data privacy, security, and ethical AI standards throughout the solution lifecycle. Required Qualifications Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or related technical field. 5+ years of experience architecting and deploying AI/ML solutions with strong expertise in NVIDIA AI platforms (NeMo, Triton, CUDA, TensorRT). Proven experience with generative AI technologies including large language models, prompt engineering, and RAG workflows. Strong background in computer vision applications, including object detection, segmentation, and video analytics frameworks. Hands-on experience deploying AI solutions on edge devices (NVIDIA Jetson, Clara), cloud platforms (Azure, AWS, GCP), and data center GPU infrastructure. Familiarity with NVIDIA Metropolis platform for AI-powered video analytics and smart infrastructure solutions. Proficiency in Python, C++, and deep learning frameworks such as PyTorch or TensorFlow. Experience with container orchestration (Kubernetes, Docker) and MLOps practices including CI/CD pipelines for AI workloads. Excellent communication skills for engaging technical teams and business stakeholders. Willingness to travel up to 15% for client and NVIDIA events. Preferred Skills Experience optimizing AI inference with TensorRT, mixed precision, and model quantization. Knowledge of AI ethics, bias mitigation, and responsible AI principles. Prior experience in industrial, manufacturing, smart cities, or healthcare domains. Certifications related to NVIDIA AI technologies or cloud platforms (AWS, Azure, GCP). Experience working in global, cross-cultural teams.

Posted 1 month ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies