Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
0.6 - 5.0 years
0 Lacs
Noida, Uttar Pradesh, India
On-site
Company: Omnipresent Robot Technologies Pvt. Ltd. Location: Noida Sector-80| Type: Full-Time About Us: Omnipresent Robot Tech Pvt. Ltd. is an innovative startup pushing the boundaries of robotics, drones, and space tech. We recently contributed to ISRO’s Chandrayaan-3 mission by developing the perception and navigation module for the Pragyaan rover. Join our dynamic team to work on satellite-based defense projects and grow your career! Position Overview: We are looking for an AI/ML Engineers for Senior and junior role to assist in the development of AI models and algorithms for our satellite-based defense project. You will work with a skilled team to train, test, and deploy ML models, gaining hands-on experience in cutting-edge AI applications. Key Responsibilities: • Assist in designing and developing AI models using ML/DL techniques. • Implement, test, and fine-tune ML models using popular frameworks (e.g., TensorFlow, PyTorch). • Load and deploy models on embedded platforms (like Jetson Orin NX). • Analyze datasets, preprocess data, and extract features for training. • Support code compatibility and optimization on embedded systems. • Monitor and evaluate model performance, suggesting improvements. • Collaborate with senior engineers to integrate AI models into production environments. • Stay updated on the latest AI trends and apply new techniques to projects. Qualifications: • B.Tech. in Computer Science, IT, or related field. • 0.6-5 years of experience in ML model development. • Proficiency in Python and familiarity with ML frameworks (e.g., TensorFlow, PyTorch, Scikit-learn). • Understanding of data preprocessing, model training, and deployment. • Basic knowledge of GPU acceleration (CUDA) and embedded platforms (Jetson Orin NX). • Familiarity with data processing tools (e.g., NumPy, Pandas). • Strong problem-solving and analytical skills. • Effective communication and team collaboration abilities. Why Join Us? • Be part of high-impact satellite defense projects. • Learn from experts in AI and embedded systems. • Work in a start-up environment that fosters innovation and creativity.
Posted 4 days ago
5.0 years
0 Lacs
Bengaluru, Karnataka, India
Remote
About Company : They balance innovation with an open, friendly culture and the backing of a long-established parent company, known for its ethical reputation. We guide customers from what’s now to what’s next by unlocking the value of their data and applications to solve their digital challenges, achieving outcomes that benefit both business and society. Job Title: C++ Developer Location: Pan India Experience: 5+ yrs. Employment Type: Contract to hire Work Mode: Remote Notice Period: - Immediate joiners Required Skills: 5+ years of overall work experience, with at least 3 years of relevant experience in Python and 2+ years in CUDA/C++. Strong hands-on experience with Python, especially in scientific computing using PyTorch and NumPy. Solid understanding of CUDA programming concepts and C++ fundamentals. Demonstrated ability to analyze CUDA kernels and accurately reproduce them in Python. Familiarity with GPU computation, parallelism, and performance-aware coding practices. Strong debugging skills and attention to numerical consistency when porting logic across languages. Experience evaluating AI-generated code or participating in LLM tuning is a plus.
Posted 4 days ago
5.0 years
0 Lacs
India
Remote
About Company: They balance innovation with an open, friendly culture and the backing of a long-established parent company, known for its ethical reputation. We guide customers from what’s now to what’s next by unlocking the value of their data and applications to solve their digital challenges, achieving outcomes that benefit both business and society. Job Description: Job Title: LLM CUDA/C++ and Python Developer Location: Pan India Experience: 6+ yrs. Employment Type: Contract to hire Work Mode: Remote Notice Period: Immediate joiners Role Overview: This role is part of a project supporting leading LLM companies. The primary objective is to help these foundational LLM companies improve their Large Language Models.We support companies in enhancing their models by offering high-quality proprietary data. This data can be used as a basis for fine-tuning models or as an evaluation set to benchmark the performance. In an SFT data generation workflow, you might have to put together a prompt that contains code and questions, then elaborate model responses, and translate the provided CUDA/C++ code into equivalent Python code using PyTorch and NumPy to replicate the algorithm's behavior.For RLHF data generation, you may need to create a prompt or use one provided by the customer, ask the model questions, and evaluate the outputs generated by different versions of the LLM, comparing it and providing feedback, which is then used in fine-tune processes. Please note that this role does not involve building or fine-tuning LLMs. What does day-to-day look like: ● Translate CUDA/C++ code into equivalent Python implementations using PyTorch and NumPy, ensuring logical and performance parity. ● Analyze CUDA kernels and GPU-accelerated code for structure, efficiency, and function before translation. ● Evaluate LLM-generated translations of CUDA/C++ code to Python, providing technical feedback and corrections. ● Collaborate with prompt engineers and researchers to develop test prompts that reflect real-world CUDA/PyTorch tasks. ● Participate in RLHF workflows, ranking LLM responses and justifying ranking decisions clearly. ● Debug and review translated Python code for correctness, readability, and consistency with industry standards. ● Maintain technical documentation to support reproducibility and code clarity. ● Propose enhancements to prompt structure or conversion approaches based on common LLM failure patterns. Requirements: ● 5+ years of overall work experience, with at least 3 years of relevant experience in Python and 2+ years in CUDA/C++. ● Strong hands-on experience with Python, especially in scientific computing using PyTorch and NumPy. ● Solid understanding of CUDA programming concepts and C++ fundamentals. ● Demonstrated ability to analyze CUDA kernels and accurately reproduce them in Python. ● Familiarity with GPU computation, parallelism, and performance-aware coding practices. ● Strong debugging skills and attention to numerical consistency when porting logic across languages. ● Experience evaluating AI-generated code or participating in LLM tuning is a plus. ● Ability to communicate technical feedback clearly and constructively. ● Fluent in conversational and written English communication skills.
Posted 4 days ago
5.0 years
0 Lacs
India
Remote
About Company: Our client organization's mission is to empower people to participate in global conversations through communities. They are responsible for the consumer-facing application on the Web, Android, and iOS platform. In this role, you'll work with a specific team within this organization to drive related technical & product strategy, operations, architecture, and execution for one of the largest sites in the world. Poster Experience specifically focuses on the user journey, which is the main source of user content for the product. We aim to make it easier, faster, and smarter to create and participate in conversations, and we drive several core product metrics for the entire ecosystem. This specific role will involve migrating legacy Python microservice code to one or more existing Go microservices. Successful candidates have prior experience in these migrations at large scale (think millions of actions per day) and understand how to instrument and monitor their code for parity and consistency during rollout. Job Description: Job Title: Python Developer Location: Pan India Experience: 5+ yrs. Employment Type: Contract to hire Work Mode: Remote Notice Period: - Immediate joiners Roles and Responsibilities: 5+ years of overall work experience, with at least 3 years of relevant experience in Python and 2+ years in CUDA/C++. Strong hands-on experience with Python, especially in scientific computing using PyTorch and NumPy. Solid understanding of CUDA programming concepts and C++ fundamentals. Demonstrated ability to analyze CUDA kernels and accurately reproduce them in Python. Familiarity with GPU computation, parallelism, and performance-aware coding practices. Strong debugging skills and attention to numerical consistency when porting logic across languages. Experience evaluating AI-generated code or participating in LLM tuning is a plus. Ability to communicate technical feedback clearly and constructively. Fluent in conversational and written English communication skills.
Posted 4 days ago
3.0 - 6.0 years
0 Lacs
Hyderabad, Telangana, India
On-site
Greeting from Leadsoc Technologies Position: AI Model Validation Engineer Strong background in machine learning fundamentals, including deep learning,large language models , and recommender systems. Strong background in validation, defect and software development life cycle Strong knowledge on ubuntu / yocto linux Experience working with opensource frameworks such as PyTorch, TensorFlow, and ONNX-Runtime. Experience in profiling ML workloads Prior experience in executing validation plans for AI/ML compute stacks s uch as HIP, CUDA, OpenCL, OpenVINO, Strong background in python programming. Experience:3- 6 years Notice period: 0-15 days Regrads Murali
Posted 4 days ago
5.0 - 9.0 years
0 Lacs
hyderabad, telangana
On-site
We are looking for a highly experienced Voice AI /ML Engineer to take the lead in designing and deploying real-time voice intelligence systems. This position specifically involves working on ASR, TTS, speaker diarization, wake word detection, and developing production-grade modular audio processing pipelines to support next-generation contact center solutions, intelligent voice agents, and high-quality audio systems. You will be operating at the convergence of deep learning, streaming infrastructure, and speech/NLP technology, with a focus on creating scalable, low-latency systems that cater to diverse audio formats and real-world applications. Your responsibilities will include: - Building, fine-tuning, and deploying ASR models such as Whisper, wav2vec2.0, and Conformer for real-time transcription. - Developing high-quality TTS systems using VITS, Tacotron, FastSpeech for natural-sounding voice generation. - Implementing speaker diarization to segment and identify speakers in multi-party conversations using embeddings and clustering techniques. - Designing wake word detection models with ultra-low latency and high accuracy even in noisy conditions. In addition to the above, you will also be involved in: - Architecting bi-directional real-time audio streaming pipelines utilizing WebSocket, gRPC, Twilio Media Streams, or WebRTC. - Integrating voice AI models into live voice agent solutions, IVR automation, and AI contact center platforms. - Building scalable microservices for audio processing, encoding, and streaming across various codecs and containers. - Leveraging deep learning and NLP techniques for speech and language tasks. Furthermore, you will be responsible for: - Developing reusable modules for different voice tasks and system components. - Designing APIs and interfaces for orchestrating voice tasks across multi-stage pipelines. - Writing efficient Python code, optimizing models for real-time inference, and deploying them on cloud platforms. Join us to be part of impactful work, tremendous growth opportunities, and an innovative environment at Tanla, where diversity is championed and inclusivity is valued.,
Posted 6 days ago
7.0 - 11.0 years
0 Lacs
chennai, tamil nadu
On-site
You will be part of the AI/HPC engineering team specializing in platform standardization initiatives, innovation, testing, and optimization of various AI technologies. Your role will involve installation, administration, troubleshooting, and analytical skills in technology stacks such as Linux, Kubernetes, SLUM, Nvidia BCM, and open-source infrastructure tools like Ansible and scripting. As a qualified candidate with a B.E/B.Tech degree and over 7+ years of experience in the IT Infrastructure industry, including 7 to 8 years in HPC and/or AI technology, you should possess a strong knowledge of scripting and Linux, with a minimum of 2 years in Kubernetes. Your responsibilities will include managing, installing, configuring, deploying, troubleshooting, and administrating open-source HPC software like BCM, SLUM, Ansible, and ELK. Additionally, you should have a good grasp of Linux OS with scripting, knowledge in BCM, Nvidia GPU, and Cuda, and experience with Ansible playbook and managing HPC environments. Exposure to Python scripting and familiarity with at least one of the LLM/Generative AI and GPU offerings on public clouds such as AWS, Azure, or Google Cloud will be beneficial. You should also have experience in using DevOps tools for deploying and managing tools like Jenkins, Git, SonarQube, Bugzilla, and Harbor Registry. It would be advantageous if you have knowledge in networking concepts like VLAN, VXLAN, InfiniBand, IP Subnetting, routing, and firewall, as well as in storage technologies such as DDN, Parallel FS, object storage, and NFS. Familiarity with infrastructure components like HP/Dell rack servers and GPU, and management/monitoring tools like Zabbix, Promotus Grafana, and SNow will also be valued.,
Posted 6 days ago
5.0 - 7.0 years
6 - 8 Lacs
Kanpur
Work from Office
* Translate CUDA/C++ code into equivalent Python implementations using PyTorch and NumPy, ensuring logical and performance parity. * Analyze CUDA kernels and GPU-accelerated code for structure, efficiency, and function before translation. Work from home
Posted 6 days ago
4.0 years
0 Lacs
Pune, Maharashtra, India
On-site
We are looking for experienced Systems SW Compiler Engineers for an exciting role in our PTX (Parallel Thread Execution) Compiler Development team. Join the PTX Compiler team and help drive PTX language design and PTX compiler evolution. PTX enables all GPU Computing applications including HPC, Deep Learning and Autonomous Driving. PTX provides a stable programming model and portable instruction set Architecture (ISA) for NVIDIA GPUs and used by all Compute programming languages compiled to NVIDIA GPUs. PTX is also used as a compiler target by various non-NVIDIA compilers. Work with NVIDIA GPU Architecture and CUDA Programming model teams to build abstractions to expose new GPU features in portable and performant ways in PTX ISA. PTX Compiler (PTXAS) apart from implementing PTX ISA is responsible for PTX Compiler Front End, interaction with optimizer and runtime aspects involving object files, debug information, linkers, loaders and Driver Compiler Interface. As a senior member of the team, you will be responsible for leading efforts to enhance PTX Compiler infrastructure to enhance it to support new compilation models for DL and Generative AI codes. You will be contributing towards evolving programming model for Generative AI and DL applications on GPUs. What You Will Be Doing Provide stewardship for PTX ISA and PTX Compiler infrastructure for Generative AI and DL. Collaborating with architecture and programming model teams to design and implement programming models for next generation GPUs. Working closely with others to help design compilation stack and strategies for AI and DL workloads. Collaborate closely with teams developing other related components to ensure compatibility, robustness and high-quality code generation. What We Need To See BS (or equivalent experience), MS or Ph.D. in Computer Science, Computer Engineering, or related fields. 4+ years of experience in the area of compiler front end, programming language designs, Compilers/Linkers. Superb analytical and C/C++ programming skills. Experience in any one area of compiler development including feature support, code generation and compiler infrastructure. Excellent and strong interactive, verbal and written communications skills. Understanding of any Processor ISA (GPU ISA a plus). Good track record of developing, driving and delivering software products. Ways To Stand Out From The Crowd Experience in Programming Languages design and drafting programming language standards. Knowledge of GPU development and compute APIs such as CUDA, and OpenCL. Development experience in LLVM IR, MLIR JR2000842
Posted 6 days ago
3.0 years
0 Lacs
Pune, Maharashtra, India
On-site
We are looking for an accomplished Engineering Manager to lead the PTX Compiler Team. Join the PTX Compiler team and help drive PTX language design and PTX compiler evolution. PTX enables all GPU Computing applications including Generative AI, ML/DL, HPC. PTX provides a stable programming model and portable instruction set Architecture (ISA) for NVIDIA GPUs and used by all Compute programming languages compiled to NVIDIA GPUs. PTX is also used as a compiler target by various non-NVIDIA compilers. You will lead a team that develops PTX ISA for new GPUs. Work with NVIDIA GPU Architecture and CUDA Programming model teams to build abstractions to expose new GPU features in portable and performant ways in PTX ISA. You will be contributing towards evolving programming model for Generative AI and DL applications on GPUs. You will be solving challenging problems working alongside some of the top minds in GPU computing and systems software. See your leadership efforts in action as HPC and DL developers use PTX to program new GPUs. What You Will Be Doing You will provide administrative and technical direction to a team of 3-6 system software development engineers, including planning, scheduling and execution of projects and activities. Provide stewardship for PTX ISA and PTX Compiler infrastructure for Generative AI and DL. Coordinate cross functional development with rest of the compiler stack. Working with customers/partners to gather feedback and drive innovative ideas and features to incorporate into the product. Drive schedule execution and quality, software engineering practices. Recommend changes to policies and establish procedures that affect immediate organization. Communicate with senior management for team vision and development progress. Groom future engineering leaders and mentor junior engineers. What We Need To See BS or MS degree in Computer Science, Computer Engineering , or related fields with a minimum of 10 overall years of experience with 3 years as manager in the area of low level system SW development related to compiler, linkers, loaders, binary tools. Superb analytical and C/C++ programming skills Experience in any one area of compiler development including feature support, code generation and compiler infrastructure Excellent and strong interactive, verbal and written communications skills. Understanding of Assembly Language / Processor ISA (GPU ISA not mandatory but a plus) Good track record of developing, driving and delivering software products. Ways To Stand Out From The Crowd Experience in Programming Languages design and drafting programming language standards. Knowledge of GPU development and compute APIs such as CUDA, and OpenCL. Development experience in LLVM IR, MLIR JR2000826
Posted 6 days ago
5.0 - 9.0 years
0 Lacs
karnataka
On-site
As an AI Infrastructure Engineer at Cisco, you will join an innovative team with a mission to revolutionize how enterprises utilize AI. Operating with the agility of a startup and the focus of an incubator, we are building a close-knit group of AI and infrastructure experts. Our team is driven by bold ideas and a shared goal: to rethink systems from the ground up and deliver breakthrough solutions that redefine what is possiblefaster, leaner, and smarter. In this dynamic environment, where experimentation is abundant and new technologies are not just welcome but expected, you will collaborate with seasoned engineers, architects, and thinkers. Together, we craft iconic products that have the potential to reshape industries and unlock entirely new operational models for enterprises. If you are energized by solving challenging problems, enjoy pushing the boundaries of what is achievable, and aspire to shape the future of AI infrastructure, we are eager to meet you. Your role as an AI Infrastructure Engineer at Cisco will be instrumental in designing and implementing next-generation AI products. You will be focused on delivering high-performance, efficient, and reliable solutions that power AI workloads across Cisco's ecosystem. Your work will directly impact the performance, efficiency, reliability, and availability of AI systems for Cisco's customers, as well as drive advancements in AI and machine learning infrastructure. Key Responsibilities: - Design and develop node-level infrastructure components to support high-performance AI workloads. - Benchmark, analyze, and optimize the performance of AI infrastructure, including CUDA kernels and memory management for GPUs. - Minimize downtime through seamless configuration and upgrade architecture for software components. - Manage the installation and deployment of AI infrastructure on Kubernetes clusters, utilizing CRDs and operators. - Develop and deploy efficient telemetry collection systems for nodes and hardware components without impacting workload performance. - Work with distributed system fundamentals to ensure scalability, resilience, and reliability. - Collaborate across teams and time zones to shape the overall direction of AI infrastructure development and achieve shared goals. Minimum Qualifications: - Proficiency in programming languages such as Rust, C/C++, Golang, Python, or eBPF. - Strong understanding of Linux operating systems, including user space and kernel-level components. - Experience with Linux user space development, including packaging, logging, telemetry, and lifecycle management of processes. - Strong understanding of Kubernetes (K8s) and related technologies, such as custom resource definitions (CRDs). - Strong debugging and problem-solving skills for complex system-level issues. - Bachelor's degree or higher and a minimum of 5 years of relevant engineering work experience. Preferred Qualifications: - Linux kernel and device driver hands-on expertise is a plus. - Experience in GPU programming and optimization, including CUDA, UCX is a plus. - Experience with high-speed data transfer technologies such as RDMA. - Use of Nvidia GPU operators and Nvidia container toolkit and Nsight, CUPTI. - Nvidia MIG and MPS concepts for managing GPU consumption. At Cisco, we are dedicated to fostering an inclusive future where every individual brings their unique skills and perspectives together. Our employees celebrate diversity and focus on unlocking potential. We prioritize learning and development at every stage, offering opportunities for growth and multiple career paths. Our technology, tools, and culture support hybrid work trends, allowing everyone to excel and thrive. As a company, we recognize our responsibility to bring communities together, and our people are at the heart of that mission. One-third of our employees collaborate in our 30 employee resource organizations, known as Inclusive Communities, to connect, foster belonging, advocate for inclusivity, and make a positive impact. We provide dedicated paid time off for volunteering, allowing our employees to give back to causes they are passionate aboutnearly 86% of our employees actively participate. At Cisco, our purpose is driven by our people, making us a global leader in technology that powers the internet. We help our customers reimagine their applications, secure their enterprise, transform their infrastructure, and achieve their sustainability goals. Every step we take is a step towards a more inclusive future for all. Join us in taking your next step and being your authentic self with Cisco.,
Posted 1 week ago
10.0 - 20.0 years
40 - 45 Lacs
Navi Mumbai, Bengaluru
Work from Office
Deep Learning and Computer Vision Expertise Architectural Design Hardware Utilisation Strategy Edge Computing Design Cross-functional Collaboration Required Candidate profile Bachelor''s or Master''s degree in Computer Science, Electrical Engineering 10-20 years experience Expertise in Visual Inertial SLAM, edge computing and deploying solutions on embedded devices
Posted 1 week ago
0 years
0 Lacs
Hyderabad, Telangana, India
On-site
About the Role: We are seeking a highly experienced Voice AI /ML Engineer to lead the design and deployment of real-time voice intelligence systems. This role focuses on ASR, TTS, speaker diarization, wake word detection, and building production-grade modular audio processing pipelines to power next-generation contact centre solutions, intelligent voice agents, and telecom-grade audio systems. You will work at the intersection of deep learning, streaming infrastructure, and speech/NLP technology, creating scalable, low-latency systems across diverse audio formats and real-world applications. Key Responsibilities: Voice & Audio Intelligence: Build, fine-tune, and deploy ASR models (e.g., Whisper, wav2vec2.0, Conformer) for real-time transcription. Develop and finetune high-quality TTS systems using VITS, Tacotron, FastSpeech for lifelike voice generation and cloning. Implement speaker diarization for segmenting and identifying speakers in multi-party conversations using embeddings (x-vectors/d-vectors) and clustering (AHC, VBx, spectral clustering). Design robust wake word detection models with ultra-low latency and high accuracy in noisy conditions. Real-Time Audio Streaming & Voice Agent Infrastructure: Architect bi-directional real-time audio streaming pipelines using WebSocket, gRPC, Twilio Media Streams, or WebRTC. Integrate voice AI models into live voice agent solutions, IVR automation, and AI contact center platforms. Optimize for latency, concurrency, and continuous audio streaming with context buffering and voice activity detection (VAD). Build scalable microservices to process, decode, encode, and stream audio across common codecs (e.g., PCM, Opus, μ-law, AAC, MP3) and containers (e.g., WAV, MP4). Deep Learning & NLP Architecture: Utilize transformers, encoder-decoder models, GANs, VAEs, and diffusion models, for speech and language tasks. Implement end-to-end pipelines including text normalization, G2P mapping, NLP intent extraction, and emotion/prosody control. Fine-tune pre-trained language models for integration with voice-based user interfaces. Modular System Development: Build reusable, plug-and-play modules for ASR, TTS, diarization, codecs, streaming inference, and data augmentation. Design APIs and interfaces for orchestrating voice tasks across multi-stage pipelines with format conversions and buffering. Develop performance benchmarks and optimize for CPU/GPU, memory footprint, and real-time constraints. Engineering & Deployment: Writing robust, modular, and efficient Python code Experience with Docker, Kubernetes, cloud deployment (AWS, Azure, GCP) Optimize models for real-time inference using ONNX, TorchScript, and CUDA, including quantization, context-aware inference, model caching. On device voice model deployment. Why join us? Impactful Work: Play a pivotal role in safeguarding Tanla's assets, data, and reputation in the industry. Tremendous Growth Opportunities: Be part of a rapidly growing company in the telecom and CPaaS space, with opportunities for professional development. Innovative Environment: Work alongside a world-class team in a challenging and fun environment, where innovation is celebrated. Tanla is an equal opportunity employer. We champion diversity and are committed to creating an inclusive environment for all employees. www.tanla.com
Posted 1 week ago
7.0 years
0 Lacs
Chennai, Tamil Nadu, India
On-site
About The Company Tata Communications Redefines Connectivity with Innovation and IntelligenceDriving the next level of intelligence powered by Cloud, Mobility, Internet of Things, Collaboration, Security, Media services and Network services, we at Tata Communications are envisaging a New World of Communications Broad Role Description This role is part of AI/HPC engineering that specializes in Platform standardization initiatives, innovation, Testing and Optimization of different AI technologies. Specific role requires Installation, Administration, troubleshooting and analytical skills in the technology stacks covering Linux, Kubernetes, SLUM and Nvidia BCM OpenSource Infrastructure Tools Ansible and scripting Candidate should be B.E / B. Tech with over 7+ Years of experience in IT Infrastructure industry, 7 to 8 years in HPC and or AI technology with strong knowledge on Scripting and Linux with at least 2 years in Kubernetes. Skills Required. Managing, Installing, Configuring, Deploying, Troubleshooting and administration of opensource HPC software’s like BCM, SLUM Ansible, ELK Good experience in Linux OS with scripting Knowledge in BCM, Nvidia GPU, Cuda is preferred. Experience in Ansible playbook, managing HPC environment. Exposure to Python Scripting Knowledge in at least one of the LLM / Generative AI and GPU offering provided on public clouds like AWS /Azure/ Google cloud Devops Tools Experience in deploying and managing tools like Jenkins, Git, SonarQube, Bugzilla, Harbor Registry Good to know. Networking: VLAN, VXLAN, InfiniBand, IP Subnetting, Routing, Firewall Storage: DDN, Parallel FS, Object storage and NFS Infrastructure: HP/Dell/ rack servers /GPU Management /Monitoring tools: Zabbix, Promotus Grafana and SNow,
Posted 1 week ago
15.0 years
0 Lacs
Thane, Maharashtra, India
On-site
We are looking for a Director of Engineering (AI Systems & Secure Platforms) to join our Core Engineering team at Thane ( Maharashtra – India). The ideal candidate should have 12–15+ years of experience in architecting and deploying AI systems at scale, with deep expertise in agentic AI workflows, LLMs, RAG, Computer Vision, and secure mobile/wearable platforms. Top 3 Daily Tasks: ● Architect, optimize, and deploy LLMs, RAG pipelines, and Computer Vision models for smart glasses and other edge devices. ● Design and orchestrate agentic AI workflows—enabling autonomous agents with planning, tool usage, error handling, and closed feedback loops. ● Collaborate across AI, Firmware, Security, Mobile, Product, and Design teams to embed “invisible intelligence” within secure wearable systems. Minimum Work Experience Required: 12–15+ years of experience in Applied AI, Deep Learning, Edge AI deployment, Secure Mobile Systems, and Agentic AI Architecture . Top 5 Skills You Should Possess: ● Expertise in TensorFlow, PyTorch, HuggingFace, ONNX, and optimization tools like TensorRT, TFLite. ● Strong hands-on experience with LLMs, Retrieval-Augmented Generation (RAG), and Vector Databases (FAISS, Milvus). ● Deep understanding of Android/iOS integration, AOSP customization , and secure communication (WebRTC, SIP, RTP). ● Experience in Privacy-Preserving AI (Federated Learning, Differential Privacy) and secure AI APIs. ● Proven track record in architecting and deploying agentic AI systems—multi-agent workflows, adaptive planning, tool chaining, and MCP (Model Context Protocol). Cross-Functional Collaboration Excellence: ● Partner with Platform & Security teams to define secure MCP server blueprints exposing device tools, sensors, and services with strong governance and traceability. ● Coordinate with Mobile and AI teams to integrate agentic workflows across Android, iOS, and AOSP environments. ● Work with Firmware and Product teams to define real-time sensor-agent interactions, secure data flows, and adaptive behavior in smart wearables. What You’ll Be Creating: ● Agentic, MCP-enabled pipelines for smart glasses—featuring intelligent agents for vision, context, planning, and secure execution. ● Privacy-first AI systems combining edge compute, federated learning, and cloud integration. ● A scalable, secure wearable AI platform that reflects our commitment to building purposeful and conscious technology. Preferred Skills: ● Familiarity with secure real-time protocols: WebRTC, SIP, RTP. ● Programming proficiency in C, C++, Java, Python, Swift, Kotlin, Objective-C, Node.js, Shell Scripting, CUDA (preferred). ● Experience designing AI platforms for wearables/XR with real-time and low-latency constraints. ● Deep knowledge of MCP deployment patterns—secure token handling, audit trails, permission governance. ● Proven leadership in managing cross-functional tech teams across AI, Firmware, Product, Mobile, and Security.
Posted 1 week ago
7.0 - 12.0 years
15 - 20 Lacs
Chennai
Work from Office
Broad Role Description This role is part ofAI/HPC engineering that specializes in Platform standardization initiatives,innovation, Testing and Optimization of different AI technologies. Specific role requires Installation,Administration, troubleshooting and analytical skills in the technology stackscovering Linux, Kubernetes, SLUM and Nvidia BCM OpenSource Infrastructure ToolsAnsible and scripting Candidate shouldbe B.E / B. Tech with over 7+ Years of experience in IT Infrastructureindustry, 7 to 8 years in HPC and or AItechnology with strong knowledge on Scripting and Linux with at least 2 years in Kubernetes. skills required. Managing,Installing, Configuring, Deploying, Troubleshooting and administration of opensourceHPC softwares like BCM, SLUM Ansible, ELK Good experiencein Linux OS with scripting Knowledge in BCM,Nvidia GPU, Cuda is preferred. Experience in Ansibleplaybook, managing HPC environment. Exposure toPython Scripting Knowledge in at least one of the LLM /Generative AI and GPU offering provided on public clouds like AWS /Azure/Google cloud Devops Tools Experience indeploying and managing tools like Jenkins, Git, SonarQube, Bugzilla, Harbor Registry Good to know. Networking: VLAN, VXLAN, InfiniBand, IPSubnetting, Routing, Firewall Storage: DDN, Parallel FS, Object storage and NFS Infrastructure: HP/Dell/ rack servers /GPU Management /Monitoring tools: Zabbix, PromotusGrafana and SNow,
Posted 1 week ago
16.0 years
40 - 65 Lacs
Chennai, Tamil Nadu, India
On-site
📍 Location: Chennai, India (Relocation Required) 💼 Domain: Information Technology (IT) 💰 Salary Range: ₹40 – 65 LPA 🧑💼 Experience Required: 8–16 Years Position: Manager – Algorithm Engineering (Computer Vision / AI / ML) Seeking a seasoned Algorithm Engineering Manager with strong expertise in Machine Learning, Deep Learning, and Computer Vision , to lead a high-performing team working on cutting-edge solutions. This is a hands-on leadership role ideal for professionals from product-based or semiconductor/manufacturing tech companies . Key Responsibilities Lead and mentor a team of algorithm engineers to drive innovation and deliver high-performance solutions. Develop and maintain scalable infrastructure for deploying algorithms in production environments. Collaborate cross-functionally with data scientists, software engineers, and product managers to design robust, scalable systems. Optimize algorithm efficiency, performance, and resource utilization. Stay abreast of advancements in algorithm engineering and apply them strategically. Drive continuous process improvement in tools, practices, and methodologies. Required Qualifications Education: Ph.D. with minimum 6 years industry experience, or M.Tech with minimum 8 years, or B.Tech with minimum 10 years experience Minimum 3 years of managerial experience (Technical Lead experience alone is not sufficient). 8+ years of hands-on experience with at least one of the following: Python, C++, CUDA 8+ years of experience in AI/ML/Deep Learning 2–3 years of experience in Image Processing & Computer Vision Experience with high-performance computing, parallel programming, and distributed systems Background in product companies is mandatory Strong problem-solving, analytical, and communication skills Preferred Qualifications Tier-1 college background (IITs, NITs, IIITs, VIT, etc.) 8+ CGPA (preferred) Familiarity with ML libraries and frameworks such as TensorFlow, PyTorch, Scikit-learn Experience with GPU architecture and deployment tools like Docker, Apptainer Prior work in semiconductor, hardware, or manufacturing tech companies (nice to have Skills: parallel programming,high-performance computing,docker,computer vision,semiconductor,python,manufacturing,machine learning,learning,distributed systems,apptainer,image processing,scikit-learn,c++,cuda,algorithms,deep learning,drive,ml,gpu architecture,tensorflow,pytorch
Posted 1 week ago
5.0 - 9.0 years
0 Lacs
karnataka
On-site
As a Senior AI Engineer at Avathon, you will be part of a cutting-edge team revolutionizing industrial AI by developing groundbreaking solutions that shape the future. Your role will involve designing, training, and deploying computer vision models using frameworks like TensorFlow, PyTorch, or ONNX to harness the full potential of operational data. You will utilize your expertise in model optimization techniques such as quantization, pruning, distillation, and structured sparsity to enhance performance on edge devices and low-power hardware. Hands-on experience with state-of-the-art architectures like YOLO, Faster R-CNN, and Vision Transformers will be essential for optimizing models for deployment in industrial environments. Your strong understanding of image preprocessing, feature extraction, traditional computer vision techniques, and end-to-end model pipelines will enable you to create real-time virtual replicas of physical assets for predictive maintenance, performance simulation, and operational optimization. Proficiency in Python and C++ for developing AI solutions, along with experience in parallel processing and hardware-aware optimizations, will be key in driving AI-driven projects that have a meaningful impact across industries. Furthermore, your expertise in profiling and optimizing model inference speed, memory usage, and throughput for resource-constrained environments, as well as practical experience in deploying AI models on embedded systems and low-power hardware, will be crucial for anomaly detection, performance forecasting, and asset lifetime extension in industrial settings. Familiarity with MLOps practices, version control with Git, and collaborative workflows will ensure efficient management of AI workflows and seamless collaboration within cross-functional teams. Join Avathon in Bengaluru and thrive in a high-growth environment where agility, collaboration, and rapid professional growth are the norm. Make a difference by working on AI-driven projects that drive real change across industries and improve lives. If you are a forward-thinking AI Engineer with a passion for innovation and a drive to create scalable solutions in industrial AI, we invite you to be a part of our team and contribute to the revolutionizing of industrial AI.,
Posted 1 week ago
2.0 - 4.0 years
1 - 3 Lacs
Chennai
On-site
Job Title: Python Developer – AI/ML & LLM Application Development Experience: 2–4 Years Location: Shollinganallur, Chennai Employment Type: Full-Time Job Summary: We are looking for a skilled Python Developer with 2–4 years of experience who is passionate about LLM (Large Language Model) applications , API development , and Machine Learning . The ideal candidate should have a strong foundation in Python, experience working with modern AI/ML frameworks, and a solid understanding of LLM architecture and deployment. You will be involved in building intelligent systems leveraging cutting-edge tools like LangChain , Hugging Face , and vector databases , and contribute to full-cycle development from design to deployment. Key Responsibilities: Programming & Development Write clean, modular, and well-documented Python code using OOP principles . Manage version control using Git , GitHub, or GitLab. Perform unit testing , debugging, and maintain high code quality and documentation. API & Backend Development Build and maintain REST APIs using Django and Flask . Integrate external APIs including OpenAI , Anthropic , and open-source model APIs . LLM Application Development Develop Retrieval-Augmented Generation (RAG) systems. Work with vector databases like Pinecone , Weaviate , or Chroma . Build and scale LLM apps using LangChain and LlamaIndex . Design and implement prompt engineering strategies and few-shot learning setups. Machine Learning & Deep Learning Apply ML algorithms using Scikit-learn , XGBoost , LightGBM , etc. Handle data preprocessing, feature engineering, and model evaluation. Build deep learning models using PyTorch (preferred) or TensorFlow/Keras . Implement neural architectures like CNNs , RNNs , Transformers , and optimize using backpropagation and regularization techniques. LLM & NLP Integration Work with LLM architectures like GPT , BERT , and LLaMA . Use the Hugging Face Transformers library and tokenizers. Perform LoRA/QLoRA fine-tuning and experiment with open-source model deployments. Required Skills & Qualifications: 2–4 years of hands-on experience in Python development . Experience in REST API development using Django/Flask. Strong knowledge of OOP , version control systems, and debugging techniques. Familiarity with LangChain , LlamaIndex , and vector stores . Practical experience in machine learning model development and deployment. Working knowledge of LLM concepts , Hugging Face, and prompt engineering. Exposure to GPU computing and CUDA basics is a plus. Nice to Have: Experience deploying ML models on cloud platforms (AWS/GCP/Azure). Knowledge of CI/CD pipelines for ML systems. Open-source contributions or AI research exposure. Job Type: Full-time Pay: ₹10,393.72 - ₹30,000.00 per month Schedule: Monday to Friday Work Location: In person
Posted 1 week ago
2.0 - 6.0 years
0 Lacs
surat, gujarat
On-site
The primary responsibility of this role is to design, develop, and implement cutting-edge image and video generation systems leveraging deep learning models. You will take the lead in exploring and prototyping diffusion, GAN, and transformer-based architectures for generative tasks. Your expertise will be instrumental in optimizing models for quality, speed, and scalability through accelerated compute technologies such as CUDA and TensorRT. Collaboration with cross-functional teams including Product, Design, and Frontend will be essential to seamlessly integrate AI pipelines into production applications and platforms. Additionally, you will play a key role in contributing to system architecture, ensuring reproducibility, versioning, and model evaluation, while also staying updated on the latest advancements in generative AI to facilitate the transition from research and development to production. To excel in this role, you should possess a minimum of 2 years of hands-on experience in the field of AI/ML with a strong emphasis on generative models. Your track record should include practical experience with video generation models like Sora, Gen-2 by Runway, Synthesia, or custom pipelines. A solid background in image generation using Diffusion Models (e.g., Stable Diffusion, DALLE, Imagen) or GANs (e.g., StyleGAN2/3) is essential. Proficiency in Python and deep learning libraries such as PyTorch, TensorFlow, or JAX is required, along with experience in training large-scale models using multi-GPU setups like DDP, DeepSpeed, or Hugging Face Accelerate. A sound understanding of computer vision, image processing, and neural rendering techniques is crucial, as well as practical skills in model fine-tuning and related methodologies like LoRA/PEFT, ControlNet, DreamBooth, and others. Preferred tools and frameworks for this role include Stable Diffusion, DALLE, MidJourney, Sora, Gen-2, VQ-GAN, Pix2Pix, CycleGAN, AnimateDiff, ControlNet, T2I-Adapter, VideoCrafter, Pika Labs, ZeroScope, and ModelScope. Proficiency in FastAPI, Flask, or gRPC for model serving and Streamlit, Gradio, or React for rapid prototyping is advantageous. Experience with cloud platforms such as AWS, GCP, or Azure, particularly with GPU instances, and serving models using TorchServe, NVIDIA Triton, or Vertex AI, will be beneficial in ensuring scalable model deployment. This is a full-time position with a flexible schedule and a day shift from Monday to Friday. The ideal candidate will have a minimum of 2 years of experience in machine learning. The work location is in person, and the expected start date is 01/08/2025.,
Posted 1 week ago
5.0 - 8.0 years
4 - 7 Lacs
Mumbai, Navi Mumbai
Work from Office
Job Title : C++ Developer Duration : 1-year contractual position Experience Range : 5 to 8 years Notice Period : Within 20 days Location : Mumbai (Only local candidates of Mumbai are acceptable) Education : B.Tech, B.E Interview Process : 1st- Technical, 2nd - Technical round & 3rd - HR Round Mandatory : End-to-end C++ skills Skills Required : - C, C++ - Qt/QML - OOPs - STL, Data Structures - JavaScript - Automotive Product Development - Android Application Development - Java - API - GitLab CI/CD - GitHub, Gerrit - Jira, Zoho - PostgreSQL, SQLite, JSON - MVVM Architecture - Testing - Debugging - Linux, Unix Job Description : We are seeking an experienced Developer with a strong background in C++, CUDA programming, and Linux to guide our development team in building cutting-edge solutions for device integration and high-performance computing tasks. This is a hands-on leadership position that combines technical expertise with team management skills to deliver high-quality software products. Primary responsibilities : Software Development : - Develop and maintain high-performance applications using C++ and CUDA. - Design and implement parallel algorithms for GPUs to accelerate computational workloads. Performance Optimization : - Optimize CUDA kernels for performance, scalability, and memory efficiency. - Analyze performance bottlenecks and propose innovative solutions. Code Review and Testing : - Conduct code reviews to ensure adherence to coding standards and best practices. - Develop and execute test cases to validate functionality and performance. Collaboration : - Work closely with the software engineering and research teams to understand requirements and deliver robust solutions. - Provide technical guidance and mentoring to junior team members when necessary. Documentation : - Write and maintain technical documentation, including design specifications and user manuals. Required Skills : - C++ : Strong proficiency in modern C++ (C++11/14/17/20). - CUDA Programming : Extensive experience in developing, debugging, and optimizing CUDA applications. - GPU Optimization : Familiarity with memory hierarchy, shared memory, streams, and warp-level operations in CUDA. - Parallel Computing : Solid understanding of parallel algorithms and multi-threaded programming. - Mathematical and Analytical Skills : Strong foundation in linear algebra, calculus, and numerical methods. - Tools : Experience with debugging/profiling tools like Nsight, CUDA Memcheck, or similar.
Posted 1 week ago
0 years
0 Lacs
Chennai, Tamil Nadu, India
On-site
About Us: TangoEye, a Chennai based deep tech startup now owned by Lenskart, is an AI – Powered video analytics SaaS company that helps brands gain shopper behavior and staff efficiency at the stores. Our diverse team brings expertise in Computer vision, AI, Data Analytics, Application building with a focus on helping brands improve store performance. Job Title : Computer Vision Engineer Experience : 3+yrs Job Location : Chennai Job Summary/Overview: Strong experience in development of Computer Vision based projects is a must Should be highly proficient in Image Processing, Deep Learning & Machine Learning Proficient in Python, object-oriented design and development. Must have worked with CUDA, GPU, OpenCV. Strong knowledge in computer vision algorithms and models, including image/video recognition and content understanding. Experienced working on human detection, face recognition with large amounts of data will be an added advantage Experience in algorithms such as image and video separation, synthesis, and intelligent layout with engineering platform implementation capability. Strong track record of solution development from scratch. Experience of using/customizing cloud providers and Al/ML services. Good analytical and problem-solving skills
Posted 1 week ago
6.0 years
40 - 65 Lacs
Chennai, Tamil Nadu, India
On-site
Key Responsibilities Lead, mentor, and manage a team of algorithm engineers focused on AI/ML, deep learning, and computer vision solutions Architect and maintain scalable infrastructure for deploying algorithms at scale Collaborate with cross-functional teams—data scientists, software engineers, and product managers—to build impactful algorithmic solutions Optimize algorithms for performance, accuracy, and computational efficiency Implement best practices in code reviews, testing, and version control for research-to-production handoff Continuously improve development tools, pipelines, and deployment workflows Stay up-to-date with advancements in algorithm engineering, HPC, and infrastructure technologies Required Qualifications ✅ Academic: Ph.D. in Computer Science, AI/ML, or related field with 6+ years industry experience OR M.Tech with 8+ years of experience OR B.Tech with 10+ years of experience Tier 1 Engineering College background (IIT, NIT, IIIT, VIT) preferred ✅ Experience & Skills Minimum 3 years in a Managerial Role (must include direct team management; only Lead experience not sufficient) 8+ years of hands-on experience in Python, C++, or CUDA 8+ years in Machine Learning, Deep Learning, or Artificial Intelligence 2–3 years in Computer Vision and Image Processing Strong exposure to high-performance computing (HPC), parallel programming, and distributed systems Solid understanding of GPU architecture and familiarity with containerization tools like Docker or Apptainer Proficient in ML/DL frameworks such as TensorFlow, PyTorch, or Scikit-learn Proven ability to scale algorithms for production use Preferred Qualifications Experience in semiconductor, manufacturing, or hardware product companies Candidates with 8+ CGPA in academics International or pan-India candidates willing to relocate to Chennai Exposure to DevOps for ML/AI pipelines Must-Haves Summary Criteria Requirement Degree + Exp. PhD (6 yrs), M.Tech (8 yrs), or B.Tech (10 yrs) Managerial Role Min 3 yrs managing engineering teams Programming Python / C++ / CUDA (8+ yrs) AI/ML/DL 8+ years CV/Image Processing 2–3 years Company Type Product-based companies only Academic Stability No frequent job changes; min 2 years in each role Target Companies 🎯 Preferred Industry Backgrounds: Product-based tech companies Semiconductor firms (e.g., Intel, AMD, Qualcomm, Micron) Electrical/Hardware/Manufacturing product companies Nice To Haves Experience in semicon design, firmware, or hardware integration with AI Projects using embedded AI, real-time vision systems, or robotics Skills: high-performance computing (hpc),algorithms,infrastructure,docker,deep learning,pytorch,parallel programming,cuda,python,scikit-learn,ml,apptainer,distributed systems,engineers,image processing,gpu architecture,ai/ml,learning,c++,tensorflow,computer vision,c
Posted 1 week ago
2.0 - 5.0 years
7 - 11 Lacs
Gurugram
Work from Office
Company: Mercer Description: About the Role: We are based in Gurgaon and looking for a Senior Computer Vision Engineer to join our team and help our team to improve and create new technologies. You'll work on projects which makes online assessment more secure and cheating proof. If you're a seasoned computer vision expert with a passion for innovation and a track record of delivering impactful solutions, we would be happy to meet you. Role Senior Computer Vision Engineer Functional Area AI Educational Qualification: BTech/MS/MTech/PhD in Computer Science/Computer vision/Signal Processing/Deep Learning or equivalent.Should have worked in an academic or professional setting in the field of computer vision/signal processing. Experience: 2-5 years Location Gurgaon Key Responsibilities: Develop and optimize advanced computer vision algorithms for image and video analysis tasks.Design, implement and train deep learning models for object detection, face processing, activity recognition and other related tasks.Test and refine models and systems based on real-world data and feedback.Evaluate project requirements, plan and manage the roadmap of a project.Present findings and insights in a clear and concise manner to stakeholders.Collaborate and help to integrate and deploy computer vision systems into broader product architecture.Conduct research to stay updated on emerging computer vision technologies and trends.Automate data preprocessing and annotation processes to streamline workflow efficiency.Maintain comprehensive documentation for algorithms, implementations, and evaluations.Mentor junior engineers and provide strategic guidance on project development. Requirements and skills: Proficiency in Python and knowledge of C++, Java and JS is plus.Solid understanding of neural networks, especially convolutional neural networks (CNNs). Knowledge of RCNNs and vision transformers.Proficient in understanding, designing and implementing deep learning models using frameworks such as TensorFlow, PyTorch and Keras.Understanding of fundamental image processing techniques like image filtering, edge detection, image segmentation and image augmentation.Experience in evaluating computer vision models using relevant metrics and performance indicators.Familiarity with GPU and related technologies which is utilized for improved computational efficiency such as CUDA, CUDNN, tensorRT etc.Familiarity with Python libraries such as OpenCV, NumPy, Pandas and scikit-learn etc.Basic knowledge of linear algebra, calculus, and statistics.Strong critical thinking, analytical, and problem-solving skillsSelf-motivated, quick learner and strong team player with ability to work with minimal supervision. Marsh McLennan is committed to embracing a diverse, inclusive and flexible work environment. We aim to attract and retain the best people and embrace diversity of age, background, caste, disability, ethnic origin, family duties, gender orientation or expression, gender reassignment, marital status, nationality, parental status, personal or social status, political affiliation, race, religion and beliefs, sex/gender, sexual orientation or expression, skin color, or any other characteristic protected by applicable law. Marsh McLennan is committed to hybrid work, which includes the flexibility of working remotely and the collaboration, connections and professional development benefits of working together in the office. All Marsh McLennan colleagues are expected to be in their local office or working onsite with clients at least three days per week. Office-based teams will identify at least one anchor day per week on which their full team will be together in person.
Posted 1 week ago
5.0 - 10.0 years
14 - 24 Lacs
Bengaluru
Hybrid
Responsibilities : Test development, integration, debugging and maintenance of a C++ based video subsystem with a focus on Accelerated User Space Libraries (USL). Maintaining the C++ based middleware stack. Debugging the video subsystem and ability to run the tests on the ECU architecture. Ownership of the end-to-end technology stack and drivers. Alignment of requirements, changes and roadmap with the internal stakeholders. Must have Experience: Expertise in developing automotive/EE tests in C++. Additional experience in Python is an added value. Experience in computer vision, image processing or video codec H.264/H.265. Handson experience with video coding, OpenCV or OpenGL (or similar technologies). Familiarity with high-level accelerated APIs for GPU/DSP(s) (Based on public interfaces/SDK from supplier) e.g. open standards like OpenCL/Vulkan and ONNX. Deep knowledge about the standards is good to have but not essential. Background in ISO 26262 and ASIL based safety analysis would be an added plus. Able to review and fix integration build scripts (Based on bazel build system) Able to identify design issues in high level C/C++ API(s) Able to read coredumps and backtraces from crashes Familiarity with POSIX based operating systems including Linux and QNX Experience with Boardnet technologies such as Ethernet, SomeIP and CAN. Experience with automotive Diagnostic Log and Trace (DLT) and Automotive Diagnosis. Experience in the ADAS domain preferred. Ability to work independently and take corresponding decisions.
Posted 1 week ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
39581 Jobs | Dublin
Wipro
19070 Jobs | Bengaluru
Accenture in India
14409 Jobs | Dublin 2
EY
14248 Jobs | London
Uplers
10536 Jobs | Ahmedabad
Amazon
10262 Jobs | Seattle,WA
IBM
9120 Jobs | Armonk
Oracle
8925 Jobs | Redwood City
Capgemini
7500 Jobs | Paris,France
Virtusa
7132 Jobs | Southborough