Jobs
Interviews

355 Onnx Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

2.0 years

0 Lacs

Hyderabad, Telangana, India

On-site

About ImageVision.AI : ImageVision.AI is an AI-driven company specializing in cutting-edge computer vision and deep learning solutions for various industries, including healthcare, security, retail, and autonomous systems. In 2024, CIO Tech Outlook recognized ImageVision.AI as one of the "10 Most Promising Computer Vision Startups." Our mission is to leverage state-of-the-art AI models to develop innovative, real-world applications that transform businesses. Join us and be part of a team pushing the boundaries of AI-powered vision technology! About the Role: We are seeking a skilled Computer Vision Engineer with 2+ years of experience to join our team. You will be responsible for developing and optimizing cutting-edge computer vision algorithms, working on real-world applications involving image processing, object detection, recognition, and deep learning. If you are passionate about AI, machine learning, and solving challenging problems in computer vision, we would love to hear from you! Key Responsibilities: · Develop and Optimize CV Algorithms: . Design, implement, and optimize computer vision and deep learning models for tasks such as object detection, image segmentation, and tracking. o Improve image classification, feature extraction, and OCR performance. · Machine Learning & Deep Learning Implementation: o Train, fine-tune, and deploy deep learning models using TensorFlow, PyTorch, OpenCV, and Scikit-learn. o Work with CNNs, transformers, GANs, and self-supervised learning techniques. · Data Processing & Model Training: o Process and augment large-scale image and video datasets. o Optimize model performance for real-time applications on edge devices or cloud-based solutions. · Software Development & Deployment: o Develop efficient Python pipelines for real-time image processing and have C++ plus. o Deploy AI models into production environments, mobile devices, or embedded systems. · Collaboration & Research: o Work closely with AI researchers, software engineers, and product teams to develop innovative solutions. o Stay updated with the latest research in computer vision, deep learning, and AI. Required Qualifications: · Education: o Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, Artificial Intelligence, or Data Science a related field. · Technical Skills: o 2+ years Strong programming skills in Python and plus to have C++. o Experience with OpenCV, TensorFlow, PyTorch, Keras, and Scikit-learn. o Proficiency in deep learning architectures like CNNs (SSD,Faster-Rcnn,YOLO), RNNs, and Vision Transformers. o Understanding image processing techniques, feature extraction, and object recognition, Model Parameters tuning. o Experience with cloud computing platforms (AWS, Google Cloud). o Familiarity with MLOps tools (Docker, Kubernetes, ONNX, TensorRT, or OpenVINO) · Soft Skills: o Strong analytical and problem-solving abilities. o Ability to work independently and within a team. o Excellent communication skills in English. Why Join ImageVision.AI? ✅ Work on cutting-edge AI and computer vision projects. ✅ Opportunity to contribute to real-world applications. ✅ Collaborative and fast-paced AI-driven environment. ✅ Competitive salary and benefits. If you're passionate about AI and computer vision, apply now and be part of our innovative team!

Posted 21 hours ago

Apply

8.0 years

0 Lacs

Delhi

On-site

Resolve to Save Lives (RTSL) is a global health organization that partners locally and globally to create and scale solutions to the world's deadliest health threats. Millions of people die from preventable health threats. We collaborate to close the gap between proven, life-saving solutions and the people who need them. Since 2017, we've worked with governments and other partners in more than 60 countries to save millions of lives. We work toward a future where people live longer, healthier lives, communities flourish, and economies thrive. This is an ambitious vision, and it inspires us and our partners to make progress every day. The Digital Team at RTSL is implementing or helping to implement cutting-edge digital tools such as the Simple app, DHIS2 and Africa Covid Dashboard. We work with national and regional health organizations to accelerate progress and advancing the use of digital technologies to save lives through our approach of simplicity, speed, and scale. Position Purpose We are looking for a passionate Applied AI Engineer to join our innovative team, focusing on the exciting field of Large Language Models (LLMs) in the context of Public Health. In this role, you will lead the design, fine-tuning, deployment, and evaluation of AI/ML systems based on pre-trained models (e.g., LLaMA, Mistral, GPT, Phi) that help ease the lives of healthcare workers and clinicians. You will work closely with back-end and mobile engineers to bring cutting-edge AI capabilities to life. The ideal candidate will possess the expertise to leverage existing Large Language Models (LLMs) to train and evaluate models using program-specific clinical data (e.g., patient notes, SMS interactions, training materials or health worker feedback) and deploy within RTSL's digital health tools and global EHRs (e.g., Simple, BP Passport). Additionally, there is a strong likelihood of developing an open-source, locally runnable, adapted LLM to address cost and confidentiality concerns. You'll be working at the intersection of cutting-edge AI and grassroots public health. This is an opportunity to shape the future of digital health tools that are open source, impactful, real-world solutions for some of the most underserved populations globally. Our primary use cases for LLMs are anticipated to include (not limited to): Generating patient summaries specifically tailored for healthcare workers. A chatbot for appointment scheduling. Develop a predictive model to enhance and automate existing workflows. Optimized worklists for frontline workers. On-the-job training and ready-reckoner tools for healthcare professionals. Length of Engagement: This is a two-year fixed-term appointment with the possibility of extension based on available funding and mutual interest. Core Responsibilities The ideal candidate will perform duties and responsibilities such as, but not limited to, the following: Research, evaluate, and implement state-of-the-art LLMs. Fine-tune pre-trained models for specific tasks and datasets. Develop and deploy AI applications using Python. Perform data manipulation and analysis using Pandas to prepare data for model training and evaluation. Design and evaluate prompt engineering strategies for optimizing LLM outputs in specific public health contexts. Collaborate with cross-functional teams to integrate AI solutions into existing products and workflows. Stay up to date with the latest advancements in AI, particularly in the LLM space. Apply responsible AI principles, including fairness, privacy, and transparency, especially in clinical and community health settings. Manage and lead the AI pilots/projects at RTSL. Train and upskill other engineers on the team. Qualifications Education Bachelor's or Master's degree in Computer Science, Engineering, Machine Learning or a related field Experience 8 years of software development experience 3-5 years of experience in training and using AI models. Proven track record of using Large Language Models (LLMs) and building Predictive Models to meet user requirements Experience collaborating with multi-disciplinary and cross-functional teams Delivered LLM-based solutions in resource-constrained environments Hands-on experience with pre-trained AI models. Experience working in healthcare or public health settings (strong plus) Contributed to or maintained open-source AI/ML projects (strong plus). Familiarity with MLOps including model serving, performance monitoring, and lifecycle management, particularly in low-bandwidth or edge environments i(s a plus). Skills & A bilities Strong proficiency in Python programming. Strong experience in data manipulation using Pandas, NumPy, and data preprocessing techniques Familiarity with pre-train models Skilled in machine learning frameworks (e.g., TensorFlow, PyTorch). Familiarity with AI Tools (Hugging Face, LangChain, ONNX, etc.) Strong understanding of AI ethics, data privacy, and bias mitigation techniques Excellent analytical and problem-solving skills Ability to communicate complex technical ideas clearly to non-technical stakeholders Ability to prototype and iterate quickly Comfortable working in agile, interdisciplinary teams across geographies Compensation and Benefits The salary for this role is competitive and set according to national labor rates for the international NGO sector in India. The exact offer will be determined by various factors, such as the candidate's location, skills and experience relative to the requirements of the role. In addition to a competitive salary, Resolve to Save Lives provides a generous package of benefits, including: Health insurance for you and your dependents Contributions toward retirement Paid annual leave and sick leave, in addition to public holidays Two paid, week-long organization-wide breaks at mid-year and end-of-year Professional development and home office setup benefits Up-to-date computer equipment RTSL believes its programs are strengthened when they are developed and supported by individuals with diverse life experiences whose understanding of social and cultural issues can help make our work and workforce more inclusive. We encourage applications from and provide equal employment opportunities to all qualified applicants without regard to race, color, religion, gender, gender identity or expression, ancestry, sexual orientation, national origin, age, disability, marital status, organ donor status, or status as a veteran. Resolve to Save Lives complies with all applicable US EEO laws.

Posted 21 hours ago

Apply

4.0 years

3 - 5 Lacs

Vadodara

On-site

Role & Responsibilities 4+ years of experience applying AI to practical uses Develop and train computer vision models for tasks like: Object detection and tracking (YOLO, Faster R-CNN, etc.) Image classification, segmentation, OCR (e.g., PaddleOCR, Tesseract) Face recognition/blurring, anomaly detection, etc. Optimize models for performance on edge devices (e.g., NVIDIA Jetson, OpenVINO, TensorRT). Process and annotate image/video datasets; apply data augmentation techniques. Proficiency in Large Language Models. Strong understanding of statistical analysis and machine learning algorithms. Hands-on implementing various machine learning algorithms such as linear regression, logistic regression, decision trees, and clustering algorithms. Understanding of image processing concepts (thresholding, contour detection, transformations, etc.) Experience in model optimization, quantization, or deploying to edge (Jetson Nano/Xavier, Coral, etc.) Strong programming skills in Python (or C++), with expertise in: Implement and optimize machine learning pipelines and workflows for seamless integration into production systems. Hands-on experience with at least one real-time CV application (e.g., surveillance, retail analytics, industrial inspection, AR/VR). OpenCV, NumPy, PyTorch/TensorFlow Computer vision models like YOLOv5/v8, Mask R-CNN, DeepSORT Engage with multiple teams and contribute on key decisions. Expected to provide solutions to problems that apply across multiple teams. Lead the implementation of large language models in AI applications. Research and apply cutting-edge AI techniques to enhance system performance. Contribute to the development and deployment of AI solutions across various domains Requirements Design, develop, and deploy ML models for: OCR-based text extraction from scanned documents (PDFs, images) Table and line-item detection in invoices, receipts, and forms Named entity recognition (NER) and information classification Evaluate and integrate third-party OCR tools (e.g., Tesseract, Google Vision API, AWS Textract, Azure OCR,PaddleOCR, EasyOCR) Develop pre-processing and post-processing pipelines for noisy image/text data Familiarity with video analytics platforms (e.g., DeepStream, Streamlit-based dashboards). Experience with MLOps tools (MLflow, ONNX, Triton Inference Server). Background in academic CV research or published papers. Knowledge of GPU acceleration, CUDA, or hardware integration (cameras, sensors).

Posted 21 hours ago

Apply

4.0 years

0 Lacs

Vadodara, Gujarat, India

On-site

Role & Responsibilities 4+ years of experience applying AI to practical uses Develop and train computer vision models for tasks like: Object detection and tracking (YOLO, Faster R-CNN, etc.) Image classification, segmentation, OCR (e.g., PaddleOCR, Tesseract) Face recognition/blurring, anomaly detection, etc. Optimize models for performance on edge devices (e.g., NVIDIA Jetson, OpenVINO, TensorRT). Process and annotate image/video datasets; apply data augmentation techniques. Proficiency in Large Language Models. Strong understanding of statistical analysis and machine learning algorithms. Hands-on implementing various machine learning algorithms such as linear regression, logistic regression, decision trees, and clustering algorithms. Understanding of image processing concepts (thresholding, contour detection, transformations, etc.) Experience in model optimization, quantization, or deploying to edge (Jetson Nano/Xavier, Coral, etc.) Strong programming skills in Python (or C++), with expertise in: Implement and optimize machine learning pipelines and workflows for seamless integration into production systems. Hands-on experience with at least one real-time CV application (e.g., surveillance, retail analytics, industrial inspection, AR/VR). OpenCV, NumPy, PyTorch/TensorFlow Computer vision models like YOLOv5/v8, Mask R-CNN, DeepSORT Engage with multiple teams and contribute on key decisions. Expected to provide solutions to problems that apply across multiple teams. Lead the implementation of large language models in AI applications. Research and apply cutting-edge AI techniques to enhance system performance. Contribute to the development and deployment of AI solutions across various domains Requirements Design, develop, and deploy ML models for: OCR-based text extraction from scanned documents (PDFs, images) Table and line-item detection in invoices, receipts, and forms Named entity recognition (NER) and information classification Evaluate and integrate third-party OCR tools (e.g., Tesseract, Google Vision API, AWS Textract, Azure OCR,PaddleOCR, EasyOCR) Develop pre-processing and post-processing pipelines for noisy image/text data Familiarity with video analytics platforms (e.g., DeepStream, Streamlit-based dashboards). Experience with MLOps tools (MLflow, ONNX, Triton Inference Server). Background in academic CV research or published papers. Knowledge of GPU acceleration, CUDA, or hardware integration (cameras, sensors).

Posted 23 hours ago

Apply

5.0 - 9.0 years

0 Lacs

noida, uttar pradesh

On-site

You are an innovation technology company focused on helping companies accelerate their digital initiatives from strategy and planning through execution. Leveraging deep technical expertise, Agile methodologies, and data-driven intelligence, you modernize systems of engagement and simplify human/tech interaction. Amazing things happen in environments where everyone feels a true sense of belonging and has the skills and opportunities to succeed. Investing in talent and supporting career growth is a priority, always looking for amazing talent to contribute to growth by delivering top results for clients. Join the team to challenge yourself and accomplish meaningful work. As a highly experienced Computer Vision Architect with deep expertise in Python, you will design and lead the development of cutting-edge vision-based systems. Architecting scalable solutions leveraging advanced image and video processing, deep learning, and real-time inference, you will collaborate with cross-functional teams to deliver high-performance, production-grade computer vision platforms. Key Responsibilities: - Architect and design end-to-end computer vision solutions for real-world applications like object detection, tracking, OCR, facial recognition, and scene understanding. - Lead R&D initiatives and prototype development using modern CV frameworks such as OpenCV, PyTorch, TensorFlow. - Optimize computer vision models for performance, scalability, and deployment on cloud, edge, or embedded systems. - Define architecture standards and best practices for Python-based CV pipelines. - Collaborate with product teams, data scientists, and ML engineers to translate business requirements into technical solutions. - Stay updated with the latest advancements in computer vision, deep learning, and AI. - Mentor junior developers and contribute to code reviews, design discussions, and technical documentation. Required Skills & Qualifications: - Bachelors or Masters degree in Computer Science, Electrical Engineering, or related field (PhD is a plus). - 8+ years of software development experience, with 5+ years in computer vision and deep learning. - Proficiency in Python and libraries such as OpenCV, NumPy, scikit-image, Pillow. - Experience with deep learning frameworks like PyTorch, TensorFlow, or Keras. - Strong understanding of CNNs, object detection (YOLO, SSD, Faster R-CNN), semantic segmentation, and image classification. - Knowledge of MLOps, model deployment strategies (e.g., ONNX, TensorRT), and containerization (Docker/Kubernetes). - Experience working with video analytics, image annotation tools, and large-scale dataset pipelines. - Familiarity with edge deployment (Jetson, Raspberry Pi, etc.) or cloud AI services (AWS SageMaker, Azure ML, GCP AI).,

Posted 1 day ago

Apply

3.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

About Apna: Apna is India's largest jobs and professional networking platform for frontline workers. We're building the infrastructure to power hiring, skill-building, and career growth for 300 million+ working Indians. As we expand our AI-first platform across voice, text, and multimodal workflows — we're looking for a bold and curious AI Data Scientist who wants to shape the future of applied Gen AI. Requirement: 1 Location: Bengaluru (Work from Office - Domlur) Team: AI & Machine Learning Experience: 3-5 years Requirements What You'll do: Fine-tune and deploy LLMs, TTS, STT, and voice models for use in real-time conversations with millions of users Convert unstructured, messy real-world audio/text data into clean, high-quality datasets for training and evaluation Build inference pipelines optimized for low-latency, high-accuracy voice agents and multimodal interfaces Work closely with infra and product teams to ship production-grade GenAI models with observability, fallback, and monitoring Experiment with GANs, diffusion models, audio generation, and multimodal fusion to power next-gen AI agents Own the full model lifecycle — from research and training to deployment, testing, and iteration. What we're Looking for: 3-5 years of hands-on experience in AI / ML roles, ideally in startups or product-driven teams. Strong grasp of LLM fine-tuning, instruction tuning, or pretraining techniques Familiarity with TTS/STT systems, Whisper, Tacotron, VITS, or commercial tools like ElevenLabs Experience with multimodal architectures, generative audio, GANs, or diffusion-based models Ability to work with real-world messy data, design training pipelines, and debug model failure modes Fluency in frameworks like PyTorch, HuggingFace, TensorFlow, and ecosystem tools (ONNX, Triton, LangChain, etc.) Passion for building high-impact AI features that ship to real customers Benefits Why Join Us: Work at the cutting edge of LLMs, voice AI, and generative models — and ship real products, not just prototypes Directly impact millions of users by powering AI agents that help with hiring, learning, and career growth Collaborate with a world-class team of AI engineers, researchers, and product minds who move fast and ship boldly Freedom to explore: Own experiments, propose architecture, or contribute to foundational model training Startup speed, enterprise scale — best of both worlds. Rapid iteration and direct customer feedback Multilingual India - first problems that push the boundaries of speech, reasoning, and personalization

Posted 1 day ago

Apply

0 years

0 Lacs

India

On-site

Launch Your Career in AI/ML – We’re Hiring Freshers! Location: Ahmedabad (Work from Office) Company: MPIRIC Web Services Role: AI/ML Intern Type: Full-Time | Internship | In-office Are you a recent graduate eager to explore the world of Artificial Intelligence & Machine Learning? Join our AI/ML Internship Program and gain real-world experience working on live projects, guided by industry experts. If you’re passionate about AI and love coding in Python, this is the opportunity for you! What You’ll Work With: Programming Languages: Python (must-have) (Bonus) C++ for performance-heavy tasks ML Frameworks: PyTorch or TensorFlow/Keras (hands-on with one is enough) Scikit-learn for traditional ML models Data Handling & Visualization: NumPy, Pandas, Matplotlib, Seaborn Comfortable with cleaning, analyzing & visualizing data Computer Vision (optional but cool): OpenCV for image processing YOLOv5/v8, MediaPipe (nice to have!) Deployment (basic exposure): Flask / FastAPI for APIs Docker (bonus) ONNX / TensorRT (only if you’re curious) Cloud & Tools: Google Colab (Required) Jupyter Notebooks ( Must-know) AWS/GCP/Azure (not required, but great if you’ve tried!) What You Should Know (Even at Basic Level): Linear Algebra, Calculus, Probability Cost Functions, Backpropagation Basics ML Concepts: Supervised vs Unsupervised CNNs, RNNs, Transformers – basic awareness Model evaluation: Accuracy, Precision, Recall, F1 Score Projects to Show (If You Have): Image classification / object detection YOLO-based mini projects Flask-deployed ML model Kaggle competition participation Anything cool on GitHub! What We’re Looking For: ✔ Strong desire to learn & grow ✔ Clear understanding of Python basics ✔ Enthusiasm for AI & solving problems ✔ Ability to explain your thought process ✔ Based in Ahmedabad (In-office internship only) What You’ll Get: Hands-on project experience Internship Certificate + LOR Mentorship from experienced developers Pre-placement opportunity for top performers Supportive, learning-focused environment Job Types: Full-time, Fresher Schedule: Monday to Friday Work Location: In person

Posted 1 day ago

Apply

0 years

0 Lacs

Ahmedabad, Gujarat, India

On-site

We're Hiring: AI/ML Intern @ Atomo Location: Gandhinagar ,Gujarat Duration: 6 months At Atomo , we’re building AI-powered edge devices for automation, smart infrastructure, and industrial IoT — made in India, designed for the world. We’re looking for a passionate AI/ML Intern to join our team and work on cutting-edge applications like: On-device inference (NPU optimization) Predictive maintenance models Energy efficiency & anomaly detection Real-time data processing on the edge,video and image analysis What You’ll Do: Build, train, and optimize ML models for embedded deployment Work closely with our firmware and hardware teams Evaluate model performance across edge devices (Electron, Proton, Neutron) Contribute to real-world, production-grade features You Should Have: Background in Computer Science, AI/ML, or related field Experience with Python, TensorFlow / PyTorch Familiarity with edge ML (TensorFlow Lite, ONNX, etc.) Interest in embedded systems, IoT, or edge computing Why Atomo? Hands-on work with real devices Learn how AI meets hardware Be part of India’s deep-tech innovation story Strong chance of PPO (pre-placement offer) for top performers #AIIntern #MLIntern #EdgeAI #IoT #Internship #Hiring #MadeInIndia #DeepTech #CareersAtAtomo

Posted 1 day ago

Apply

10.0 years

0 Lacs

Gurugram, Haryana, India

On-site

Job Title: Senior CV Engineer Location: Gurugram Experience: 6–10 Years Industry: AI Product Overview: We are hiring for our esteemed client, a Series-A funded deep-tech company building a first-of-its-kind app-based operating system for Computer Vision. The team specializes in real-time video/image inference, distributed processing, and high-throughput data handling using advanced technologies and frameworks. Key Responsibilities: Lead design and implementation of complex CV pipelines (object detection, instance segmentation, industrial anomaly detection). Own major modules from concept to deployment ensuring low latency and high reliability. Transition algorithms from Python/PyTorch to optimized C++ edge GPU implementations using TensorRT, ONNX, and GStreamer. Collaborate with cross-functional teams to refine technical strategies and roadmaps. Drive long-term data and model strategies (synthetic data generation, validation frameworks). Mentor engineers and maintain high engineering standards. Required Skills & Qualifications: 6–10 years of experience in architecting and deploying CV systems. Expertise in multi-object tracking, object detection, semi/unsupervised learning. Proficiency in Python, PyTorch/TensorFlow, Modern C++, CUDA. Experience with real-time, low-latency model deployment on edge devices. Strong systems-level design thinking across ML lifecycles. Familiarity with MLOps (CI/CD for models, versioning, experiment tracking). Bachelor’s/Master’s degree in CS, EE, or related fields with strong ML and algorithmic foundations. (Preferred) Experience with NVIDIA DeepStream, GStreamer, LLMs/VLMs, open-source contributions.

Posted 2 days ago

Apply

3.0 - 7.0 years

0 Lacs

karnataka

On-site

As a Backend + ML Engineer at our company, you will be responsible for building scalable APIs, managing secure patient data flow, and supporting AI model integration. Your role will involve collaborating with AI engineers, front-end developers, and field researchers to operationalize models for real-time use, particularly on edge devices. Your key responsibilities will include designing and developing robust backend systems to handle real-time patient data and ML outputs, integrating machine learning models with APIs and the main O-Health app, optimizing model serving pipelines, managing data pipelines for de-identified OPD datasets, implementing data encryption and anonymization, setting up backend support for multilingual voice and text processing, and supporting versioning and A/B testing of health algorithms. To excel in this role, you should have expertise in Backend Engineering, including strong proficiency in Python with frameworks like FastAPI, Flask, or Django, experience with RESTful APIs, WebSockets, and asynchronous data flows, familiarity with databases such as PostgreSQL, MongoDB, or time-series databases, and working knowledge of Docker, Git, and CI/CD pipelines. Additionally, you should possess skills in Machine Learning Ops, such as hands-on experience with PyTorch, Scikit-learn, or TensorFlow, model optimization, quantization, and edge deployment formats, familiarity with language models and multilingual NLP, and knowledge of data preprocessing and feature engineering for clinical/NLP tasks. Preferred skills for this role include an understanding of HIPAA/GDPR compliance and experience working on healthcare, social impact, or AI-for-good projects. In this position, you will have a significant impact on connecting machine learning research with field-ready healthcare tools, scaling diagnosis support systems to underserved patients, and enabling multilingual health consultations in real-time. Join us in making a difference in the healthcare industry and contributing to the improvement of patient care.,

Posted 2 days ago

Apply

12.0 - 15.0 years

0 Lacs

Thane, Maharashtra, India

On-site

We are looking for a Director of Engineering (AI Systems & Secure Platforms) to join our client&aposs Core Engineering team at Thane (Maharashtra - India). The ideal candidate should have 12-15+ years of experience in architecting and deploying AI systems at scale, with deep expertise in agentic AI workflows, LLMs, RAG, Computer Vision, and secure mobile/wearable platforms. Top 3 Daily Tasks: Architect, optimize, and deploy LLMs, RAG pipelines, and Computer Vision models for smart glasses and other edge devices. Design and orchestrate agentic AI workflowsenabling autonomous agents with planning, tool usage, error handling, and closed feedback loops. Collaborate across AI, Firmware, Security, Mobile, Product, and Design teams to embed "invisible intelligence" within secure wearable systems. Must have 12-15+ years of experience in Applied AI, Deep Learning, Edge AI deployment, Secure Mobile Systems, and Agentic AI Architecture. Must have: Programming languages: Python C/C++ Java Kotlin JavaScript/Node.js Swift Objective-C CUDA Shell scripting Expert in: TensorFlow PyTorch ONNX HuggingFace model optimization with TensorRT TFLite Deep experience with: LLMs RAG pipelines vector DBs (FAISS, Milvus) Proficient in agentic AI workflowsmulti-agent orchestration, planning, feedback loops Strong in privacy-preserving AI (federated learning, differential privacy) Secure real-time comms (WebRTC, SIP, RTP) Nice to have: Experience with MCP or similar protocol frameworks Background in wearables/XR or smart glass AI platforms Expertise in platform security architectures (sandboxing, auditability) Show more Show less

Posted 2 days ago

Apply

5.0 years

4 - 6 Lacs

Hyderābād

Remote

Job Description We are seeking an experienced Staff Software Engineer (AI). In this role, you will be part of the Pathfinding Studio within the Innovation Office. The team is exploring AI/machine learning techniques for various domain-specific problems related to engineering and robotics. Responsibilities: This role requires you to research and develop AI/machine learning models. You will train, fine-tune, or perform in-context learning to develop state-of-the-art AI/machine learning-based models. Activities will involve the identification and preparation of data sets, maintenance of data sets, identifying, using, and developing appropriate model architecture, and delivering the model along with model usage documentation and examples. Additionally, you will work on developing AI models targeting robotics utilizing relevant virtual modeling frameworks. Minimum Qualifications: Experience with multimodal machine learning model development based on structured and unstructured data targeting robotics. Experience with the end-to-end deep learning development life cycle. Ability to understand, analyze, test model behavior, and summarize model performance. Ability to work with AI/ML flow in CPU, embedded CPU, and GPU-based infrastructure in Windows and Linux environments. Ability to select and use appropriate models from the open-source environment in memory and compute-constrained infrastructure. Experience in training, fine-tuning, and in-context learning with multi-modal models and datasets using Python/C/C++. Knowledge of traditional and state-of-the-art machine learning techniques related to NLP, Video, Audio, Generative AI, Code understanding, and generation. Experience with virtual model development using IsaacSim and MuJoCo for robotics. Qualifications Education: Master’s degree in computer science, Electrical Engineering, Computer Engineering, or a similar field with 5 years of relevant experience in end-to-end machine learning development or a PhD with 3 years of relevant experience. Preferred Qualifications: Hands-on expertise in deep learning model development and tuning. Experience with structured and unstructured data manipulation and management for machine learning pipelines. Experience with virtual and physical modeling environments targeting robotics. Experience in Python and ML frameworks (e.g., PyTorch, TensorFlow, ONNX, etc.). Experience in using open-source model development (e.g., Huggingface, Langchain, etc.). Experience with data processing and UI frameworks (e.g., Pandas, Plotly, SciPy, Flask, Streamlit, or similar). Ability to understand and explain state-of-the-art AI models. Ability to understand and summarize the hardware and software complexity of various AI models. Knowledge of applying AI algorithms in semiconductor design and verification. Company Description Renesas is one of the top global semiconductor companies in the world. We strive to develop a safer, healthier, greener, and smarter world, and our goal is to make every endpoint intelligent by offering product solutions in the automotive, industrial, infrastructure and IoT markets. Our robust product portfolio includes world leading MCUs, SoCs, Analog and power products, plus Winning Combination solutions that curate these complementary products. We are a key supplier to the world’s leading manufacturers of electronics you rely on every day; you may not see our products, but they are all around you. Renesas employs roughly 21,000 people in more than 30 countries worldwide. As a global team, our employees actively embody the Renesas Culture, our guiding principles based on five key elements: Transparent, Agile, Global, Innovative, and Entrepreneurial. Renesas believes in, and has a commitment to, diversity and inclusion, with initiatives and a leadership team dedicated to its resources and values. At Renesas, we want to build a sustainable future where technology helps make our lives easier. Join us and build your future by being part of what’s next in electronics and the world. Additional Information Renesas is an embedded semiconductor solution provider driven by its Purpose ‘ To Make Our Lives Easier .’ As the industry’s leading expert in embedded processing with unmatched quality and system-level know-how, we have evolved to provide scalable and comprehensive semiconductor solutions for automotive, industrial, infrastructure, and IoT industries based on the broadest product portfolio, including High Performance Computing, Embedded Processing, Analog & Connectivity, and Power. With a diverse team of over 21,000 professionals in more than 30 countries, we continue to expand our boundaries to offer enhanced user experiences through digitalization and usher into a new era of innovation. We design and develop sustainable, power-efficient solutions today that help people and communities thrive tomorrow, ‘ To Make Our Lives Easier .’ At Renesas, you can: Launch and advance your career in technical and business roles across four Product Groups and various corporate functions. You will have the opportunities to explore our hardware and software capabilities and try new things. Make a real impact by developing innovative products and solutions to meet our global customers' evolving needs and help make people’s lives easier, safe and secure. Maximize your performance and wellbeing in our flexible and inclusive work environment. Our people-first culture and global support system, including the remote work option and Employee Resource Groups, will help you excel from the first day. Are you ready to own your success and make your mark? Join Renesas. Let’s Shape the Future together. Renesas Electronics is an equal opportunity and affirmative action employer, committed to supporting diversity and fostering a work environment free of discrimination on the basis of sex, race, religion, national origin, gender, gender identity, gender expression, age, sexual orientation, military status, veteran status, or any other basis protected by law. For more information, please read our Diversity & Inclusion Statement . Job title Staff Software Engineer Department Engineering Location Hyderabad Remote No Requisition ID 20020176_2025-07-03

Posted 2 days ago

Apply

5.0 years

0 Lacs

New Delhi, Delhi, India

Remote

Location: Remote (India-based preferred) Type: Full-time | Founding Team | High Equity Company: Flickd (www.flickd.in) About the Role We’re building India’s most advanced virtual try-on engine — think Doji meets TryOnDiffusion, but optimized for real-world speed, fashion, and body diversity. As our ML Engineer (Computer Vision + Try-On) , you’ll own the end-to-end pipeline : from preprocessing user/product images to generating hyper-realistic try-on results with preserved pose, skin, texture, and identity. You’ll have full autonomy to build, experiment, and ship — working directly with React, Spring Boot, DevOps, and design folks already in place. This is not a junior researcher role. This is one person building the brain of the system - and setting the foundation for India's biggest visual shopping innovation. What You’ll Build Stage 1: User Image Preprocessing Human parsing (face, body, hair), pose detection, face/limb alignment Auto orientation, canvas resizing, brightness/contrast normalization Stage 2: Product Image Processing Background removal, garment segmentation (SAM/U^2-Net/YOLOv8) Handle occlusions, transparent clothes, long sleeves, etc. Stage 3: Try-On Engine Implement and iterate on CP-VTON / TryOnDiffusion / FlowNet Fine-tune on custom data for realism, garment drape, identity retention Inference Optimisation TorchScript / ONNX, batching, inference latency minimization Collaborate with DevOps for Lambda/EC2 + GPU deployment Postprocessing Alpha blending, edge smoothing, fake shadows, cloth-body warps You’re a Fit If You: Have 2–5 years in ML/CV with real shipped work (not just notebooks) Have worked on: human parsing, pose estimation, cloth warping, GANs Are hands-on with PyTorch , OpenCV, Segmentation Models, Flow or ViT Can replicate models from arXiv fast, and care about output quality Want to own a system seen by millions , not just improve metrics Stack You’ll Use PyTorch, ONNX, TorchScript, Hugging Face DensePose, OpenPose, Segment Anything, Diffusion Models Docker, Redis, AWS Lambda, S3 (infra is already set up) MLflow or DVC (can be implemented from scratch) For exceptional talent, we’re flexible on cash vs equity split. Why This Is a Rare Opportunity Build the core AI product that powers a breakout consumer app Work in a zero BS, full-speed team (React, SpringBoot, DevOps, Design all in place) Be the founding ML brain and shape all future hires Ship in weeks, not quarters — and see your output in front of users instantly Apply now, or DM Dheekshith (Founder) on LinkedIn with your GitHub or project links. Let’s build something India’s never seen before.

Posted 2 days ago

Apply

1.0 - 5.0 years

0 Lacs

hyderabad, telangana

On-site

Qualcomm India Private Limited is seeking a candidate to join their Multimedia Audio Systems Group as a Voice AI Engineer. As part of the team, you will be responsible for prototyping and productizing Voice AI Models for tasks such as Automatic Speech Recognition (ASR), Text-to-Speech (TTS), NLP, Multilingual Translation, Summarization, Language modeling, and other Speech/text generation tasks. You will work closely with a team of engineers to develop, train, and optimize Voice AI models for efficient offload to NPU, GPU, and CPU. Additionally, you will conduct model evaluation studies, competitive analysis, and collaborate with other R&D and Systems teams for system integration, use case validation, efficient offload to HW accelerators, and commercialization support. The ideal candidate should have strong programming skills in C/C++ and Python, along with experience in ML inference optimizations. Proficiency in designing, implementing, and training DL models using high-level languages/frameworks such as PyTorch, TensorFlow, and ONNX is required. Knowledge of ML architectures and operators like Transformers, LSTM, GRUs, and familiarity with recent trends in machine learning and traditional statistical modeling/feature extraction techniques are essential. Experience in Speech-to-text, Text-to-Speech, Speech-to-Speech, NLP applications, model quantization, compression techniques, software development on embedded platforms, software design patterns, multi-threaded programming, computer architecture, operating systems, data structures, algorithms, fixed-point coding, and AI HW accelerators (NPU or GPU) is a plus. Candidates should hold a Bachelor's/Masters/PhD degree in Engineering, Electronics and Communication, Computer Science, or related field, along with 3+ years of experience in Audio Systems engineering, Audio Signal Processing modules, ML Model development, or related work. Minimum qualifications include a Bachelor's degree in Engineering, Information Systems, Computer Science, or related field with 2+ years of Systems Engineering or related work experience, or a Master's degree with 1+ year of experience, or a PhD in a related field. Qualcomm is an equal opportunity employer committed to providing accessible processes for individuals with disabilities. Individuals seeking accommodation during the application/hiring process can contact Qualcomm for support. The company expects its employees to adhere to all applicable policies and procedures, including security requirements regarding protection of confidential information. Please note that Qualcomm does not accept unsolicited resumes or applications from agencies. Staffing and recruiting agencies are not authorized to submit profiles, applications, or resumes on behalf of individuals. For more information about this role, please contact Qualcomm Careers.,

Posted 3 days ago

Apply

12.0 - 15.0 years

0 Lacs

Thane, Maharashtra, India

On-site

We are looking for a Director of Engineering (AI Systems & Secure Platforms) to join our client&aposs Core Engineering team at Thane (Maharashtra India). The ideal candidate should have 1215+ years of experience in architecting and deploying AI systems at scale, with deep expertise in agentic AI workflows, LLMs, RAG, Computer Vision, and secure mobile/wearable platforms. 1. Top 3 Daily Tasks: - Architect, optimize, and deploy LLMs, RAG pipelines, and Computer Vision models for smart glasses and other edge devices. - Design and orchestrate agentic AI workflowsenabling autonomous agents with planning, tool usage, error handling, and closed feedback loops. - Collaborate across AI, Firmware, Security, Mobile, Product, and Design teams to embed invisible intelligence within secure wearable systems. 2. Must have: - Must have 1215+ years of experience in Applied AI, Deep Learning, Edge AI deployment, Secure Mobile Systems, and Agentic AI Architecture. Programming languages: Python C/C++ Java Kotlin JavaScript/Node.js Swift Objective-C CUDA Shell scripting - Expert in: TensorFlow PyTorch ONNX HuggingFace model optimization with TensorRT TFLite - Deep experience with: LLMs RAG pipelines vector DBs (FAISS, Milvus) - Proficient in agentic AI workflowsmulti-agent orchestration, planning, feedback loops - Strong in privacy-preserving AI (federated learning, differential privacy) - Secure real-time comms (WebRTC, SIP, RTP) 3. Nice to have: - Experience with MCP or similar protocol frameworks - Background in wearables/XR or smart glass AI platforms - Expertise in platform security architectures (sandboxing, auditability) Show more Show less

Posted 3 days ago

Apply

8.0 years

0 Lacs

Noida, Uttar Pradesh, India

On-site

About Company, Droisys is an innovation technology company focused on helping companies accelerate their digital initiatives from strategy and planning through execution. We leverage deep technical expertise, Agile methodologies, and data-driven intelligence to modernize systems of engagement and simplify human/tech interaction. Amazing things happen when we work in environments where everyone feels a true sense of belonging and when candidates have the requisite skills and opportunities to succeed. At Droisys, we invest in our talent and support career growth, and we are always on the lookout for amazing talent who can contribute to our growth by delivering top results for our clients. Join us to challenge yourself and accomplish work that matters We are seeking a highly experienced Computer Vision Architect with deep expertise in Python to design and lead the development of cutting-edge vision-based systems. The ideal candidate will architect scalable solutions that leverage advanced image and video processing, deep learning, and real-time inference. You will collaborate with cross-functional teams to deliver high-performance, production-grade computer vision platforms. Key Responsibilities: Architect and design end-to-end computer vision solutions for real-world applications (e.g., object detection, tracking, OCR, facial recognition, scene understanding, etc.) Lead R&D initiatives and prototype development using modern CV frameworks(OpenCV, PyTorch, TensorFlow, etc.) Optimize computer vision models for performance, scalability, and deployment on cloud, edge, or embedded systems Define architecture standards and best practices for Python-based CV pipelines Collaborate with product teams, data scientists, and ML engineers to translate business requirements into technical solutions Stay updated with the latest advancements in computer vision, deep learning, and AI Mentor junior developers and contribute to code reviews, design discussions, and technical documentation Required Skills & Qualifications: Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or related field (PhD is a plus) 8+ years of software development experience, with 5+ years in computer vision and deep learning Proficient in Python and libraries such as OpenCV, NumPy, scikit-image, Pillow Experience with deep learning frameworks like PyTorch, TensorFlow, or Keras Strong understanding of CNNs, object detection (YOLO, SSD, Faster R-CNN), semantic segmentation, and image classification Knowledge of MLOps, model deployment strategies (e.g., ONNX, TensorRT), and containerization (Docker/Kubernetes) Experience working with video analytics, image annotation tools, and large-scale dataset pipelines Familiarity with edge deployment (Jetson, Raspberry Pi, etc.) or cloud AI services(AWS SageMaker, Azure ML, GCP AI) Droisys is an equal opportunity employer. We do not discriminate based on race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. Droisys believes in diversity, inclusion, and belonging, and we are committed to fostering a diverse work environment.

Posted 3 days ago

Apply

5.0 years

0 Lacs

Hyderabad, Telangana, India

Remote

Company Description Renesas is one of the top global semiconductor companies in the world. We strive to develop a safer, healthier, greener, and smarter world, and our goal is to make every endpoint intelligent by offering product solutions in the automotive, industrial, infrastructure and IoT markets. Our robust product portfolio includes world leading MCUs, SoCs, Analog and power products, plus Winning Combination solutions that curate these complementary products. We are a key supplier to the world’s leading manufacturers of electronics you rely on every day; you may not see our products, but they are all around you. Renesas employs roughly 21,000 people in more than 30 countries worldwide. As a global team, our employees actively embody the Renesas Culture, our guiding principles based on five key elements: Transparent, Agile, Global, Innovative, and Entrepreneurial. Renesas believes in, and has a commitment to, diversity and inclusion, with initiatives and a leadership team dedicated to its resources and values. At Renesas, we want to build a sustainable future where technology helps make our lives easier. Join us and build your future by being part of what’s next in electronics and the world. Job Description We are seeking an experienced Staff Software Engineer (AI). In this role, you will be part of the Pathfinding Studio within the Innovation Office. The team is exploring AI/machine learning techniques for various domain-specific problems related to engineering and robotics. Responsibilities This role requires you to research and develop AI/machine learning models. You will train, fine-tune, or perform in-context learning to develop state-of-the-art AI/machine learning-based models. Activities will involve the identification and preparation of data sets, maintenance of data sets, identifying, using, and developing appropriate model architecture, and delivering the model along with model usage documentation and examples. Additionally, you will work on developing AI models targeting robotics utilizing relevant virtual modeling frameworks. Minimum Qualifications Experience with multimodal machine learning model development based on structured and unstructured data targeting robotics. Experience with the end-to-end deep learning development life cycle. Ability to understand, analyze, test model behavior, and summarize model performance. Ability to work with AI/ML flow in CPU, embedded CPU, and GPU-based infrastructure in Windows and Linux environments. Ability to select and use appropriate models from the open-source environment in memory and compute-constrained infrastructure. Experience in training, fine-tuning, and in-context learning with multi-modal models and datasets using Python/C/C++. Knowledge of traditional and state-of-the-art machine learning techniques related to NLP, Video, Audio, Generative AI, Code understanding, and generation. Experience with virtual model development using IsaacSim and MuJoCo for robotics. Qualifications Education: Master’s degree in computer science, Electrical Engineering, Computer Engineering, or a similar field with 5 years of relevant experience in end-to-end machine learning development or a PhD with 3 years of relevant experience. Preferred Qualifications Hands-on expertise in deep learning model development and tuning. Experience with structured and unstructured data manipulation and management for machine learning pipelines. Experience with virtual and physical modeling environments targeting robotics. Experience in Python and ML frameworks (e.g., PyTorch, TensorFlow, ONNX, etc.). Experience in using open-source model development (e.g., Huggingface, Langchain, etc.). Experience with data processing and UI frameworks (e.g., Pandas, Plotly, SciPy, Flask, Streamlit, or similar). Ability to understand and explain state-of-the-art AI models. Ability to understand and summarize the hardware and software complexity of various AI models. Knowledge of applying AI algorithms in semiconductor design and verification. Additional Information Renesas is an embedded semiconductor solution provider driven by its Purpose ‘ To Make Our Lives Easier .’ As the industry’s leading expert in embedded processing with unmatched quality and system-level know-how, we have evolved to provide scalable and comprehensive semiconductor solutions for automotive, industrial, infrastructure, and IoT industries based on the broadest product portfolio, including High Performance Computing, Embedded Processing, Analog & Connectivity, and Power. With a diverse team of over 21,000 professionals in more than 30 countries, we continue to expand our boundaries to offer enhanced user experiences through digitalization and usher into a new era of innovation. We design and develop sustainable, power-efficient solutions today that help people and communities thrive tomorrow, ‘ To Make Our Lives Easier .’ At Renesas, You Can Launch and advance your career in technical and business roles across four Product Groups and various corporate functions. You will have the opportunities to explore our hardware and software capabilities and try new things. Make a real impact by developing innovative products and solutions to meet our global customers' evolving needs and help make people’s lives easier, safe and secure. Maximize your performance and wellbeing in our flexible and inclusive work environment. Our people-first culture and global support system, including the remote work option and Employee Resource Groups, will help you excel from the first day. Are you ready to own your success and make your mark? Join Renesas. Let’s Shape the Future together. Renesas Electronics is an equal opportunity and affirmative action employer, committed to supporting diversity and fostering a work environment free of discrimination on the basis of sex, race, religion, national origin, gender, gender identity, gender expression, age, sexual orientation, military status, veteran status, or any other basis protected by law. For more information, please read our Diversity & Inclusion Statement.

Posted 3 days ago

Apply

3.0 - 6.0 years

0 Lacs

Hyderabad, Telangana, India

On-site

Greeting from Leadsoc Technologies _ Hyderabad Position: AI Engineer & Validation Skills: Machine learning fundamentals, including deep learning,large language models , and recommender systemsLLLM. Strong background in validation, defect and software development life cycle Strong knowledge on ubuntu / yocto linux Experience working with opensource frameworks such as PyTorch, TensorFlow, and ONNX-Runtime. Experience in profiling ML workloads Prior experience in executing validation plans for AI/ML compute stacks s uch as HIP, CUDA, OpenCL, OpenVINO, Strong background in python programming. Experience : 3-6 Years Notice period: 0- 15 days Regards Murali

Posted 4 days ago

Apply

2.0 - 6.0 years

0 Lacs

noida, uttar pradesh

On-site

You will be joining our team as a skilled Deep Learning Engineer with expertise in object detection and segmentation models. Your primary responsibilities will include implementing and refining object detection models such as YOLOvX, Faster R-CNN, EfficientDet, SSD, and Mask R-CNN. Additionally, you will work on real-time computer vision applications, optimize performance, annotate and prepare datasets, and collaborate on research and development projects to enhance model performance and robustness. As a Deep Learning Engineer, you will be expected to deploy models using Docker on Linux/Windows systems, with experience in edge deployment considered a plus. It will be essential for you to document code, experiments, and deployment processes while collaborating with cross-functional teams. Strong Python programming skills, knowledge of TensorFlow, PyTorch, OpenCV, and ONNX, as well as hands-on experience with Docker and familiarity with model optimization techniques like quantization and pruning are required for this role. An advantage would be your experience in edge deployments using platforms such as NVIDIA Jetson, TensorRT, and OpenVINO. Additionally, familiarity with experiment tracking tools like MLflow or Weights & Biases is a plus. The qualifications we are looking for include a Bachelors or Masters degree in Computer Science, AI, Data Science, or a related field, along with strong analytical, problem-solving, and team collaboration skills. This is a full-time position with a day shift schedule that requires in-person work at our location.,

Posted 4 days ago

Apply

12.0 - 15.0 years

0 Lacs

Thane, Maharashtra, India

On-site

We are looking for a Director of Engineering (AI Systems & Secure Platforms) to join our client&aposs Core Engineering team at Thane (Maharashtra India). The ideal candidate should have 1215+ years of experience in architecting and deploying AI systems at scale, with deep expertise in agentic AI workflows, LLMs, RAG, Computer Vision, and secure mobile/wearable platforms. Top 3 Daily Tasks: ? Architect, optimize, and deploy LLMs, RAG pipelines, and Computer Vision models for smart glasses and other edge devices. ? Design and orchestrate agentic AI workflowsenabling autonomous agents with planning, tool usage, error handling, and closed feedback loops. ? Collaborate across AI, Firmware, Security, Mobile, Product, and Design teams to embed invisible intelligence within secure wearable systems. Must have 1215+ years of experience in Applied AI, Deep Learning, Edge AI deployment, Secure Mobile Systems, and Agentic AI Architecture. Must have: -Programming languages: Python, C/C++, Java (Android), Kotlin, JavaScript/Node.js, Swift, Objective-C, CUDA, Shell scripting -Expert in TensorFlow, PyTorch, ONNX, HuggingFace; model optimization with TensorRT, TFLite -Deep experience with LLMs, RAG pipelines, vector DBs (FAISS, Milvus) -Proficient in agentic AI workflowsmulti-agent orchestration, planning, feedback loops -Strong in privacy-preserving AI (federated learning, differential privacy) -Secure real-time comms (WebRTC, SIP, RTP) Nice to have: -Experience with MCP or similar protocol frameworks -Background in wearables/XR or smart glass AI platforms -Expertise in platform security architectures (sandboxing, auditability) Industry Technology, Information and Internet Employment Type Full-time Show more Show less

Posted 4 days ago

Apply

6.0 years

0 Lacs

Gurgaon, Haryana, India

On-site

Job Description: Senior MLOps Engineer Position: Senior MLOps Engineer Location: Gurugram Relevant Experience Required: 6+ years Employment Type: Full-time About The Role We are seeking a Senior MLOps Engineer with deep expertise in Machine Learning Operations, Data Engineering, and Cloud-Native Deployments . This role requires building and maintaining scalable ML pipelines , ensuring robust data integration and orchestration , and enabling real-time and batch AI systems in production. The ideal candidate will be skilled in state-of-the-art MLOps tools , data clustering , big data frameworks , and DevOps best practices , ensuring high reliability, performance, and security for enterprise AI workloads. Key Responsibilities MLOps & Machine Learning Deployment Design, implement, and maintain end-to-end ML pipelines from experimentation to production. Automate model training, evaluation, versioning, deployment, and monitoring using MLOps frameworks. Implement CI/CD pipelines for ML models (GitHub Actions, GitLab CI, Jenkins, ArgoCD). Monitor ML systems in production for drift detection, bias, performance degradation, and anomaly detection. Integrate feature stores (Feast, Tecton, Vertex AI Feature Store) for standardized model inputs. Data Engineering & Integration Design and implement data ingestion pipelines for structured, semi-structured, and unstructured data. Handle batch and streaming pipelines with Apache Kafka, Apache Spark, Apache Flink, Airflow, or Dagster. Build ETL/ELT pipelines for data preprocessing, cleaning, and transformation. Implement data clustering, partitioning, and sharding strategies for high availability and scalability. Work with data warehouses (Snowflake, BigQuery, Redshift) and data lakes (Delta Lake, Lakehouse architectures). Ensure data lineage, governance, and compliance with modern tools (DataHub, Amundsen, Great Expectations). Cloud & Infrastructure Deploy ML workloads on AWS, Azure, or GCP using Kubernetes (K8s) and serverless computing (AWS Lambda, GCP Cloud Run). Manage containerized ML environments with Docker, Helm, Kubeflow, MLflow, Metaflow. Optimize for cost, latency, and scalability across distributed environments. Implement infrastructure as code (IaC) with Terraform or Pulumi. Real-Time ML & Advanced Capabilities Build real-time inference pipelines with low latency using gRPC, Triton Inference Server, or Ray Serve. Work on vector database integrations (Pinecone, Milvus, Weaviate, Chroma) for AI-powered semantic search. Enable retrieval-augmented generation (RAG) pipelines for LLMs. Optimize ML serving with GPU/TPU acceleration and ONNX/TensorRT model optimization. Security, Monitoring & Observability Implement robust access control, encryption, and compliance with SOC2/GDPR/ISO27001. Monitor system health with Prometheus, Grafana, ELK/EFK, and OpenTelemetry. Ensure zero-downtime deployments with blue-green/canary release strategies. Manage audit trails and explainability for ML models. Preferred Skills & Qualifications Core Technical Skills Programming: Python (Pandas, PySpark, FastAPI), SQL, Bash; familiarity with Go or Scala a plus. MLOps Frameworks: MLflow, Kubeflow, Metaflow, TFX, BentoML, DVC. Data Engineering Tools: Apache Spark, Flink, Kafka, Airflow, Dagster, dbt. Databases: PostgreSQL, MySQL, MongoDB, Cassandra, DynamoDB. Vector Databases: Pinecone, Weaviate, Milvus, Chroma. Visualization: Plotly Dash, Superset, Grafana. Tech Stack Orchestration: Kubernetes, Helm, Argo Workflows, Prefect. Infrastructure as Code: Terraform, Pulumi, Ansible. Cloud Platforms: AWS (SageMaker, S3, EKS), GCP (Vertex AI, BigQuery, GKE), Azure (ML Studio, AKS). Model Optimization: ONNX, TensorRT, Hugging Face Optimum. Streaming & Real-Time ML: Kafka, Flink, Ray, Redis Streams. Monitoring & Logging: Prometheus, Grafana, ELK, OpenTelemetry.

Posted 4 days ago

Apply

10.0 - 15.0 years

18 - 22 Lacs

Bengaluru

Work from Office

Job Area: Engineering Group, Engineering Group > Systems Engineering General Summary: We are seeking a passionate and skilled AI/ML Engineer to join our cutting-edge Extended Reality (XR) Software team. In this role, you will work on next-generation XR products that blend the physical and digital worlds, leveraging artificial intelligence and machine learning to create immersive, intelligent, and responsive experiences. You will collaborate with cross-functional teams of researchers, engineers, and designers to build real-time AI/ML software optimized for XR platforms. A strong background in C++ or embedded firmware development is essential, as you will be working close to hardware and performance-critical systems. Key Responsibilities Design, develop, and optimize AI/ML models for XR applications such as computer vision, sensor fusion, gesture recognition, and spatial understanding. Implement real-time inference pipelines on embedded or edge devices. Collaborate with firmware and hardware teams to integrate ML models into XR systems. Analyze system performance and optimize for latency, power, and memory. Stay up to date with the latest research and trends in AI/ML and XR technologies. Contribute to the full lifecycle of product development"”from prototyping to production. Required Qualifications Bachelors or Masters degree in Computer Science, Electrical Engineering, or a related field. 1"“10 years of industry experience in AI/ML engineering or embedded systems. Proficiency in C++ and/or embedded firmware development . Solid understanding of machine learning fundamentals and experience with frameworks like TensorFlow , PyTorch , or ONNX . Experience with deploying ML models on edge devices Familiarity with XR technologies (AR/VR/MR), sensor data processing, or 3D spatial computing. Minimum Qualifications: Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 3+ years of Systems Engineering or related work experience. OR Master's degree in Engineering, Information Systems, Computer Science, or related field and 2+ years of Systems Engineering or related work experience. OR PhD in Engineering, Information Systems, Computer Science, or related field and 1+ year of Systems Engineering or related work experience. Applicants Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail disability-accomodations@qualcomm.com or call Qualcomm's toll-free number found here. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries). Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law. To all Staffing and Recruiting Agencies Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications. If you would like more information about this role, please contact Qualcomm Careers.

Posted 4 days ago

Apply

0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Job Title: AI/ML Validation Engineer Location: Bangalore (Onsite) Experience: 5-8 yrs Requirements: · Strong background in machine learning fundamentals, including deep learning,large language models, and recommender systems. · Strong background in validation, defect and software development life cycle · Strong knowledge on ubuntu / yocto linux · Experience working with opensource frameworks such as PyTorch, TensorFlow, and ONNX-Runtime. · Experience in profiling ML workloads · Prior experience in executing validation plans for AI/ML compute stacks such as HIP, CUDA, OpenCL, OpenVINO, ONNX Runtime and TensorFlow/PyTorch integrations. · Prior experience in validating end-to-end AI pipelines, for e.g. model conversion (e.g., PyTorch à ONNX), Inference runtimes (e.g, ONNX Runtime, TensorRT, ROCm/HIP), compilers/toolchains (e.g. TVM, Vitis AI, XDNA, XLA), kernel execution, memory transfer and inference results · Strong background in python programming. · Excellent problem-solving skills and willingness to think outside the box. · Experience with production software quality assurance practices, methodologies, and procedures · Strong ownership of deliverables, Excellent communication skills and experience working with global teams

Posted 4 days ago

Apply

0.0 - 3.0 years

0 Lacs

Bengaluru, Karnataka

On-site

Job Description – AI Developer (Agentic AI Frameworks, Computer Vision & LLMs) Location (Hybrid - Bangalore) About the Role We’re seeking an AI Developer who specializes in agentic AI frameworks —LangChain, LangGraph, CrewAI, or equivalents—and who can take both vision and language models from prototype to production. You will lead the design of multi‑agent systems that coordinate perception (image classification & extraction), reasoning, and action, while owning the end‑to‑end deep‑learning life‑cycle (training, scaling, deployment, and monitoring). Key Responsibilities Scope What You’ll Do Agentic AI Frameworks (Primary Focus) Architect and implement multi‑agent workflows using LangChain, LangGraph, CrewAI, or similar. Design role hierarchies, state graphs, and tool integrations that enable autonomous data processing, decision‑making, and orchestration. Benchmark and optimize agent performance (cost, latency, reliability). Image Classification & Extraction Build and fine‑tune CNN/ViT models for classification, detection, OCR, and structured data extraction. Create scalable data‑ingestion, labeling, and augmentation pipelines. LLM Fine‑Tuning & Retrieval‑Augmented Generation (RAG) Fine‑tune open‑weight LLMs with LoRA/QLoRA, PEFT; perform SFT, DPO, or RLHF as needed. Implement RAG pipelines using vector databases (FAISS, Weaviate, pgvector) and domain‑specific adapters. Deep Learning at Scale Develop reproducible training workflows in PyTorch/TensorFlow with experiment tracking (MLflow, W&B). Serve models via TorchServe/Triton/KServe on Kubernetes, SageMaker, or GCP Vertex AI. MLOps & Production Excellence Build robust APIs/micro‑services (FastAPI, gRPC). Establish CI/CD, monitoring (Prometheus, Grafana), and automated retraining triggers. Optimize inference on CPU/GPU/Edge with ONNX/TensorRT, quantization, and pruning. Collaboration & Mentorship Translate product requirements into scalable AI services. Mentor junior engineers, conduct code and experiment reviews, and evangelize best practices. Minimum Qualifications B.S./M.S. in Computer Science, Electrical Engineering, Applied Math, or related discipline. 5+ years building production ML/DL systems with strong Python & Git . Demonstrable expertise in at least one agentic AI framework (LangChain, LangGraph, CrewAI, or comparable). Proven delivery of computer‑vision models for image classification/extraction. Hands‑on experience fine‑tuning LLMs and deploying RAG solutions. Solid understanding of containerization (Docker) and cloud AI stacks (AWS/Azure). Knowledge of distributed training, GPU acceleration, and performance optimization. ---------------------------------------------------------------------------------------------------------------------------------------------------------- Job Type: Full-time Pay: Up to ₹1,200,000.00 per year Experience: AI, LLM, RAG: 4 years (Preferred) Vector database, Image classification: 4 years (Preferred) containerization (Docker): 3 years (Preferred) ML/DL systems with strong Python & Git: 3 years (Preferred) LangChain, LangGraph, CrewAI: 3 years (Preferred) Location: Bangalore, Karnataka (Preferred) Work Location: In person

Posted 4 days ago

Apply

10.0 - 15.0 years

11 - 14 Lacs

Hyderabad, Telangana, India

On-site

THE ROLE AMD is looking for a talented, self-driven and motivated engineer to technically lead AIG s Vitis AI Compiler projects working on AMD s XDNA (AI Engine) architecture and the Vitis AI family of software tools. The XDNA is an industry leading NPU (Neural Processing Engine) architecture in terms of performance per watt and is used in AMD s client and embedded devices as the primary engine for Machine Learning workloads. It is the hardware engine behind Windows Co-pilot on AMD devices. The team provides a fast-paced environment offering each of its members immense opportunity to interact with a wide variety of people including from other organizations like hardware designers, marketing, support, and even direct customer interaction, and truly learn and grow their skills and capabilities. THE PERSON: The ideal candidate should be passionate about software engineering and possess leadership skills to drive sophisticated technical issues to resolution. They should have demonstrated ability to identify technical problems, explore and propose viable options, and apply technical solutions. They should be able to excel in a global team environment with strong verbal and written communication skills. KEY RESPONSIBILITIES: Vitis AI is AMD s primary SDK that enables users to compile and run their ML models on the XDNA architecture which forms the basis for AMD s. As a senior member of this high-performance team, the selected candidate will have the opportunity to work on integrating the ML tool chain into frameworks like ONNX, Pytorch, TensorFlow etc. Candidate will have opportunity to work on orchestrating the compilation of ML model through different phases Candidate will integrate runtime execution of ML model on the NPU hardware through the runtime and driver. Candidate will collaborate with compiler and runtime teams to bring up latest AI models like CNNs, Transformers, Stable Diffusion, NLPs etc. on the XDNA simulator. Candidates would develop a deeper understanding of the various ML models, and how they are executed, identify performance bottlenecks and enable faster development. PREFERRED EXPERIENCE: Minimum 10 years of relevant work experience. Strong background in large scale based development and debug, including Design Patterns Experience with multi-threaded programming infrastructure and performance optimization Experience in the software development environment on both Linux and Windows is required. Experience in any one of the ML Framework like ONNX, Pytorch etc is strongly desired. Experience with scalable builds and code versioning through github, docker, CMake, artifactory is highly desired. ACADEMIC CREDENTIALS: Bachelor s or Masters degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent

Posted 5 days ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies