66 Vllm Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

5.0 - 12.0 years

0 Lacs

mumbai, maharashtra, india

On-site

Dear All, Hiring for Agentic AI & Multi-Agent Solutions track, LangGraph experience is a must-have . Exp: 5 to 12yrs Location: Mumbai/Gurugram Mode: Hybrid Notice period: Immediate to 15days 1. Lead Developer 10+ years Experience Mandatory: LangGraph, multi-agent architecture, Python, Kubernetes, VLLM 2. Mid-Level Developers) 7 years experience Strong experience with LangGraph, Python, knowledge graphs, agent workflows 3. Mid-Junior Developers 5 years experience Good understanding of agentic systems, Python, and exposure to LangGraph Roles Covered Developer Agent Performance Evaluation & Intelligence Enhancement Lead Developer Agentic AI & Multi-Agent Solutions Developer Agentic AI & Multi-A...

Posted 1 day ago

AI Match Score
Apply

6.0 - 12.0 years

0 Lacs

chennai, all india

On-site

As an experienced AI/ML Architect, your role will involve leading the design and development of scalable, real-time AI systems. You will collaborate with product, data, and engineering teams to architect end-to-end solutions, from model development to deployment, system integration, and production monitoring. Key Responsibilities: - Design and architect AI/ML systems that are scalable, low-latency, and production-ready. - Lead the development of real-time inference pipelines for voice, vision, or NLP use cases. - Select and integrate appropriate tools, frameworks, and infrastructure such as Kubernetes, Kafka, TensorFlow, PyTorch, ONNX, Triton, and VLLM. - Collaborate with data scientists and...

Posted 3 days ago

AI Match Score
Apply

8.0 - 10.0 years

0 Lacs

bengaluru, karnataka, india

On-site

Company Qualcomm India Private Limited Job Area Engineering Group, Engineering Group > Software Engineering General Summary Job Overview: The Qualcomm Cloud Computing team is developing hardware and software for Machine Learning solutions spanning the data center, edge, infrastructure, automotive market. We are seeking ambitious, bright, and innovative engineers with experience in machine learning framework development. Job activities span the whole product life cycle from early design to commercial deployment. The environment is fast-paced and requires cross-functional interaction daily so good communication, planning and execution skills are a must. Key Responsibilities Analyze software re...

Posted 5 days ago

AI Match Score
Apply

2.0 - 5.0 years

0 Lacs

mumbai, maharashtra, india

On-site

About Us Zycus, recognized by leading analyst firms in procurement technology, empowers teams to unlock deep value through its comprehensive Source-to-Pay (S2P) solutions. At the heart of our S2P solution is the Merlin Agentic Platform, which orchestrates intelligent AI agents to deliver simplified, efficient, and compliant processes. The Merlin Intake Agent Offers Business Users Unparalleled Ease Of Use, Increasing Adoption Rates And Significantly Reducing Non-compliant Spending. For Procurement Teams, The Merlin Autonomous Negotiation Agent Handles Tail Spend Autonomously, Securing Additional Savings; The Merlin Contract Agent Helps Draft Compliant Contracts And Reduces Risks By Actively M...

Posted 5 days ago

AI Match Score
Apply

4.0 - 6.0 years

0 Lacs

thiruvananthapuram, kerala, india

On-site

Job Family Data Science & Analysis (India) Travel Required Up to 10% Clearance Required None What You Will Do Design, train, and fine-tune advanced foundational models (text, audio, vision) using healthcare-and other relevant datasets, focusing on accuracy and context relevance. Collaborate with cross-functional teams (Business, engineering, IT) to seamlessly integrate AI/ML technologies into our solution offerings. Deploy, monitor, and manage AI models in a production environment, ensuring high availability, scalability, and performance. Continuously research and evaluate the latest advancements in AI/ML and industry trends to drive innovation. Develop and maintain comprehensive documentati...

Posted 5 days ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

kerala

On-site

As a highly skilled Senior Machine Learning Engineer with expertise in Deep Learning, Large Language Models (LLMs), and MLOps/LLMOps, your role will involve designing, optimizing, and deploying cutting-edge AI solutions. You will be responsible for tasks such as model development, fine-tuning, deployment, and scalability. The ideal candidate will have hands-on experience in developing and scaling deep learning models, fine-tuning LLMs, and implementing deployment pipelines for production environments. Key Responsibilities: - Design, train, fine-tune, and optimize deep learning models (CNNs, RNNs, Transformers) for NLP, computer vision, or multimodal applications. - Fine-tune and adapt Large ...

Posted 1 week ago

AI Match Score
Apply

0.0 years

0 Lacs

india

On-site

About Salvo Software Salvo Software is a global technology company specializing in custom software development and advanced engineering solutions. With distributed teams across the US, LATAM, and India, we partner with clients to build high-performance, scalable systems that solve complex technical challenges. Our culture values innovation, ownership, and engineering excellence. We're growing our AI capabilities and are looking for a backend-focused AI Developer to join our team Role Description We are seeking a highly skilled AI Developer with a strong backend and machine learning engineering background to design, train, optimize, and deploy LLM models in on-prem and offline environments. T...

Posted 1 week ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

karnataka

On-site

As a Machine Learning Systems Architect, your role involves leading the architecture, development, and deployment of scalable machine learning systems with a focus on real-time inference for LLMs serving multiple concurrent users. You will optimize inference pipelines using high-performance frameworks such as vLLM, Groq, ONNX Runtime, Triton Inference Server, and TensorRT to minimize latency and cost. Additionally, you will design and implement agentic AI systems utilizing frameworks like LangChain, AutoGPT, and ReAct for autonomous task orchestration. Key Responsibilities: - Fine-tune, integrate, and deploy foundation models including GPT, LLaMA, Claude, Mistral, Falcon, and others into int...

Posted 2 weeks ago

AI Match Score
Apply

4.0 - 9.0 years

13 - 15 Lacs

noida, greater noida

Work from Office

Strong NLP, Speech, Deep Learning, and GenAI skills with . Expertise in Python, PyTorch, TensorFlow, STT/TTS, open-source LLMs, ONNX, vLLM, FAISS/Pinecone, FastAPI, Docker, Linux, GPU infra, multilingual NLP, and model training. Apply - 9136606172

Posted 2 weeks ago

AI Match Score
Apply

3.0 - 5.0 years

0 Lacs

bengaluru, karnataka, india

On-site

Company Qualcomm India Private Limited Job Area Engineering Group, Engineering Group > Software Engineering General Summary Job Overview: The Qualcomm Cloud Computing team is developing hardware and software for Machine Learning solutions spanning the data center, edge, infrastructure, automotive market. We are seeking ambitious, bright, and innovative engineers with experience in machine learning framework development. Job activities span the whole product life cycle from early design to commercial deployment. The environment is fast-paced and requires cross-functional interaction daily so good communication, planning and execution skills are a must. Key Responsibilities Analyze software re...

Posted 2 weeks ago

AI Match Score
Apply

1.0 - 3.0 years

0 Lacs

gurugram, haryana, india

On-site

Job Title: Artificial Intelligence (AI) Engineer Location: Gurugram, India (Onsite Only) Company: Rx One Care Pvt. Ltd. Type: Full-Time | Immediate Joining Preferred Role Overview deployments, and scalable backend services. You will play a key role in building, fine-tuning, and deploying AI-powered solutions (voice, NLP, automation) that power RxOne's next-gen patient engagement and clinical workflow intelligence platform. Key Responsibilities Build, fine-tune, and optimize open-source LLMs , ASR/TTS models, and other ML components. Develop and maintain containerized ML pipelines using Docker & Kubernetes. Deploy inference services in cloud-native environments (AWS/GCP/Azure). Collaborate wi...

Posted 3 weeks ago

AI Match Score
Apply

0.0 years

0 Lacs

bengaluru, karnataka, india

On-site

Job Requirements Work on latest machine learning technologies Work on supporting for latest Linux operating system Work on AMD next generation GPUs/Accelerators Work on optimizing latest Rocm drivers and improve performance Design new machine learning technologies Work Experience MS/BS degree in Computer Science or an equivalent Deep Knowledge of C/C++ and Python programming Experience with Linux Commands is must Experience with Scripting language like bash/powershell Understanding of various python ML frameworks like Pytorch, Transformers etc Understanding of various language and compiler for writing highly efficient custom Deep-Learning GPU Kernels. like Triton/Jax Hands on Debugging Exper...

Posted 3 weeks ago

AI Match Score
Apply

7.0 - 9.0 years

0 Lacs

chennai, tamil nadu, india

On-site

Job Title: Site Reliability Engineer (SRE) Azure & AI Experience: 7+ years Work Mode: Hybrid Work Location: Chennai/Mumbai/Gurgaon Job Summary: We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure , AI infrastructure , and automation . The ideal candidate will have a solid background in managing cloud environments using GitHub/Azure DevOps , and hands-on experience in AI model deployment and scaling . This role involves working closely with engineering teams to deliver reliable, secure, and scalable cloud infrastructure that supports AI workloads and enterprise applications. Key Responsibilities: Design, build, and maintain scalable cloud...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

all india, gurugram

On-site

As a Senior Software Engineer at Carelon Global Solutions India, your role will involve developing and maintaining microservice architecture and API management solutions for seamless deployment of AI solutions. You will collaborate with cross-functional teams to acquire, process, and manage data for AI/ML model integration, design robust data pipelines, optimize machine learning models, and create CI/CD pipelines using Git-based platforms. Operating container orchestration platforms like Kubernetes for scalable ML workload deployments will be a key part of your responsibilities. You will also engage in advanced prompt engineering, document model architectures, and implement cutting-edge LLM ...

Posted 4 weeks ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

pune, all india

On-site

As a Machine Learning Systems Architect, your role will involve leading the architecture, development, and deployment of scalable machine learning systems with a focus on real-time inference for LLMs serving multiple concurrent users. You will be responsible for optimizing inference pipelines using high-performance frameworks such as vLLM, Groq, ONNX Runtime, Triton Inference Server, and TensorRT to minimize latency and cost. Your key responsibilities will include: - Designing and implementing agentic AI systems utilizing frameworks like LangChain, AutoGPT, and ReAct for autonomous task orchestration. - Fine-tuning, integrating, and deploying foundation models including GPT, LLaMA, Claude, M...

Posted 4 weeks ago

AI Match Score
Apply

8.0 - 10.0 years

0 Lacs

pune, maharashtra, india

On-site

Key Responsibilities: Design, build, and maintain CI/CD pipelines for ML model training, validation, and deployment Automate and optimize ML workflows, including data ingestion, feature engineering, model training, and monitoring Deploy, monitor, and manage LLMs and other ML models in production (on-premises and/or cloud) Implement model versioning, reproducibility, and governance best practices Collaborate with data scientists, ML engineers, and software engineers to streamline end-to-end ML lifecycle Ensure security, compliance, and scalability of ML/LLM infrastructure Troubleshoot and resolve issues related to ML model deployment and serving Evaluate and integrate new MLOps/LLMOps tools a...

Posted 1 month ago

AI Match Score
Apply

5.0 - 7.0 years

0 Lacs

chennai, tamil nadu, india

On-site

We're looking for a skilled Al/ML lead ( 5+ years) based out of Chennai, for a global computer and network security company. Deep experience in training and fine-tuning Large Language Models (LLMs) such as LLaMA 3 using frameworks like vLLM . The ideal candidate will bring a strong background in machine learning and a practical understanding of the cybersecurity domainespecially around threat intelligence, vulnerabilities, exploits, and configuration analysis . You will lead the development and implementation of models that understand, process, and generate insights across a wide range of cybersecurity content. You will guide a team of ML engineers and collaborate closely with cybersecurity ...

Posted 1 month ago

AI Match Score
Apply

8.0 - 15.0 years

0 Lacs

bengaluru, karnataka, india

On-site

Job Description Publicis Sapient is looking for an experienced Manager, Machine Learning Engineering to lead our talented team of AI and data science experts. In this influential role, you will be responsible for developing and implementing solutions that address complex business challenges across a wide range of industries. Your expertise will empower clients to revolutionize their businesses by harnessing the potential of advanced technology. As a Manager, Machine Learning Engineering, you will collaborate with cross-functional teams to strategize, develop, and deliver machine learning models tailored to meet specific business objectives. You will be responsible for overseeing the entire l...

Posted 1 month ago

AI Match Score
Apply

4.0 - 5.0 years

13 - 15 Lacs

noida

Work from Office

Hiring Voicebot Developer (4+ yrs) – Onsite Noida Sec-16A. Work on real-time AI voice systems using WebRTC, LiveKit, Whisper, ASR/TTS & open-source models. Strong Python required. Share CV: kshitij.gawali@enlinkit.com

Posted 1 month ago

AI Match Score
Apply

3.0 - 7.0 years

0 Lacs

maharashtra

On-site

As a Senior Machine Learning Engineer at GoCommotion, you will lead the design, development, and scaling of the AI Worker platform, a multimodal, agentic system that handles voice, vision, and language in real time. In addition to this, you will mentor junior engineers and collaborate across teams to drive innovation and enhance customer experience. Key Responsibilities: - Build AI systems powering AI Workers, enabling agentic, persistent, and multimodal interactions across voice, text, and image. - Lead the full ML lifecycle, including data pipelines, architecture design, model training, deployment, and monitoring. - Design speech-to-speech models for low-latency voice interactions without ...

Posted 1 month ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

pune, maharashtra

On-site

As a Machine Learning Systems Architect, your primary responsibility will be to lead the architecture, development, and deployment of scalable machine learning systems with a focus on real-time inference for Large Language Models (LLMs) to serve multiple concurrent users. To achieve this, you will: - Optimize inference pipelines using high-performance frameworks such as vLLM, Groq, ONNX Runtime, Triton Inference Server, and TensorRT to minimize latency and cost. - Design and implement agentic AI systems using frameworks like LangChain, AutoGPT, and ReAct for autonomous task orchestration. - Fine-tune, integrate, and deploy foundation models like GPT, LLaMA, Claude, Mistral, Falcon, and other...

Posted 1 month ago

AI Match Score
Apply

0.0 years

0 Lacs

india

Remote

We're building production-grade LLM/AI capabilities into our Cloud Operating System and we need someone who can ship. If you like taking AI from prototype ? scalable product, this is for you. What you'll do Design and ship AI/LLM features that run in production Build RAG-style systems (embeddings, vector search) that actually perform Own model serving, observability, and reliability across our AI stack What we're looking for Strong Go (Golang) for production ML services LLMs, RAG, embeddings, vector search (FAISS / pgvector) PyTorch or TensorFlow and experience with model serving/inference MLOps : CI/CD, monitoring, evaluations, Docker/Kubernetes, GPUs Inference engine experience: e.g. vLLM,...

Posted 1 month ago

AI Match Score
Apply

0.0 years

0 Lacs

bengaluru, karnataka, india

Remote

Red Hat OpenShift AI is a flexible, scalable artificial intelligence (AI) and machine learning (ML) platform that enables enterprises to create and deliver AI-enabled applications at scale across hybrid cloud environments. Built using open-source technologies, OpenShift AI provides trusted, operationally consistent capabilities for teams to experiment, serve models, and deliver innovative apps. The OpenShift AI team seeks a Software Engineer with Kubernetes and Model Inference Runtimes experience to join our rapidly growing engineering team. Our team focuses on making machine learning model deployment and monitoring seamless and scalable across the hybrid cloud and the edge. This is a fascin...

Posted 1 month ago

AI Match Score
Apply

0.0 years

0 Lacs

bengaluru, karnataka, india

On-site

Lead the architecture, development, and deployment of scalable machine learning systems, focusing on real-time inference for LLMs serving multiple concurrent users. Optimize inference pipelines using high-performance frameworks like vLLM, Groq, ONNX Runtime, Triton Inference Server, and TensorRT to minimize latency and cost. Design and implement agentic AI systems utilizing frameworks such as LangChain, AutoGPT, and ReAct for autonomous task orchestration. Fine-tune, integrate, and deploy foundation models including GPT, LLaMA, Claude, Mistral, Falcon, and others into intelligent applications. Develop and maintain robust MLOps workflows to manage the full model lifecycle including training, ...

Posted 1 month ago

AI Match Score
Apply

3.0 - 7.0 years

0 Lacs

vadodara, gujarat

On-site

Role Overview: As a GPU Infrastructure Engineer at Dharmakit Networks, you will play a crucial role in building, optimizing, and scaling the GPU and AI compute infrastructure for Project Ax1. Your responsibilities will include managing cloud and on-prem clusters, setting up model CI/CD pipelines, and ensuring efficient utilization of GPUs to support AI systems. Key Responsibilities: - Design, deploy, and optimize GPU infrastructure for large-scale AI workloads. - Manage GPU clusters across cloud platforms such as AWS, Azure, and GCP, as well as on-prem setups. - Set up and maintain model CI/CD pipelines to streamline training and deployment processes. - Optimize LLM inference using technolog...

Posted 1 month ago

AI Match Score
Apply
Page 1 of 3
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies