42 Vllm Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

3.0 - 7.0 years

0 Lacs

vadodara, gujarat

On-site

Role Overview: As a GPU Infrastructure Engineer at Dharmakit Networks, you will play a crucial role in building, optimizing, and scaling the GPU and AI compute infrastructure for Project Ax1. Your responsibilities will include managing cloud and on-prem clusters, setting up model CI/CD pipelines, and ensuring efficient utilization of GPUs to support AI systems. Key Responsibilities: - Design, deploy, and optimize GPU infrastructure for large-scale AI workloads. - Manage GPU clusters across cloud platforms such as AWS, Azure, and GCP, as well as on-prem setups. - Set up and maintain model CI/CD pipelines to streamline training and deployment processes. - Optimize LLM inference using technolog...

Posted 4 days ago

AI Match Score
Apply

3.0 - 5.0 years

0 Lacs

noida, uttar pradesh, india

On-site

Artificial Intelligence Overview You will collaborate closely with cross-functional teams to design, develop, and deploy innovative solutions that harness the power of AI and generative technologies . Your technical expertise, leadership skills, and passion for pushing boundaries will be instrumental in achieving our goals. Responsibilities Lead the design, architecture, and development of Gen AI applications using cloud technologies. Collaborate with product managers, software engineers, and end user to define application requirements, user experiences, and technical specifications. Drive technical excellence, code quality, and best practices within the team. Stay abreast of the latest adva...

Posted 4 days ago

AI Match Score
Apply

3.0 - 5.0 years

0 Lacs

chandigarh, india

On-site

AIOps Lead Location: Chandigarh (On-site) Experience: 3 to 5 years (AI/ML + DevOps + Observability) Employment Type: Full-time About the Role We are looking for a next-generation AIOps Engineer to design and operate AI-driven, self-healing, and intelligent infrastructure systems. In this role, you'll fuse MLOps, DevOps, and agentic AI systems leveraging technologies like Ray, vLLM, SGLang, and PyTorch Lightning to build predictive, autonomous, and scalable operational pipelines. You will develop intelligent observability systems capable of detecting, diagnosing, and resolving issues in real time powered by distributed AI and LLM-based automation. Key Responsibilities Design, implement, and s...

Posted 4 days ago

AI Match Score
Apply

14.0 - 20.0 years

0 Lacs

haryana

On-site

Role Overview: As a Senior Technical Architect specializing in Data Science & Agentic AI, you will play a crucial role in leading the end-to-end modeling lifecycle. Your main responsibilities will include problem framing, experiment design, production deployment, and monitoring. You will be tasked with setting up the technical architecture for ML/GenAI and agentic systems. While collaborating with DE/Platform teams, your primary focus will be on modeling excellence, MLOps, and AI solution architecture to impact business KPIs positively. Key Responsibilities: - Own the technical vision for data-science initiatives and translate ambiguous business goals into modellable problems, KPIs, and NFRs...

Posted 6 days ago

AI Match Score
Apply

5.0 - 7.0 years

0 Lacs

bengaluru, karnataka, india

On-site

About us: Hiver offers teams the simplest way to offer outstanding and personalized customer service. As a customer service solution built on Gmail, Hiver is intuitive, super easy to learn, and delightful to use. Hiver is used by thousands of teams at some of the best-known companies in the world to provide attentive, empathetic, and human service to their customers at scale. We're a top-rated product on G2 and rank very highly on customer satisfaction.At Hiver, we obsess about being world-class at everything we do. Our product is loved by our customers, our content engages a very wide audience, our customer service is one of the highest rated in the industry, and our sales team is as driven...

Posted 2 weeks ago

AI Match Score
Apply

3.0 - 5.0 years

0 Lacs

noida, uttar pradesh, india

On-site

Company Overview: We are a dynamic and innovative technology company leveraging cutting-edge cloud technologies and generative AI to create groundbreaking solutions that address complex challenges across various industries. We are seeking a skilled AI Engineer to to drive the architecture and development of our AI/ML applications, contributing to our growth and success. Role Overview: You will collaborate closely with cross-functional teams to design, develop, and deploy innovative solutions that harness the power of AI and generative technologies . Your technical expertise, leadership skills, and passion for pushing boundaries will be instrumental in achieving our goals. Responsibilities: L...

Posted 2 weeks ago

AI Match Score
Apply

0.0 - 4.0 years

0 Lacs

noida, uttar pradesh

On-site

As an AI Research Intern at our company in Noida, you will be part of a dynamic team for a 6-month internship. Your role involves the following key responsibilities: - Benchmark open-source LLMs for information extraction, document reasoning, and OCR on different NVIDIA GPUs and Google TPUs - Tune infrastructure and vLLM configurations to maximize tokens-per-second per dollar spent - Continuously improve system performance through prompt refinement and rival prompt strategies To excel in this position, you should possess the following technical skills: - Proficiency in Python and experience with deep learning frameworks (PyTorch/TensorFlow) - Familiarity with LLM serving frameworks (vLLM, Hu...

Posted 3 weeks ago

AI Match Score
Apply

4.0 - 6.0 years

0 Lacs

bengaluru, karnataka, india

On-site

About the Role - We are looking for a Senior Software Engineer AI who is passionate about building intelligent systems that solve real-world problems. You'll work at the intersection of machine learning, large language models (LLMs), and backend engineering, turning research into production-ready systems. This is a hands-on engineering role with a strong emphasis on scalable AI integration, prompt engineering, RAG (retrieval-augmented generation), and building intelligent APIs and microservices. What You'll Do - ? Design and build AI-powered applications using LLMs (OpenAI, LLaMA, Mistral, etc.), vector databases, and embedding models ? Build scalable backend systems and APIs (Python, FastAP...

Posted 3 weeks ago

AI Match Score
Apply

3.0 - 6.0 years

0 Lacs

hyderabad, telangana, india

On-site

Job Title : AI Systems Engineer GPU/ROCm/CUDA | ML Frameworks Optimization Location : : 3-6 [Mid-Senior] Job Description We are looking for a passionate and experienced AI Systems Engineer to join our team to work on next-generation Machine Learning technologies and optimize performance across AMD GPU accelerators. This role involves low-level GPU programming, custom ML kernel development, and working with state-of-the-art inference engines. Key Responsibilities Develop and optimize custom Deep Learning GPU kernels using ROCm/CUDA or shader languages Support and enhance ML model deployment on Linux platforms Optimize performance of ROCm drivers and inferencing engines for AI/ML workloads Col...

Posted 3 weeks ago

AI Match Score
Apply

6.0 - 8.0 years

0 Lacs

pune, maharashtra, india

On-site

Role: Gen AI Developer Total Experience: 6+ years with 2+ years working on GenAI initiatives Employment Type: Permanent & Full time Working Model: Hybrid (3 days work from office) Job Summary: We are seeking a Senior AI Developer with proven expertise in Generative AI technologies, a solid foundation in machine learning, and a strong understanding of data governance. The ideal candidate will have hands-on experience with both cloud-based LLM platforms, on-premise, open-source LLMs like Ollama, Llama.cpp, and GGUF-based models. You should also have good knowledge in Model Context Protocol (MCP). You will help architect and implement GenAI-powered products that are secure, scalable, and enterp...

Posted 3 weeks ago

AI Match Score
Apply

0.0 years

0 Lacs

noida, uttar pradesh, india

On-site

Data Scientist (GenAI - 0-2 Years Experience) We are seeking a highly driven Data Scientist - Generative AI with 0-2 years of experience , who is passionate about building state-of-the-art AI solutions using LLMs, VLMs, and cutting-edge prompting strategies. You will work closely with our product, engineering, and research teams to prototype, finetune, and deploy GenAI models for real-world use-cases. Responsibilities Design and develop GenAI systems using prompt engineering, retrieval-augmented generation (RAG), and finetuning of LLMs/VLMs Build, evaluate, and improve prompting techniques (zero-shot, few-shot, chain-of-thought, self-consistency, etc.) Develop and maintain scalable model pip...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

hyderabad, telangana

On-site

As an AI Expert (Senior Engineer) at Grid Dynamics, your primary responsibility will be to develop local Large Language Models (LLMs) for Personally Identifiable Information (PII) detection across various data formats. You will be designing and fine-tuning custom LLMs to identify PII in documents, databases, knowledge graphs, source code, and other platforms. Your role will involve handling data at scale (50+ PB) and ensuring continuous retraining of the models for optimal performance. Key Responsibilities: - Design, train, fine-tune, and optimize local LLMs or other NLP models for PII detection in diverse data types. - Develop generative AI agents for schema- and metadata-based PII detectio...

Posted 1 month ago

AI Match Score
Apply

4.0 - 9.0 years

6 - 10 Lacs

chennai

Remote

Roles and Responsibilities Design, test, and refine prompts to optimize LLM performance across diverse tasks and domains Build and maintain a library of prompt templates and reusable components Collaborate with engineers, analysts, and business teams to embed prompt engineering into broader AI workflows Analyse model outputs and user feedback to iteratively improve prompt accuracy, consistency, and relevance Monitor developments in the fields of prompt engineering, generative AI, and LLM capabilities to inform internal practices Contribute to documentation, internal guidelines, and knowledge-sharing initiatives related to prompt engineering Required Background Demonstrable experience with pr...

Posted 1 month ago

AI Match Score
Apply

4.0 - 6.0 years

0 Lacs

bengaluru, karnataka, india

On-site

About the Role - We are looking for a Senior Software Engineer AI who is passionate about building intelligent systems that solve real-world problems. You'll work at the intersection of machine learning, large language models (LLMs), and backend engineering, turning research into production-ready systems. This is a hands-on engineering role with a strong emphasis on scalable AI integration, prompt engineering, RAG (retrieval-augmented generation), and building intelligent APIs and microservices. What You'll Do - ? Design and build AI-powered applications using LLMs (OpenAI, LLaMA, Mistral, etc.), vector databases, and embedding models ? Build scalable backend systems and APIs (Python, FastAP...

Posted 1 month ago

AI Match Score
Apply

3.0 - 5.0 years

0 Lacs

pune, maharashtra, india

On-site

Company Description pi-labs provides cutting-edge cybersecurity and intelligence solutions to governments and enterprises, helping them stay ahead of emerging cyber threats driven by the rapid adoption of AI. We specialize in developing advanced tools that safeguard digital ecosystems and ensure trust, safety, and authenticity. Our flagship product, Authentify , is a market-leading deepfake detection solution designed to combat sophisticated AI-driven manipulations. At pi-labs, you'll work with a team of passionate experts at the forefront of AI-powered security technologies. Join us to build impactful solutions, solve complex challenges, and contribute to protecting the digital world as it ...

Posted 1 month ago

AI Match Score
Apply

10.0 - 14.0 years

0 Lacs

chennai, tamil nadu

On-site

As an experienced AI/ML Architect, your role will involve leading the design and development of scalable, real-time AI systems. You will collaborate closely with product, data, and engineering teams to architect end-to-end solutions, encompassing model development, deployment, system integration, and production monitoring. Key Responsibilities: - Design and architect AI/ML systems that are scalable, low-latency, and production-ready - Lead the development of real-time inference pipelines for various use cases such as voice, vision, or NLP - Select and integrate appropriate tools, frameworks, and infrastructure like Kubernetes, Kafka, TensorFlow, PyTorch, ONNX, Triton, VLLM, etc. - Collaborat...

Posted 1 month ago

AI Match Score
Apply

0.0 years

0 Lacs

chennai, tamil nadu, india

On-site

Have you built and delivered complete products / applications in AI or otherwise Do you want co-founder-level ownership We'd love to talk to you. About the role This is not a standard job. It's an invitation to become a Partner and Co-founder, working with Australian founders that have a proven track record in running successful businesses in varied sectors. Established in 2006, the company now wishes to build the AI Division from the ground up this means you will be hands on in setting engineering standards, making early stack decisions, developing and launching the first product(s) and hiring and managing the AI Tech team. What's in it for you Equity and Cash Compensation: You will receive...

Posted 1 month ago

AI Match Score
Apply

3.0 - 8.0 years

8 - 12 Lacs

mumbai, delhi / ncr, bengaluru

Work from Office

About the Role: We are looking for a hands-on AI/ML Developer with experience in Large Language Models (LLMs), Prompt Engineering, and AI model integration. The ideal candidate should have practical experience working with AI models, fine-tuning them, optimizing prompts, and integrating them into real-world applications. This role is perfect for someone who has already worked on AI-driven applications and wants to expand their expertise by researching and implementing new AI advancements. You will have the opportunity to experiment with different LLM architectures, improve AI model efficiency, and contribute to AI-driven solutions. Key Responsibilities: LLM Development & Implementation: Work...

Posted 1 month ago

AI Match Score
Apply

12.0 - 14.0 years

0 Lacs

pune, maharashtra, india

On-site

Come work at a place where innovation and teamwork come together to support the most exciting missions in the world! Job Title - Technical Program Manager, AI/ML Enterprise TruRisk management (ETM) Platform, the foundation of the industry's first Risk Operations Center (ROC), aggregates exposures to quantify, communicate, and reduce cyber risk in business terms. Now powered with built in AI fabric, it empowers organizations with advanced automation, intelligent decision-making, and enhanced resilience at enterprise scale. About The Role We are seeking a highly motivated & hands-on Program management professional to join our Data Science team to manage AI/ML, GenAI initiatives. As a Technical...

Posted 1 month ago

AI Match Score
Apply

3.0 - 5.0 years

16 - 20 Lacs

noida, uttar pradesh, india

On-site

Position Overview We are hiring an experienced AI/ML Engineer to lead and shape our AI/ML initiatives. The ideal candidate will have hands-on experience in machine learning and artificial intelligence, with strong leadership capabilities and a passion for delivering production-ready solutions. This role involves end-to-end ownership of AI/ML projects, from strategy development to deployment and optimization of large-scale systems. Key Responsibilities Lead and mentor a high-performing AI/ML team. Design and execute AI/ML strategies aligned with business goals. Collaborate with product and engineering teams to identify impactful AI opportunities. Build, train, fine-tune, and deploy ML models ...

Posted 1 month ago

AI Match Score
Apply

3.0 - 7.0 years

0 Lacs

vadodara, gujarat

On-site

Dharmakit Networks is a premium global IT solutions partner dedicated to innovation and success worldwide. Specializing in website development, SaaS, digital marketing, AI Solutions, and more, we help brands turn their ideas into high-impact digital products. Known for blending global standards with deep Indian insight, we are now stepping into our most exciting chapter yet. Project Ax1 is our next-generation Large Language Model (LLM), a powerful AI initiative designed to make intelligence accessible and impactful for Bharat and the world. Built by a team of AI experts, Dharmakit Networks is committed to developing cost-effective, high-performance AI tailored for India and beyond, enabling ...

Posted 2 months ago

AI Match Score
Apply

4.0 - 6.0 years

0 Lacs

mumbai, maharashtra, india

On-site

Requirements 45 years of experience as an ML Engineer or Applied Scientist in production Strong Python skills (FastAPI/Flask), with PyTorch or TensorFlow Proficient in building data pipelines and deploying ML systems Experience with LLMs, embeddings, RAG systems, and vector DBs (PGVector/Postgres preferred) Hands-on with multiple LLM providers (OpenAI, Anthropic, Google, open-source) Knowledge of containerization and deployment (Docker, Kubernetes, AWS/GCP, CI/CD) Bonus: Familiarity with LangGraph/LangChain, vLLM, Ray, LLM eval tools, or event-driven systems Responsibilities Build and scale ML pipelines for Novas AI teammates Implement agentic AI workflows with LLMs and orchestration framewo...

Posted 2 months ago

AI Match Score
Apply

8.0 - 12.0 years

0 Lacs

karnataka

On-site

As an Engineering Manager specializing in Data & Insights at a funded Fintech company in Bangalore, India, you will play a pivotal role in reimagining financial products for the next billion users. You will be tasked with addressing the challenge of enabling access to financial services for the population that is currently underserved by traditional channels like insurance, credit, and investments. In this role, you will lead a high-performance team focused on building data-driven products that accelerate time-to-market and foster innovation in credit services. You will be responsible for driving end-to-end product delivery, making critical technical decisions, and nurturing your team of eng...

Posted 2 months ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

chennai, tamil nadu

On-site

As an engineer in this role, you will be responsible for building and optimizing high-throughput, low-latency LLM inference infrastructure. This will involve using open-source models such as Qwen, LLaMA, and Mixtral on multi-GPU systems like A100/H100. Your main areas of focus will include performance tuning, model hosting, routing logic, speculative decoding, and cost-efficiency tooling. To excel in this position, you must have deep experience with vLLM, tensor/pipe parallelism, and KV cache management. A strong understanding of CUDA-level inference bottlenecks, FlashAttention2, and quantization is essential. Additionally, familiarity with FP8, INT4, and speculative decoding (e.g., TwinPilo...

Posted 2 months ago

AI Match Score
Apply

8.0 - 10.0 years

0 Lacs

bengaluru, karnataka, india

Remote

Senior Manager - Senior Data Scientist (NLP & Generative AI) Location: PAN India / Remote Employment Type: Full-time About the Role We are seeking a highly experienced Senior data scientist with 8+ years of expertise in machine learning, focusing on NLP, Generative AI, and advanced LLM ecosystems. This role demands leadership in designing and deploying scalable AI systems leveraging the latest advancements such as Google ADK, Agent Engine, and Gemini LLM. You will spearhead building real-time inference pipelines and agentic AI solutions that power complex, multi-user applications with cutting-edge technology. Key Responsibilities Lead the architecture, development, and deployment of scalable...

Posted 2 months ago

AI Match Score
Apply
Page 1 of 2
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies