6 Llm Deployment Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

12.0 - 18.0 years

0 Lacs

chennai, tamil nadu

On-site

Role Overview: Join a dynamic team working towards creating a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for all. As a Principal Member of Technical Staff (PMTS), your expertise in designing, developing, debugging, and maintaining AI-powered applications and data engineering workflows for both local and cloud environments will be crucial. You will be involved in large-scale projects, optimizing AI/ML pipelines, and ensuring scalable data infrastructure. Key Responsibilities: - Develop AI-driven applications, microservices, and automation workflows using FastAPI, Flask, or Django, ensuring cloud-native deployment and performance optimization. - Integ...

Posted 1 week ago

AI Match Score
Apply

0.0 - 4.0 years

0 Lacs

noida, uttar pradesh

On-site

As an AI Engineer at our company, your role will involve designing, developing, and deploying an in-house AI assistant powered by LLaMA 3 and integrated with our MS SQL-based ERP system (4QT ERP). You will be responsible for setting up LLM infrastructure, implementing voice input (Whisper), translating natural language to SQL, and ensuring accurate, context-aware responses to ERP-related queries. Key Responsibilities: - Setup and deploy LLaMA 3 (8B/FP16) models using llama-cpp-python or Hugging Face - Integrate the AI model with FastAPI to create secure REST endpoints - Implement prompt engineering or fine-tuning (LoRA) to enhance SQL generation accuracy - Develop a user-facing interface (Re...

Posted 3 weeks ago

AI Match Score
Apply

4.0 - 6.0 years

6 - 8 Lacs

chennai

Work from Office

Key Responsibilities: LLM Deployment & Optimization Deploy, fine-tune, and optimize open-source LLMs (e.g., LLaMA, Mistral, CodeS, DeepSeek). Implement quantization (e.g., 4-bit, 8-bit) and pruning for efficient inference on commodity hardware. Build and manage inference APIs (REST/gRPC) for production use. Infrastructure Management Set up and manage on-premise GPU servers and VM-based deployments. Build scalable cloud-based LLM infrastructure using AWS (SageMaker, EC2), Azure ML, or GCP Vertex AI. Ensure cost efficiency by choosing appropriate hardware and job scheduling strategies. MLOps & Reliability Engineering Develop CI/CD pipelines for model training, testing, evaluation, and deployme...

Posted 1 month ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

jaipur, rajasthan

On-site

As a Senior Data Engineer + AI, you will play a crucial role in designing and optimizing distributed data pipelines using PySpark, Apache Spark, and Databricks to cater to both analytics and AI workloads. Your expertise in PySpark, Apache Spark, and Databricks for batch and streaming data pipelines will be instrumental in contributing to high-impact programs with clients. Your strong SQL skills for data analysis, transformation, and modeling will enable you to drive data-driven decision-making and facilitate rapid insight generation. Your responsibilities will involve supporting RAG pipelines, embedding generation, and data pre-processing for LLM applications, as well as creating and maintai...

Posted 2 months ago

AI Match Score
Apply

4.0 - 6.0 years

6 - 8 Lacs

Chennai

Work from Office

Key Responsibilities: LLM Deployment & Optimization Deploy, fine-tune, and optimize open-source LLMs (e.g., LLaMA, Mistral, CodeS, DeepSeek). Implement quantization (e.g., 4-bit, 8-bit) and pruning for efficient inference on commodity hardware. Build and manage inference APIs (REST/gRPC) for production use. Infrastructure Management Set up and manage on-premise GPU servers and VM-based deployments. Build scalable cloud-based LLM infrastructure using AWS (SageMaker, EC2), Azure ML, or GCP Vertex AI. Ensure cost efficiency by choosing appropriate hardware and job scheduling strategies. MLOps & Reliability Engineering Develop CI/CD pipelines for model training, testing, evaluation, and deployme...

Posted 2 months ago

AI Match Score
Apply

17.0 - 18.0 years

17 - 18 Lacs

Bengaluru, Karnataka, India

On-site

We are looking for an experienced Data Scientist with expertise in Generative AI and Large Language Models (LLMs) to join our innovation team. In this role, you will work at the cutting edge of AI technologies, leveraging LLMs to develop intelligent applications that can generate, analyze, and process text data. You will collaborate with data scientists, machine learning engineers, and product teams to build and deploy state-of-the-art AI solutions that solve complex real-world problems. Programming Python GenAI Technologies understanding evaluation LLM-LMM integration, Optimization, Deployment RAG, Knowledge graphs, transformers AWS Bedrock We re hiring Sr. Data Scientist for one of our Lea...

Posted 3 months ago

AI Match Score
Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies