9 Llm Engineer Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

3.0 - 7.0 years

0 Lacs

hyderabad, all india

On-site

As an experienced LLM Engineer, your role will involve leading the deployment and optimization of self-hosted Large Language Models (LLMs) such as LLaMA, Mistral, and Falcon on on-premise GPU servers. You will be responsible for deploying and fine-tuning LLMs for low-latency, high-efficiency inference, setting up GPU-accelerated servers for AI workloads, implementing model quantization, developing APIs for model inference, automating model deployment, fine-tuning and training models, monitoring system performance, and collaborating with cross-functional teams. Key Responsibilities: - Deploy and optimize self-hosted LLMs for low-latency, high-efficiency inference. - Set up GPU-accelerated ser...

Posted 2 weeks ago

AI Match Score
Apply

4.0 - 7.0 years

6 - 10 Lacs

chennai

Hybrid

We are seeking a skilled AI Engineer with 47 years of hands-on experience in designing, developing, and deploying AI/ML models The ideal candidate must have strong practical exposure to Large Language Models (LLMs), machine learning frameworks, and end-to-end model development The role involves building scalable AI-driven solutions, optimizing model performance, integrating intelligence into applications, and collaborating with data and engineering teams to drive automation and innovation The candidate should possess strong problem-solving skills, experience working with modern AI/ML pipelines, and a solid foundation in deploying models into production environments This is a full-time, perma...

Posted 3 weeks ago

AI Match Score
Apply

1.0 - 5.0 years

0 Lacs

hyderabad, all india

On-site

As a Generative AI/LLM Engineer at Soothsayer Analytics, your role will involve designing, developing, and deploying AI models using cutting-edge technologies like Azure OpenAI GPT-4 variants. Your focus will be on leveraging state-of-the-art tools such as GPT-4 Vision, GPT-4 Turbo, and Retrieval-Augmented Generation (RAG) techniques to create data-driven solutions tailored to specific business needs. Key Responsibilities: - Design, develop, and deploy generative AI models using GPT-4 variants like GPT-4 Vision and GPT-4 Turbo to address specific business requirements. - Implement and optimize RAG techniques for enhanced data-driven solutions. - Build and manage AI services using Python fram...

Posted 1 month ago

AI Match Score
Apply

7.0 - 10.0 years

14 - 18 Lacs

ahmedabad

Work from Office

Responsibilities: * Develop generative AI models using Hugging Face, Prompt Engineering & NLP DL. * Implement LLM strategies with PyTorch, Agentic AI & MLOps.

Posted 2 months ago

AI Match Score
Apply

7.0 - 10.0 years

12 - 18 Lacs

ahmedabad

Work from Office

Responsibilities: * Develop generative AI models using LLM, Hugging Face & Prompt Eng. * Optimize ML pipelines with MLOps & Agentic AI principles. * Design NLP solutions on Python, Transformer Models & LangChains.

Posted 2 months ago

AI Match Score
Apply

3.0 - 7.0 years

0 Lacs

hyderabad, telangana

On-site

As an experienced LLM Engineer, your role will involve leading the deployment and optimization of self-hosted Large Language Models (LLMs). You should have hands-on expertise in deploying, fine-tuning, and optimizing open-source models like LLaMA, Mistral, and Falcon on on-premise GPU servers. Key Responsibilities: - Deploy and optimize self-hosted LLMs for low-latency, high-efficiency inference. - Set up GPU-accelerated servers for AI workloads using CUDA, TensorRT, and vLLM. - Implement model quantization (GPTQ, AWQ, bitsandbytes) for efficient memory usage. - Develop APIs for model inference using FastAPI, Flask, or Hugging Face TGI. - Automate model deployment with Docker, Kubernetes, an...

Posted 3 months ago

AI Match Score
Apply

3.0 - 7.0 years

0 Lacs

hyderabad, telangana

On-site

As an experienced LLM Engineer, you will be responsible for leading the deployment and optimization of self-hosted Large Language Models (LLMs). Your expertise in deploying, fine-tuning, and optimizing open-source models like LLaMA, Mistral, and Falcon on on-premise GPU servers will be crucial for this role. Your key responsibilities will include deploying and optimizing self-hosted LLMs for low-latency, high-efficiency inference, setting up GPU-accelerated servers for AI workloads using CUDA, TensorRT, and vLLM, implementing model quantization for efficient memory usage, developing APIs for model inference, automating model deployment, fine-tuning and training models, monitoring system perf...

Posted 4 months ago

AI Match Score
Apply

1.0 - 5.0 years

0 Lacs

hyderabad, telangana

On-site

You will be working full-time at Soothsayer Analytics in Hyderabad as a Generative AI/LLM Engineer. Your primary responsibility will be to design, develop, and deploy generative AI models using cutting-edge technologies like Azure OpenAI GPT-4, GPT-4 Vision, or GPT-4 Omni. You should have a strong background in building and deploying AI models, with a focus on leveraging technologies such as Retrieval-Augmented Generation (RAG) and working with Vector Databases. While experience in fine-tuning large language models (LLMs) is beneficial, it is not mandatory. You are expected to have a general understanding of training or fine-tuning deep learning models and be able to quickly learn and implem...

Posted 5 months ago

AI Match Score
Apply

6.0 - 9.0 years

10 - 20 Lacs

Hyderabad

Work from Office

Note: 1. Immediate to 30 days serving notice period 2.Who are available for face to face and video can apply Please add more profile for LLM engineer for weekend drive, below is the mandatory skills which delivery is looking for: 5+ years of relevant experience in Python , AI and machine learning - 2+ years of relevant experience in Gen AI LLM Hands-on experience with at least 1 end-to-end GenAI project Worked with LLMs such as GPT, Gemini, Claude, LLaMA, etc LLM skills: RAG, LangChain, Transformers, TensorFlow, PyTorch, spaCy Experience with REST API integration (e.g. FastAPI, Flask) Proficient in prompt types: zero-shot, few-shot, chain-of-thought - Knowledge of model training, fine-tuning...

Posted 5 months ago

AI Match Score
Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies