Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
3.0 - 7.0 years
0 Lacs
hyderabad, all india
On-site
As an experienced LLM Engineer, your role will involve leading the deployment and optimization of self-hosted Large Language Models (LLMs) such as LLaMA, Mistral, and Falcon on on-premise GPU servers. You will be responsible for deploying and fine-tuning LLMs for low-latency, high-efficiency inference, setting up GPU-accelerated servers for AI workloads, implementing model quantization, developing APIs for model inference, automating model deployment, fine-tuning and training models, monitoring system performance, and collaborating with cross-functional teams. Key Responsibilities: - Deploy and optimize self-hosted LLMs for low-latency, high-efficiency inference. - Set up GPU-accelerated ser...
Posted 2 weeks ago
4.0 - 7.0 years
6 - 10 Lacs
chennai
Hybrid
We are seeking a skilled AI Engineer with 47 years of hands-on experience in designing, developing, and deploying AI/ML models The ideal candidate must have strong practical exposure to Large Language Models (LLMs), machine learning frameworks, and end-to-end model development The role involves building scalable AI-driven solutions, optimizing model performance, integrating intelligence into applications, and collaborating with data and engineering teams to drive automation and innovation The candidate should possess strong problem-solving skills, experience working with modern AI/ML pipelines, and a solid foundation in deploying models into production environments This is a full-time, perma...
Posted 3 weeks ago
1.0 - 5.0 years
0 Lacs
hyderabad, all india
On-site
As a Generative AI/LLM Engineer at Soothsayer Analytics, your role will involve designing, developing, and deploying AI models using cutting-edge technologies like Azure OpenAI GPT-4 variants. Your focus will be on leveraging state-of-the-art tools such as GPT-4 Vision, GPT-4 Turbo, and Retrieval-Augmented Generation (RAG) techniques to create data-driven solutions tailored to specific business needs. Key Responsibilities: - Design, develop, and deploy generative AI models using GPT-4 variants like GPT-4 Vision and GPT-4 Turbo to address specific business requirements. - Implement and optimize RAG techniques for enhanced data-driven solutions. - Build and manage AI services using Python fram...
Posted 1 month ago
7.0 - 10.0 years
14 - 18 Lacs
ahmedabad
Work from Office
Responsibilities: * Develop generative AI models using Hugging Face, Prompt Engineering & NLP DL. * Implement LLM strategies with PyTorch, Agentic AI & MLOps.
Posted 2 months ago
7.0 - 10.0 years
12 - 18 Lacs
ahmedabad
Work from Office
Responsibilities: * Develop generative AI models using LLM, Hugging Face & Prompt Eng. * Optimize ML pipelines with MLOps & Agentic AI principles. * Design NLP solutions on Python, Transformer Models & LangChains.
Posted 2 months ago
3.0 - 7.0 years
0 Lacs
hyderabad, telangana
On-site
As an experienced LLM Engineer, your role will involve leading the deployment and optimization of self-hosted Large Language Models (LLMs). You should have hands-on expertise in deploying, fine-tuning, and optimizing open-source models like LLaMA, Mistral, and Falcon on on-premise GPU servers. Key Responsibilities: - Deploy and optimize self-hosted LLMs for low-latency, high-efficiency inference. - Set up GPU-accelerated servers for AI workloads using CUDA, TensorRT, and vLLM. - Implement model quantization (GPTQ, AWQ, bitsandbytes) for efficient memory usage. - Develop APIs for model inference using FastAPI, Flask, or Hugging Face TGI. - Automate model deployment with Docker, Kubernetes, an...
Posted 3 months ago
3.0 - 7.0 years
0 Lacs
hyderabad, telangana
On-site
As an experienced LLM Engineer, you will be responsible for leading the deployment and optimization of self-hosted Large Language Models (LLMs). Your expertise in deploying, fine-tuning, and optimizing open-source models like LLaMA, Mistral, and Falcon on on-premise GPU servers will be crucial for this role. Your key responsibilities will include deploying and optimizing self-hosted LLMs for low-latency, high-efficiency inference, setting up GPU-accelerated servers for AI workloads using CUDA, TensorRT, and vLLM, implementing model quantization for efficient memory usage, developing APIs for model inference, automating model deployment, fine-tuning and training models, monitoring system perf...
Posted 4 months ago
1.0 - 5.0 years
0 Lacs
hyderabad, telangana
On-site
You will be working full-time at Soothsayer Analytics in Hyderabad as a Generative AI/LLM Engineer. Your primary responsibility will be to design, develop, and deploy generative AI models using cutting-edge technologies like Azure OpenAI GPT-4, GPT-4 Vision, or GPT-4 Omni. You should have a strong background in building and deploying AI models, with a focus on leveraging technologies such as Retrieval-Augmented Generation (RAG) and working with Vector Databases. While experience in fine-tuning large language models (LLMs) is beneficial, it is not mandatory. You are expected to have a general understanding of training or fine-tuning deep learning models and be able to quickly learn and implem...
Posted 5 months ago
6.0 - 9.0 years
10 - 20 Lacs
Hyderabad
Work from Office
Note: 1. Immediate to 30 days serving notice period 2.Who are available for face to face and video can apply Please add more profile for LLM engineer for weekend drive, below is the mandatory skills which delivery is looking for: 5+ years of relevant experience in Python , AI and machine learning - 2+ years of relevant experience in Gen AI LLM Hands-on experience with at least 1 end-to-end GenAI project Worked with LLMs such as GPT, Gemini, Claude, LLaMA, etc LLM skills: RAG, LangChain, Transformers, TensorFlow, PyTorch, spaCy Experience with REST API integration (e.g. FastAPI, Flask) Proficient in prompt types: zero-shot, few-shot, chain-of-thought - Knowledge of model training, fine-tuning...
Posted 5 months ago
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
192783 Jobs | Dublin
Wipro
61786 Jobs | Bengaluru
EY
49321 Jobs | London
Accenture in India
40642 Jobs | Dublin 2
Turing
35027 Jobs | San Francisco
Uplers
31887 Jobs | Ahmedabad
IBM
29626 Jobs | Armonk
Capgemini
26439 Jobs | Paris,France
Accenture services Pvt Ltd
25841 Jobs |
Infosys
25077 Jobs | Bangalore,Karnataka