Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Jobs

Interviews

Home
>
Jobs in Maharashtra
>
The Value Maximizer
>
AI Engineer/ Data Scientist

AI Engineer/ Data Scientist

The Value Maximizer

5 years

0 Lacs

Maharashtra India

Posted:1 month ago| Platform:

Apply

Skills Required

ai data multilingual personalization retrieval latency optimization vision technology research ml design inference layers mobile strategies evaluation metrics engineering pytorch orchestration collaboration ux training deployment tuning writing

Work Mode

On-site

Job Type

Full Time

Job Description

About The Role

We are building next-generation AI assistants that combine real-time responsiveness, multilingual multimodal understanding, and deep personalization. As an

LLM Engineer - AI Assistant & RAG Systems

, you'll be leading initiatives that bridge Retrieval-Augmented Generation (RAG), vector search, latency optimization, and long-term memory for highly scalable consumer-facing applications. You'll work on architecting intelligent, efficient, and privacy-aware voice and text-based assistants.We're proud to share that

Lenskart is now our strategic investor

, backing our vision to make conscious technology accessible at scale. If you're someone who thrives at the intersection of research and product, we want you on our team.

Minimum Work Experience Required

5+ years

of experience in ML/NLP roles with strong hands-on expertise in building large-scale AI/LLM systems for production.

Top 3 Daily Tasks

Design and optimize

LLM-powered assistant systems

including RAG, vector databases, rerankers, and latency-aware inference pipelines.Build feedback loops and observability layers to

evaluate and improve assistant quality

in production.Collaborate with product, mobile, and infra teams to enable seamless

multilingual + multimodal assistant experiences

minimal latency.

Top 5 Skills You Should Possess

Proven experience working with

LLMs

RAG pipelines

, and

vector search systems

(e.g., FAISS, Qdrant, Milvus).Deep understanding of

latency optimization

, streaming token responses, and caching strategies in LLM deployment.Experience with

retriever-reranker tuning

, LLM evaluation metrics, prompt engineering, and hallucination mitigation techniques.Strong foundation in

Python

PyTorch/TensorFlow

FastAPI

, and orchestration tools like

Airflow

Docker

, and

Kubernetes

.Ability to design memory modules using

long-term embeddings

, user vectors, and strategies like

memory decay

and

context truncation

for scalable personalization.

Cross-Functional Collaboration Excellence

Work closely with front-end, infra, and product teams to deliver cohesive assistant interactions.Collaborate with UX teams to define

feedback capture

user adaptation mechanisms

, and

privacy-aware memory usage

.Interface with Data and MLOps teams for scalable training, evaluation, and deployment pipelines.

Bonus Points For

Experience in

Agentic systems

autonomous workflows

, or

fine-tuning LLMs with LORA/QLORA

.Publications or writing in the domain of

LLMs

GenAI

, or

retrieval architectures

.Contributions to open-source projects in

RAG/LLM/prompt engineering

or published tools for LLM deployment.Exposure to building

voice interfaces

multimodal input pipelines

using tools like Whisper or CLIP.

What You'll Be Creating

A real-time,

multimodal and multilingual assistant

that adapts to user preferences and evolves with usage.Low-latency, scalable backend for LLM-powered interactions under minimal

latency SLA

.A robust feedback and retraining loop enabling

continuous improvement of LLM outputs

.Privacy-aware

long-term memory system

with vectorized personalization and memory decay.

More Jobs at The Value Maximizer

Senior Dot Net Developer

Thiruvananthapuram, Kerala, India

8.0 - 8.0 yrs

Salary: Not disclosed

Enterprise Architect

Thiruvananthapuram, Kerala, India

10.0 - 10.0 yrs

Salary: Not disclosed

Data Architect

Trivandrum, Kerala, India

10.0 - 10.0 yrs

Salary: Not disclosed

Enterprise Architect

Kochi, Kerala, India

10.0 - 10.0 yrs

Salary: Not disclosed

Data Architect

Kochi, Kerala, India

10.0 - 10.0 yrs

Salary: Not disclosed

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

The Value Maximizer

Before You Leave... Find Your Perfect Job!

Login to

Please Verify Your Phone or Email

Confirm Action

Contact Us

Search

Profile

Upskill and Grow with AI

Personal Settings

AI Engineer/ Data Scientist

Experience & Salary

Skills Required

Work Mode

Job Type

Job Description

About The Role

LLM Engineer - AI Assistant & RAG Systems

Lenskart is now our strategic investor

Minimum Work Experience Required

5+ years

Top 3 Daily Tasks

LLM-powered assistant systems

evaluate and improve assistant quality

multilingual + multimodal assistant experiences

Top 5 Skills You Should Possess

LLMs

RAG pipelines

vector search systems

latency optimization

retriever-reranker tuning

Python

PyTorch/TensorFlow

FastAPI

Airflow

Docker

Kubernetes

long-term embeddings

memory decay

context truncation

Cross-Functional Collaboration Excellence

feedback capture

user adaptation mechanisms

privacy-aware memory usage

Bonus Points For

Agentic systems

autonomous workflows

fine-tuning LLMs with LORA/QLORA

LLMs

GenAI

retrieval architectures

RAG/LLM/prompt engineering

voice interfaces

multimodal input pipelines

What You'll Be Creating

multimodal and multilingual assistant

latency SLA

continuous improvement of LLM outputs

long-term memory system

More Jobs at The Value Maximizer