Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in india
>
CareerXperts Consulting
>
SDE III - AI Software Engineer- RAG- Vector Database

SDE III - AI Software Engineer- RAG- Vector Database

CareerXperts Consulting

5 years

0 Lacs

india

Posted:14 hours ago| Platform:

Apply

Skills Required

ai software database sql latency design mlflow kubernetes aws vertex azure openai server evaluation checks regression model drift inference reliability optimization strategies retrieval orchestration support leadership development github code engineering ml python integration monitoring tracking testing developer

Work Mode

On-site

Job Type

Full Time

Job Description

What You’ll Do

Architect, build, and scale
agentic RAG and text-to-SQL copilots
supporting
50K+ daily queries
, delivering
99.9% uptime
, low latency, and high semantic accuracy.
Design, operate, and continuously optimize a
production-grade LLMOps platform
, leveraging
LangGraph, LangSmith, MLflow, Kubernetes, async inference
, and leading cloud LLM providers such as
AWS Bedrock, Google Vertex AI, Azure OpenAI, and Anthropic
.
Develop and own
MCP server integrations
, ensuring reliable, efficient, and secure runtime execution across multi-agent workflows and toolchains.
Implement
evaluation and guardrail frameworks
(AI-as-a-Judge, grounding checks, safety filters, regression tests) to minimize hallucinations, control model drift, and
reduce token usage and inference costs by 30%+
.
Own
end-to-end system observability and performance
, including latency, throughput, reliability, cost optimization, caching strategies, and retrieval quality.
Optimize
inference, retrieval, and orchestration pipelines
to support high-traffic, enterprise-scale workloads.
Partner closely with
product, infrastructure, and leadership teams
to define SLAs, unblock customer requirements, and deliver robust, enterprise-ready AI capabilities.
Leverage
AI-assisted development tools
(GitHub Copilot, MCP-enabled IDEs, Claude, GPT, etc.) to improve development velocity, code quality, and system reliability.

What We’re Looking For

5+ years of experience
in software engineering or ML engineering, with hands-on ownership of
production-grade LLM, RAG, or agent-based systems
.
Strong
Python engineering expertise
, with deep experience building
RAG pipelines, agent architectures, tool-calling workflows, and text-to-SQL copilots
.
Proven experience working with
MCP servers, vector databases, and retrieval-augmented system architectures
.
Strong understanding of
agent development
, LLM integration patterns, prompt engineering, and
runtime orchestration frameworks
.
Hands-on experience with
cloud-native infrastructure
, including
Kubernetes, async workers, queueing systems, and observability/monitoring stacks
.
Demonstrated ability to build
LLM evaluation pipelines
, guardrails, monitoring, experiment tracking, and regression testing for AI systems.
Experience with multiple
agent SDKs
, such as:
Anthropic SDK
ClaudeAgent SDK
Google ADK (Agent Developer Kit)
Bonus: LangChain, LlamaIndex, AutoGen, or custom agent runtimes
Strong
ownership mindset
, with a track record of taking AI prototypes from concept to
scalable, reliable, high-traffic production systems
.

Write to shruthi.s@careerxperts.com to get connected.

More Jobs at CareerXperts Consulting

Senior Human Resources Manager

India

4 - 4 yrs

Salary: Not disclosed

Head of Sales

India

Experience: Not specified

Salary: Not disclosed

Senior Accountant

India

Experience: Not specified

Salary: Not disclosed

Technical Support Engineer

India

Experience: Not specified

Salary: Not disclosed

Javascript Developer

India

Experience: Not specified

Salary: Not disclosed

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

CareerXperts Consulting

Staffing and Recruiting

Bangalore Karnataka

RecommendedJobs for You

SDE III - AI Software Engineer- RAG- Vector Database

CareerXperts Consulting

india

SDE III - AI Software Engineer- RAG- Vector Database

CareerXperts Consulting

india

Before You Leave... Find Your Perfect Job!

Login to

Please Verify Your Phone or Email

Confirm Action

Contact Us

SDE III - AI Software Engineer- RAG- Vector Database

Experience & Salary

Skills Required

Work Mode

Job Type

Job Description

agentic RAG and text-to-SQL copilots

50K+ daily queries

99.9% uptime

production-grade LLMOps platform

LangGraph, LangSmith, MLflow, Kubernetes, async inference

AWS Bedrock, Google Vertex AI, Azure OpenAI, and Anthropic

MCP server integrations

evaluation and guardrail frameworks

reduce token usage and inference costs by 30%+

end-to-end system observability and performance

inference, retrieval, and orchestration pipelines

product, infrastructure, and leadership teams

AI-assisted development tools

5+ years of experience

production-grade LLM, RAG, or agent-based systems

Python engineering expertise

RAG pipelines, agent architectures, tool-calling workflows, and text-to-SQL copilots

MCP servers, vector databases, and retrieval-augmented system architectures

agent development

runtime orchestration frameworks

cloud-native infrastructure

Kubernetes, async workers, queueing systems, and observability/monitoring stacks

LLM evaluation pipelines

agent SDKs

ownership mindset

scalable, reliable, high-traffic production systems

More Jobs at CareerXperts Consulting