Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in chennai
>
Tenth Planet Technologies
>
Vector Database & Embedding Engineer RAG Pipeline Development

Vector Database & Embedding Engineer RAG Pipeline Development

Tenth Planet Technologies

3 - 8 years

12 - 18 Lacs

chennai

Posted:2 days ago| Platform:

Apply

Skills Required

embedding retrieval augmented generation vector

Work Mode

Work from Office

Job Type

Full Time

Job Description

Job Summary

Vector Database & Embedding Engineer

vector DBs (pgvector, Pinecone, Chroma, Milvus, Weaviate)

high-accuracy, high-recall retrieval systems

Key Responsibilities

1. Vector Database Design & Management

Setup, configure and manage vector DBs such as:

pgvector
,
FAISS
,
Pinecone
,
Weaviate
,
Chroma
,
Milvus

Design schemas for:

Multi-embedding storage
Metadata storage
Document-level and chunk-level indexing

Implement filtering, similarity search, MMR, reranking, and index optimization.

2. Embedding Pipeline Development

Select, fine-tune, or run embedding models such as:

Sentence-BERT, BGE, GTE, Instructor, FlagEmbedding
OpenAI Embeddings / Azure OpenAI
HuggingFace Transformers

Build:

Batch embedding pipelines
Real-time embedding APIs
Multi-encoder architecture for hybrid search

Evaluate embedding quality, dimensionality, and vector drift.

3. Chunking, Indexing & Document Processing

Design advanced
chunking strategies
:

Fixed window chunking
Sliding window
Semantic chunking
Layout-aware chunking (tables, lists, multi-column)

Extract content from:

PDFs, HTML pages, Office files, emails, scanned docs

Build a complete indexing pipeline:

Preprocessing Chunking Embedding Vector DB upsert Metadata linking

4. RAG Optimization & Retrieval Tuning

Optimize retrieval for:

Accuracy
Latency
Recall / diversity

Implement hybrid search:

Vector + Keyword
Vector + Graph (GraphRAG)

Build ranking stacks using rerankers (Cross-Encoders).

5. Backend & API Development

Build APIs for:

Document ingestion
Embedding generation
Retrieval & context merging

Serve embedding + vector workflows using Python/FastAPI or Node.js.
Integrate vector search with LLM prompt templates.

6. Monitoring, Evaluation & Scaling

Evaluate retrieval metrics (precision@k, recall@k, MRR).
Implement observability for indexing, failures, and accuracy degradation.
Scale vector DBs horizontally & vertically based on dataset size.

7. Collaboration & Documentation

Work with LLM engineers to design end-to-end RAG pipelines.
Maintain documentation for:

Embedding configs
Chunking logic
Vector schemas
Retrieval settings

Train internal teams on best practices.

Required Technical Skills

Vector Databases

Strong hands-on with:

pgvector
(must-have for enterprise)
Pinecone
,
Chroma
,
Weaviate
,
Milvus
, or
FAISS

Deep knowledge of:

Index types (HNSW, IVFFlat, PQ, IVF-PQ)
Similarity metrics (cosine, dot, euclidean)
Index tuning (ef_search, ef_construction, cluster size)

Embeddings

Experience generating and evaluating embeddings using:

OpenAI / Azure OpenAI
InstructorXL, BGE, GTE, FlagEmbedding
Sentence-BERT / HF embeddings

Knowledge of:

Embedding dimensionality
Tokenization & vector normalization
Multi-embedding pipelines

Chunking & Preprocessing

Strong experience with document processing libraries:

PDFPlumber, PyMuPDF, Textract, Tika

Designing chunking strategies for:

PDFs
Web pages
Product catalogs
Emails & logs

Metadata creation and linking strategies.

Backend / Engineering

Python (preferred), Node.js
FastAPI / Flask
SQL & NoSQL
ETL pipelines (Airflow / custom)
Docker, Linux environments

Experience Required

Total Experience:
26 years
Relevant Vector Search / Embedding Experience:
1–3 years
Experience in building real RAG systems (highly preferred).

Preferred Skills

Knowledge of:

LangChain or LlamaIndex
Rerankers (Cross-Encoders)
Hybrid retrieval
Graph + Vector hybrid search

Experience in:

OCR processing
Data extraction
Enterprise search systems

Familiarity with:

RedisSearch
ElasticSearch vector search

More Jobs at Tenth Planet Technologies

SAP SD Lead (Order-to-Cash)

Chennai

7 - 12 yrs

INR 16 - 25 Lacs

SAP MM Lead (Procure To Pay)

Chennai

7 - 12 yrs

INR 16 - 20 Lacs

SAP Program Lead Manufacturing

Chennai

12 - 15 yrs

INR 18 - 27 Lacs

SAP FICO Lead

Chennai

7 - 12 yrs

INR 16 - 25 Lacs

Senior Manager SAP Program

Chennai

12 - 15 yrs

INR 25 - 32 Lacs

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

Tenth Planet Technologies

Software Development

Innovation City

RecommendedJobs for You

Vector Database & Embedding Engineer RAG Pipeline Development

Tenth Planet Technologies

chennai

Aws Data Engineer

Acesoft Labs

chennai, coimbatore

Data Engineer (Databricks, SQL, Pyspark, )

Gainwell Technologies

bengaluru

Associate - Data Engineer

Conde Nast India

chennai

Data Engineer

Principal Global Services

pune

Informatica+ Snowflake ( Matillion)Technical Lead-Data Engg

Birlasoft

hyderabad

Technical Lead-Data Engg

Birlasoft

hyderabad

Matillion Sr Technical Lead-Data Engg

Birlasoft

bengaluru

Sr Technical Lead-Data Engg

Birlasoft

noida

Technical Specialist-Data Engg

Birlasoft

bengaluru

Before You Leave... Find Your Perfect Job!

Login to

Please Verify Your Phone or Email

Confirm Action

Contact Us

Vector Database & Embedding Engineer RAG Pipeline Development

Experience & Salary

Skills Required

Work Mode

Job Type

Job Description

Job Summary

Vector Database & Embedding Engineer

vector DBs (pgvector, Pinecone, Chroma, Milvus, Weaviate)

high-accuracy, high-recall retrieval systems

Key Responsibilities

1. Vector Database Design & Management

pgvector

FAISS

Pinecone

Weaviate

Chroma

Milvus

2. Embedding Pipeline Development

3. Chunking, Indexing & Document Processing

chunking strategies

4. RAG Optimization & Retrieval Tuning

Vector + Keyword

Vector + Graph (GraphRAG)

5. Backend & API Development

6. Monitoring, Evaluation & Scaling

7. Collaboration & Documentation

Required Technical Skills

Vector Databases

pgvector

Pinecone

Chroma

Weaviate

Milvus

FAISS

Embeddings

Chunking & Preprocessing

Backend / Engineering

Experience Required

Total Experience:

Relevant Vector Search / Embedding Experience:

Preferred Skills

More Jobs at Tenth Planet Technologies