Posted:1 day ago|
Platform:
Work from Office
Full Time
The Senior Data Engineer will build and maintain the core data infrastructure for an enterprise AI platform. This role focuses on designing scalable data pipelines, developing knowledge graphs, and preparing structured and unstructured data for AI and LLM-based applications.
- Design ontologies and graph schemas for complex enterprise relationships
- Implement entity resolution and relationship inference across data sources- Build APIs and query interfaces for graph traversal- Optimize graph storage and query performance for large-scale usage
- Extract and model enterprise metadata such as business rules and data dictionaries
- Parse and semantically index documents and code artifacts- Build integrations with enterprise APIs and internal platforms
- Prepare structured and contextual data for LLM consumption
- Design embedding strategies and manage vector databases for semantic search- Build memory and context management systems for stateful AI applications
- 5+ years of Data Engineering experience with production-grade pipelines
- Strong Python skills (clean, testable, maintainable code) - MongoDB expertise (schema design, aggregation pipelines, indexing, performance tuning)- Vector databases experience (Qdrant, Pinecone, Weaviate, pgvector)- Document processing experience (chunking, metadata extraction, PDFs/Word/HTML; LangChain or similar)- Strong SQL Skills (complex Queries, Joins, Window Functions, Optimization)- ETL/ELT at scale (incremental loads, CDC, idempotent pipelines)- Pipeline orchestration tools (Airflow, Dagster, Prefect, or similar)
- Experience building production RAG pipelines
- Deep understanding of embedding models and dimensionality- Graph databases (Neo4j) and Cypher query expertise- LLM application development using LangChain or Lang Graph- Streaming systems (Kafka, Flink) for real-time pipelines
- Hybrid search (vector + keyword/metadata filtering)
- Apache Spark for large-scale transformations
pipelines, rag, cdc, vector databases, metadata,flink,kafka,design,data Warehouse
Aifa Labs
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Practice Python coding challenges to boost your skills
Start Practicing Python Now
bengaluru
15.0 - 20.0 Lacs P.A.
navi mumbai, pune, mumbai (all areas)
13.0 - 20.0 Lacs P.A.
4.0 - 6.0 Lacs P.A.
8.65 - 10.15 Lacs P.A.
bhopal
10.0 - 15.0 Lacs P.A.
hyderabad, chennai
7.0 - 11.0 Lacs P.A.
Salary: Not disclosed
0.00019 - 0.00021 Lacs P.A.
noida, uttar pradesh, india
Salary: Not disclosed
hyderabad, telangana, india
Salary: Not disclosed