Jobs
Interviews

27 Vector Db Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

4.0 - 8.0 years

13 - 18 Lacs

Chennai

Work from Office

Handson exp with LLM-based solution experience using OpenAI, Gemini, etc. Must have skills in fine-tuning, RAG, agents, vector DBs (ChromaDB, FAISS, Pinecone), LangChain, Hugging Face, TensorFlow, PyTorch. 1-2 GenAI implementations & 2+ yrs in ML/DL.

Posted 4 days ago

Apply

2.0 - 5.0 years

9 - 16 Lacs

Pune

Work from Office

Responsibilities: * Develop AI solutions (Agentic AI) using Python, RAG, LLM, Vector DB & MC Protocol * Optimize agent performance through continuous finetuning * Collaborate with cross-functional teams on project delivery Health insurance

Posted 4 days ago

Apply

4.0 - 9.0 years

15 - 30 Lacs

Pune

Work from Office

Experience : 4+ ,6+,9 + Yrs Skills : Python , Rag, Vector DB, AI/ML ,Langchain/Langgraph , LLM, AWS , Java , Fullstack , Sql , Nosql Face -Face interview - Pune Balewadi

Posted 1 week ago

Apply

6.0 - 10.0 years

12 - 22 Lacs

Bengaluru

Hybrid

Job Title: Python, Machine Learning & GenAI Specialist Company: Tekskills Location: Bengaluru, India Contact: shailender.b@tekskills.in Job Summary We are seeking a dynamic and experienced professional with 5+ years of hands-on expertise in Python, Machine Learning, and advanced experience in Deep Learning and Generative AI (GenAI) . This role is ideal for candidates passionate about solving complex problems, driving innovation, and implementing next-gen AI solutions for real-world applications. Key Responsibilities Design, develop, and deploy robust solutions leveraging Core Python , standard libraries, and frameworks. Build and optimize end-to-end machine learning and deep learning models for production use. Architect and implement GenAI-powered applications utilizing: Vector Databases, Langchain, Langgraph, RAG, and MCP. Integrate external data sources and APIs, and build scalable pipelines for AI model deployment. Collaborate with internal teams to translate business requirements into cutting-edge AI features. Conduct model evaluations, fine-tuning, and support ongoing model monitoring. Mentor and guide team members and contribute to best practices within the AI community. Required Skills & Qualifications Minimum 5 years of professional experience in Python development, with a focus on machine learning and deep learning. Strong understanding of core ML concepts , with proven project delivery using frameworks such as TensorFlow, PyTorch, or scikit-learn. Hands-on experience in deep learning (CNNs, RNNs, LLMs/Transformers, or similar architectures). Demonstrable expertise in GenAI tools (Vector Databases, Langchain, Langgraph, Retrieval Augmented Generation (RAG), MCP). Introductory knowledge of AI Agents and their integration in larger workflows. Solid understanding of model deployment, API integration, version control, and containerization. Excellent communication, problem-solving skills, and a collaborative mindset. Preferred Qualifications Experience deploying AI/ML solutions in production at scale. Familiarity with cloud AI services (AWS Sagemaker, Azure ML, GCP, etc.). Exposure to LLM fine-tuning, prompt engineering, or knowledge of autonomous agent systems. How to Apply Interested candidates should submit their CV to shailender.b@tekskills.in with the subject line: Application for Python, Machine Learning & GenAI Specialist Bengaluru Join our innovative team at Tekskills and transform the future of AI-driven solutions!

Posted 1 week ago

Apply

4.0 - 8.0 years

0 Lacs

karnataka

On-site

You are a Data Science Engineer who will be contributing to the development of intelligent, autonomous AI systems. The ideal candidate should have a strong background in agentic AI, LLMs, SLMs, vector DB, and knowledge graphs. Your responsibilities will include deploying AI solutions that leverage technologies such as Retrieval-Augmented Generation (RAG), multi-agent frameworks, and hybrid search techniques to enhance enterprise applications. As part of the flexible scheme, you will enjoy various benefits such as a best-in-class leave policy, gender-neutral parental leaves, childcare assistance benefit reimbursement, sponsorship for industry-relevant certifications, employee assistance program, comprehensive hospitalization insurance, accident and term life insurance, and health screening. Your key responsibilities will involve designing and developing Agentic AI Applications using frameworks like LangChain, CrewAI, and AutoGen, implementing RAG Pipelines, fine-tuning Language Models, training NER Models, developing Knowledge Graphs, collaborating cross-functionally, and optimizing AI workflows. To excel in this role, you should have at least 4 years of professional experience in AI/ML development, proficiency in Python, Python API frameworks, SQL, and familiarity with AI/ML frameworks like TensorFlow or PyTorch. Experience in deploying AI models on cloud platforms, understanding of LLMs, SLMs, semantic technologies, and MLOps tools is required. Additionally, hands-on experience with vector databases, embedding techniques, and developing AI solutions for specific industries will be beneficial. You will receive support through training, coaching, and a culture of continuous learning to aid in your career progression. The company strives for a culture of empowerment, responsibility, commercial thinking, initiative, and collaboration. They promote a positive, fair, and inclusive work environment for all individuals. For further information about the company and its teams, please visit the company website at https://www.db.com/company/company.htm. Join a team that celebrates success and fosters a culture of excellence and inclusivity.,

Posted 2 weeks ago

Apply

6.0 - 11.0 years

9 - 18 Lacs

Gurugram

Hybrid

About Nirvna Nirvna Solutions is a financial technology & services provider that delivers integrated and modular front, middle, and back-office solutions to a wide array of financial firms, including hedge funds, private equity firms, asset managers, prime brokers, and fund administrators. Nirvna 's ability to electronically ingest data from the inception of a portfolio and seamlessly integrate its day-to-day workflow from front to back office makes it stand out from the crowd. The complexity of the application poses interesting challenges and facilitates multitude of learning opportunities to the one who wants to dive in. At Nirvna, we strive to build a close- knit competitive team environment. We believe in team players. A successful team gives better results than an accomplished individual. For further information about us, please visit our website www.nirvanasolutions.com. Nirvna Solutions headquarter is in the financial capital of the world - Manhattan, NY, USA. Our offshore development centre is in Gurugram and is a wholly owned subsidiary of the U.S Entity. The offshore development and client service Centre Is a critical piece to the company's overall success and will continue to play an increasingly important role in the future We are hiring for Senior Python Developer. Job Description Summary: We are seeking an experienced Senior Python Developer (6+ years) to join our team in building and scaling our AI-driven Copilot platform. This platform leverages LangChain, LangGraph, LangMem, vector databases, and advanced agentic flows to provide intelligent assistance to users across our investment management solutions. The role focuses on designing, developing, and implementing modern AI techniques to build and maintain multi-modal conversational systems enabling voice, text, and potentially image-based interactions while planning future enhancements with MCP (Model Control Protocol) to deliver exceptional user experiences that simplify workflows and fulfill user queries with engaging UX. Employment Type: Permanent Job Location: Gurugram (Hybrid Work Model) Salary Offered : As per industry standard Qualifications: BE/B.Tech in Computer Science from a top IT college or a Masters degree in a relevant field such as Statistics, Data Science, or Applied Mathematics. Experience: 6+ years of professional Python development experience in complex applications or AI/ML platforms. Job Responsibilities: Build and refine the design and development of scalable, modular Python applications for the Copilot platform. Architect, implement, and refine LangChain/LangGraph agentic flows and long-term memory structures (using LangMem). Build robust API layers using FastAPI and integrate with internal/external APIs (e.g., NirvanaOne API, MCP Servers). Collaborate with product managers, UX designers, and engineering teams to translate requirements into technical designs. Provide technical mentoring, and code reviews for developers. Design and document clear architecture diagrams and technical specifications. Refine and integrate secure Azure-based authentication mechanisms across tenants. Ensure code quality through automated testing, CI/CD, and adherence to secure coding practices. Monitor system performance, identify bottlenecks, and proactively propose improvements. Stay updated with AI/ML advancements and advise on their integration into our architecture. Requirements: Strong expertise in Python and related libraries. Experience with LangChain, LangGraph, LangMem, vector DBs and embeddings-based search. Solid knowledge of backend api frameworks like FastAPI for building high-performance APIs. Hands-on experience designing agentic or conversational AI architectures. Experience integrating with identity providers (Azure AD preferred) and working with refresh/access tokens. Proficiency with RDBMS (PostgreSQL, SQL Server) and NoSQL databases. Familiarity with cloud services (Azure preferred) and containerization (Docker). Knowledge of modern AI/ML techniques, including LLM-based applications. Experience working on multi-tenant architectures and secure API integrations. Strong software engineering principles: version control, automated testing, CI/CD. Excellent problem-solving skills and ability to think strategically. Strong communication skills and ability to work in cross-functional teams. Nice to Have: Exposure to the investment management domain or financial services applications. Experience with MCP (or equivalent frameworks for multi-modal/agentic systems). Familiarity with Kafka-based event-driven architectures. Why Nirvna Work on the Test environment which gives you an opportunity to enhance and modify with respect to your desired skills. Opportunity to become a subject matter expert by way of certifications and relevant assignments. Get early opportunity to take product ownership for fast-paced growth. Latest software engineering practices. Opportunity to directly work with top leadership (including the CEO) and be recognized for the good work. Take the initiative to implement new technology/frameworks/processes to delight our clients with a wonderful product. Exposure to the finance domain (Security Markets) is one of our distinctive advantages. A conducive working environment with several employee benefits. Friendly Culture. 5 Days Working with flexibility of work from home / hybrid working model

Posted 2 weeks ago

Apply

2.0 - 6.0 years

0 Lacs

kolkata, west bengal

On-site

At EY, youll have the chance to build a career as unique as you are, with the global scale, support, inclusive culture and technology to become the best version of you. And were counting on your unique voice and perspective to help EY become even better, too. Join us and build an exceptional experience for yourself, and a better working world for all. EY-Consulting AI Enabled Automation Developer Staff -Python We are looking to hire people with strong AI Enabled Automation skills and who are interested in learning new technologies in the process automation space Azure . GenAI , large Lang Models(LLM). RAG ,Vector DB , Graph DB ,Python At EY, youll have the chance to build a career as unique as you are, with the global scale, support, inclusive culture, and technology to become the best version of you. And were counting on your unique voice and perspective to help EY become even better, too. Join us and build an exceptional experience for yourself, and a better working world for all. Responsibilities Development and implementation of AI enabled automation solutions, ensuring alignment with business objectives. Design and deploy Proof of Concepts (POCs) and Points of View (POVs) across various industry verticals, demonstrating the potential of AI enabled automation applications. Ensure seamless integration of optimized solutions into the overall product or system Collaborate with cross-functional teams to understand requirements, to integrate solutions into cloud environments (Azure, GCP, AWS, etc.) and ensure it aligns with business goals and user needs Educate team on best practices and keep updated on the latest tech advancements to bring innovative solutions to the project Requirements 2 to 3 years of relevant professional experience Expertise in Python programming including experience with Al/machine learning frameworks like TensorFlow, PyTorch, Keras, Langchain, MLflow, Promtflow(Good to have) 1-2 years of working knowledge of NLP and LLMs like BERT, GPT-3/4, T5, etc. Knowledge of how these models work and how to fine-tune them Expertise in prompt engineering principles and techniques like chain of thought, in-context learning, tree of thought, etc. Knowledge of retrieval augmented generation (RAG) Knowledge of Knowledge Graph RAG Strong analytical and problem-solving skills with the ability to think critically and troubleshoot issues Excellent communication skills, both verbal and written in English What we look for A Team of people with commercial acumen, technical experience and enthusiasm to learn new things in this fast-moving environment An opportunity to be a part of market-leading, multi-disciplinary team of 1400 + professionals, in the only integrated global transaction business worldwide. Opportunities to work with EY Advisory practices globally with leading businesses across a range of industries What working at EY offers At EY, were dedicated to helping our clients, from startups to Fortune 500 companies and the work we do with them is as varied as they are. You get to work with inspiring and meaningful projects. Our focus is education and coaching alongside practical experience to ensure your personal development. We value our employees and you will be able to control your own development with an individual progression plan. You will quickly grow into a responsible role with challenging and stimulating assignments. Moreover, you will be part of an interdisciplinary environment that emphasizes high quality and knowledge exchange. Plus, we offer: Support, coaching and feedback from some of the most engaging colleagues around Opportunities to develop new skills and progress your career The freedom and flexibility to handle your role in a way thats right for you EY | Building a better working world EY exists to build a better working world, helping to create long-term value for clients, people and society and build trust in the capital markets. Enabled by data and technology, diverse EY teams in over 150 countries provide trust through assurance and help clients grow, transform and operate. Working across assurance, consulting, law, strategy, tax and transactions, EY teams ask better questions to find new answers for the complex issues facing our world today.,

Posted 3 weeks ago

Apply

1.0 - 4.0 years

6 - 24 Lacs

Ahmedabad

Work from Office

Responsibilities: * Collaborate with cross-functional teams on data science projects * Ensure code quality through testing and documentation * Develop scalable Python applications using Vector DB and LLM techniques

Posted 3 weeks ago

Apply

7.0 - 10.0 years

10 - 15 Lacs

Gurugram

Work from Office

Hiring a Senior GenAI Engineer with 7-12 years of experience in Python, Machine Learning, and Large Language Models (LLMs) for a 6-month engagement based in Gurugram This hands-on role involves building intelligent systems using Langchain and RAG, developing agent workflows, and defining technical roadmaps The ideal candidate will be proficient in LLM architecture, prompt engineering, vector databases, and cloud platforms (AWS, Azure, GCP) The position demands strong collaboration skills, a system design mindset, and a focus on production-grade AI/ML solutions

Posted 3 weeks ago

Apply

8.0 - 13.0 years

30 - 45 Lacs

Bhopal, Pune, Gurugram

Work from Office

We're Hiring | AI Lead GenAI & LLMs | 8-10 Yrs | Multiple Xebia Locations | Hybrid Locations: Bangalore | Hyderabad | Chennai | Pune | Bhopal | Jaipur | Gurugram (Hybrid 3 days/week in office) Experience: 8 to 10 years Joiners: Immediate to Max 2 Weeks Notice Period ONLY About the Role: Are you a visionary in the AI/ML space looking to build cutting-edge Generative AI solutions ? We are seeking an accomplished AI Lead who will architect and drive enterprise-grade AI systems—leveraging LLMs, deep learning models, vision AI, and GenAI . You’ll lead from the front, mentor top talent, and build transformative solutions that shape the future of intelligent applications. Key Responsibilities: Architect and implement scalable Generative AI (GenAI) and RAG solutions using LLMs , vision models , and vector databases Lead the design, development, and deployment of AI/ML and deep learning systems Integrate AI platforms with Azure AI Studio , SharePoint , and Power BI Guide the team in leveraging Agentic frameworks like LlamaIndex Align with business stakeholders to craft and deliver high-impact AI strategies Drive innovation, technical mentorship, and excellence across the AI team Establish coding best practices, performance tuning, and scalable solution design Must-Have Skills: 8–10 years of AI/ML experience, with 3+ years in leadership or architecture roles Proficiency in Python , TensorFlow , PyTorch , Scikit-learn Strong experience in LLMs , OCR , Vision AI , vector DBs , RAG Familiarity with Azure AI Studio , Azure Cloud Architecture , and Azure DevOps Exposure to Agentic frameworks and tools like LlamaIndex Basic working knowledge of Power BI and SharePoint Proven leadership and team management skills Good-to-Have: Experience with AWS or GCP Knowledge of DevOps practices , CI/CD , Kubernetes Awareness of AI ethics , governance , and compliance Exposure to advanced BI/visualization tools To Apply: Send your resume to vijay.s@xebia.com with the following details: Full Name Total Experience Current CTC Expected CTC Current Location Preferred Xebia Location (from above) Notice Period / Last Working Day (if serving) Primary Skills LinkedIn Profile Important: Apply only if you’re available to join immediately or within 2 weeks and are not currently in process with other roles at Xebia. #AIJobs #GenerativeAI #LLMs #VisionAI #XebiaHiring #ImmediateJoiners #Python #AzureAI #AIML #LeadershipHiring #DataScienceJobs #AgenticFrameworks #ChennaiJobs #PuneJobs #BangaloreJobs #HyderabadJobs #GurugramJobs #JaipurJobs #BhopalJobs #TechHiring #AILead

Posted 4 weeks ago

Apply

7.0 - 10.0 years

8 - 12 Lacs

Gurugram

Work from Office

Hiring a Senior GenAI Engineer with 712 years of experience in Python, Machine Learning, and Large Language Models (LLMs) for a 6-month engagement based in Gurugram. This hands-on role involves building intelligent systems using Langchain and RAG, developing agent workflows, and defining technical roadmaps. The ideal candidate will be proficient in LLM architecture, prompt engineering, vector databases, and cloud platforms (AWS, Azure, GCP). The position demands strong collaboration skills, a system design mindset, and a focus on production-grade AI/ML solutions.

Posted 1 month ago

Apply

5.0 - 10.0 years

12 - 22 Lacs

Pune, Chennai, Bengaluru

Hybrid

Role : Gen AI developer/ AI ML / ML operations/Data science Experience: 4 Years - 11 Years Locations: Bangalore/Chennai/Pune/Kolkata Notice Period: Immediate to 30 Days Mandatory Skills : Gen AI, LLM, RAG, Lang chain, Mistral,Llama, Vector DB, Azure/GCP/ Lambda, Python, Tensorflow, Pytorch Preferred Skills : GPT-4, NumPy, Pandas, Keras, Databricks, Pinecone/Chroma/Weaviate, Scale/Labelbox, Job Description / Roles & Responsibilities (in Detail) : We are looking for a good Python Developer with Knowledge of Machine learning and deep learning framework. Take care of entire prompt life cycle like prompt design, prompt template creation, prompt tuning/optimization for various GenAI base models Design and develop prompts suiting project needs Stakeholder management across business and domains as required for the projects Evaluating base models and benchmarking performance Implement prompt guardrails to prevent attacks like prompt injection, jail braking and prompt leaking Develop, deploy and maintain auto prompt solutions Design and implement minimum design standards for every use case involving prompt engineering You will be responsible for training the machine learning and deep learning model. Writing reusable, testable, and efficient code using Python Design and implementation of low-latency, high-availability, and performant applications Implementation of security and data protection Integration of data storage solutions and API Gateways Production change deployment and related support Interested candidates can share their updated CV to pravallika@wrootsglobal.in

Posted 1 month ago

Apply

5.0 - 8.0 years

14 - 18 Lacs

Mumbai

Work from Office

Role & responsibilities Develop, test, and maintain backend services using Python and frameworks such as FastAPI or Flask . Implement and optimize LLM-based applications using models like OpenAI (GPT-4o) , Gemini , LLaMA , etc. Work on RAG implementations , including integration with vector databases and prompt engineering strategies. Design, build, and maintain database connectivity using SQL for infrastructure-level applications. Develop and deploy containerized applications using Docker , Git , GitHub , and integrate with CI/CD pipelines . Deploy and manage applications on cloud platforms ( AWS , Azure , GCP ). Ensure clean code practices including unit testing , error handling , Python best practices , and design patterns . Collaborate with cross-functional teams including Product, Data Science, and DevOps for end-to-end solution delivery. Preferred candidate profile Strong proficiency in Python programming . Familiarity with FastAPI , Flask , or similar web frameworks. Experience with SQL databases and connection management. Hands-on with LLMs and RAG workflows (OpenAI GPT, Gemini, LLaMA, etc.). Understanding of vector databases such as FAISS, Pinecone, or similar. Proficiency with Git , GitHub , and CI/CD pipelines . Experience with Docker for containerization. Exposure to any of the major cloud platforms AWS , Azure , or GCP . Ability to write unit test cases , implement robust error handling , and follow design patterns . Clear understanding of enterprise software development best practices . Someone who have experience working with Banking or Financial Services projects/clients. Immediate Joiner or someone serving notice period.

Posted 1 month ago

Apply

3.0 - 8.0 years

14 - 16 Lacs

Gurugram, Bengaluru

Hybrid

Roles and Responsibilities Develop and maintain Microservice architecture and API management solutions using REST and gRPC for seamless deployment of AI solutions. Collaborate with cross-functional teams, including data scientists and product managers, to acquire, process, and manage data for AI/ML model integration and optimization. Design and implement robust, scalable, and enterprise-grade data pipelines to support state-of-the-art AI/ML models. Debug, optimize, and enhance machine learning models, ensuring quality assurance and performance improvements. Familiarity with tools like Terraform, CloudFormation, and Pulumi for efficient infrastructure management. Create and manage CI/CD pipelines using Git-based platforms (e.g., GitHub Actions, Jenkins) to ensure streamlined development workflows. Operate container orchestration platforms like Kubernetes, with advanced configurations and service mesh implementations, for scalable ML workload deployments. Design and build scalable LLM inference architectures, employing GPU memory optimization techniques and model quantization for efficient deployment. Engage in advanced prompt engineering and fine-tuning of large language models (LLMs), focusing on semantic retrieval and chatbot development. Document model architectures, hyperparameter optimization experiments, and validation results using version control and experiment tracking tools like MLflow or DVC. Research and implement cutting-edge LLM optimization techniques, such as quantization and knowledge distillation, ensuring efficient model performance and reduced computational costs. Collaborate closely with stakeholders to develop innovative and effective natural language processing solutions, specializing in text classification, sentiment analysis, and topic modeling. Design and execute rigorous A/B tests for machine learning models, analyzing results to drive strategic improvements and decisions. Stay up-to-date with industry trends and advancements in AI technologies, integrating new methodologies and frameworks to continually enhance the AI engineering function. Contribute to creating specialized AI solutions in healthcare, leveraging domain-specific knowledge for task adaptation and deployment. Technical Skills: Advanced proficiency in Python . Extensive experience with LLM frameworks (Hugging Face Transformers, LangChain) and prompt engineering techniques Experience with big data processing using Spark for large-scale data analytics Version control and experiment tracking using Git and MLflow Software Engineering & Development: Advanced proficiency in Python, familiarity with Go or Rust, expertise in microservices, test-driven development, and concurrency processing. DevOps & Infrastructure: Experience with Infrastructure as Code (Terraform, CloudFormation), CI/CD pipelines (GitHub Actions, Jenkins), and container orchestration (Kubernetes) with Helm and service mesh implementations. LLM Infrastructure & Deployment: Proficiency in LLM serving platforms such as vLLM and FastAPI, model quantization techniques, and vector database management. MLOps & Deployment: Utilization of containerization strategies for ML workloads, experience with model serving tools like TorchServe or TF Serving, and automated model retraining. Cloud & Infrastructure: Strong grasp of advanced cloud services (AWS, GCP, Azure) and network security for ML systems. LLM Project Experience: Expertise in developing chatbots, recommendation systems, translation services, and optimizing LLMs for performance and security. General Skills: Python, SQL, knowledge of machine learning frameworks (Hugging Face, TensorFlow, PyTorch), and experience with cloud platforms like AWS or GCP. Experience in creating LLD for the provided architecture. Experience working in microservices based architecture. Domain Expertise: Deep understanding of ML and LLM development lifecycle, including fine-tuning and evaluation Expertise in feature engineering, embedding optimization, and dimensionality reduction Advanced knowledge of A/B testing, experimental design, and statistical hypothesis testing Experience with RAG systems, vector databases, and semantic search implementation Proficiency in LLM optimization techniques including quantization and knowledge distillation Understanding of MLOps practices for model deployment and monitoring Professional Competencies: Strong analytical thinking with ability to solve complex ML challenges Excellent communication skills for presenting technical findings to diverse audiences Experience translating business requirements into data science solutions Project management skills for coordinating ML experiments and deployments Strong collaboration abilities for working with cross-functional teams Dedication to staying current with latest ML research and best practices Ability to mentor and share knowledge with team members

Posted 1 month ago

Apply

11.0 - 15.0 years

20 - 30 Lacs

Pune, Chennai, Bengaluru

Work from Office

ONLY 30 DAYS JOINERS Hello I hope you're doing well. We are currently hiring for an exciting opportunity with one of our top clients in the field of Generative AI . Based on your background, I believe this role could be a strong match for your expertise. Role Highlights: Location: Bangalore/Chennai/Kolkata/Pune/Hyderabad Experience Required: 11 -16 Years Notice Period: Maximum 30 Days CTC: As per market standards Mandatory Skills: Gen AI, LLM, ML/DL/NLP, RAG, LangChain, Mistral, Llama, Hugging Face, Python, TensorFlow, PyTorch, Django, Vector DB Preferred Skills: GCP/Azure/AWS, Databricks, MLOps (Kubeflow/Mlflow), Kubernetes, GitHub/Bitbucket, ADO, GPT-4 Key Responsibilities: Develop and implement advanced Generative AI models. Apply ML/DL algorithms for real-world applications across NLP and vision domains. Collaborate on data preprocessing, model training, and deployment. Drive innovation by staying current with industry trends and R&D in Gen AI. Ensure responsible AI practices and performance monitoring post-deployment. Mentor junior members and lead AI-driven project initiatives. If you're interested or know someone who fits this role, please share your updated resume and current CTC/Notice Period details. WhatsApp: 987153039 Email: shweta.gupta@sspearhead.com Warm Regards SHWETA GUPTA || Senior Recruitment Specialist Spearhead Professional Services Call/ WhatsApp: 9871530393 LinkedIn Profile: shwetagupta1810

Posted 1 month ago

Apply

11.0 - 16.0 years

25 - 32 Lacs

Pune, Chennai, Bengaluru

Work from Office

ONLY 30 DAYS JOINERS Hello I hope you're doing well. We are currently hiring for an exciting opportunity with one of our top clients in the field of Generative AI . Based on your background, I believe this role could be a strong match for your expertise. Role Highlights: Location: Bangalore/Chennai/Kolkata/Pune/Hyderabad Experience Required: 1118 Years Notice Period: Maximum 30 Days CTC: As per market standards Mandatory Skills: Gen AI, LLM, ML/DL/NLP, RAG, LangChain, Mistral, Llama, Hugging Face, Python, TensorFlow, PyTorch, Django, Vector DB Preferred Skills: GCP/Azure/AWS, Databricks, MLOps (Kubeflow/Mlflow), Kubernetes, GitHub/Bitbucket, ADO, GPT-4 Key Responsibilities: Develop and implement advanced Generative AI models. Apply ML/DL algorithms for real-world applications across NLP and vision domains. Collaborate on data preprocessing, model training, and deployment. Drive innovation by staying current with industry trends and R&D in Gen AI. Ensure responsible AI practices and performance monitoring post-deployment. Mentor junior members and lead AI-driven project initiatives. If you're interested or know someone who fits this role, please share your updated resume and current CTC/Notice Period details. WhatsApp: 9871530393 Email: shweta.gupta@sspearhead.com Warm Regards SHWETA GUPTA || Senior Recruitment Specialist Spearhead Professional Services Call/ WhatsApp: 9871530393 LinkedIn Profile: shwetagupta1810

Posted 1 month ago

Apply

5.0 - 10.0 years

5 - 10 Lacs

Pune, Maharashtra, India

On-site

Proven experience in developing and implementing Generative AI models and algorithms, with a strong understanding of deep learning fundamentals. Experience with training and fine-tuning Generative models on high-performance computing infrastructure. Experience in working with Vector DB and Embedding Proficiency in programming languages such as Python, NLP, TensorFlow, PyTorch, or similar frameworks for building and deploying AI models. Strong analytical and problem-solving and collaboration skills A passion for exploring new ideas, pushing the boundaries of AI technology, and making a meaningful impact through your work.

Posted 1 month ago

Apply

4.0 - 8.0 years

0 Lacs

, India

On-site

Job Description Senior Java Developer Location- Pune/Chennai/Bangalore/Coimbatore Exp- 4 yrs to 8 yrs Technical Skills (Must have): Core Java Development Design Patterns Message queues (Kafka/RabitMQ) SQL - NoSQL - Vector DB Cloud services (Any of Azure/GCP/AWS) Docker basics and Maven Optional Skills : Servlet > Tomcat server > Python basics Check Your Resume for Match Upload your resume and our tool will compare it to the requirements for this job like recruiters do.

Posted 1 month ago

Apply

2.0 - 4.0 years

6 - 9 Lacs

Mumbai

Work from Office

Seeking AI Engineer to build intelligent, task-driven agents using React & FastAPI. Must blend AI/ML expertise with software skills to create scalable, modular systems for API/UI interaction. Required Candidate profile 1. 2+ yrs in AI dev 2. Strong in FastAPI, React, Python 3. Built LLM-based agent workflows 4. Used vector DB, LangChain, OpenAI 5. Deployed on cloud(Azure, AWS, GCP) 6. Scalable, UI-integrated systems

Posted 1 month ago

Apply

2.0 - 4.0 years

6 - 9 Lacs

Mumbai

Work from Office

Seeking Full-Stack Developer to build intelligent, task-driven AI agents using React & FastAPI. Must blend AI/ML expertise with software skills to create scalable, modular systems for API/UI interaction. Required Candidate profile 1. 2+ yrs in software (Full-Stack) development 2. Strong in Next.js 3. Built scalable web apps 4. Well-versed with prompt engineering

Posted 1 month ago

Apply

6.0 - 8.0 years

6 - 8 Lacs

Hyderabad / Secunderabad, Telangana, Telangana, India

On-site

Key Responsibilities Design and implement GenAI architectures leveraging Google Cloud and Gemini AI models Lead solution architecture and integration of generative AI models into enterprise applications Collaborate with data scientists engineers and business stakeholders to define AI use cases and technical strategy Develop and optimize prompt engineering, model fine tuning, and deployment pipelines Design scalable data storage and retrieval layers using PostgreSQL BigQuery and vector databases e.g.Vertex AI Search Pinecone or FAISS Evaluate third party GenAI APIs and tools for integration Ensure compliance with data security privacy and responsible AI guidelines Support performance tuning monitoring and optimization of AI solutions in production Stay updated with evolving trends in GenAI and GCP offerings especially related to Gemini and Vertex AI Required Skills and Qualifications Proven experience architecting AI and ML or GenAI systems on Google Cloud Platform Hands-on experience with Google Gemini Vertex AI and related GCP AI tools Strong understanding of LLMs, prompt engineering and text generation frameworks Proficiency in PostgreSQL, including advanced SQL and performance tuning Experience with MLOps, CI and CD pipelines, and AI model lifecycle management Solid knowledge of Python, APIs, RESTful services, and cloud native architecture Familiarity with vector databases and semantic search concepts Strong communication and stakeholder management skills Preferred Qualifications GCP certifications e.g., Professional Cloud Architect Machine Learning Engineer Experience in model fine-tuning and custom LLM training Knowledge of LangChain, RAG Retrieval Augmented Generation frameworks Exposure to data privacy regulations GDPR, HIPAA, etc. Background in natural language processing NLP and deep learning

Posted 1 month ago

Apply

6.0 - 10.0 years

10 - 17 Lacs

Pune, Gurugram, Bengaluru

Work from Office

Job Description: We are looking for a skilled Data / Analytics Engineer with hands-on experience in vector databases and search optimization techniques . You will help build scalable, high-performance infrastructure to support AI-powered applications like semantic search , recommendation systems , and RAG pipelines . Key Responsibilities: Optimize vector search algorithms for performance and scalability. Build pipelines to process high-dimensional embeddings (e.g., BERT , CLIP , OpenAI ). Implement ANN indexing techniques like HNSW , IVF , PQ . Integrate vector search with data platforms and APIs . Collaborate with cross-functional teams (data scientists, engineers, product). Monitor and resolve latency , throughput , and scaling issues. Must-Have Skills: Python AWS Vector Databases (e.g., Elasticsearch , FAISS , Pinecone ) Vector Search / Similarity Search ANN Search Algorithms HNSW , IVF , PQ Snowflake / Databricks Embedding Models – BERT , CLIP , OpenAI Kafka / Flink for real-time data pipelines REST APIs , GraphQL , or gRPC for integration Good to Have: Knowledge of semantic caching and hybrid retrieval Experience with distributed systems and high-performance computing Familiarity with RAG (Retrieval-Augmented Generation) workflows Apply Now if You: Enjoy solving performance bottlenecks in AI infrastructure Love working with cutting-edge ML models and search technologies Thrive in collaborative , fast-paced environments

Posted 1 month ago

Apply

10.0 - 18.0 years

16 - 31 Lacs

Chennai, Bengaluru

Hybrid

V2Soft India Pvt Ltd Experience: 10+ years of experience in the field of AI and ML ( Machine Learning ) Overall Experience : Location: Bangalore and Chennai Work Mode: Hybrid Mode: Notice Period: Immediate to 30 days Experience : 3 Years of relevant experience and previous work in any related AI is a plus. Gen AI Experience Text embeddings Vector DB AI Frameworks [ Lang chain / Lama Index/ Language Graph ] Prompt Engineering & Techniques RAG ( Retrieve Augment and Generate ) MCP ( Model Context Protocol ) Agentic AI LLM ( Large Language Models , both open source and proprietary ) Measuring Model Accuracy Deploying Models on-prem and on the cloud

Posted 1 month ago

Apply

4.0 - 8.0 years

15 - 30 Lacs

Gurugram

Hybrid

We are seeking a highly skilled and innovative Data Engineer to join our dynamic team. This role is ideal for professionals passionate about building scalable data pipelines and enabling machine learning and analytics at scale. Youll play a key role in designing and implementing systems that support real-time and batch data processing for advanced AI-driven use cases, including Retrieval-Augmented Generation (RAG). Key Responsibilities: Develop, maintain, and scale data pipelines using Airflow (preferably Metaflow ) and other orchestration tools. Design microservices-based data solutions to support AI/ML initiatives. Work with vector databases to optimize data retrieval for RAG and generative AI models. Collaborate with data scientists, ML engineers, and software engineers to integrate data engineering solutions into broader systems. Ensure data quality, reliability, and governance across all data pipelines and storage layers. Required Skills: Proficiency in Python for data processing, scripting, and service development. Experience with Airflow (Metaflow preferred) or other workflow orchestration tools. Strong knowledge of Microservices architecture and REST APIs. Familiarity with RAG (Retrieval-Augmented Generation) frameworks and concepts. Experience with Vector databases (e.g., Pinecone, FAISS, Weaviate, Qdrant). Solid understanding of data modeling, ETL/ELT processes, and cloud data platforms.

Posted 2 months ago

Apply

4.0 - 6.0 years

25 - 30 Lacs

Hyderabad

Hybrid

Senior AI Developer Experience: 4 - 6 Years Exp Salary : INR 20-30 Lacs per annum Preferred Notice Period : Within 60 Days Shift : 2:30PM to 11:30PM IST Opportunity Type: Hybrid (Hyderabad) Placement Type: Permanent (*Note: This is a requirement for one of Uplers' Clients) Must have skills required : AI RAG, Fast or Flask API, Vector DB, postgresql, Python, Langchain OR Llama Good to have skills : Grafana or Prometheus, Docker, Kubernetes K-3 Innovations (One of Uplers' Clients) is Looking for: Senior AI Developer who is passionate about their work, eager to learn and grow, and who is committed to delivering exceptional results. If you are a team player, with a positive attitude and a desire to make a difference, then we want to hear from you. Role Overview Description About K3-Innovations K3-Innovations, Inc. is building a cutting-edge, AI-driven SaaS platform that automates critical workflows in the biopharma industry. We are expanding our team with an AI RAG Engineer who will help us design and optimize retrieval-augmented generation (RAG) pipelines for knowledge workflows. This role combines deep expertise in database design, vector search optimization, backend architecture, and LLM (Large Language Model) integration. You will play a key role in building a scalable AI platform, bridging structured and unstructured biopharma data with next-generation AI. If you're passionate about building intelligent retrieval systems, fine-tuning prompt pipelines, and optimizing LLM-based applications for real-world datasets, we want to hear from you! Key Responsibilities 1. RAG Pipeline Design and Optimization (Priority #1) Architect and implement retrieval-augmented generation pipelines integrating document retrieval and LLM response generation. Design and maintain knowledge bases and vector stores using tools like FAISS, Weaviate, or PostgreSQL PGVector. Optimize retrieval mechanisms (chunking, indexing strategies, reranking) to maximize response accuracy and efficiency. Integrate context-aware querying from structured (Postgres) and unstructured (text/PDF) sources. 2. Database and Embedding Management Design relational schemas to support knowledge base metadata and chunk-level indexing. Manage embeddings pipelines using open-source models (e.g., HuggingFace sentence transformers) or custom embedding services. Optimize large-scale vector search performance (indexing, sharding, partitioning). 3. LLM and Prompt Engineering Develop prompt engineering strategies for retrieval-augmented LLM pipelines. Experiment with prompt chaining, memory-augmented generation, and adaptive prompting techniques. Fine-tune lightweight LLMs or integrate APIs from OpenAI, Anthropic, or open-source models (e.g., LlamaIndex, LangChain). 4. Backend API and Workflow Orchestration Build scalable, secure backend services (FastAPI/Flask) to serve RAG outputs to applications. Design orchestration workflows integrating retrieval, generation, reranking, and response streaming. Implement system monitoring for LLM-based applications using observability tools (Prometheus, OpenTelemetry). 5. Collaboration and Platform Ownership Work closely with platform architects, AI scientists, and domain experts to evolve the knowledge workflows. Take ownership from system design to model integration and continuous improvement of RAG performance. Required Skills AI RAG Engineering (Most Critical) Knowledge Retrieval: o Experience building RAG architectures in production environments. o Expertise with vector stores (e.g., FAISS, Weaviate, Pinecone, PGVector). o Experience with embedding models and retrieval optimization strategies. Prompt Engineering: o Deep understanding of prompt construction for factuality, context augmentation, and reasoning. o Familiarity with frameworks like LangChain, LlamaIndex, or Haystack. Database and Backend Development (Essential) PostgreSQL Expertise: o Strong proficiency in relational and vector extension design (PGVector preferred). o SQL optimization, indexing strategies for large datasets. Python Development: o Experience building backend services using FastAPI or Flask. o Proficiency with async programming and API integrations. Observability and DevOps (Supportive) System monitoring for AI workflows using Prometheus, Grafana, OpenTelemetry. Familiarity with Docker, Kubernetes-based deployment pipelines. Preferred Experience (Bonus but not Required) Working with large-scale scientific or healthcare datasets. Exposure to clinical standards like SDTM, ADaM (advantageous for biopharma workflows). Experience integrating domain-specific ontologies into retrieval systems. Familiarity with fine-tuning LLMs on private knowledge bases. What Were Looking For AI Problem Solver: You are excited by combining retrieval, reasoning, and generative capabilities to solve real-world problems. Backend and Data Specialist: You understand database performance and scalable architectures for retrieval and serving. Builder's Mindset: You thrive in dynamic, evolving environments where you can architect and implement end-to-end solutions. What We Offer Meaningful Impact: Build AI systems that accelerate workflows in the critical biopharma space. Technical Growth: Deepen your expertise in retrieval-augmented generation and scalable AI systems. Remote Flexibility: Results-driven work culture with location flexibility. Competitive Compensation: Attractive salary, benefits, and learning opportunities. Join Us Help us revolutionize how biopharma manages and accesses knowledge through the power of AI. How to apply for this opportunity: Easy 3-Step Process: 1. Click On Apply! And Register or log in on our portal 2. Upload updated Resume & Complete the Screening Form 3. Increase your chances to get shortlisted & meet the client for the Interview! About Our Client: K3-Innovations is redefining clinical research with a strategic scaling approach, blending AI-powered automation, adaptive clinical resourcing, and advanced data science. As a next-generation CRO, we provide flexible FSP models, regulatory-compliant statistical programming, and AI-driven analytics to accelerate clinical trial execution and regulatory submissions. About Uplers: Our goal is to make hiring and getting hired reliable, simple, and fast. Our role will be to help all our talents find and apply for relevant product and engineering job opportunities and progress in their career. (Note: There are many more opportunities apart from this on the portal.) So, if you are ready for a new challenge, a great work environment, and an opportunity to take your career to the next level, don't hesitate to apply today. We are waiting for you!

Posted 2 months ago

Apply
Page 1 of 2
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies