Senior Gen AI Developer

6 - 11 years

14 - 18 Lacs

Posted:3 days ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Role:

Gen AI Developer

Total Experience:

6+ years with 2+ years working on GenAI initiatives

Employment Type:

Permanent & Full time

Working Model:

Hybrid (3 days work from office)

Job Summary:

We are seeking a Senior AI Developer with proven expertise in Generative AI technologies, a solid foundation in machine learning, and a strong understanding of data governance. The ideal candidate will have hands-on experience with both cloud-based LLM platforms, on-premise, open-source LLMs like Ollama, Llama.cpp, and GGUF-based models. You should also have good knowledge in Model Context Protocol (MCP). You will help architect and implement GenAI-powered products that are secure, scalable, and enterprise-ready.

Key Responsibilities:

  • Design, build, and deploy GenAI solutions using both cloud-hosted and on-prem LLMs.
  • Work with frameworks like Hugging Face, LangChain, LangGraph, LlamaIndex to enable RAG and prompt orchestration.
  • Implement private LLM deployments using tools such as Ollama, LM Studio, llama.cpp, GPT4All, and vLLM.
  • Design retrieval-augmented generation (RAG) pipelines with context-aware orchestration using MCP.
  • Implement and manage Model Context Protocol (MCP) for dynamic context injection, chaining, memory management, and secure prompt orchestration across GenAI workflows.
  • Fine-tune open-source models for specific enterprise tasks and optimize inference performance.
  • Integrate LLMs into real-world applications via REST, gRPC, or local APIs.
  • Ensure secure data flows and proper context management in RAG pipelines.
  • Collaborate across data, product, and infrastructure teams to operationalize GenAI.
  • Incorporate data governance and responsible AI practices from design through deployment.

Required Skills and Qualifications:

  • 6+ years of experience in AI/ML; 2+ years working on GenAI initiatives.
  • Experience with OpenAI, Claude, Gemini, cloud based LLMs (AWS/GCP/Azure) and open-source LLMs like Mistral, LLaMA 2/3, Falcon, Mixtral.
  • Strong hands-on expertise with on-premise LLM frameworks (Ollama, llama.cpp, GGUF models, etc.)
  • Hands-on experience with Model Context Protocol (MCP) for structured prompt orchestration, context injection and tool execution.
  • Proven experience in building and optimizing Retrieval-Augmented Generation (RAG) pipelines, including document chunking, embedding generation, and vector search integration.
  • Proficiency in Python and libraries such as Transformers, Hugging Face, LangChain, and PyTorch.
  • Experience with embedding models and vector DBs (FAISS, Pinecone, Weaviate, Qdrant, etc.)
  • Familiarity with MLOps, GPU optimization, containerization, and deployment in secure environments.
  • Good understanding of data governance access control, lineage, auditability, privacy.

Nice to Have:

  • Exposure to multi-modal models (image, speech) and toolformer-style agents
  • Experience integrating AI into enterprise platforms (e.g., ServiceNow, Salesforce, Jira)
  • Awareness of inference acceleration tools (vLLM, DeepSpeed, TensorRT)

Mock Interview

Practice Video Interview with JobPe AI

Start Machine Learning Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Neova Solutions logo
Neova Solutions

Information Technology

Tech City

RecommendedJobs for You

Kolkata, Mumbai, New Delhi, Hyderabad, Pune, Chennai, Bengaluru