Posted:2 days ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

About The Opportunity

A fast-scaling team in the Generative AI and Enterprise AI sector, building production-grade LLM-powered features for search, assistants, and knowledge automation across business applications. We ship end-to-end GenAI capabilities—from prompt design and fine-tuning to scalable inference and observability—deployed into cloud-native environments for enterprise customers.

Primary title:

Generative AI Engineer |

Location:

India (On-site)Role & Responsibilities
  • Design, implement, and productionize generative AI components (prompting, fine-tuning, RAG pipelines, embeddings) for customer-facing applications.
  • Build robust ML pipelines for data ingestion, embedding generation, model training/finetuning, and automated deployment using containerized workflows.
  • Integrate vector search and retrieval systems (FAISS/Milvus/Pinecone) with LLM stacks to enable low-latency, relevance-optimized inference.
  • Develop REST/gRPC APIs and microservices for inference at scale, ensuring reliability, monitoring, and cost-efficient autoscaling.
  • Collaborate with data scientists to iterate on instruction tuning, evaluation metrics, and model selection for production scenarios.
  • Establish engineering best practices: CI/CD for models, observability for ML services, model versioning, and reproducible experiments.

Skills & Qualifications

Must-Have

  • Strong production experience with Python and deep learning frameworks (PyTorch or TensorFlow).
  • Hands-on with Hugging Face Transformers and model fine-tuning workflows (LoRA/PEFT or equivalent).
  • Practical experience building RAG pipelines, embedding generation, and vector search integration (FAISS, Milvus, or Pinecone).
  • Experience deploying containerized ML services (Docker, Kubernetes) on public cloud (AWS/Azure/GCP).

Preferred

  • Familiarity with LangChain or similar orchestration frameworks and prompt engineering at scale.
  • Experience with MLOps tooling: CI/CD for models, model registry, monitoring, and cost optimization.
Benefits & Culture Highlights
  • Opportunity to work on cutting-edge GenAI products and influence model & architecture choices end-to-end.
  • Collaborative engineering culture with strong emphasis on quality, observability, and customer impact.
  • On-site collaboration with cross-functional product and data science teams in India.
We seek a hands-on engineer who thrives in a fast-paced environment, cares about robust production delivery, and is passionate about shipping impactful GenAI features. Apply if you want to build scalable LLM systems that power real business outcomes.
Skills: tensorflow,llm,sql,azure,ml,python,etl,genai developer,pytorch,docker,gcp,aws,kubernetes

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You

hyderabad, ahmedabad, bengaluru

bengaluru east, karnataka, india

bengaluru east, karnataka, india