Gen AI Engineer

0 years

13 - 15 Lacs

Posted:1 day ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

About The Opportunity

We’re a fast-growing

enterprise AI platform provider in the cloud services & software (SaaS) sector

, helping Fortune 500 clients modernize data pipelines, automate knowledge work and unlock new revenue with Generative AI. Backed by a deep bench of AI researchers and cloud architects, we build scalable, production-grade solutions on Microsoft Azure. Join our Pune-based hybrid team to shape the next generation of agentic AI products.Role & Responsibilities
  • Design, prototype and deploy GenAI applications (LLMs, RAG, multimodal) on Azure OpenAI, Cognitive Search and Kubernetes-based micro-services.
  • Build and orchestrate agentic frameworks (LangGraph / AutoGen) to enable multi-agent reasoning, tool-calling and workflow automation at scale.
  • Engineer robust data & prompt pipelines using Azure Data Factory, Event Hub and Cosmos DB, ensuring low-latency, high-throughput inference.
  • Optimize model performance & cost via fine-tuning, quantization and scalable caching on Azure ML and AKS.
  • Harden solutions for production with end-to-end CI/CD, observability (App Insights, Prometheus), security & responsible-AI guardrails.
  • Collaborate cross-functionally with product managers, designers and customer success to deliver measurable business impact.

Skills & Qualifications

Must-Have

  • 3-5 yrs hands-on in Generative AI / LLM engineering (GPT, Llama 2, Claude, etc.) with at least one product in production.
  • Proven expertise in Microsoft Azure services: Azure OpenAI, Functions, Data Factory, Cosmos DB, AKS.
  • Strong Python/TypeScript with agentic frameworks (LangChain, AutoGen, Semantic Kernel) and REST/GraphQL APIs.
  • Solid grounding in cloud MLOps: Docker, Helm, Terraform/Bicep, GitHub Actions or Azure DevOps.

Preferred

  • Experience benchmarking & scaling pipelines to >10 K QPS using Vector DBs (Qdrant, Pinecone) and distributed caching.
  • Familiarity with prompt-engineering, fine-tuning & retrieval-augmented generation (RAG) best practices.
  • Knowledge of Kubernetes operators, Dapr, Service Mesh for fault-tolerant micro-services.
Skills: Generative AI,Azure,Python,LLMs,SQL Azure,agentic framework,Langgraph,autogen,CI,Cd,Kubernetes,Microsoft Azure,Cloud

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You

Gurgaon, Haryana, India

Hyderabad, Telangana, India

Pune, Maharashtra, India

Chennai, Tamil Nadu, India

Hyderabad, Telangana, India