Gen AI Lead Engineer

5 - 8 years

0 Lacs

Posted:1 week ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Gen AI Lead Engineer JD (5-8 years)



We are seeking a


Key Responsibilities


  • Design, build, and deploy ML/DL models for:
  • Tabular data

     (e.g., XGBoost)
  • NLP and GenAI

     (e.g., RAG, tool/function calling)
  • Fine-tune and serve LLMs using 

    Hugging Face

    PyTorch

    , and efficient-tuning techniques like 

    LoRA

    QLoRA

    , and 

    PEFT

    .
  • Architect and implement agent-based LLM solutions with tools like 

    LlamaIndex

    LangGraph

    , or 

    LangChain Agents

    .
  • Design multi-step tool-calling workflows and structured function-calling strategies for complex tasks.
  • Orchestrate agent memory, state management, and contextual conversations.
  • Develop and maintain 

    FastAPI microservices

     for low-latency GenAI inference, streaming, and secure system integration.
  • Deploy ML pipelines and training jobs in the cloud (preferably 

    Azure

    , also 

    AWS

    ).
  • Handle end-to-end MLOps: CI/CD, containerization (Docker), GPU/ACI deployment, observability, and cost governance.
  • Collaborate cross-functionally with product, data, and frontend teams to translate abstract ideas into tangible outcomes.


Required Skills & Qualifications

  • Strong Python programming skills; experience with 

    FastAPI

     and/or 

    Java

     is a plus.

Proven experience with:

  • XGBoost

     or similar ML models for tabular data.
  • YOLO

    OCR

    , and PyTorch for vision and text extraction tasks.

Deep knowledge in NLP / GenAI:

  • LLM fine-tuning, prompt engineering, and RAG design.
  • Proficiency with 

    Hugging Face Transformers

    PEFT

    , and vector databases.
  • Implementation of agent frameworks like 

    LlamaIndex

    LangGraph

    , or 

    LangChain Agents

    .

MLOps & Deployment:

  • Experience with Docker, CI/CD pipelines, experiment tracking, model versioning, and rollback mechanisms.

Cloud Proficiency:

  • Azure ML

    , Azure Functions, AKS (preferred) or 

    AWS SageMaker

    , Lambda.
  • Bonus

    : experience with Triton / vLLM, streaming websockets, or GPU cost-optimization.


Benefits of Working with Us:

  • Best of Both Worlds: Enjoy the enthusiasm and learning curve of a startup combined with the deliveries and performance of an enterprise service provider.
  • Flexible Working Hours: We offer a delivery-oriented approach with flexible working hours to help you maintain a healthy work-life balance.
  • Limitless Growth Opportunities: The sky is not the limit when it comes to learning, growth, and sharing ideas. We encourage continuous learning and personal development.
  • Flat Organizational Structure: We don't follow the typical corporate hierarchy ladder, fostering an open and collaborative work environment where everyone's voice is heard.


As part of our dedication to an inclusive and diverse workforce, TechChefz Digital is committed to Equal Employment Opportunity without regard to race, color, national origin, ethnicity, gender, protected veteran status, disability, sexual orientation, gender identity, or religion. If you need assistance, you may contact us at joinus@techchefz.com

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
TechChefz Digital logo
TechChefz Digital

Information Technology

Silicon Valley

RecommendedJobs for You