5 - 10 years

10 - 20 Lacs

Posted:2 hours ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Role Overview

Senior AI/ML Engineer

This role involves building and optimizing large-scale AI systems, working closely with cross-functional engineering teams, and delivering production-grade AI solutions for video, speech, and language-based applications.

Key Responsibilities

  • Design, develop, and optimize

    CV, NLP, speech, and multimodal transformer models

    for production environments.
  • Implement

    GPU-accelerated training pipelines

    and optimize model performance across distributed frameworks such as

    DeepSpeed

    and

    Horovod

    .
  • Build and scale

    real-time/near-real-time video processing

    pipelines for inference and analytics.
  • Develop and maintain

    ML workflows

    using

    MLflow, Ray, and Airflow

    .
  • Deploy scalable model-serving pipelines using

    KServe

    and

    Triton Inference Server

    .
  • Implement RAG pipelines and integrate

    vector databases

    (e.g., Pinecone, Weaviate, Milvus) with LLM frameworks such as

    LangChain

    .
  • Collaborate with platform teams, data scientists, and architects to deliver end-to-end AI solutions.
  • Ensure model reproducibility, performance monitoring, and adherence to responsible AI best practices.

Must-Have Skills

AI/ML Modeling

  • Expert in

    Computer Vision, NLP, Speech Models, and Multimodal Transformers

  • Experience building

    large transformer-based architectures

    (vision-language models, speech-to-text, text-to-video, etc.)

Performance & Training

  • Strong expertise in

    GPU optimization

  • Knowledge of

    distributed training

    tools:
    • DeepSpeed

    • Horovod

Video Processing

  • Hands-on experience with

    real-time / near-real-time video processing

    , encoding, and streaming pipelines

ML Engineering Tools

  • MLflow

    for experiment tracking
  • Ray

    for distributed workflows
  • Airflow

    for orchestration

Model Serving

  • Proficiency with

    KServe

  • Experience with

    Triton Inference Server

LLM/RAG & Vector Stores

  • Vector DBs (Pinecone, Weaviate, Milvus, Chroma)
  • RAG pipeline development

  • LangChain

    for LLM workflow orchestration

Good-to-Have Skills

  • Experience working with

    broadcast/OTT platforms

    ,

    MAM (Media Asset Management)

    systems, or

    DAM (Digital Asset Management)

    workflows
  • Experience with

    generative imaging

    (Stable Diffusion, ControlNet)
  • Knowledge of

    video generation AI models

Qualifications

  • Bachelors or Master’s degree in Computer Science, Engineering, or related field
  • 6–12 years of experience in AI/ML engineering
  • Proven track record of building and deploying production-grade AI systems

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Techno Facts Solutions logo
Techno Facts Solutions

Information Technology Consulting

Tech City

RecommendedJobs for You

hyderabad, chennai, bengaluru

hyderabad, chennai, bengaluru