3 - 8 years

15 - 25 Lacs

Posted:3 weeks ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Job Overview

Senior AI/ML Engineer

Key Responsibilities

  • Design, build, and fine-tune

    CV, NLP, speech, and multimodal transformer models

    .
  • Develop high-performance pipelines for

    real-time and near-real-time video processing

    .
  • Optimize model training and inference across

    multi-GPU

    and

    distributed environments

    using DeepSpeed, Horovod, and CUDA-level improvements.
  • Implement scalable and reliable ML pipelines using

    Ray, Airflow

    , and

    MLflow

    for tracking and workflow management.
  • Deploy models using

    KServe

    ,

    Triton Inference Server

    , and other high-performance serving solutions.
  • Build and optimize

    RAG pipelines

    , vector embeddings, and integrate vector databases using FAISS, Pinecone, Weaviate, etc.
  • Use frameworks like

    LangChain

    to orchestrate multimodal and retrieval-enhanced AI systems.
  • Collaborate with product, engineering, and data teams to deliver end-to-end AI features.
  • Ensure performance, scalability, and robustness of deployed AI systems.

Must-Have Skills

AI/ML Expertise

  • Strong proficiency in

    Computer Vision

    ,

    NLP

    ,

    Speech models

    , and

    multimodal transformers

    .
  • Experience with SOTA models like CLIP, BLIP, Whisper, ViT, SAM, LLaVA, etc.

Performance Engineering

  • Deep understanding of

    GPU optimization

    , memory management, and inference tuning.
  • Hands-on experience with

    distributed training frameworks

    such as DeepSpeed, Horovod, or PyTorch Distributed.

Real-Time Processing

  • Experience building

    real-time or near-real-time video analysis systems

    (streaming, processing, inferencing).

MLOps & Tools

  • Hands-on experience with:
    • MLflow

      experiment tracking, model registry
    • Ray

      – distributed job execution
    • Airflow

      – pipeline orchestration
  • Expertise deploying models with

    KServe

    and

    Triton Inference Server

    .
  • Experience with

    Vector DBs

    ,

    RAG frameworks

    , and

    LangChain

    .

Good-to-Have Skills

  • Experience working with

    broadcast/OTT platforms

    ,

    Media Asset Management (MAM)

    systems, or

    Digital Asset Management (DAM)

    workflows.
  • Exposure to

    generative AI for images or video

    , including

    Stable Diffusion

    or diffusion-based video generation models.
  • Understanding of media processing pipelines, compression, codecs, or streaming technologies.

Soft Skills

  • Strong problem-solving abilities and system-thinking mindset.
  • Excellent communication for explaining complex AI architectures.
  • Ability to collaborate effectively with cross-functional teams.
  • Strong documentation and design articulation skills.

Mock Interview

Practice Video Interview with JobPe AI

Start Artificial Intelligence Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Techno Facts Solutions logo
Techno Facts Solutions

Information Technology Consulting

Tech City

RecommendedJobs for You

pune, bengaluru, mumbai (all areas)