Sr AI Engineers

0 - 5 years

0 Lacs

Posted:1 month ago| Platform: Indeed logo

Apply

Work Mode

On-site

Job Description

About the Role
We are seeking a skilled and hands-on Sr. AI Engineer with 4–5 years of experience in developing, fine-tuning, and deploying machine learning and deep learning models, including Generative AI systems. The ideal candidate has a strong foundation in classification, anomaly detection, and time-series modeling, along with experience in Transformer-based architectures. Expertise in model optimization, quantization, and Retrieval-Augmented Generation (RAG) pipelines is highly desirable.


Responsibilities


  • Design, train, and evaluate ML models for classification, anomaly detection, forecasting, and natural language understanding tasks.


  • Build and fine-tune deep learning models, including RNNs, GRUs, LSTMs, and Transformer architectures (e.g., BERT, T5, GPT).


  • Develop and deploy Generative AI solutions, including RAG pipelines for applications such as document search, Q&A, and summarization.


  • Apply model optimization techniques, including quantization, to improve latency and reduce memory/compute overhead in production.


  • Fine-tune large language models (LLMs) using Supervised Fine-Tuning (SFT) and Parameter-Efficient Fine-Tuning (PEFT) methods like LoRA or QLoRA (optional).


  • Define, track, and report relevant evaluation metrics; monitor model drift and retrain models as required.


  • Collaborate with cross-functional teams (data engineering, backend, DevOps) to productionize ML models using CI/CD pipelines.


  • Maintain clean, reproducible code, and proper documentation and versioning of experiments.


Required Skills & Qualifications


  • 4–5 years of hands-on experience in machine learning, deep learning, or data science roles.


  • Proficiency in Python and ML/DL libraries: scikit-learn, pandas, PyTorch, TensorFlow.


  • Strong understanding of traditional ML and deep learning, particularly for sequence and NLP tasks.


  • Experience with Transformer models and open-source LLMs (e.g., Hugging Face Transformers).


  • Familiarity with Generative AI tools and RAG frameworks (e.g., LangChain, LlamaIndex).


  • Experience in model quantization (dynamic/static, INT8) and deploying models in resource-constrained environments.


  • Knowledge of vector stores (e.g., FAISS, Pinecone, Azure AI Search), embeddings, and retrieval techniques.


  • Proficiency in evaluating models using statistical and business metrics.


  • Experience with model deployment, monitoring, and performance tuning in production.


  • Familiarity with Docker, MLflow, and CI/CD practices.


Preferred Qualifications


  • Experience fine-tuning LLMs (SFT, LoRA, QLoRA) on domain-specific datasets.


  • Exposure to MLOps platforms (e.g., SageMaker, Vertex AI, Kubeflow).


  • Familiarity with distributed data processing frameworks (e.g., Spark) and orchestration tools (e.g., Airflow).


  • Contributions to research papers, blogs, or open-source projects in ML/NLP/Generative AI.


    Job Opening ID

    RRF_6145

    Job Type

    Permanent

    Industry

    IT Services

    Date Opened

    18/11/2025

    City

    Pune City

    Province

    Maharashtra

    Country

    India

    Postal Code

    411001

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You