Posted:1 week ago| Platform: Foundit logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Role Overview

hands-on Testing Lead

clear documentation is as critical as good testing

What You'll Do

Own Quality & Documentation End-to-End

  • Define testing strategy for

    LLMs, VLMs, and DL pipelines

    .
  • Create and maintain

    clear, lightweight documentation

    covering:
    • Model testing strategies and assumptions
    • Evaluation metrics and acceptance criteria
    • Known limitations, risks, and failure modes
    • Release readiness and quality sign-off
  • Ensure documentation evolves with models, data, and prompts.

LLM / GenAI Testing

  • Design tests for:
    • Prompt templates and prompt changes
    • RAG pipelines (retrieval quality, grounding, hallucination control)
    • Multi-turn conversations and long-context behaviour
  • Maintain

    golden datasets

    , regression test suites, and test result summaries.
  • Document prompt behaviour, edge cases, and known model quirks.

Vision & Multimodal Testing

  • Test VLMs for image-text alignment, OCR, captioning, and reasoning.
  • Document model performance across different image types, quality levels, and domains.
  • Track and publish

    model behaviour changes

    between versions.

Automation, MLOps & Reporting

  • Build Python-based automation for evaluation and regression testing.
  • Integrate tests into

    CI/CD and MLOps pipelines

    .
  • Produce

    readable quality reports and dashboards

    for engineers and leadership.
  • Monitor and document production issues such as

    model/data drift and degradation

    .

Build a Quality-First Culture

  • Establish QA and documentation standards that scale with a startup.
  • Mentor engineers on writing testable code and meaningful documentation.
  • Act as the

    single source of truth

    for AI quality, testing, and known risks.

What we're looking For

Must-Have

  • Strong background in

    software testing with lead or ownership experience

    .
  • Hands-on experience testing

    LLMs, DL models, or GenAI systems

    .
  • Strong

    Python

    skills for test automation and data validation.
  • Proven ability to write

    clear, structured technical documentation

    .
  • Understanding of:
    • Transformer-based models and DL workflows
    • Model evaluation metrics and non-deterministic system testing
  • Comfortable working in ambiguity and moving fast in a startup.

Nice-to-Have

  • Experience with

    VLMs, multimodal models, or computer vision

    .
  • Exposure to

    RAG architectures

    , vector databases, and embeddings.
  • Familiarity with tools like LangChain, LlamaIndex, MLflow, or similar.
  • Experience documenting AI risks, limitations, or compliance requirements.

[HIDDEN TEXT]

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You

bengaluru, karnataka

mumbai metropolitan region