QA Engineer, GenAI

0 years

0 Lacs

Posted:2 weeks ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Make our Should-Costing & Negotiation Copilot answers correct, safe, grounded —every time. You’ll define and own the quality bar for GenAI at Numberz.ai.


Outcomes (

Evaluation harness:


Safety


Grounding


Slice coverage:

Drift & staleness: Canary tests in place; MTTD < 24h for model/data drift; materials/FX freshness within SLA.


Observability

Cost/latency:


Skills:

Must-Haves

Python + Pytest; JSON Schema/strict parsers; CI (GitHub Actions).

LLM/RAG Testing

Grounding checks for RAG; retrieval coverage & freshness controls.

Safety Testing

Data Rraftsmanship: Curate goldens, create slices, measure IAA; basic SQL; Docker.


Competencies:

How You Work

Pragmatism over dogma

Systems thinking

Ownership & clarity:

Collaboration:

  • Security & ethics mindset

    : Handle data and evals responsibly. 


 



Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You