AI Evaluation & Data Engineering Specialist

8 years

0 Lacs

Posted:1 day ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Role Overview
We are looking for AI Evaluation & Data Engineering Specialists to design, curate, and operationalize datasets and evaluation frameworks for AI product performance assessment. This role involves working with large language models (LLMs), human raters, and automation tools to measure model accuracy, correctness, and usability.Key Responsibilities
  • Build and maintain evaluation datasets for AI models across programming languages (Python, Golang, JavaScript, Java).
  • Develop and apply data labeling and scoring guidelines based on Google’s evaluation framework.
  • Implement LLM-judge calibration workflows to align automated and human evaluations.
  • Perform error analysis, drift detection, and regression testing of AI model outputs.
  • Collaborate with automation engineers to integrate datasets into evaluation pipelines.
  • Support rater training, inter-rater reliability checks, and dataset validation reviews.
  • Manage data quality assurance and documentation for contributions to Google-maintained repositories.

Required Skills & Experience

  • 4–8 years of experience in AI/ML data operations, evaluation, or data engineering.
  • Proficiency in Python (mandatory) for dataset manipulation, analysis, and scripting.
  • Experience with LLM evaluation, prompt engineering, or text generation quality assessment.
  • Familiarity with Gemini CLI, Vertex AI, or LangChain evaluation tools.
  • Strong understanding of data curation, annotation workflows, and labeling quality metrics.
  • Hands-on with Git-based repositories and CI/CD data workflows.
  • Excellent analytical and problem-solving skills with attention to detail.

Preferred Qualifications

  • Experience evaluating code-generation or NLP-based AI products.
  • Exposure to data governance and privacy compliance frameworks.
  • Background in computer science, data science, or linguistics preferred

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Virtusa logo
Virtusa

Information Technology and Services

Southborough

RecommendedJobs for You