Automation & Benchmarking Engineer

4 - 8 years

5 - 9 Lacs

Posted:15 hours ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Automation & Benchmarking Engineer (2 Roles)

Role Overview

We are seeking

Automation & Benchmarking Engineers

to build scalable pipelines and tools that automate dataset ingestion, evaluation, and benchmarking of AI systems. This role blends

software engineering

,

automation scripting

, and

AI performance analysis

to accelerate evaluation workflows and generate insights across Googles AI tools.

Key Responsibilities

  • Design and develop

    end-to-end automation pipelines

    for evaluation workflowsprompt submission, response collection, result aggregation, and reporting.
  • Integrate evaluation tooling with developer surfaces like

    Gemini CLI, VS Code, and GitHub

    .
  • Conduct

    competitive benchmarking

    against peer AI tools to measure correctness, verbosity, and usefulness.
  • Build dashboards and visualization reports using

    Looker Studio, BigQuery, or Python-based tools

    .
  • Optimize system performance, automate error logging, and maintain reproducibility across evaluations.
  • Collaborate with TPM and data specialists to deliver evaluation automation at scale.
  • Ensure source code management and deployment compliance in

    GitLab / Bitbucket

    environments.

Required Skills & Experience

  • 4-8 years of experience in

    software engineering, test automation, or AI evaluation tooling

    .
  • Proficiency in

    Python, JavaScript, or Go

    for automation and data handling.
  • Experience building pipelines or automation frameworks (Airflow, Beam, or custom orchestration).
  • Strong understanding of

    REST APIs

    ,

    LLM integration

    , and

    evaluation metrics computation

    .
  • Hands-on experience with

    data visualization tools

    (Looker, Tableau, Plotly).
  • Familiarity with

    cloud platforms (GCP preferred)

    and

    source control workflows

    (GitHub/GitLab).

Preferred Qualifications

  • Experience benchmarking AI-assisted developer tools (e.g., Copilot, TabNine, Replit).
  • Knowledge of

    ML model evaluation metrics

    and

    comparative performance analysis

    .
  • Background in computer science, applied ML, or automation frameworks.

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Virtusa logo
Virtusa

Information Technology and Services

Southborough

RecommendedJobs for You