GE Vernova is seeking a highly skilled AI QA Engineer to lead the development of benchmark frameworks for AI systems & use cases deployed. This role is critical to ensuring the accuracy, reliability, safety, and compliance of AI models driving innovation at GE Vernova. The ideal candidate will design and implement rigorous testing frameworks tailored for AI use cases unique to GE Vernova.

Job Description

Design, develop, and execute comprehensive benchmark frameworks to evaluate AI use cases, prompt performance, robustness, across diverse array of use cases.
Develop automated testing suites to validate AI functionalities such as prediction accuracy, response consistency, case handling, bias detection, and model degradation over time.
Collaborate closely with Prompt Engineers, AI Agent Engineers and product owners to define quality standards, acceptance criteria, and key performance indicators (KPIs) for AI deployments.
Conduct quantitative and qualitative evaluations of AI outputs ensuring alignment with business objectives, regulatory standards, and ethical AI principles.
Monitor AI use cases performance in production, systematically identify issues, and recommend improvements to maintain high standards of AI quality and safety.
Document testing methodologies, framework designs, quality metrics, and ensure thorough reporting to stakeholders.
Support continuous improvement initiatives by incorporating feedback loops and new benchmarking techniques as AI technologies evolve.
Provide training and mentorship on AI testing best practices within GE Vernovas Prompt engineering and AI Agent Engineering team.
Stay current with emerging AI technologies, contribute to platform and tooling improvements, and share knowledge within the team.

Required Qualifications and Skills:

Masters degree in Computer Science, AI Engineering, Data Science, or related field.
3+ years of experience in software quality assurance with at least 2 years focused on AI/ML systems.
Proven track record designing and implementing AI benchmark frameworks or quality assurance strategies.
Strong programming skills in Python and experience with automated testing frameworks (e.g., pytest, Selenium).
Familiarity with RAG systems, vector databases, and GenAI architectures.
Deep understanding of machine learning models, LLMs, and AI validation methodologies.
Experience with AI fairness, bias detection, and responsible AI practices.
Knowledge of cloud environments (AWS, Azure, or GCP), containerization, and deployment pipelines.
Ability to analyze complex datasets and performance metrics quantitatively and qualitatively.
Strong communication skills with ability to document and present technical information clearly.
Hands-on experience with LLMs, prompt engineering, and natural language processing (NLP).
Knowledge of agent orchestration platforms and multi-agent systems (e.g., AutogenAI, LangGraph, MCP protocol).

More Jobs at GE VERNOVA

Lead Engineer - Electrical Component

noida, chennai

8.0 - 12.0 yrs

INR 10 - 15 Lacs

Engineer - Mechanical Component

chennai

3.0 - 8.0 yrs

INR 6 - 11 Lacs

Commercial Leader

bengaluru

10.0 - 15.0 yrs

INR 10 - 14 Lacs

Sourcing Specialist - Supplier Quality Engineering

coimbatore, padappai

5.0 - 10.0 yrs

INR 9 - 14 Lacs

Lead Engineer 2 - Electrical

chennai

10.0 - 15.0 yrs

INR 11 - 16 Lacs

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

GE VERNOVA

Login to

Please Verify Your Phone or Email

Confirm Action

AI QA Engineer - GenAI