SDET - AI

2 - 5 years

12 - 17 Lacs

Posted:3 weeks ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

In this role, you'll be at the forefront of testing cutting-edge technologies, including Large Language Models (LLMs), AI agents, and Generative AI systems. you'll play a critical role in validating the performance, reliability, fairness, and transparency of AI-powe'red applications ensuring they meet high standards for both quality and responsible use.

If you think like a tester, code like a developer, and break systems like a hacker Resilinc is your proving ground.
 

What You Will Do

    • Develop and implement QA strategies for AI-powe'red applications, focusing on accuracy, bias, fairness, robustness, and performance.
    • Design and execute automated and manual test cases to validate AI Agents/LLM models, APIs, and data pipelines and good understanding of data integrity, data models, etc
    • Assess AI models using quality metrics such as precision/recall and hallucination detection.
    • Test AI models for bias, fairness, explainability (XAI), drift, and adversarial robustness.
    • Validate prompt engineering, fine-tuning techniques, and model-generated responses for accuracy and ethical AI considerations.
    • Service/tool development.
    • Conduct scalability, latency, and performance testing for AI-driven applications.
    • Collaborate with data engineers to validate data pipelines, feature engineering processes, and model outputs.
    • Design, develop, and maintain automation scripts using Selenium and Playwright for API and web testing
    • Work closely with cross-functional teams to integrate automation best practices into the development lifecycle.
    • Identify, document, and track bugs while conducting detailed regression testing to ensure product quality.

What You Will Bring

    • Proven expertise in testing AI models, LLMs, and Generative AI applications, with hands-on experience in AI evaluation metrics and testing tools like Arize, MAIHEM, and LangTest.
    • Strong proficiency in Python for writing test scripts and automating model validation, along with a deep understanding of AI bias detection, adversarial testing, model explainability (XAI), and AI robustness.
    • Demonstrate strong SQL expertise for validating data integrity and backend processes, particularly in PostgreSQL and MySQL.
    • Strong analytical and problem-solving skills with keen attention to detail, along with excellent communication and documentation abilities to convey complex testing processes and results.

Why You Will Love It Here

    • Next-Level QA Go beyond traditional testing to challenge AI agents, LLMs, and GenAI systems with intelligent, self-evolving test strategies
    • Agentic AI Frontier Be at the forefront of validating autonomous, ethical AI in high-impact applications trusted by global enterprises
    • Full-Stack Test Engineering Combine Python, SQL, and tools like LangTest, Arize, Selenium Playwright to test everything from APIs to AI fairness
    • Purpose-Driven Mission Join a remote-first team that protects critical supply chains ensuring vital products reach people when they need them most

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Resilinc logo
Resilinc

Supply Chain Management

Walnut Creek

RecommendedJobs for You

hyderabad, pune, chennai, bengaluru

chennai, tamil nadu