Posted:3 days ago|
Platform:
Work from Office
Full Time
The QA Engineer will own quality assurance across the ML lifecyclefrom raw data validation through feature engineering checks, model training/evaluation verification, batch prediction/optimization validation, and end-to-end (E2E) workflow testing. The role is hands-on with Python automation, data profiling, and pipeline test harnesses in Azure ML and Azure DevOps. Success means provably correct data, models, and outputs at production scale and cadence.
Test Strategy & Governance
Define an ML-specific Test Strategy covering data quality KPIs, feature consistency
checks, model acceptance gates (metrics + guardrails), and E2E run acceptance (timeliness, completeness, integrity).
Establish versioned test datasets & golden baselines for repeatable regression of
features, models, and optimizers.
Data Quality & Transformation
Validate raw data extracts and landed datalake data: schema/contract checks, null/outlier thresholds, time-window completeness, duplicate detection, site/material coverage.
Validate transformed/feature datasets: deterministic feature generation, leakage detection, drift vs. historical distributions, feature parity across runs (hash or statistical similarity tests).
Implement automated data quality checks (e.g., Great Expectations/pytest + Pandas/SQL) executed in CI and AML pipelines.
Model Training & Evaluation
Verify training inputs (splits, windowing, target leakage prevention) and
hyperparameter configs per site/cluster.
Automate metric verification (e.g., MAPE/MAE/RMSE, uplift vs. last model, stability
tests) with acceptance thresholds and champion/challenger logic.
Validate feature importance stability and sensitivity/elasticity sanity checks (price-
volume monotonicity where applicable).
Gate model registration/promotion in AML based on signed test artifacts and
reproducible metrics.
Predictions, Optimization & Guardrails
Validate batch predictions: result shapes, coverage, latency, and failure handling.
Test model optimization outputs and enforced guardrails: detect violations and prove idempotent writes to DB.
Verify API push to third party system (idempotency keys, retry/backoff, delivery receipts).
Pipelines & E2E
Build pipeline test harnesses for AML pipelines (data-gen nightly, training weekly,
prediction/optimization) including orchestrated synthetic runs and fault injection (missing slice, late competitor data, SB backlog).
Run E2E tests from raw data store -> ADLS -> AML -> RDBMS -> APIM/Frontend; assert
freshness SLOs and audit event completeness (Event Hubs -> ADLS immutable).
Automation & Tooling
Develop Python-based automated tests (pytest) for data checks, model metrics, and API contracts; integrate with Azure DevOps (pipelines, badges, gates).
Implement data-driven test runners (parameterized by site/material/model-version) and store signed test artifacts alongside models in AML Registry.
Create synthetic test data generators and golden fixtures to cover edge cases (price gaps, competitor shocks, cold starts).
Reporting & Quality Ops
Publish weekly test reports and go/no-go recommendations for promotions; maintain a
defect taxonomy (data vs. model vs. serving vs. optimization).
5 to 7+ years in QA with 3+ years focused on ML/Data systems (data pipelines + model validation).
Python automation (pytest, pandas, NumPy), SQL (PostgreSQL/Snowflake), and CI/CD (Azure
DevOps) for fully automated ML QA.
Strong grasp of ML validation: leakage checks, proper splits, metric selection
(MAE/MAPE/RMSE), drift detection, sensitivity/elasticity sanity checks.
Experience testing AML pipelines (pipelines/jobs/components), and message-driven integrations
(Service Bus/Event Hubs).
API test skills (FastAPI/OpenAPI, contract tests, Postman/pytest-httpx) + idempotency and retry
patterns.
Familiar with feature stores/feature engineering concepts and reproducibility.
Solid understanding of observability (App Insights/Log Analytics) and auditability requirements.
Bachelors or Masters degree in Computer Science, Information Technology, or related field.
Certification in Azure Data or ML Engineer Associate is a plus.
Dmag Marketing
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Practice Python coding challenges to boost your skills
Start Practicing Python Now0.6 - 0.8 Lacs P.A.
6.0 - 10.0 Lacs P.A.
bengaluru
7.0 - 7.0 Lacs P.A.
bengaluru
8.0 - 15.0 Lacs P.A.
pune, bengaluru
3.0 - 7.0 Lacs P.A.
mohali
3.0 - 7.0 Lacs P.A.
9.0 - 13.0 Lacs P.A.
pune
3.0 - 7.0 Lacs P.A.
bengaluru
9.0 - 13.0 Lacs P.A.
bengaluru
7.0 - 11.0 Lacs P.A.