Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in bengaluru
>
Western Digital
>
Data Quality Engineer, Enterprise Data Platform

Data Quality Engineer, Enterprise Data Platform

Western Digital

5 - 9 years

10 - 14 Lacs

bengaluru

Posted:1 week ago| Platform:

Apply

Skills Required

metadata reconciliation schema hipaa data governance data quality apache monitoring sql python

Work Mode

Work from Office

Job Type

Full Time

Job Description

About the Role

We are seeking a skilled and forward-thinking Data Quality Engineer to advance the data trust, governance, and certification framework for our enterprise Data Lakehouse platform built on Databricks, Apache Iceberg, AWS (Glue, Glue Catalog, SageMaker Studio), Dremio, Atlan, and Power BI.

This role is critical in ensuring that data across Bronze (raw), Silver (curated), and Gold (business-ready) layers is certified, discoverable, and AI/BI-ready. You will design data quality pipelines, semantic layers, and governance workflows, enabling both Power BI dashboards and Conversational Analytics leveraging LLMs (Large Language Models).

Your work will ensure that all 9 dimensions of data quality (accuracy, completeness, consistency, timeliness, validity, uniqueness, integrity, conformity, reliability) are continuously met, so both humans and AI systems can trust and use the data effectively.

ESSENTIAL DUTIES AND RESPONSIBILITIES

Data Quality & Reliability

Build and maintain automated validation frameworks across Bronze Silver Gold pipelines.

Develop tests for schema drift, anomalies, reconciliation, timeliness, and referential integrity.

Integrate validation into Databricks (Delta Lake, Delta Live Tables, Unity Catalog) and Iceberg-based pipelines.

Data Certification & Governance

Define data certification workflows ensuring only trusted data is promoted for BI/AI consumption.

Leverage Atlan and AWS Glue Catalog for metadata management, lineage, glossary, and access control.

Utilize Iceberg s schema evolution & time travel to ensure reproducibility and auditability.

Semantic Layer & Business Consumption

Build a governed semantic layer on gold data to support BI and AI-driven consumption.

Enable Power BI dashboards and self-service reporting with certified KPIs and metrics.

Partner with data stewards to align semantic models with business glossaries in Atlan.

Conversational Analytics & LLM Enablement

Prepare and certify datasets that fuel conversational analytics experiences.

Collaborate with AI/ML teams to integrate LLM-based query interfaces (e. g. , natural language to SQL) with Dremio, Databricks SQL, and Power BI.

Ensure LLM responses are grounded on high-quality, certified datasets, reducing hallucinations and maintaining trust.

ML Readiness & SageMaker Studio

Provide certified, feature-ready datasets for ML training and inference in SageMaker Studio.

Collaborate with ML engineers to ensure input data meets all 9 quality dimensions.

Establish monitoring for data drift and model reliability.

Holistic Data Quality Dimensions

Continuously enforce all 9 dimensions of data quality:

Accuracy, Completeness, Consistency, Timeliness, Validity, Uniqueness, Integrity, Conformity, Reliability.

Required

5 10 years of experience in data engineering, data quality, or data governance roles.

Strong skills in Python, PySpark, and SQL.

Hands-on with Databricks (Delta Lake, Unity Catalog, Delta Live Tables) and Apache Iceberg.

Expertise in AWS data stack (S3, Glue ETL, Glue Catalog, Athena, EMR, Redshift, SageMaker Studio).

Experience with Power BI semantic modeling, DAX, and dataset certification.

Familiarity with Dremio or query engines (Trino, Presto).

Knowledge of Atlan or equivalent catalog/governance tools.

Experience with data quality testing frameworks (Great Expectations, Deequ, Soda).

Preferred

Exposure to Conversational Analytics platforms or LLM-powered BI (e. g. , natural language query over Lakehouse/Power BI).

Experience integrating LLM pipelines (LangChain, OpenAI, AWS Bedrock, etc. ) with enterprise data.

Familiarity with data observability tools (Monte Carlo, Bigeye, DataDogs, Grafana).

Knowledge of data compliance frameworks (GDPR, CCPA, HIPAA).

Cloud certifications: AWS Data Analytics Specialty, Databricks Certified Data Engineer.

More Jobs at Western Digital

Intern 3, Non-Engineering BTech/BE 2026 Passout - Python , SQL

Bengaluru, Karnataka, India

Experience: Not specified

Salary: Not disclosed

Manager, Finance

Bengaluru, Karnataka, India

Experience: Not specified

Salary: Not disclosed

Manager, Accounting

Bengaluru, Karnataka, India

Experience: Not specified

Salary: Not disclosed

Analyst 4, Data Analyst

Bengaluru

1.0 - 4.0 yrs

INR 3 - 7 Lacs

Analyst 3, Data Analytics

Bengaluru

2.0 - 4.0 yrs

INR 3 - 4 Lacs

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Western Digital

Computer Hardware Manufacturing

San Jose CA

Login to

Please Verify Your Phone or Email

Confirm Action

Data Quality Engineer, Enterprise Data Platform