Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in Delhi
>
Markovate
>
Senior Data Engineer (Data Lake, Forecasting & Governance)- 9+ yrs-Immediate

Senior Data Engineer (Data Lake, Forecasting & Governance)- 9+ yrs-Immediate

Markovate

9 - 12 years

0 Lacs

Delhi India

Posted:2 months ago| Platform:

Apply

Skills Required

data forecasting development aws governance versioning tracking pyspark storage apache management design etl forecast audit logging logic support cataloging automation pipeline regression testing dashboard compliance discovery collaboration query optimization tooling engineering git deployment bitbucket volatility visualization checks communication documentation traceability

Work Mode

On-site

Job Type

Full Time

Job Description

Senior Data Engineer

The ideal candidate will be highly proficient in AWS data services, PySpark, and versioned storage formats such as Apache Hudi or Iceberg. A strong understanding of data quality, observability, governance, and metadata management in large-scale analytical systems is critical.

Roles & Responsibilities

Design and implement data lake zoning (Raw Clean Modeled) using Amazon S3, AWS Glue, and Athena.
Ingest structured and unstructured datasets including POS, USDA, Circana, and internal sales data.
Build versioned and upsert-ready ETL pipelines using Apache Hudi or Iceberg.
Create forecast-ready datasets with lagged, rolling, and trend features for revenue and occupancy modeling.
Optimize Athena datasets with partitioning, CTAS queries, and S3 metadata tagging.
Implement S3 lifecycle policies, intelligent file partitioning, and audit logging for performance and compliance.
Build reusable transformation logic using dbt-core or PySpark to support KPIs and time series outputs.
Integrate data quality frameworks such as Great Expectations, custom logs, and AWS CloudWatch for field-level validation and anomaly detection.
Apply data governance practices using tools like OpenMetadata or Atlan, enabling lineage tracking, data cataloging, and impact analysis.
Establish QA automation frameworks for pipeline validation, data regression testing, and UAT handoff.
Collaborate with BI, QA, and business teams to finalize schema design and deliverables for dashboard consumption.
Ensure compliance with enterprise data governance policies and enable discovery and collaboration through metadata platforms.

Preferred Candidate Profile

9-12 years of experience in data engineering.
Deep hands-on experience with AWS Glue, Athena, S3, Step Functions, and Glue, Data Catalog.
Strong command over PySpark, dbt-core, CTAS query optimization, and advanced partition strategies.
Proven experience with versioned ingestion using Apache Hudi, Iceberg, or
Delta Lake.
Experience in data lineage, metadata tagging, and governance tooling using
OpenMetadata, Atlan, or similar platforms.
Proficiency in feature engineering for time series forecasting (lags, rolling windows, trends).
Expertise in Git-based workflows, CI/CD, and deployment automation (Bitbucket or similar).
Strong understanding of time series KPIs: revenue forecasts, occupancy trends, demand volatility, etc.
Knowledge of statistical forecasting frameworks (e.g., Prophet, GluonTS, Scikit-learn).
Experience with Superset or Streamlit for QA visualization and UAT testing.
Experience building data QA frameworks and embedding data validation checks at each stage of the ETL lifecycle.
Independent thinker capable of designing systems that scale with evolving business logic and compliance requirements.
Excellent communication skills for collaboration with
BI, QA, data governance, and business stakeholders.
High attention to detail, especially around data accuracy, documentation, traceability, and auditability.

More Jobs at Markovate

AI/ML Engineer- 1-2yrs- Immediate

India

1.0 - 2.0 yrs

Salary: Not disclosed

Markovate - Senior Data Engineer - Spark/Hadoop

Gurugram, Haryana, India

9.0 - 9.0 yrs

Salary: Not disclosed

Full Stack Developer- 2yrs- Immediate

India

2.0 - 2.0 yrs

Salary: Not disclosed

Lead Engineer - AI/ML - 5+years

India

5.0 - 5.0 yrs

Salary: Not disclosed

Markovate - Senior Artificial Intelligence/Machine Learning Engineer - LLM/Deep Learning

Greater Kolkata Area

5.0 - 5.0 yrs

Salary: Not disclosed

Mock Interview

Practice Video Interview with JobPe AI

Start PySpark Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

Markovate

RecommendedJobs for You

Senior Data Engineer (Data Lake, Forecasting & Governance)- 9+ yrs-Immediate

Markovate

Delhi, India

Senior Data Engineer (Data Lake, Forecasting & Governance)- 9+ yrs-Immediate

Markovate

Delhi, India

Login to

Please Verify Your Phone or Email

Confirm Action

Senior Data Engineer (Data Lake, Forecasting & Governance)- 9+ yrs-Immediate