Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home

Jobs

Home
>
Jobs in Gurugram
>
Leewayhertz Technologies
>
Senior Data Engineer

Senior Data Engineer

Leewayhertz Technologies

7 - 12 years

15 - 30 Lacs

Gurugram Delhi / NCR

Posted:18 hours ago| Platform:

Apply

Skills Required

AWS Data Ingestion Pyspark Data Quality Data Lineage Metadata Management Redshift Aws Metadata Great Expectation AWS Glue Data Governance Python

Work Mode

Work from Office

Job Type

Full Time

Job Description

We are seeking a highly skilled Senior Data Engineer with deep expertise in AWS data services, data wrangling using Python & PySpark, and a solid understanding of data governance, lineage, and quality frameworks. The ideal candidate will have a proven track record of delivering end-to-end data pipelines for logistics, supply chain, enterprise finance, or B2B analytics use cases.

Role & responsibilities.

Design, build, and optimize ETL pipelines using AWS Glue 3.0+ and PySpark.
Implement scalable and secure data lakes using Amazon S3, following bronze/silver/gold zoning.
Write performant SQL using AWS Athena (Presto) with CTEs, window functions, and aggregations.
Take full ownership from ingestion transformation validation metadata documentation dashboard-ready output.
Build pipelines that are not just performant, but audit-ready and metadata-rich from the first version.
Integrate classification tags and ownership metadata into all columns using AWS Glue Catalog tagging conventions.
Ensure no pipeline moves to QA or BI team without validation logs and field-level metadata completed.
Develop job orchestration workflows using AWS Step Functions integrated with EventBridge or CloudWatch.
Manage schemas and metadata using AWS Glue Data Catalog.
Take full ownership from ingestion transformation validation metadata documentation dashboard-ready output.
Ensure no pipeline moves to QA or BI team without validation logs and field-level metadata completed.
Enforce data quality using Great Expectations, with checks for null %, ranges, and referential rules.
Ensure data lineage with OpenMetadata or Amundsen and add metadata classifications (e.g., PII, KPIs).
Collaborate with data scientists on ML pipelines, handling JSON/Parquet I/O and feature engineering.
Must understand how to prepare flattened, filterable datasets for BI tools like Sigma, Power BI, or Tableau.
Interpret business metrics such as forecasted revenue, margin trends, occupancy/utilization, and volatility.
Work with consultants, QA, and business teams to finalize KPIs and logic.
Build pipelines that are not just performant, but audit-ready and metadata-rich from the first version.
Integrate classification tags and ownership metadata into all columns using AWS Glue Catalog tagging conventions.

Preferred candidate profile

Strong hands-on experience with AWS: Glue, S3, Athena, Step Functions, EventBridge, CloudWatch, Glue Data Catalog.
Programming skills in Python 3.x, PySpark, and SQL (Athena/Presto).
Proficient with Pandas and NumPy for data wrangling, feature extraction, and time series slicing.
Strong command over data governance tools like Great Expectations, OpenMetadata / Amundsen.
Familiarity with tagging sensitive metadata (PII, KPIs, model inputs).
Capable of creating audit logs for QA and rejected data.
Experience in feature engineering rolling averages, deltas, and time-window tagging.

BI-readiness with Sigma, with exposure to Power BI / Tableau (nice to have).

More Jobs at Leewayhertz Technologies

Senior Data Engineer

Gurugram

9.0 - 14.0 yrs

INR 11 - 16 Lacs

Senior Data Engineer

Gurugram, Delhi / NCR

7.0 - 12.0 yrs

INR 15 - 30 Lacs

Senior Data Engineer

Gurugram

7.0 - 12.0 yrs

INR 15 - 30 Lacs

Mock Interview

Practice Video Interview with JobPe AI

Start Technical Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Leewayhertz Technologies

3 Jobs

RecommendedJobs for You

Senior Data Engineer

Argano Software

Kolkata, Mumbai, New Delhi, Hyderabad, Pune, Chennai, Bengaluru

Senior Data Engineer

Lumen Technologies

Bengaluru

Senior Data Engineer

Infoobjects

Jaipur

Senior Data Engineer

Varahe Analytics Private Limited

Noida, Uttar Pradesh, India

Senior Data Engineer

Jones Lang Lasalle Property Consultants

Bengaluru, Karnataka, India

Senior Data Engineer

Leewayhertz Technologies

Gurugram, Delhi / NCR

Senior Data Engineer

Leewayhertz Technologies

Gurugram

Senior Data Engineer

Epam Systems

Hyderabad, Pune, Bengaluru

Senior Data Engineer

New Groyp Talentoj

Hyderabad, Telangana, India

Senior Data Engineer

Neal Analytics

Bengaluru, Karnataka, India

Before You Leave... Find Your Perfect Job!

Login to

Please Verify Your Phone or Email

Confirm Action

Contact Us

Search

Profile

Upskill and Grow with AI

Personal Settings

Senior Data Engineer

Experience & Salary

Skills Required

Work Mode

Job Type

Job Description

Role & responsibilities.

Design, build, and optimize ETL pipelines using AWS Glue 3.0+ and PySpark.

Implement scalable and secure data lakes using Amazon S3, following bronze/silver/gold zoning.

Write performant SQL using AWS Athena (Presto) with CTEs, window functions, and aggregations.

Take full ownership from ingestion transformation validation metadata documentation dashboard-ready output.

Build pipelines that are not just performant, but audit-ready and metadata-rich from the first version.

Integrate classification tags and ownership metadata into all columns using AWS Glue Catalog tagging conventions.

Ensure no pipeline moves to QA or BI team without validation logs and field-level metadata completed.

Develop job orchestration workflows using AWS Step Functions integrated with EventBridge or CloudWatch.

Manage schemas and metadata using AWS Glue Data Catalog.

Take full ownership from ingestion transformation validation metadata documentation dashboard-ready output.

Ensure no pipeline moves to QA or BI team without validation logs and field-level metadata completed.

Enforce data quality using Great Expectations, with checks for null %, ranges, and referential rules.

Ensure data lineage with OpenMetadata or Amundsen and add metadata classifications (e.g., PII, KPIs).

Collaborate with data scientists on ML pipelines, handling JSON/Parquet I/O and feature engineering.

Must understand how to prepare flattened, filterable datasets for BI tools like Sigma, Power BI, or Tableau.

Interpret business metrics such as forecasted revenue, margin trends, occupancy/utilization, and volatility.

Work with consultants, QA, and business teams to finalize KPIs and logic.

Build pipelines that are not just performant, but audit-ready and metadata-rich from the first version.

Integrate classification tags and ownership metadata into all columns using AWS Glue Catalog tagging conventions.

Preferred candidate profile

Strong hands-on experience with AWS: Glue, S3, Athena, Step Functions, EventBridge, CloudWatch, Glue Data Catalog.

Programming skills in Python 3.x, PySpark, and SQL (Athena/Presto).

Proficient with Pandas and NumPy for data wrangling, feature extraction, and time series slicing.

Strong command over data governance tools like Great Expectations, OpenMetadata / Amundsen.

Familiarity with tagging sensitive metadata (PII, KPIs, model inputs).

Capable of creating audit logs for QA and rejected data.

Experience in feature engineering rolling averages, deltas, and time-window tagging.

BI-readiness with Sigma, with exposure to Power BI / Tableau (nice to have).

More Jobs at Leewayhertz Technologies