Azure Data Engineer (Lead)

14 years

0 Lacs

Posted:3 days ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Roles & Responsibilities

Key Responsibilities

  • Lead design and execution of Dataproc ? Databricks PySpark migration roadmap.
  • Define modernization strategy, including data ingestion, transformation, orchestration, and governance.
  • Architect scalable Delta Lake and Unity Catalog–based solutions.
  • Manage and guide teams on code conversion, dependency mapping, and data validation.
  • Collaborate with platform, infra, and DevOps teams to optimize compute costs and performance.
  • Own the automation & GenAI acceleration layer, integrating code parsers, lineage tools, and validation utilities.
  • Conduct performance benchmarking, cost optimization, and platform tuning (Photon, Auto-scaling, Delta Caching).
  • Mentor senior and mid-level developers, ensuring quality standards, documentation, and delivery timelines.

Technical Skills

  • Languages: Python, PySpark, SQL
  • Platforms: Databricks (Jobs, Workflows, Delta Live Tables, Unity Catalog), GCP Dataproc
  • Data Tools: Hadoop, Hive, Pig, Spark (RDD & DataFrame APIs), Delta Lake
  • Cloud & Integration: GCS, BigQuery, Pub/Sub, Cloud Composer, Airflow
  • Automation: GenAI-powered migration tools, custom Python utilities for code conversion
  • Version Control & DevOps: Git, Terraform, Jenkins, CI/CD pipelines
  • Other: Performance tuning, cost optimization, and lineage tracking with Unity Catalog

Preferred Experience

  • 10–14 years of data engineering experience with at least 3 years leading Databricks or Spark modernization programs.
  • Proven success in migration or replatforming projects from Hadoop or Dataproc to Databricks.
  • Exposure to AI/GenAI in code transformation or data engineering automation.
  • Strong stakeholder management and technical leadership skills.

Experience

  • 11-12 Years

Skills

  • Primary Skill: Data Engineering
  • Sub Skill(s): Data Engineering
  • Additional Skill(s): Python, Apache Hadoop, Apache Hive, Apache Airflow, synapse, databricks, SQL, Apache Spark, Azure Data Factory, Pyspark, GenAI Fundamentals, Cloud Pub/Sub, BigQuery

Mock Interview

Practice Video Interview with JobPe AI

Start PySpark Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Infogain logo
Infogain

IT Services and IT Consulting

Los Gatos CA

RecommendedJobs for You