Azure Data Engineer (Lead)

10 - 14 years

0 Lacs

Posted:2 days ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

As a Senior Data Engineer at Infogain, you will be responsible for leading the design and execution of the Dataproc to Databricks PySpark migration roadmap. Your role will involve defining a modernization strategy encompassing data ingestion, transformation, orchestration, and governance. Additionally, you will architect scalable solutions using Delta Lake and Unity Catalog, ensuring optimal performance and cost efficiency. Key Responsibilities: - Lead the design and execution of Dataproc to Databricks PySpark migration roadmap. - Define a modernization strategy for data ingestion, transformation, orchestration, and governance. - Architect scalable Delta Lake and Unity Catalog-based solutions. - Manage and guide teams on code conversion, dependency mapping, and data validation. - Collaborate with platform, infrastructure, and DevOps teams to optimize compute costs and performance. - Own the automation & GenAI acceleration layer, integrating code parsers, lineage tools, and validation utilities. - Conduct performance benchmarking, cost optimization, and platform tuning (Photon, Auto-scaling, Delta Caching). - Mentor senior and mid-level developers, ensuring quality standards, documentation, and delivery timelines. Qualifications Required: - Languages: Python, PySpark, SQL - Platforms: Databricks (Jobs, Workflows, Delta Live Tables, Unity Catalog), GCP Dataproc - Data Tools: Hadoop, Hive, Pig, Spark (RDD & DataFrame APIs), Delta Lake - Cloud & Integration: GCS, BigQuery, Pub/Sub, Cloud Composer, Airflow - Automation: GenAI-powered migration tools, custom Python utilities for code conversion - Version Control & DevOps: Git, Terraform, Jenkins, CI/CD pipelines - Other: Performance tuning, cost optimization, and lineage tracking with Unity Catalog Preferred Experience: - 10-14 years of data engineering experience with at least 3 years leading Databricks or Spark modernization programs. - Proven success in migration or replatforming projects from Hadoop or Dataproc to Databricks. - Exposure to AI/GenAI in code transformation or data engineering automation. - Strong stakeholder management and technical leadership skills. About the Company: Infogain is a human-centered digital platform and software engineering company based out of Silicon Valley. They engineer business outcomes for Fortune 500 companies and digital natives in various industries using technologies such as cloud, microservices, automation, IoT, and artificial intelligence. Infogain is a Microsoft Gold Partner and Azure Expert Managed Services Provider. They have offices in California, Washington, Texas, the UK, the UAE, and Singapore, with delivery centers in multiple locations globally.,

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Infogain logo
Infogain

IT Services and IT Consulting

Los Gatos CA

RecommendedJobs for You