We are seeking a skilled and experienced Data Engineer Lead to join our team. The ideal candidate will have expertise in Apache Spark, PySpark, Python , and AWS services (particularly AWS Glue ). You will be responsible for designing, building, and optimizing ETL processes and data workflows in the cloud, specifically on the AWS platform . Your work will focus on leveraging Spark-based frameworks, Python, and AWS services to efficiently process and manage large datasets. Experience Range - 5 to 7 years Key Responsibilities: Spark & PySpark Development : Design and implement scalable data processing pipelines using Apache Spark and PySpark to support large-scale data transformations. ETL Pipeline Development : Build, maintain, and optimize ETL processes for seamless data extraction, transformation, and loading across various data sources and destinations. AWS Glue Integration : Utilize AWS Glue to create, run, and monitor serverless ETL jobs for data transformations and integrations in the cloud. Python Scripting : Develop efficient, reusable Python scripts to support data manipulation, analysis, and transformation within the Spark and Glue environments. Data Pipeline Optimization : Ensure that all data workflows are optimized for performance, scalability , and cost-efficiency on the AWS Cloud platform. Collaboration : Work closely with data analysts , data scientists , and other engineering teams to create reliable data solutions that support business analytics and decision-making . Documentation & Best Practices : Maintain clear documentation of processes, workflows, and code while adhering to best practices in data engineering , cloud architecture , and ETL design . Required Skills: Expertise in Apache Spark and PySpark for large-scale data processing and transformation. Hands-on experience with AWS Glue for building and managing ETL workflows in the cloud. Strong programming skills in Python , with experience in data manipulation, automation, and integration with Spark and Glue. In-depth knowledge of ETL principles and data pipeline design, including optimization techniques. Proficiency in working with AWS services , such as S3 , Glue , Lambda , and Redshift . Strong skills in writing optimized SQL queries , with a focus on performance tuning. Ability to translate complex business requirements into practical technical solutions . Familiarity with Apache Airflow for orchestrating data workflows. Knowledge of data warehousing concepts and cloud-native analytics tools. Required Skills Aws Glue,Pyspark,Python.

More Jobs at UST

UCC Engineer (Collab)

Trivandrum

5 - 7 yrs

INR 0 - 0 Lacs

Specialist I - Cloud Infrastructure Services - Network Engineer

Trivandrum

12 - 18 yrs

INR 0 - 0 Lacs

Application Packaging - SCCM, Release management

Trivandrum

8 - 12 yrs

INR 0 - 0 Lacs

Lead I - Cloud Infrastructure Services

Trivandrum

5 - 7 yrs

INR 0 - 0 Lacs

SQL Database Engineering

Trivandrum

12 - 15 yrs

INR 0 - 0 Lacs

Mock Interview

Practice Video Interview with JobPe AI

Start Hive Interview Now

My Connections UST

Download Chrome Extension (See your connection in the UST )

Download Now

UST

www.ust.com

IT Services and IT Consulting

Aliso Viejo CA

10001 Employees

1845 Jobs

Key People

Kris Canekeratne

Co-Founder & CEO
Sandeep Reddy

President
Baskar Subramanian

Co-Founder & Chief Strategy Officer
Lynn C. Mclean

Chief Financial Officer

Login to

Please Verify Your Phone or Email

Confirm Action

Search

Profile

Bookmarks

Data Engineering Lead - AWS Glue & PySpark Specialist