Big Data Engineer - Python+ PySpark + Spark

9 - 12 years

0 - 3 Lacs

Posted:6 days ago| Platform: Naukri logo

Apply

Work Mode

Hybrid

Job Type

Full Time

Job Description

Experience - 9 years - 12 years Location - Mumbai / Chennai / Bangalore / Pune Develop and maintain scalable data pipelines using PySpark and Spark SQL for processing large datasets efficiently. Write clean, reusable, and optimized code in Python for data manipulation, analysis, and automation tasks. Design and implement ETL workflows to extract, transform, and load data from various structured and unstructured sources. Collaborate with data engineers, analysts, and stakeholders to understand data requirements and deliver solutions. Optimize Spark jobs for performance tuning, resource utilization, and minimizing execution time. Work with distributed computing frameworks to process and analyze big data in a cloud or on-premises environment. Develop and maintain unit tests to ensure the accuracy and reliability of data pipelines and transformations. Utilize Spark SQL for querying and managing large datasets stored in distributed systems like Hadoop or cloud storage. Monitor and troubleshoot data pipeline issues, ensuring reliability and timely delivery of data. Stay updated with the latest advancements in PySpark, Spark SQL, and big data technologies to improve existing systems.

Mock Interview

Practice Video Interview with JobPe AI

Start Pyspark Interview Now

My Connections Hexaware Technologies

Download Chrome Extension (See your connection in the Hexaware Technologies )

chrome image
Download Now
Hexaware Technologies
Hexaware Technologies

IT Services and IT Consulting

Navi Mumbai Maharashtra

10001 Employees

513 Jobs

    Key People

  • R Srikrishna

    CEO
  • Hiten D. Tuli

    CFO

RecommendedJobs for You

Chennai, Tamil Nadu, India

Gurugram, Bengaluru, Mumbai (All Areas)

Bengaluru, Delhi / NCR, Mumbai (All Areas)

Pune, Maharashtra, India

Gurgaon, Haryana, India