Sr PySpark Developer (Bigdata)

8 - 12 years

0 Lacs

Posted:2 days ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

As a highly skilled PySpark Developer with expertise in Distributed data processing, your role will involve optimizing Spark Jobs and ensuring efficient data processing in a Big Data platform. This requires a strong understanding of Spark performance tuning, distributed computing, and Big data architecture. Key Responsibilities: - Analyze and comprehend existing data ingestion and reconciliation frameworks - Develop and implement PySpark programs to process large datasets in Hive tables and Big data platforms - Perform complex transformations including reconciliation and advanced data manipulations - Fine-tune Spark jobs for performance optimization, ensuring efficient data processing at scale - Work closely with Data Engineers, Architects, and Analysts to understand data reconciliation requirements - Collaborate with cross-functional teams to improve data ingestion, transformation, and validation workflows Required Skills and Qualifications: - Extensive hands-on experience with Python, PySpark, and PyMongo for efficient data processing across distributed and columnar databases - Expertise in Spark Optimization techniques, and ability to debug Spark performance issues and optimize resource utilization - Proficiency in Python and Spark DataFrame API, and strong experience in complex data transformations using PySpark - Experience working with large-scale distributed data processing, and solid understanding of Big Data architecture and distributed computing frameworks - Strong problem-solving and analytical skills - Experience with CI/CD for data pipelines - Experience with SnowFlake for data processing and integration In addition to the technical skills required for this role, you should have 8+ years of relevant experience in Apps Development or systems analysis and the ability to adjust priorities quickly as circumstances dictate. Education: - Bachelors degree/University degree or equivalent experience in Computer science - Masters degree preferred Please note that Citi is an equal opportunity and affirmative action employer, and they invite all qualified interested applicants to apply for career opportunities. As a highly skilled PySpark Developer with expertise in Distributed data processing, your role will involve optimizing Spark Jobs and ensuring efficient data processing in a Big Data platform. This requires a strong understanding of Spark performance tuning, distributed computing, and Big data architecture. Key Responsibilities: - Analyze and comprehend existing data ingestion and reconciliation frameworks - Develop and implement PySpark programs to process large datasets in Hive tables and Big data platforms - Perform complex transformations including reconciliation and advanced data manipulations - Fine-tune Spark jobs for performance optimization, ensuring efficient data processing at scale - Work closely with Data Engineers, Architects, and Analysts to understand data reconciliation requirements - Collaborate with cross-functional teams to improve data ingestion, transformation, and validation workflows Required Skills and Qualifications: - Extensive hands-on experience with Python, PySpark, and PyMongo for efficient data processing across distributed and columnar databases - Expertise in Spark Optimization techniques, and ability to debug Spark performance issues and optimize resource utilization - Proficiency in Python and Spark DataFrame API, and strong experience in complex data transformations using PySpark - Experience working with large-scale distributed data processing, and solid understanding of Big Data architecture and distributed computing frameworks - Strong problem-solving and analytical skills - Experience with CI/CD for data pipelines - Experience with SnowFlake for data processing and integration In addition to the technical skills required for this role, you should have 8+ years of relevant experience in Apps Development or systems analysis and the ability to adjust priorities quickly as circumstances dictate. Education: - Bachelors degree/University degree or equivalent experience in Computer science - Masters degree preferred Please note that Citi is an equal opportunity and affirmative action employer, and they invite all qualified interested applicants to apply for career opportunities.

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You