Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in hyderābād
>
Recruitment Hub 365
>
Pyspark Developer

Pyspark Developer

Recruitment Hub 365

5 years

15 - 20 Lacs

hyderābād

Posted:2 hours ago| Platform: GlassDoor logo

Apply

Skills Required

pyspark developer python redshift data engineering design processing sql etl analysis spark consistency documentation logic aws support code relational json xml postgresql workflow orchestration storage governance cataloging devops git pipeline

Work Mode

On-site

Job Type

Full Time

Job Description

Job Title : PySpark Developer

Location : Chennai, Hyderabad, Kolkata

Work Mode : Monday - Friday (5 days WFO)

Experience : 5+ Years in Backend Development

Notice Period : Immediate to 15 days

Must-Have Experience : Python, PySpark, Amazon Redshift, PostgreSQL

About the Role :

We are looking for an experienced PySpark Developer with strong data engineering capabilities to design, develop, and optimize scalable data pipelines for large-scale data processing. The ideal candidate must possess in-depth knowledge of PySpark, SQL, and cloud-based data ecosystems, along with strong problem-solving skills and the ability to work with cross-functional teams.

Roles & Responsibilities :

- Design and develop robust, scalable ETL/ELT pipelines using PySpark to process data from various sources such as databases, APIs, logs, and files.

- Transform raw data into analysis-ready datasets for data hubs and analytical data marts.

- Build reusable, parameterized Spark jobs for batch and micro-batch processing.

- Optimize PySpark job performance to handle large and complex datasets efficiently.

- Ensure data quality, consistency, and lineage, and maintain thorough documentation across

all ingestion workflows.

- Collaborate with Data Architects, Data Modelers, and Data Scientists to implement ingestion

logic aligned with business requirements.

- Work with AWS-based data platforms (S3, Glue, EMR, Redshift) for data movement and

storage.

- Support version control, CI/CD processes, and infrastructure-as-code practices as required.

Must-Have Skills :

- Minimum 5+ years of data engineering experience, with a strong focus on PySpark/Spark.

- Proven experience building data pipelines and ingestion frameworks for relational, semi-

structured (JSON, XML), and unstructured data (logs, PDFs).

- Strong knowledge of Python and related data processing libraries.

- Advanced SQL proficiency (Amazon Redshift, PostgreSQL or similar).

- Hands-on expertise with distributed computing frameworks such as Spark on EMR or

Databricks.

- Familiarity with workflow orchestration tools like AWS Step Functions or similar.

- Good understanding of data lake and data warehouse architectures, including fundamental

data modeling concepts.

Good-to-Have Skills :

- Experience with AWS data services : Glue, S3, Redshift, Lambda, CloudWatch.

- Exposure to Delta Lake or similar large-scale storage technologies.

- Experience with real-time streaming tools such as Spark Structured Streaming or Kafka.

- Understanding of data governance, lineage, and cataloging tools (AWS Glue Catalog, Apache

Atlas).

- Knowledge of DevOps/CI-CD pipelines using Git, Jenkins.

Job Type: Full-time

Pay: ₹1,500,000.00 - ₹2,000,000.00 per year

Application Question(s):

We are hiring for this position immediately. Are you available to join within 30 days? If not, please mention your official notice period or your last working day
How many years of experience do you have with PySpark?
How many years of experience do you have with Amazon Redshift?
How many years of hands-on experience do you have with ETL/ELT pipeline development?
What is your current location?
Are you comfortable working from office (WFO) Monday–Friday in Chennai/Hyderabad/Kolkata?
What is your current CTC? What is your expected CTC? Do you have any offers in hand?

Work Location: In person

More Jobs at Recruitment Hub 365

UI/UX QA Engineer

Bengaluru, Karnataka

Experience: Not specified

Salary: Not disclosed

Inside Sales Executive

Bengaluru, Karnataka

Experience: Not specified

INR 0 - 0 Lacs

Full Stack Developer

Pune, Maharashtra

Experience: Not specified

Salary: Not disclosed

Senior Architect - AI & Enterprise Development

Delhi

6.0 - 6.0 yrs

INR 30 - 35 Lacs

Senior Architect - AI & Enterprise Development

Delhi, Delhi

6.0 - 6.0 yrs

Salary: Not disclosed

Mock Interview

Practice Video Interview with JobPe AI

Start PySpark Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Recruitment Hub 365

Login to

Please Verify Your Phone or Email

Confirm Action

Pyspark Developer