26274 Pyspark Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

6.0 - 9.0 years

4 - 7 Lacs

chennai

On-site

We are seeking a highly skilled and experienced Data Engineer with 6 9 years of experience to design, develop, and maintain scalable data pipelines and infrastructure on AWS. The ideal candidate will have strong expertise in AWS Glue, Redshift, Athena, and related AWS services. The role demands end to end ownership of data workflows, performance optimization, and delivering reliable data solutions to support business intelligence, analytics, and machine learning initiatives. Design, build, and manage scalable and high-performance ETL ELT pipelines using AWS Glue, PySpark, and Step Functions. Develop and optimize data warehousing solutions using Amazon Redshift for structured and semi-structu...

Posted 6 hours ago

AI Match Score
Apply

12.0 years

0 Lacs

hyderabad, telangana, india

On-site

Job Title: Architect – Snowflake (Cloud Data) Job ID: POS-6710 Primary Skill: Snowflake Location: Hyderabad/Pune Experience Primary Skills: Snowflake, and Python Secondary Skills: AWS, Azure, GCP, Linux, Windows Environments, Redshift, and Databricks Job Locat ion: Hyderabad Mode of work: Work from Office Experience : 12+ Years of Experience in Data Modeling/Data Warehousing and 5+ years in Snowflake. About The Job We are seeking a highly skilled Architect with expertise in Snowflake Data Modeling and Cloud Data solutions. The successful candidate will lead Snowflake optimizations at both warehouse and database levels, ensuring that Snowflake components are efficiently set up, configured, an...

Posted 7 hours ago

AI Match Score
Apply

0 years

0 Lacs

bengaluru, karnataka, india

On-site

We are looking for a Commercial Data Analyst (x|f|m) for the Commercial Excellence Intelligence & Analytics Team based in Göttingen (Sartorius Stedim Biotech GmbH). In this role, you will support various Commercial Data & Analytics initiatives, providing sales and customer performance metrics, forecasts and insights to drive business growth and efficiency. If you're passionate about analytics and eager to contribute on a global level, we look forward to your application. The team consists of ten professionals and we are looking forward to shaping the future with you. Your tasks Work closely with sales teams to understand day-to-day challenges, data requirements, and overall behavior patterns...

Posted 8 hours ago

AI Match Score
Apply

5.0 years

0 Lacs

hyderabad, telangana, india

On-site

Greetings from TCS!!! Come and join us for an exciting career with TCS!!! Date: 20th December 2025 Venue: Hyderabad - Synergy park, Non SEZ Role: Azure data Desired Experience Range: 5-10 years Location: Hyderabad Must Have Experiences and Skills: 3+ years of relevant experience in Pyspark and Azure Databricks. Proficiency in integrating, transforming, and consolidating data from various structured and unstructured data sources. Good experience in SQL or native SQL query languages. Strong experience in implementing Databricks notebooks using Python. Good experience in Azure Data Factory, ADLS, Storage Services, Serverless architecture, Azure functions. Advanced working knowledge in building ...

Posted 8 hours ago

AI Match Score
Apply

5.0 years

0 Lacs

pune, maharashtra, india

On-site

Greetings from TCS!!! Come and join us for an exciting career with TCS!!! Role: Datastage Developer Desired Experience Range: 5-8 years Location: Pune/Hyderabad Must Have Experiences and Skills: 4+Years of experience DataStage , 4+Years of experience Delta Lake , 2 + years of experience Terada ( Bteq ,) Strong in Pyspark and SQL Experience in GCP cloud Services

Posted 8 hours ago

AI Match Score
Apply

0 years

0 Lacs

hyderabad, telangana, india

Remote

Working with Us Challenging. Meaningful. Life-changing. Those aren't words that are usually associated with a job. But working at Bristol Myers Squibb is anything but usual. Here, uniquely interesting work happens every day, in every department. From optimizing a production line to the latest breakthroughs in cell therapy, this is work that transforms the lives of patients, and the careers of those who do it. You'll get the chance to grow and thrive through opportunities uncommon in scale and scope, alongside high-achieving teams. Take your career farther than you thought possible. Bristol Myers Squibb recognizes the importance of balance and flexibility in our work environment. We offer a w...

Posted 10 hours ago

AI Match Score
Apply

175.0 years

0 Lacs

gurgaon, haryana, india

On-site

At American Express, our culture is built on a 175-year history of innovation, shared At American Express, our culture is built on a 175-year history of innovation, shared values and Leadership Behaviors, and an unwavering commitment to back our customers, communities, and colleagues. As part of Team Amex, you'll experience this powerful backing with comprehensive support for your holistic well-being and many opportunities to learn new skills, develop as a leader, and grow your career. Here, your voice and ideas matter, your work makes an impact, and together, you will help us define the future of American Express. How will you make an impact in this role? Launched in 2012, Amex Offers is a ...

Posted 13 hours ago

AI Match Score
Apply

0 years

0 Lacs

jaipur, rajasthan, india

On-site

PF and Gratuity Medical Coverage included About Our Client The hiring company is a reputable player in the business services and recruitment industry. As a medium-sized enterprise, it focuses on leveraging technology to deliver exceptional results and innovative solutions. Job Description Design and develop data pipelines and workflows using Python, PySpark, and ADF. Collaborate with cross-functional teams to understand data requirements and implement solutions. Optimize and maintain SQL databases for efficient data storage and retrieval. Ensure data quality and accuracy through validation and cleansing processes. Develop and maintain documentation for data workflows and processes. Monitor a...

Posted 13 hours ago

AI Match Score
Apply

6.0 - 9.0 years

0 Lacs

chennai, tamil nadu, india

On-site

We are seeking a highly skilled and experienced Data Engineer with 6 9 years of experience to design, develop, and maintain scalable data pipelines and infrastructure on AWS. The ideal candidate will have strong expertise in AWS Glue, Redshift, Athena, and related AWS services. The role demands end to end ownership of data workflows, performance optimization, and delivering reliable data solutions to support business intelligence, analytics, and machine learning initiatives. Design, build, and manage scalable and high-performance ETL ELT pipelines using AWS Glue, PySpark, and Step Functions. Develop and optimize data warehousing solutions using Amazon Redshift for structured and semi-structu...

Posted 22 hours ago

AI Match Score
Apply

4.0 - 8.0 years

0 Lacs

hyderabad, all india

On-site

As a Senior Data Engineer, you will be responsible for designing, developing, and optimizing high-performance data pipelines using Python, PySpark, SQL, and cloud platforms. Your role will involve optimizing distributed data processing, implementing best practices in data modeling and governance, ensuring data integrity and security, and automating data workflows. Additionally, you will provide technical leadership, mentor junior engineers, and drive best practices in data engineering. Key Responsibilities: - Design and develop high-performance, scalable data pipelines using PySpark, SQL, and cloud platforms. - Optimize distributed data processing for improved efficiency and reliability. - I...

Posted 23 hours ago

AI Match Score
Apply

8.0 - 12.0 years

0 Lacs

all india, gurugram

On-site

Role Overview: YASH Technologies is a leading technology integrator focused on helping clients enhance competitiveness, optimize costs, and drive business transformation. They are currently seeking a Data Analytics Lead with 8+ years of experience to join their team. Key Responsibilities: - Build, manage, and nurture a high-performing team of data engineers and data analysts. - Collaborate with business and technical teams to prioritize platform ingestion requirements. - Lead the data analytics team, providing guidance and support for their professional growth. - Manage customer, partner, and internal data on cloud and on-premises platforms. - Evaluate current data technologies and trends, f...

Posted 23 hours ago

AI Match Score
Apply

4.0 - 8.0 years

0 Lacs

hyderabad, all india

On-site

Role Overview: As a Data Engineer in our team on a contract basis, your primary responsibility will be to utilize your expertise in SQL, Python, and PySpark to work on data extraction, transformation, and processing. You will collaborate with internal teams and clients to ensure data accuracy, consistency, and integrity across platforms. Excellent communication skills are essential as the role involves direct client interaction. Key Responsibilities: - Develop and optimize advanced SQL queries for data extraction and transformation. - Utilize Python and PySpark to efficiently process large-scale data. - Collaborate with internal teams and clients to understand data requirements and implement...

Posted 23 hours ago

AI Match Score
Apply

12.0 - 18.0 years

0 Lacs

chennai, all india

On-site

Role Overview: Join us as we work to create a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for all. As a Principal Member of Technical Staff (PMTS), you will utilize your expertise in designing, developing, debugging, and maintaining AI-powered applications and data engineering workflows for both local and cloud environments. Your role will involve working on large-scale projects, optimizing AI/ML pipelines, and ensuring scalable data infrastructure. You will collaborate with AI/ML, Data Engineering, DevOps, and Product teams to deliver impactful solutions that enhance products and services. Key Responsibilities: - Develop AI-driven applications, micr...

Posted 23 hours ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

all india, gurugram

On-site

Role Overview: As a Full Stack AI Developer at GlobalLogic, you will be responsible for developing and implementing AI solutions using a variety of programming languages and tools. You will work on projects that have a significant impact on clients around the world, contributing to innovative solutions and pushing the boundaries of what is possible in the AI field. Key Responsibilities: - Must Have: Proficiency in programming languages such as Python, Java/Scala - Must Have: Experience with data processing libraries like Pandas, NumPy, and Scikit-learn - Must Have: Proficient in distributed computing platforms such as Apache Spark (PySpark, Scala) and Torch - Must Have: Ability to develop AP...

Posted 23 hours ago

AI Match Score
Apply

2.0 - 6.0 years

0 Lacs

all india, gurugram

On-site

Role Overview: You will be responsible for designing, developing, and maintaining high-quality software solutions using Python and Go. Collaborate with cross-functional teams to define, design, and ship new features. Architect and implement scalable and efficient APIs, ensuring they are well-documented and adhere to industry standards. You will optimize database queries for performance, work with large datasets to ensure data integrity, implement caching strategies, and participate in regular code reviews to provide constructive feedback to peers. Additionally, you will work closely with product managers, designers, and other engineers to deliver high-quality products. Key Responsibilities: ...

Posted 23 hours ago

AI Match Score
Apply

2.0 - 6.0 years

0 Lacs

chennai, all india

On-site

As a Full Stack Data Engineer at our company, you will collaborate with Data Scientists and Product Development teams to create cutting-edge data products that align with our Company Objectives. Your responsibilities will include landing data, building new data products, enhancing existing ones, and collaborating with Analytics & Business partners to ensure solutions are production-ready. Key Responsibilities: - Utilize GCP services such as Big Query, Dataproc, Data Plex, DataFusion, Terraform, Tekton, Airflow, Cloud Storage, and Pub/Sub for data processing and management. - Demonstrate proficiency in Git or any other version control tool. - Possess 2+ years of coding experience in Python an...

Posted 23 hours ago

AI Match Score
Apply

3.0 - 7.0 years

0 Lacs

pune, all india

On-site

Role Overview: YASH Technologies is a leading technology integrator focused on helping clients reimagine operating models, enhance competitiveness, optimize costs, foster exceptional stakeholder experiences, and drive business transformation. As an AWS Data Pipeline Professional, you will play a crucial role in designing, developing, and implementing cloud solutions on AWS, utilizing a wide range of AWS services. Your expertise in AWS core services, Python, and PySpark will be essential in analyzing business requirements and translating them into technical solutions for successful execution. You will also interact with customers, manage multiple cloud solution projects, and collaborate effec...

Posted 1 day ago

AI Match Score
Apply

8.0 - 12.0 years

0 Lacs

chennai, all india

On-site

As a highly skilled PySpark Developer with expertise in Distributed data processing, your role will involve optimizing Spark Jobs and ensuring efficient data processing in a Big Data platform. This requires a strong understanding of Spark performance tuning, distributed computing, and Big data architecture. Key Responsibilities: - Analyze and comprehend existing data ingestion and reconciliation frameworks - Develop and implement PySpark programs to process large datasets in Hive tables and Big data platforms - Perform complex transformations including reconciliation and advanced data manipulations - Fine-tune Spark jobs for performance optimization, ensuring efficient data processing at sca...

Posted 1 day ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

noida, all india

On-site

Role Overview: You will support system activities in a specific process area, including analysis, design, and overseeing the implementation of system changes to increase effectiveness and efficiency aligned with corporate strategies. Working in a cross-functional environment, you will collaborate closely with global and regional teams to optimize SAP and integrated non-SAP systems and processes, translating business needs into functional requirements. You will define what the capability must achieve and how success will be measured and ensure that solutions are robust, strategically aligned, and efficient to deliver and maintain. Additionally, you must understand integration points in an int...

Posted 1 day ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

vadodara, all india

On-site

Role Overview: You will be responsible for leveraging your advanced expertise in time-series forecasting, predictive modeling, and deep learning to develop and implement reusable and scalable machine learning systems for time-series forecasting. Your role will involve analyzing, retraining, and fine-tuning machine learning models based on evolving data trends and business requirements to ensure long-term efficiency and reliability. Additionally, you will automate machine learning workflows using Azure Machine Learning as an MLOps technology and utilize PySpark for efficient data processing and analytics in large-scale environments. Collaboration on data engineering tasks in Azure Databricks ...

Posted 1 day ago

AI Match Score
Apply

15.0 years

0 Lacs

bengaluru, karnataka, india

On-site

At PwC, our people in data and analytics engineering focus on leveraging advanced technologies and techniques to design and develop robust data solutions for clients. They play a crucial role in transforming raw data into actionable insights, enabling informed decision-making and driving business growth. Those in artificial intelligence and machine learning at PwC will focus on developing and implementing advanced AI and ML solutions to drive innovation and enhance business processes. Your work will involve designing and optimising algorithms, models, and systems to enable intelligent decision-making and automation. Years of Experience: Candidates with 15+ years of hands on experience Must H...

Posted 1 day ago

AI Match Score
Apply

160.0 years

0 Lacs

bengaluru, karnataka, india

On-site

About PwC: PricewaterhouseCoopers (PwC) is a leading global consulting firm. For more than 160 years, PwC has worked to build trust in society and solve important problems for clients and the communities in which we live and work. Today we have more than 276,000 people across 157 countries working towards this goal. The US Advisory Bangalore Acceleration Center is a natural extension of our United States based consulting capabilities, providing support to a broad range of practice teams. Our US-owned ACs are fully integrated into our client facing teams and are key to PwC's success in the marketplace. Job Summary: At PwC, we are betting big on data, analytics, and a digital revolution to tra...

Posted 1 day ago

AI Match Score
Apply

160.0 years

0 Lacs

bengaluru, karnataka, india

On-site

About PwC: PricewaterhouseCoopers (PwC) is a leading global consulting firm. For more than 160 years, PwC has worked to build trust in society and solve important problems for clients and the communities in which we live and work. Today we have more than 276,000 people across 157 countries working towards this goal. The US Advisory Bangalore Acceleration Center is a natural extension of our United States based consulting capabilities, providing support to a broad range of practice teams. Our US-owned ACs are fully integrated into our client facing teams and are key to PwC's success in the marketplace. Job Summary: At PwC, we are betting big on data, analytics, and a digital revolution to tra...

Posted 1 day ago

AI Match Score
Apply

160.0 years

0 Lacs

bengaluru, karnataka, india

On-site

About PwC: PricewaterhouseCoopers (PwC) is a leading global consulting firm. For more than 160 years, PwC has worked to build trust in society and solve important problems for clients and the communities in which we live and work. Today we have more than 276,000 people across 157 countries working towards this goal. The US Advisory Bangalore Acceleration Center is a natural extension of our United States based consulting capabilities, providing support to a broad range of practice teams. Our US-owned ACs are fully integrated into our client facing teams and are key to PwC's success in the marketplace. Job Summary: At PwC, we are betting big on data, analytics, and a digital revolution to tra...

Posted 1 day ago

AI Match Score
Apply

160.0 years

0 Lacs

bengaluru, karnataka, india

On-site

About PwC: PricewaterhouseCoopers (PwC) is a leading global consulting firm. For more than 160 years, PwC has worked to build trust in society and solve important problems for clients and the communities in which we live and work. Today we have more than 276,000 people across 157 countries working towards this goal. The US Advisory Bangalore Acceleration Center is a natural extension of our United States based consulting capabilities, providing support to a broad range of practice teams. Our US-owned ACs are fully integrated into our client facing teams and are key to PwC's success in the marketplace. Job Summary: At PwC, we are betting big on data, analytics, and a digital revolution to tra...

Posted 1 day ago

AI Match Score
Apply

Exploring PySpark Jobs in India

PySpark, a powerful data processing framework built on top of Apache Spark and Python, is in high demand in the job market in India. With the increasing need for big data processing and analysis, companies are actively seeking professionals with PySpark skills to join their teams. If you are a job seeker looking to excel in the field of big data and analytics, exploring PySpark jobs in India could be a great career move.

Top Hiring Locations in India

Here are 5 major cities in India where companies are actively hiring for PySpark roles: 1. Bangalore 2. Pune 3. Hyderabad 4. Mumbai 5. Delhi

Average Salary Range

The estimated salary range for PySpark professionals in India varies based on experience levels. Entry-level positions can expect to earn around INR 6-8 lakhs per annum, while experienced professionals can earn upwards of INR 15 lakhs per annum.

Career Path

In the field of PySpark, a typical career progression may look like this: 1. Junior Developer 2. Data Engineer 3. Senior Developer 4. Tech Lead 5. Data Architect

Related Skills

In addition to PySpark, professionals in this field are often expected to have or develop skills in: - Python programming - Apache Spark - Big data technologies (Hadoop, Hive, etc.) - SQL - Data visualization tools (Tableau, Power BI)

Interview Questions

Here are 25 interview questions you may encounter when applying for PySpark roles:

  • Explain what PySpark is and its main features (basic)
  • What are the advantages of using PySpark over other big data processing frameworks? (medium)
  • How do you handle missing or null values in PySpark? (medium)
  • What is RDD in PySpark? (basic)
  • What is a DataFrame in PySpark and how is it different from an RDD? (medium)
  • How can you optimize performance in PySpark jobs? (advanced)
  • Explain the difference between map and flatMap transformations in PySpark (basic)
  • What is the role of a SparkContext in PySpark? (basic)
  • How do you handle schema inference in PySpark? (medium)
  • What is a SparkSession in PySpark? (basic)
  • How do you join DataFrames in PySpark? (medium)
  • Explain the concept of partitioning in PySpark (medium)
  • What is a UDF in PySpark? (medium)
  • How do you cache DataFrames in PySpark for optimization? (medium)
  • Explain the concept of lazy evaluation in PySpark (medium)
  • How do you handle skewed data in PySpark? (advanced)
  • What is checkpointing in PySpark and how does it help in fault tolerance? (advanced)
  • How do you tune the performance of a PySpark application? (advanced)
  • Explain the use of Accumulators in PySpark (advanced)
  • How do you handle broadcast variables in PySpark? (advanced)
  • What are the different data sources supported by PySpark? (medium)
  • How can you run PySpark on a cluster? (medium)
  • What is the purpose of the PySpark MLlib library? (medium)
  • How do you handle serialization and deserialization in PySpark? (advanced)
  • What are the best practices for deploying PySpark applications in production? (advanced)

Closing Remark

As you explore PySpark jobs in India, remember to prepare thoroughly for interviews and showcase your expertise confidently. With the right skills and knowledge, you can excel in this field and advance your career in the world of big data and analytics. Good luck!

cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies