Jobs
Interviews

110 Pyspark Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

Project Management Lead-Tech
Ameriprise India

8.0 - 10.0 years

Andhra pradesh

On-site

Project Manager will be responsible for developing and managing technology initiatives and their cost, time and scope. The primary responsibilities will include: project and task management, financial and resource management and delivery management. Responsibilities: Create & manage project plans for small to medium enhancement/initiatives. Define project schedules, allocate resources and monitor progress. Align project objectives with company goals, and make sure project team is clear on objectives. Deliver and install technology solutions. Tracking and managing financials. Help project team with the design and development tasks. Lead process of issue identification and resolution. Risk man...

Posted Just now

AI Match Score
Apply
Azure Data Engineer
Tata Consultancy Services

5.0 - 10.0 years

Varanasi, Uttar pradesh, India

On-site

Greetings From TCS! Azure Data Engineer 5 to 10 years PAN India Python/Pyspark Experience developing in Azure with key data technologies (e.g. ADLS, ADF, Azure Databricks etc.) Software development methodologies Experience working with at least one DevOps tool (GIT, Azure DevOps, Maven, Jenkins) Prior roles that demonstrate utilisation of high-quality Agile development best practices. In depth knowledge of at least one scheduling tool (e.g. Control-M, Oozie, ADF, etc.) Follow me for more Job updates!

Posted 4 hours ago

AI Match Score
Apply
Custom Software Engineer
Accenture

7.0 - 12.0 years

Bengaluru

Work from Office

Project Role : Custom Software Engineer Project Role Description : Develop custom software solutions to design, code, and enhance components across systems or applications. Use modern frameworks and agile practices to deliver scalable, high-performing solutions tailored to specific business needs. Must have skills : Databricks Unified Data Analytics Platform Good to have skills : Apache Airflow, PySpark Minimum 7.5 year(s) of experience is required Educational Qualification : 15 years full time education Summary : As a Custom Software Engineer, you will engage in a dynamic work environment where you will analyze, design, code, and test various components of application code for multiple clie...

Posted 20 hours ago

AI Match Score
Apply
Custom Software Engineer
Accenture

5.0 - 10.0 years

Bengaluru

Work from Office

Project Role : Custom Software Engineer Project Role Description : Develop custom software solutions to design, code, and enhance components across systems or applications. Use modern frameworks and agile practices to deliver scalable, high-performing solutions tailored to specific business needs. Must have skills : Databricks Unified Data Analytics Platform Good to have skills : Apache Airflow, PySpark Minimum 5 year(s) of experience is required Educational Qualification : 15 years full time education Summary : As a Custom Software Engineer, you will engage in a dynamic work environment where you will analyze, design, code, and test various components of application code for multiple client...

Posted 20 hours ago

AI Match Score
Apply
Project Management Lead-Tech
Ameriprise India

8.0 - 10.0 years

Andhra pradesh

On-site

Project Manager will be responsible for developing and managing technology initiatives and their cost, time and scope. The primary responsibilities will include: project and task management, financial and resource management and delivery management. Responsibilities: Create & manage project plans for small to medium enhancement/initiatives. Define project schedules, allocate resources and monitor progress. Align project objectives with company goals, and make sure project team is clear on objectives. Deliver and install technology solutions. Tracking and managing financials. Help project team with the design and development tasks. Lead process of issue identification and resolution. Risk man...

Posted 1 day ago

AI Match Score
Apply
Data Analyst-Data Management and Analytics-Data Ops/Data Analyst
EXL

5.0 - 9.0 years

Pune, Maharashtra, India

On-site

5 - 9 years of professional experience as a Data Analyst with good decision-making, analytical and problem-solving skills. Working knowledge / experience of Big Data frameworks like Hadoop, Hive and Spark. Hands-on experience in query languages like HQL or SQL (Spark SQL) for Data exploration. Data mapping: Determine the data mapping required to join multiple data sets together across multiple sources. Documentation - Data Mapping, Subsystem Design, Technical Design, Business Requirements. Exposure to Logical to Physical Mapping, Data Processing Flow to measure the consistency, etc. Data Asset design / build: Working with the data model / asset generation team to identify critical data eleme...

Posted 1 day ago

AI Match Score
Apply
SENIOR SOFTWARE ENGINEER - Test Lead Automation Engineer
Uplers

6.0 - 10.0 years

Chennai, All india

On-site

As a Test Lead Automation Engineer specializing in Python and Selenium, you will play a crucial role in ensuring the quality and reliability of software products. Your responsibilities will include: - Develop, troubleshoot, and debug application enhancements using Python, PySpark, Selenium, PlayWright, and SQL. - Create automation frameworks and scalable test suites across technologies. - Maintain test plans, procedures, and scripts, and perform product level integration tests. - Implement, execute, and debug automated test scripts using various technologies and tools. - Perform manual testing across all service functionalities before automation. - Collaborate with quality and development en...

Posted 1 day ago

AI Match Score
Apply
Data Engineer
CGI

6.0 - 8.0 years

Hyderabad

Hybrid

Job Title: Data Engineer Experience: 6 to 8 Years Job Id: J0226-0467 Location: Hyderabad CGI is looking for a skilled Data Engineer to design, build, and maintain scalable data pipelines and systems. The ideal candidate will work closely with data scientists, analysts, and business teams to ensure high-quality, reliable, and optimized data solutions. Key Responsibilities: Design, develop, and maintain scalable data pipelines and ETL processes Process and transform large volumes of structured and unstructured data Build and optimize data workflows using Python and PySpark Work with Hadoop ecosystem for distributed data processing Develop and optimize complex queries using SQL Ensure data qual...

Posted 1 day ago

AI Match Score
Apply
Databricks Tech Lead / Data Analyst / Engineer
Han Digital Solution

10.0 - 14.0 years

Chennai

On-site

As a senior-level Databricks Tech Lead / Data Engineer / Data Analyst, your primary role will involve designing, developing, and implementing scalable, secure, and cost-efficient data solutions on AWS using Databricks. Your responsibilities will include leading the migration of data assets from on-premises to AWS, developing PySpark applications, building data models, and generating actionable insights to support business solutions. You will also be responsible for identifying, gathering, and consolidating data from diverse sources, ensuring data integrity, and automating repetitive data preparation tasks for efficiency and scalability. Operating independently, you will maintain clear docume...

Posted 1 day ago

AI Match Score
Apply
Lead/Staff Data Scientist
Agoda

4.0 - 8.0 years

Delhi, All india

On-site

As a data scientist at Agoda in Bangkok, you will be part of the Data Science and Machine Learning (AI/ML) team working on challenging projects involving dynamic pricing, customer intent prediction, search result ranking, content classification, personalization, and algorithm-supported promotions. You will have the opportunity to leverage a large ML infrastructure to innovate the user experience. - Design, code, experiment, and implement models and algorithms to enhance customer experience, business outcomes, and infrastructure readiness. - Analyze a vast amount of customer data and user-generated events to derive actionable insights for improvements and innovation. - Collaborate with develo...

Posted 1 day ago

AI Match Score
Apply
Advisory Services Consultant - PowerBI, Data Modeling + SQL
Optum

7.0 years

4 - 6 Lacs

Noida

On-site

Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together. This position will design, develop, implement, test, deploy, monitor, and maintain the delivery of high-performance...

Posted 2 days ago

AI Match Score
Apply
Project Management Lead-Tech
Ameriprise India

8.0 - 10.0 years

Andhra pradesh

On-site

Project Manager will be responsible for developing and managing technology initiatives and their cost, time and scope. The primary responsibilities will include: project and task management, financial and resource management and delivery management. Responsibilities: Create & manage project plans for small to medium enhancement/initiatives. Define project schedules, allocate resources and monitor progress. Align project objectives with company goals, and make sure project team is clear on objectives. Deliver and install technology solutions. Tracking and managing financials. Help project team with the design and development tasks. Lead process of issue identification and resolution. Risk man...

Posted 2 days ago

AI Match Score
Apply
Project Management Lead-Tech
Ameriprise India

8.0 - 10.0 years

Andhra pradesh

On-site

Project Manager will be responsible for developing and managing technology initiatives and their cost, time and scope. The primary responsibilities will include: project and task management, financial and resource management and delivery management. Responsibilities: Create & manage project plans for small to medium enhancement/initiatives. Define project schedules, allocate resources and monitor progress. Align project objectives with company goals, and make sure project team is clear on objectives. Deliver and install technology solutions. Tracking and managing financials. Help project team with the design and development tasks. Lead process of issue identification and resolution. Risk man...

Posted 4 days ago

AI Match Score
Apply
Senior Business Data Analyst, Capital Market
Citi

13.0 - 17.0 years

Pune, All india

On-site

Role Overview: You will be part of the enterprise data office and product solution team, with a focus on ensuring accurate, timely, and fit-for-purpose data for business, risk management, and regulatory reporting requirements. Your average day will involve collaborating with various teams to understand Markets products processing in Regulatory Reporting data flow and designing systematic solutions for business needs. Key Responsibilities: - Understand Derivatives and SFT data flows within CITI - Conduct data analysis for derivatives products across systems for target state adoption and resolution of data gaps/issues - Lead assessment of end-to-end data flows for all data elements used in Reg...

Posted 4 days ago

AI Match Score
Apply
EY - GDS Consulting - AI and DATA - AWS Databricks - Senior
EY

4.0 years

Calcutta

On-site

At EY, you’ll have the chance to build a career as unique as you are, with the global scale, support, inclusive culture and technology to become the best version of you. And we’re counting on your unique voice and perspective to help EY become even better, too. Join us and build an exceptional experience for yourself, and a better working world for all. EY GDS – Data and Analytics (D&A) – AWS Data Engineer with Databricks As part of our EY-GDS D&A (Data and Analytics) team, we help our clients solve complex business challenges with the help of data and technology. We dive deep into data to extract the greatest value and discover opportunities in key business and functions like Banking, Insur...

Posted 6 days ago

AI Match Score
Apply
Project Management Lead-Tech
Ameriprise India

8.0 - 10.0 years

Andhra pradesh

On-site

Project Manager will be responsible for developing and managing technology initiatives and their cost, time and scope. The primary responsibilities will include: project and task management, financial and resource management and delivery management. Responsibilities: Create & manage project plans for small to medium enhancement/initiatives. Define project schedules, allocate resources and monitor progress. Align project objectives with company goals, and make sure project team is clear on objectives. Deliver and install technology solutions. Tracking and managing financials. Help project team with the design and development tasks. Lead process of issue identification and resolution. Risk man...

Posted 6 days ago

AI Match Score
Apply
GCP Data Architect - PySpark
Workassist

3.0 - 10.0 years

Chennai, All india

On-site

As a GCP Data Architect with over 10 years of experience, you will play a crucial role in designing, architecting, and optimizing large-scale data platforms on Google Cloud Platform (GCP) for our organization. Your expertise in Databricks, PySpark, Unity Catalog, ETL processes, and Terraform will be essential in delivering scalable, secure, and high-performance data engineering solutions. **Key Responsibilities:** - Architect and design modern, scalable, and secure data engineering platforms on GCP, ensuring high availability and performance. - Lead the implementation and optimization of ETL/ELT pipelines using Databricks, PySpark, and Unity Catalog for data governance and cataloging. - Defi...

Posted 6 days ago

AI Match Score
Apply
Data Analysis Programmer
EXL

3.0 - 7.0 years

Chennai, All india

On-site

As a Software Engineer in this role, you will collaborate with project stakeholders (client) to identify product and technical requirements. You will develop, implement, and tune large-scale distributed systems and pipelines that process a large volume of data. Your responsibility will also include writing clean, maintainable, and testable code for data workflows. In addition, you will troubleshoot data issues and perform root cause analysis. Key Responsibilities: - Collaborate with project stakeholders (client) to identify product and technical requirements. - Develop, implement, and tune large-scale distributed systems and pipelines that process a large volume of data. - Write clean, maint...

Posted 6 days ago

AI Match Score
Apply
Pyspark+Databricks
Cognizant

4.0 - 13.0 years

Chennai, All india

On-site

As an experienced candidate with 4 to 13 years of experience and proficiency in Data Bricks and PySpark, your role will involve developing and optimizing data solutions using Azure Data Lake Store to meet business requirements. You will utilize Python to create efficient data processing scripts and manage Databricks SQL for querying and analyzing large datasets. Your responsibilities will include designing and maintaining Databricks Workflows for automating data processing tasks, leveraging PySpark for large-scale data transformations, and collaborating with cross-functional teams to deliver effective solutions. Key Responsibilities: - Develop and optimize data solutions using Azure Data Lak...

Posted 6 days ago

AI Match Score
Apply

9.0 - 12.0 years

18 - 25 Lacs

Pune, Bengaluru

Work from Office

I. Job Summary: The Technical Lead will be responsible for managing day-to-day operations of the Data Platform in Azure/AWS Databricks environments. This role includes designing and implementing data ingestion pipelines, ensuring data quality and availability, optimizing performance, and maintaining system stability. The candidate will lead architectural decisions, mentor team members, and ensure compliance with security and governance standards. II. Roles & Responsibilities: Design and implement data ingestion pipelines from multiple sources using Azure Databricks. • Ensure smooth and efficient execution of data pipelines. • Develop scalable and reusable frameworks for ingesting datasets. •...

Posted 6 days ago

AI Match Score
Apply
S&C Global Network - AI - T&O - Human Capital Analytics - Consultant
Accenture

2.0 years

5 - 8 Lacs

Calcutta

On-site

Entity: - Accenture Strategy & Consulting Team: - Strategy & Consulting – Global Network Practice: - Talent & organization - Human Capital Analytics Title: - Ind & Func AI Decision Science Consultant – Level 9 Job location: - Gurgaon/Bengaluru / Mumbai About Strategy & Consulting Global Network: - Accenture S&C Global Network - Data & AI practice help our clients grow their business in entirely new ways. Analytics enables our clients to achieve high performance through insights from data - insights that inform better decisions and strengthen customer relationships. From strategy to execution, Accenture works with organizations to develop analytic capabilities - from accessing and reporting o...

Posted 1 week ago

AI Match Score
Apply
Data Scientist
Ericsson

0 years

3 - 9 Lacs

Calcutta

On-site

About this opportunity: Welcome to an exciting opportunity at Ericsson, where you'll be stepping into the role of a Data Scientist. As part of our team, you'll have the opportunity to develop unique machine learning solutions that address complex business issues. You'll employ scientific methods, processes, and systems to reveal valuable insights and pave the way for the future of applied analytics. Direct your prowess in machine learning towards formulating innovative AI/ML solutions that are consistent with Ericsson's architectural and coding standards. What you will do: Implement technical requirements by analyzing and optimizing their needs while managing any constraints effectively. Bui...

Posted 1 week ago

AI Match Score
Apply
S&C Global Network - AI - Supply Chain Analytics - Consultant
Accenture

5.0 - 6.0 years

5 - 8 Lacs

Calcutta

On-site

Job Title - S&C Global Network - AI - Supply Chain Analytics - Consultant Management Level: 9-Team Lead/Consultant Location: Kolkata, KDC1A Must-have skills: Supply Chain Analytics Good to have skills: Ability to leverage design thinking, business process optimization, and stakeholder management skills. Job Summary: As an Ind & Func AI Decision Science Consultant in the Strategy & Consulting – Global Network team, you will leverage data science skills to solve supply chain business challenges. You will work on developing data-driven insights, applying statistical concepts, and optimizing supply chain operations through advanced analytics and AI/ML models. Roles & Responsibilities: Project De...

Posted 1 week ago

AI Match Score
Apply
ETL Developer
Virtusa

0 years

Andhra pradesh

On-site

.Responsibilities include translating requirements and data mapping documents into technical designs developing enhancing and maintaining code following best practices and standards supporting regression and system testing efforts debugging and resolving issues found during testing or production communicating status issues and blockers with the project team and supporting continuous improvement by identifying and addressing opportunities. Basic qualifications include a Bachelor’s degree or military experience in a related field preferably computer science with three to five years of ETL development experience in a data warehouse environment deep understanding of enterprise data warehousing b...

Posted 1 week ago

AI Match Score
Apply
IN_Senior Associate_Data Engineering_Data & Analytics_Advisory_Mumbai
PwC India

5.0 years

Goregaon, Maharashtra, India

On-site

Line of Service Advisory Industry/Sector Not Applicable Specialism Data, Analytics & AI Management Level Senior Associate Job Description & Summary At PwC, our people in data and analytics engineering focus on leveraging advanced technologies and techniques to design and develop robust data solutions for clients. They play a crucial role in transforming raw data into actionable insights, enabling informed decision-making and driving business growth. Those in artificial intelligence and machine learning at PwC will focus on developing and implementing advanced AI and ML solutions to drive innovation and enhance business processes. Your work will involve designing and optimising algorithms, mo...

Posted 1 week ago

AI Match Score
Apply
Page 1 of 5

Exploring PySpark Jobs in India

PySpark, a powerful data processing framework built on top of Apache Spark and Python, is in high demand in the job market in India. With the increasing need for big data processing and analysis, companies are actively seeking professionals with PySpark skills to join their teams. If you are a job seeker looking to excel in the field of big data and analytics, exploring PySpark jobs in India could be a great career move.

Top Hiring Locations in India

Here are 5 major cities in India where companies are actively hiring for PySpark roles: 1. Bangalore 2. Pune 3. Hyderabad 4. Mumbai 5. Delhi

Average Salary Range

The estimated salary range for PySpark professionals in India varies based on experience levels. Entry-level positions can expect to earn around INR 6-8 lakhs per annum, while experienced professionals can earn upwards of INR 15 lakhs per annum.

Career Path

In the field of PySpark, a typical career progression may look like this: 1. Junior Developer 2. Data Engineer 3. Senior Developer 4. Tech Lead 5. Data Architect

Related Skills

In addition to PySpark, professionals in this field are often expected to have or develop skills in: - Python programming - Apache Spark - Big data technologies (Hadoop, Hive, etc.) - SQL - Data visualization tools (Tableau, Power BI)

Interview Questions

Here are 25 interview questions you may encounter when applying for PySpark roles:

  • Explain what PySpark is and its main features (basic)
  • What are the advantages of using PySpark over other big data processing frameworks? (medium)
  • How do you handle missing or null values in PySpark? (medium)
  • What is RDD in PySpark? (basic)
  • What is a DataFrame in PySpark and how is it different from an RDD? (medium)
  • How can you optimize performance in PySpark jobs? (advanced)
  • Explain the difference between map and flatMap transformations in PySpark (basic)
  • What is the role of a SparkContext in PySpark? (basic)
  • How do you handle schema inference in PySpark? (medium)
  • What is a SparkSession in PySpark? (basic)
  • How do you join DataFrames in PySpark? (medium)
  • Explain the concept of partitioning in PySpark (medium)
  • What is a UDF in PySpark? (medium)
  • How do you cache DataFrames in PySpark for optimization? (medium)
  • Explain the concept of lazy evaluation in PySpark (medium)
  • How do you handle skewed data in PySpark? (advanced)
  • What is checkpointing in PySpark and how does it help in fault tolerance? (advanced)
  • How do you tune the performance of a PySpark application? (advanced)
  • Explain the use of Accumulators in PySpark (advanced)
  • How do you handle broadcast variables in PySpark? (advanced)
  • What are the different data sources supported by PySpark? (medium)
  • How can you run PySpark on a cluster? (medium)
  • What is the purpose of the PySpark MLlib library? (medium)
  • How do you handle serialization and deserialization in PySpark? (advanced)
  • What are the best practices for deploying PySpark applications in production? (advanced)

Closing Remark

As you explore PySpark jobs in India, remember to prepare thoroughly for interviews and showcase your expertise confidently. With the right skills and knowledge, you can excel in this field and advance your career in the world of big data and analytics. Good luck!

cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Featured Companies