28415 Pyspark Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

3.0 - 8.0 years

7 - 17 Lacs

mumbai, pune

Hybrid

Role: Senior Data Engineer Location: Mumbai & Pune Experience: 3yrs to 8yrs Technologies / Skills: Advanced SQL, Python and associated libraries like Pandas, NumPy etc., Pyspark , Shell scripting, Data-Modelling, Big data, Hadoop, Hive, ETL pipelines. Responsibilities: • Proven success in communicating with users, other technical teams, and senior management to collect requirements, describe data modeling decisions and develop data engineering strategy. • Ability to work with business owners to define key business requirements and convert to user stories with required technical specifications. • Communicate results and business impacts of insight initiatives to key stakeholders to collaborat...

Posted -1 days ago

AI Match Score
Apply

8.0 - 12.0 years

7 - 14 Lacs

pune, mumbai (all areas)

Hybrid

Job Title: Lead Data Engineer Location: Mumbai / Pune Experience: 8+ yrs Job Summary: We are seeking a technically strong and delivery-focused Lead Engineer to support and enhance enterprise-grade data and application products under the Durables model. The ideal candidate will act as the primary technical interface for the client, ensuring high system availability, performance, and continuous improvement. This role requires a hands-on technologist with strong team management experience, cloud (AWS) expertise, and excellent communication skills to handle client interactions and drive technical decisions. Key Responsibilities: Support & Enhancement Leadership Act as the primary technical lead ...

Posted -1 days ago

AI Match Score
Apply

5.0 - 10.0 years

15 - 30 Lacs

hyderabad, pune, bengaluru

Hybrid

Job description Hiring for Spark Developer Mandatory Skills: Spark, Python, SQL Responsibilities A day in the life of an Infoscion As part of the Infosys consulting team, your primary role would be to actively aid the consulting team in different phases of the project including problem definition, effort estimation, diagnosis, solution generation and design and deployment You will explore the alternatives to the recommended solutions based on research that includes literature surveys, information available in public domains, vendor evaluation information, etc. and build POCs You will create requirement specifications from the business needs, define the to-be-processes and detailed functional...

Posted -1 days ago

AI Match Score
Apply

5.0 - 10.0 years

15 - 30 Lacs

hyderabad, chennai, bengaluru

Hybrid

Job description Hiring for Pyspark Develper-Female Diversity Mandatory Skills: Pyspark, Python, Big data, ETL Responsibilities A day in the life of an Infoscion As part of the Infosys consulting team, your primary role would be to actively aid the consulting team in different phases of the project including problem definition, effort estimation, diagnosis, solution generation and design and deployment You will explore the alternatives to the recommended solutions based on research that includes literature surveys, information available in public domains, vendor evaluation information, etc. and build POCs You will create requirement specifications from the business needs, define the to-be-pro...

Posted -1 days ago

AI Match Score
Apply

5.0 - 10.0 years

15 - 30 Lacs

bengaluru

Hybrid

Job description Hiring for Pyspark Develper-Female Candidates Mandatory Skills: Pyspark, Python, Big data, ETL Responsibilities A day in the life of an Infoscion As part of the Infosys consulting team, your primary role would be to actively aid the consulting team in different phases of the project including problem definition, effort estimation, diagnosis, solution generation and design and deployment You will explore the alternatives to the recommended solutions based on research that includes literature surveys, information available in public domains, vendor evaluation information, etc. and build POCs You will create requirement specifications from the business needs, define the to-be-pr...

Posted -1 days ago

AI Match Score
Apply

5.0 - 10.0 years

15 - 30 Lacs

hyderabad, pune, bengaluru

Hybrid

Job description Hiring for Databrick Engineer-Female candidate Mandatory Skills: Databrick, Data lake, Azure Responsibilities A day in the life of an Infoscion As part of the Infosys consulting team, your primary role would be to actively aid the consulting team in different phases of the project including problem definition, effort estimation, diagnosis, solution generation and design and deployment You will explore the alternatives to the recommended solutions based on research that includes literature surveys, information available in public domains, vendor evaluation information, etc. and build POCs You will create requirement specifications from the business needs, define the to-be-proc...

Posted -1 days ago

AI Match Score
Apply

6.0 - 8.0 years

13 - 23 Lacs

indore, pune, delhi / ncr

Work from Office

TCS is inviting applications!!! Role: AWS Data Engineer EXP: 6 - 8 YEARS LOCATION: Chennai/ Pune / Bangalore/ Hyderabad / Kolkata/ Delhi/ Kochi/ Indore Job Description Professional experience in data integration and management 3+ years of hands-on development experience in AWS Glue, Lambda functions, AWS Athena, Redshift, Pyspark is a Must Hands on working experience in AWS GLUE ETL tool and having good knowledge on any of the ETL tools (Informatica cloud, Talend etc.) Must have working experience with SQL, and databases (e.g.: Postgres, Redshift) implementing Data Models. Expertise with Python Language and Apache Spark. Have working experience in projects involving these. Strong hands-on ex...

Posted Just now

AI Match Score
Apply

6.0 - 8.0 years

14 - 24 Lacs

hyderabad, chennai, bengaluru

Work from Office

TCS is inviting applications!!! Role: AWS Data Engineer EXP: 6 - 8 YEARS LOCATION: Chennai/ Pune / Bangalore/ Hyderabad / Kolkata/ Delhi/ Kochi/ Indore Job Description Professional experience in data integration and management 3+ years of hands-on development experience in AWS Glue, Lambda functions, AWS Athena, Redshift, Pyspark is a Must Hands on working experience in AWS GLUE ETL tool and having good knowledge on any of the ETL tools (Informatica cloud, Talend etc.) Must have working experience with SQL, and databases (e.g.: Postgres, Redshift) implementing Data Models. Expertise with Python Language and Apache Spark. Have working experience in projects involving these. Strong hands-on ex...

Posted Just now

AI Match Score
Apply

4.0 - 9.0 years

9 - 16 Lacs

gurugram

Work from Office

Dear Candidate, We have an urgent requirement for a Data Science / AI-ML Engineer (Agentic AI Focus) for our client based in Gurgaon . If you are interested, please share your updated profile at: peddollu.d@honeybeetechsolutions.com Job Details Job Location: Gurgaon Experience: 4+ Years Role: Data Science / AI-ML Engineer (Agentic AI Focus) Skills: Data Science, AI/ML, Agentic AI Interview Process: Round 1: Technical Virtual Round 2: Technical Virtual / Face-to-Face Round 3: HR Role Overview We are looking for an innovative and proactive AI/ML Engineer with deep specialization in Agentic AI and Large Language Models (LLMs) to drive next-generation intelligent automation and customer servicin...

Posted 1 hour ago

AI Match Score
Apply

5.0 - 10.0 years

15 - 30 Lacs

hyderabad, pune, bengaluru

Hybrid

Responsibilities A day in the life of an InfoscionAs part of the Infosys delivery team, your primary role would be to provide best fit architectural solutions for one or more projects. You would also provide technology consultation and assist in defining scope and sizing of work You would implement solutions, create technology differentiation and leverage partner technologies. Additionally, you would participate in competency development with the objective of ensuring the best-fit and high quality technical solutions. You would be a key contributor in creating thought leadership within the area of technology specialization and in compliance with guidelines, policies and norms of Infosys.If y...

Posted 1 hour ago

AI Match Score
Apply

5.0 - 10.0 years

15 - 30 Lacs

hyderabad, pune, bengaluru

Hybrid

Job description Hiring for Spark Developer Mandatory Skills: Spark, Python, SQL Responsibilities A day in the life of an Infoscion As part of the Infosys consulting team, your primary role would be to actively aid the consulting team in different phases of the project including problem definition, effort estimation, diagnosis, solution generation and design and deployment You will explore the alternatives to the recommended solutions based on research that includes literature surveys, information available in public domains, vendor evaluation information, etc. and build POCs You will create requirement specifications from the business needs, define the to-be-processes and detailed functional...

Posted 1 hour ago

AI Match Score
Apply

5.0 - 10.0 years

15 - 30 Lacs

hyderabad, chennai, bengaluru

Hybrid

Job description Hiring for Pyspark Develper Mandatory Skills: Pyspark, Python, Big data, ETL Responsibilities A day in the life of an Infoscion As part of the Infosys consulting team, your primary role would be to actively aid the consulting team in different phases of the project including problem definition, effort estimation, diagnosis, solution generation and design and deployment You will explore the alternatives to the recommended solutions based on research that includes literature surveys, information available in public domains, vendor evaluation information, etc. and build POCs You will create requirement specifications from the business needs, define the to-be-processes and detail...

Posted 1 hour ago

AI Match Score
Apply

5.0 - 10.0 years

4 - 4 Lacs

hyderabad, pune, bengaluru

Hybrid

Job description Hiring for Hadoop Developer Mandatory Skills: Hadoop, bigdata, spark Responsibilities A day in the life of an Infoscion As part of the Infosys consulting team, your primary role would be to actively aid the consulting team in different phases of the project including problem definition, effort estimation, diagnosis, solution generation and design and deployment You will explore the alternatives to the recommended solutions based on research that includes literature surveys, information available in public domains, vendor evaluation information, etc. and build POCs You will create requirement specifications from the business needs, define the to-be-processes and detailed funct...

Posted 1 hour ago

AI Match Score
Apply

8.0 - 13.0 years

15 - 30 Lacs

hyderabad, pune, bengaluru

Hybrid

Job description Hiring for Azure Databrick Engineer Mandatory Skills: Databrick, Data lake, Azure Responsibilities A day in the life of an Infoscion As part of the Infosys consulting team, your primary role would be to actively aid the consulting team in different phases of the project including problem definition, effort estimation, diagnosis, solution generation and design and deployment You will explore the alternatives to the recommended solutions based on research that includes literature surveys, information available in public domains, vendor evaluation information, etc. and build POCs You will create requirement specifications from the business needs, define the to-be-processes and d...

Posted 2 hours ago

AI Match Score
Apply

5.0 - 10.0 years

15 - 30 Lacs

hyderabad, pune, bengaluru

Hybrid

Job description Hiring for Azure Databrick Engineer Mandatory Skills: Databrick, Data lake, Azure Responsibilities A day in the life of an Infoscion As part of the Infosys consulting team, your primary role would be to actively aid the consulting team in different phases of the project including problem definition, effort estimation, diagnosis, solution generation and design and deployment You will explore the alternatives to the recommended solutions based on research that includes literature surveys, information available in public domains, vendor evaluation information, etc. and build POCs You will create requirement specifications from the business needs, define the to-be-processes and d...

Posted 2 hours ago

AI Match Score
Apply

4.0 - 6.0 years

0 Lacs

hyderabad, telangana, india

On-site

Join Amgen’s Mission of Serving Patients At Amgen, if you feel like you’re part of something bigger, it’s because you are. Our shared mission—to serve patients living with serious illnesses—drives all that we do. Since 1980, we’ve helped pioneer the world of biotech in our fight against the world’s toughest diseases. With our focus on four therapeutic areas –Oncology, Inflammation, General Medicine, and Rare Disease– we reach millions of patients each year. As a member of the Amgen team, you’ll help make a lasting impact on the lives of patients as we research, manufacture, and deliver innovative medicines to help people live longer, fuller happier lives. Our award-winning culture is collabo...

Posted 2 hours ago

AI Match Score
Apply

0 years

6 - 8 Lacs

bengaluru

On-site

5+yrs of hands on experience in Hadoop Scala Pyspark Hive Spark hive Impala SQL with Hadoop eco system Experience in design and development of data engineering platforms .Guide developers and others on project limitations and capabilities, performance requirements and interfaces .Good understanding of Data integration, Data Quality and data architecture Location Bengaluru Job Function TECHNOLOGY Role Engineer Job Id 387777 Desired Skills Big Data | Spark Desired Candidate Profile Qualifications : BACHELOR OF ENGINEERING

Posted 2 hours ago

AI Match Score
Apply

6.0 - 9.0 years

7 - 9 Lacs

bengaluru

On-site

ROLES & RESPONSIBILITIES Key Responsibilities 1. Data Quality Development & Monitoring Design and implement automated data quality rules and validation checks using Databricks (Delta Lake) and PySpark . Build and operationalize data quality workflows in Ataccama ONE / Ataccama Studio . Perform data profiling, anomaly detection, and reconciliation across systems and data sources. Establish thresholds, KPIs, and alerts for data quality metrics. 2. Root Cause Analysis & Issue Management Investigate data anomalies and quality incidents using SQL, Python, and Ataccama diagnostics. Collaborate with data engineers and business analysts to identify and remediate root causes. Document recurring data ...

Posted 2 hours ago

AI Match Score
Apply

0 years

4 - 5 Lacs

bengaluru

On-site

Key Responsibilities: A day in the life of an Infoscion As part of the Infosys delivery team your primary role would be to interface with the client for quality assurance issue resolution and ensuring high customer satisfaction You will understand requirements create and review designs validate the architecture and ensure high levels of service offerings to clients in the technology domain You will participate in project estimation provide inputs for solution delivery conduct technical risk planning perform code reviews and unit test plan reviews You will lead and guide your teams towards developing optimized high quality code deliverables continual knowledge management and adherence to the ...

Posted 2 hours ago

AI Match Score
Apply

2.0 - 5.0 years

10 Lacs

chennai

On-site

Discover your future at Citi Working at Citi is far more than just a job. A career with us means joining a team of more than 230,000 dedicated people from around the globe. At Citi, you’ll have the opportunity to grow your career, give back to your community and make a real impact. Job Overview At Citi we’re not just building technology, we’re building the future of banking. Encompassing a broad range of specialties, roles, and cultures, our teams are creating innovations used across the globe. Citi is constantly growing and progressing through our technology, with laser focused on evolving the ways of doing things. As one of the world’s most global banks we’re changing how the world does bu...

Posted 2 hours ago

AI Match Score
Apply

2.0 - 5.0 years

10 - 20 Lacs

pune, gurugram, bengaluru

Hybrid

Salary: 15 to 25 LPA Exp: 4 to 7 years Location: Gurgaon/Pune/Bengalore Notice: Immediate to 30 days..!! Job Profile: Experienced Data Engineer with a strong foundation in designing, building, and maintaining scalable data pipelines and architectures. Skilled in transforming raw data into clean, structured formats for analytics and business intelligence. Proficient in modern data tools and technologies such as SQL, T-SQL, Python, Databricks, and cloud platforms (Azure). Adept at data wrangling, modeling, ETL/ELT development, and ensuring data quality, integrity, and security. Collaborative team player with a track record of enabling data-driven decision-making across business units. As a Dat...

Posted 2 hours ago

AI Match Score
Apply

4.0 - 7.0 years

15 - 27 Lacs

pune, gurugram, bengaluru

Hybrid

Salary: 15 to 25 LPA Exp: 4 to 7 years Location: Gurgaon/Pune/Bengalore Notice: Immediate to 30 days..!! Job Profile: Experienced Data Engineer with a strong foundation in designing, building, and maintaining scalable data pipelines and architectures. Skilled in transforming raw data into clean, structured formats for analytics and business intelligence. Proficient in modern data tools and technologies such as SQL, T-SQL, Python, Databricks, and cloud platforms (Azure). Adept at data wrangling, modeling, ETL/ELT development, and ensuring data quality, integrity, and security. Collaborative team player with a track record of enabling data-driven decision-making across business units. As a Dat...

Posted 2 hours ago

AI Match Score
Apply

5.0 - 8.0 years

12 - 22 Lacs

kolkata, pune, bengaluru

Work from Office

TCS is inviting applications!!! Role: Azure Data Engineer EXP: 5 - 8 YEARS LOCATION: Pune / Bangalore/ Bhubaneshwar/ Kolkata **Immediate - 60 days Notice period** Job Description Strong knowledge of Extraction Transformation and Loading (ETL) processes using frameworks like Azure Data Factory or Synapse or Databricks ; establishing cloud connectivity between different systems like ADLS, ADF, Synapse, Databricks etc. Candidates must possess hands on Power BI skills. Candidates must have good understanding of Informatica. Design and develop ETL processes based on functional and non-functional requirements in python / pyspark within Azure platform. A minimum of 5 years' experience with large SQ...

Posted 3 hours ago

AI Match Score
Apply

8.0 years

0 Lacs

bengaluru, karnataka, india

On-site

Job Requisition ID # 25WD91419 Position Overview We are seeking a highly experienced Principal Engineer to lead the design, development, and evolution of our Batch Processing platform , which powers Autodesk’s Analytics Data Platform (ADP). This role requires deep technical expertise in distributed data systems, large-scale pipeline orchestration, and hands-on leadership in shaping next-generation data platform capabilities. You will partner closely with Engineering Managers, Architects, Product teams, and Partner Engineering to modernize our data lakehouse architecture, deliver highly reliable data pipelines, and establish technical excellence across ingestion, processing, and governance. R...

Posted 3 hours ago

AI Match Score
Apply

9.0 - 12.0 years

0 Lacs

hyderabad, telangana, india

On-site

We are seeking a highly skilled and experienced Lead Palantir Data Software Engineer to join our dynamic team and lead transformative data projects within the Insurance domain. This role is ideal for professionals who excel at addressing complex big data challenges, leveraging cutting-edge technologies like Palantir Foundry, and building scalable analytics solutions to empower critical business decision-making. Responsibilities Design and manage complex data pipelines and analytics solutions to address business challenges in the Insurance domain Develop scalable workflows using Python and PySpark to handle large datasets efficiently Optimize querying and data manipulation through expertise i...

Posted 3 hours ago

AI Match Score
Apply

Exploring PySpark Jobs in India

PySpark, a powerful data processing framework built on top of Apache Spark and Python, is in high demand in the job market in India. With the increasing need for big data processing and analysis, companies are actively seeking professionals with PySpark skills to join their teams. If you are a job seeker looking to excel in the field of big data and analytics, exploring PySpark jobs in India could be a great career move.

Top Hiring Locations in India

Here are 5 major cities in India where companies are actively hiring for PySpark roles: 1. Bangalore 2. Pune 3. Hyderabad 4. Mumbai 5. Delhi

Average Salary Range

The estimated salary range for PySpark professionals in India varies based on experience levels. Entry-level positions can expect to earn around INR 6-8 lakhs per annum, while experienced professionals can earn upwards of INR 15 lakhs per annum.

Career Path

In the field of PySpark, a typical career progression may look like this: 1. Junior Developer 2. Data Engineer 3. Senior Developer 4. Tech Lead 5. Data Architect

Related Skills

In addition to PySpark, professionals in this field are often expected to have or develop skills in: - Python programming - Apache Spark - Big data technologies (Hadoop, Hive, etc.) - SQL - Data visualization tools (Tableau, Power BI)

Interview Questions

Here are 25 interview questions you may encounter when applying for PySpark roles:

  • Explain what PySpark is and its main features (basic)
  • What are the advantages of using PySpark over other big data processing frameworks? (medium)
  • How do you handle missing or null values in PySpark? (medium)
  • What is RDD in PySpark? (basic)
  • What is a DataFrame in PySpark and how is it different from an RDD? (medium)
  • How can you optimize performance in PySpark jobs? (advanced)
  • Explain the difference between map and flatMap transformations in PySpark (basic)
  • What is the role of a SparkContext in PySpark? (basic)
  • How do you handle schema inference in PySpark? (medium)
  • What is a SparkSession in PySpark? (basic)
  • How do you join DataFrames in PySpark? (medium)
  • Explain the concept of partitioning in PySpark (medium)
  • What is a UDF in PySpark? (medium)
  • How do you cache DataFrames in PySpark for optimization? (medium)
  • Explain the concept of lazy evaluation in PySpark (medium)
  • How do you handle skewed data in PySpark? (advanced)
  • What is checkpointing in PySpark and how does it help in fault tolerance? (advanced)
  • How do you tune the performance of a PySpark application? (advanced)
  • Explain the use of Accumulators in PySpark (advanced)
  • How do you handle broadcast variables in PySpark? (advanced)
  • What are the different data sources supported by PySpark? (medium)
  • How can you run PySpark on a cluster? (medium)
  • What is the purpose of the PySpark MLlib library? (medium)
  • How do you handle serialization and deserialization in PySpark? (advanced)
  • What are the best practices for deploying PySpark applications in production? (advanced)

Closing Remark

As you explore PySpark jobs in India, remember to prepare thoroughly for interviews and showcase your expertise confidently. With the right skills and knowledge, you can excel in this field and advance your career in the world of big data and analytics. Good luck!

cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies