Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in Hyderabad
>
Synechron
>
Senior PySpark Data Engineer (Big Data, Cloud Data Solutions)

Senior PySpark Data Engineer (Big Data, Cloud Data Solutions)

Synechron

2 - 7 years

4 - 9 Lacs

Hyderabad

Posted:3 months ago| Platform:

Apply

Skills Required

PySpark PostgreSQL Big Data Kafka Spark Streaming SQL Jenkins Apache Airflow Git Cloud Data Solutions Bash scripting GitLab CI MySQL Python

Work Mode

Work from Office

Job Type

Full Time

Job Description

Job Summary
Synechron is seeking a skilled PySpark Data Engineer to design, develop, and optimize data processing solutions leveraging modern big data technologies. In this role, you will lead efforts to build scalable data pipelines, support data integration initiatives, and work closely with cross-functional teams to enable data-driven decision-making. Your expertise will contribute to enhancing business insights and operational efficiency, positioning Synechron as a pioneer in adopting emerging data technologies.Software RequirementsRequired Software Skills:PySpark (Apache Spark with Python) experience in developing data pipelinesApache Spark ecosystem knowledgePython programming (versions 3.7 or higher)SQL and relational database management systems (e.g., PostgreSQL, MySQL)Cloud platforms (preferably AWS or Azure)Version control: GITData workflow orchestration tools like Apache AirflowData management tools: SQL Developer or equivalentPreferred Software Skills:Experience with Hadoop ecosystem componentsKnowledge of containerization (Docker, Kubernetes)Familiarity with data lake and data warehouse solutions (e.g., AWS S3, Redshift, Snowflake)Monitoring and logging tools (e.g., Prometheus, Grafana)Overall ResponsibilitiesLead the design and implementation of large-scale data processing solutions using PySpark and related technologiesCollaborate with data scientists, analysts, and business teams to understand data requirements and deliver scalable pipelinesMentor junior team members on best practices in data engineering and emerging technologiesEvaluate new tools and methodologies to optimize data workflows and improve data qualityEnsure data solutions are robust, scalable, and aligned with organizational data governance policiesStay informed on industry trends and technological advancements in big data and analyticsSupport production environment stability and performance tuning of data pipelinesDrive innovative approaches to extract value from large and complex datasetsTechnical Skills (By Category)Programming Languages:Required: Python (PySpark experience minimum 2 years)Preferred: Scala (for Spark), SQL, Bash scriptingDatabases/Data Management:Relational databases (PostgreSQL, MySQL)Distributed storage solutions (HDFS, cloud object storage like S3 or Azure Blob Storage)Data warehousing platforms (Snowflake, Redshift preferred)Cloud Technologies:Required: Experience deploying and managing data solutions on AWS or AzurePreferred: Knowledge of cloud-native services like EMR, Data Factory, or Azure Data LakeFrameworks and Libraries:Apache Spark (PySpark)Airflow or similar orchestration toolsData processing frameworks (Kafka, Spark Streaming preferred)Development Tools and Methodologies:Version control with GITAgile management tools: Jira, ConfluenceContinuous integration/deployment pipelines (Jenkins, GitLab CI)Security Protocols:Understanding of data security, access controls, and GDPR compliance in cloud environmentsExperience RequirementsMinimum of 5+ years in data engineering, with hands-on PySpark experienceProven track record of developing, deploying, and maintaining scalable data pipelinesExperience working with data lakes, data warehouses, and cloud data servicesDemonstrated leadership in projects involving big data technologiesExperience mentoring junior team members and collaborating across teamsPrior experience in financial, healthcare, or retail sectors is beneficial but not mandatoryDay-to-Day ActivitiesDevelop, optimize, and deploy big data pipelines using PySpark and related toolsCollaborate with data analysts, data scientists, and business teams to define data requirementsConduct code reviews, troubleshoot pipeline issues, and optimize performanceMentor junior team members on best practices and emerging technologiesDesign solutions for data ingestion, transformation, and storageEvaluate new tools and frameworks for continuous improvementMaintain documentation, monitor system health, and ensure security complianceParticipate in sprint planning, daily stand-ups, and project retrospectives to align prioritiesQualificationsBachelors or Masters degree in Computer Science, Information Technology, or related disciplineRelevant industry certifications (e.g., AWS Data Analytics, GCP Professional Data Engineer) preferredProven experience working with PySpark and big data ecosystemsStrong understanding of software development lifecycle and data governance standardsCommitment to continuous learning and professional development in data engineering technologiesProfessional CompetenciesAnalytical mindset and problem-solving acumen for complex data challengesEffective leadership and team management skillsExcellent communication skills tailored to technical and non-technical audiencesAdaptability in fast-evolving technological landscapesStrong organizational skills to prioritize tasks and manage multiple projectsInnovation-driven with a passion for leveraging emerging data technologies

More Jobs at Synechron

Full Stack .Net Developer

Pune, Bengaluru

5 - 10 yrs

INR 15 - 30 Lacs

C++ Developer

Chennai

5 - 9 yrs

INR 14 - 19 Lacs

Business Analyst - Capital Markets

Chennai

6 - 11 yrs

INR 0 - 0 Lacs

Database Developer

Chennai

5 - 10 yrs

INR 0 - 0 Lacs

DevOps Engineer

Pune, Maharashtra, India

8 - 8 yrs

Salary: Not disclosed

Mock Interview

Practice Video Interview with JobPe AI

Start PySpark Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Synechron

Information Technology and Services

New York

Login to

Please Verify Your Phone or Email

Confirm Action

Senior PySpark Data Engineer (Big Data, Cloud Data Solutions)