Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Jobs

Interviews

Home
>
Jobs in Pune
>
Shivsys Inc.
>
Lead Data Engineer

Lead Data Engineer

Shivsys Inc.

8 years

0 Lacs

Pune Maharashtra India

Posted:1 month ago| Platform:

Apply

Skills Required

data python apache spark airflow design processing analytics learning pyspark code etl orchestration sensors hadoop aws databricks integrity security governance pipeline resolve support architecture scalability engineering development numpy writing workflow scheduling gcp azure redshift sql nosql git docker linux kafka compliance communication agile

Work Mode

On-site

Job Type

Full Time

Job Description

Job Summary: We are looking for a Senior Data Engineer with deep expertise in Python , Apache Spark , and Apache Airflow to design, build, and optimize scalable data pipelines and processing frameworks. You will play a key role in managing large-scale data workflows, ensuring data quality, performance, and timely delivery for analytics and machine learning platforms. Key Responsibilities: Design, develop, and maintain data pipelines using Apache Spark (PySpark) and Airflow for batch and near real-time processing. Write efficient, modular, and reusable Python code for ETL jobs, data validation, and transformation tasks. Implement robust data orchestration workflows using Apache Airflow (DAGs, sensors, hooks, etc.). Work with big data technologies on distributed platforms (e.g., Hadoop, AWS EMR, Databricks). Ensure data integrity, security, and governance across various stages of the pipeline. Monitor and optimize pipeline performance; resolve bottlenecks and failures proactively. Collaborate with data scientists, analysts, and other engineers to support data needs. Document architecture, processes, and code to support maintainability and scalability. Participate in code reviews, architecture discussions, and production deployments. Mentor junior engineers and provide guidance on best practices. Required Skills: 8+ years of experience in data engineering or backend development roles. Strong proficiency in Python , including data manipulation (Pandas, NumPy) and writing scalable code. Hands-on experience with Apache Spark (preferably PySpark) for large-scale data processing. Extensive experience with Apache Airflow for workflow orchestration and scheduling. Deep understanding of ETL/ELT patterns , data quality, lineage, and data modeling. Familiarity with cloud platforms (AWS, GCP, or Azure) and related services (S3, BigQuery, Redshift, etc.). Solid experience with SQL , NoSQL, and file formats like Parquet, ORC, and Avro. Proficient with CI/CD pipelines , Git, Docker, and Linux-based development environments. Preferred Qualifications: Experience with data lakehouse architectures (e.g., Delta Lake, Iceberg). Exposure to real-time streaming technologies (e.g., Kafka, Flink, Spark Streaming). Background in machine learning pipelines and MLOps tools (optional). Knowledge of data governance frameworks and compliance standards. Soft Skills: Strong problem-solving and communication skills. Ability to work independently and lead complex projects. Experience working in agile and cross-functional teams. Show more Show less

More Jobs at Shivsys Inc.

Technical Analyst - Senior Java Developer

Noida, Uttar Pradesh, India

Experience: Not specified

Salary: Not disclosed

Global Cloud Senior Administrator

Noida, Uttar Pradesh, India

Experience: Not specified

Salary: Not disclosed

Mainframe Database Administrator

Noida, Uttar Pradesh, India

8.0 - 8.0 yrs

Salary: Not disclosed

PostgreSQL DBA

Noida, Uttar Pradesh, India

Experience: Not specified

Salary: Not disclosed

Product Security Engineer

Noida, Uttar Pradesh, India

3.0 - 3.0 yrs

Salary: Not disclosed

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Shivsys Inc.

Login to

Please Verify Your Phone or Email

Confirm Action

Search

Profile

Upskill and Grow with AI

Lead Data Engineer