Location : Mumbai Experience : 0-6months Technologies / Skills: Advanced SQL, Python and associated libraries like Pandas, Numpy etc., Pyspark , Shell scripting, Data Modelling, Big data, Hadoop, Hive, ETL pipelines. Responsibilities : Proven success in communicating with users, other technical teams, and senior management to collect requirements, describe data modeling decisions and develop data engineering strategy. Ability to work with business owners to define key business requirements and convert to user stories with required technical specifications. Communicate results and business impacts of insight initiatives to key stakeholders to collaboratively solve business problems. Working closely with the overall Enterprise Data & Analytics Architect and Engineering practice leads to ensure adherence with the best practices and design principles. Assures quality, security and compliance requirements are met for supported area. Design and create fault-tolerance data pipelines running on cluster Excellent communication skills with the ability to influence client business and IT teams Should have design data engineering solutions end to end. Ability to come up with scalable and modular solutions Required Qualification: 0-6months of hands-on experience Designing and developing Data Pipelines for Data Ingestion or Transformation using Python (PySpark)/Spark SQL in AWS cloud Experience in design and development of data pipelines and processing of data at scale. Advanced experience in writing and optimizing efficient SQL queries with Python and Hive handling Large Data Sets in Big-Data Environments Experience in debugging, tunning and optimizing PySpark data pipelines Should have implemented concepts and have good knowledge of Pyspark data frames, joins, caching, memory management, partitioning, parallelism etc. Understanding of Spark UI, Event Timelines, DAG, Spark config parameters, in order to tune the long running data pipelines. Experience working in Agile implementations Experience with building data pipelinesin streaming and batch mode. Experience with Git and CI/CD pipelines to deploy cloud applications Good knowledge of designing Hive tables with partitioning for performance. Desired Qualification: Experience in data modelling Hands on creating workflows on any Scheduling Tool like Autosys, CA Workload Automation Proficiency in using SDKsfor interacting with native AWS services Strong understanding of concepts of ETL, ELT and data modeling.

More Jobs at Go Digital Technology Consulting

Senior Data Engineer

Mumbai

3 - 8 yrs

INR 5.0 - 10.0 Lacs P.A.

Sr Full Stack Developer

Mumbai

5 - 9 yrs

INR 7.0 - 11.0 Lacs P.A.

Sr Software Engineer

Mumbai

5 - 7 yrs

INR 7.0 - 9.0 Lacs P.A.

Funtional Data Analyst

Mumbai

4 - 8 yrs

INR 5.0 - 8.0 Lacs P.A.

Senior Software Engineer

Mumbai

3 - 8 yrs

INR 6.0 - 16.0 Lacs P.A.

Go Digital Technology Consulting

www.godigitaltechconsulting.com

Technology Consulting

Tech City

50 Employees

28 Jobs

Key People

John Doe

CEO
Jane Smith

CTO

Login to

Please Verify Your Phone or Email

Confirm Action

Search

Profile

Bookmarks

Associate Data Engineer

Experience & Salary

Skills Required

Work Mode

Job Type

Job Description

More Jobs at Go Digital Technology Consulting

RecommendedJobs for You

Associate Data Engineer

Associate Data Engineer

Associate Data Engineer

Associate Data Engineer

Associate Data Engineer

Associate Data Engineer

Associate Data Engineer

Associate Data Engineer

Associate Data Engineer

Associate Data Engineer

Login to

Please Verify Your Phone or Email

Confirm Action

Contact Us

Contact Us

Search

Profile

Bookmarks

Personal Settings

Associate Data Engineer

Experience & Salary

Skills Required

Work Mode

Job Type

Job Description

More Jobs at Go Digital Technology Consulting