The Data Engineer exercises judgment when following general instructions and works with minimal instruction to support the integration and automation of data solutions. This role focuses on data massaging, reconciliation, and analysis, resolving routine to semi-routine issues. Responsibilities include creating optimized SQL queries, managing data pipelines, and collaborating with cross-functional teams to ensure data accuracy and availability.
What will you do:
-
Write optimized and scalable complex SQL queries
-
Automate data processing tasks using Python, focusing on cleaning and merging datasets.
-
Manage data pipelines, including scheduling, monitoring, and debugging workflows.
-
Collaborate with data engineers and IT teams to maintain data accessibility for stakeholders.
-
Assist in developing automated tests to ensure the accuracy and integrity of data.
-
Participate in version control and CI/CD processes for deploying and testing pipeline changes across environments.
-
Work cross-functionally with analysts, engineers, and operations.
-
Data stewardship including: data governance, data compliance, data transformation, data cleanliness, data validation, data audit/maintenance.
-
Writing complex, highly-optimized SQL queries across large datasets, involved in SQL Query tuning and provided tuning recommendations
-
Experienced in Data Analytics, hands-on experience of various Python libraries such as NumPy and Pandas
-
Python development experience to massage, clean data and automate data extract and loads
-
Expertise to convert raw data to processed data by merging, finding outliers, errors, trends, missing values and distributions in the data
-
Expertise in Creating, Debugging, Scheduling and Monitoring jobs using Airflow, resolve performance tuning related issues and queries
-
Foster collaboration among Data engineers, IT & other business groups to ensure data is accessible to FP&A team
-
Scheduled a regular hot backup process and involved in the backup activities
-
Strong analytical and problem-solving skills with ability to represent complex algorithms in software
-
Develop automated unit tests, end-to-end tests, and integration tests to assist in quality assurance (QA) procedures
What will you bring:
-
Bachelor's or Master's degree in Computer Science, IT, Engineering or equivalent
-
5+ years of experience as a Data Engineer, BI Engineer, Systems Analyst in a company with large, complex data sources
-
Working knowledge of DBT, Snowflake, Fivetran, Git and SQL or Python programming skills for data querying, cleaning, and presentation
-
Build highly available, reliable and secured API solutions, experience working with REST API design and Implementation
-
Working knowledge of relational databases (PostgreSQL, MSSQL, etc.), experience with AWS services including S3, Redshift, EMR and RDS
-
Ability to manage multiple projects at the same time in a fast-paced team environment, across time zones, and with different cultures, while maintaining ability to work as part of a team
-
The candidate must have good troubleshooting skills and be able to think through issues and problems in a logical manner and planning knowledge would be an added advantage
-
Detail-oriented and enthusiastic who is also focused and diligent on delivering results