Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Jobs

Interviews

Home
>
Jobs in Thiruvananthapuram
>
UST Global
>
Lead II - Data Engineering- Pyspark, Sql, Azure databricks, ELT

Lead II - Data Engineering- Pyspark, Sql, Azure databricks, ELT

UST Global

7 years

8 Lacs

Thiruvananthapuram

Posted:1 week ago| Platform: GlassDoor logo

Apply

Skills Required

data pyspark sql azure databricks coding testing etl informatica python processing devops development maintenance design support architecture code test storage relational nosql engagement engineering schedule compliance training efficiency resolve pipeline security scalability costing configuration management strategies analysis estimation sharepoint interface retention technology programming apache airflow talend aws retrieval estimate debugging gcp dataflow adf analytics optimization automation drive consistency documentation agile querying orchestration governance cataloging api integration finance agility

Work Mode

On-site

Job Type

Part Time

Job Description

7 - 14 Years

1 Opening

Trivandrum

Role description

Role Proficiency:

This role requires proficiency in developing data pipelines including coding and testing for ingesting wrangling transforming and joining data from various sources. The ideal candidate should be adept in ETL tools like Informatica Glue Databricks and DataProc with strong coding skills in Python PySpark and SQL. This position demands independence and proficiency across various data domains. Expertise in data warehousing solutions such as Snowflake BigQuery Lakehouse and Delta Lake is essential including the ability to calculate processing costs and address performance issues. A solid understanding of DevOps and infrastructure needs is also required.

Outcomes:

Act creatively to develop pipelines/applications by selecting appropriate technical options optimizing application development maintenance and performance through design patterns and reusing proven solutions. Support the Project Manager in day-to-day project execution and account for the developmental activities of others.

Interpret requirements create optimal architecture and design solutions in accordance with specifications.

Document and communicate milestones/stages for end-to-end delivery.

Code using best standards debug and test solutions to ensure best-in-class quality.

Tune performance of code and align it with the appropriate infrastructure understanding cost implications of licenses and infrastructure.

Create data schemas and models effectively.

Develop and manage data storage solutions including relational databases NoSQL databases Delta Lakes and data lakes.

Validate results with user representatives integrating the overall solution.

Influence and enhance customer satisfaction and employee engagement within project teams.

Measures of Outcomes:

TeamOne's Adherence to engineering processes and standards

TeamOne's Adherence to schedule / timelines

TeamOne's Adhere to SLAs where applicable

TeamOne's # of defects post delivery

TeamOne's # of non-compliance issues

TeamOne's Reduction of reoccurrence of known defects

TeamOne's Quickly turnaround production bugs

Completion of applicable technical/domain certifications

Completion of all mandatory training requirementst

Efficiency improvements in data pipelines (e.g. reduced resource consumption faster run times).

TeamOne's Average time to detect respond to and resolve pipeline failures or data issues.

TeamOne's Number of data security incidents or compliance breaches.

Outputs Expected:

Code:

Develop data processing code with guidance
ensuring performance and scalability requirements are met.

Define coding standards
templates and checklists.

Review code for team and peers.

Documentation:

Create/review templates
checklists guidelines and standards for design/process/development.

Create/review deliverable documents
including design documents architecture documents infra costing business requirements source-target mappings test cases and results.

Configure:

Define and govern the configuration management plan.

Ensure compliance from the team.

Test:

Review/create unit test cases
scenarios and execution.

Review test plans and strategies created by the testing team.

Provide clarifications to the testing team.

Domain Relevance:

Advise data engineers on the design and development of features and components
leveraging a deeper understanding of business needs.

Learn more about the customer domain and identify opportunities to add value.

Complete relevant domain certifications.

Manage Project:

Support the Project Manager with project inputs.

Provide inputs on project plans or sprints as needed.

Manage the delivery of modules.

Manage Defects:

Perform defect root cause analysis (RCA) and mitigation.

Identify defect trends and implement proactive measures to improve quality.

Estimate:

Create and provide input for effort and size estimation
and plan resources for projects.

Manage Knowledge:

Consume and contribute to project-related documents
SharePoint libraries and client universities.

Review reusable documents created by the team.

Release:

Execute and monitor the release process.

Design:

Contribute to the creation of design (HLD
LLD SAD)/architecture for applications business components and data models.

Interface with Customer:

Clarify requirements and provide guidance to the Development Team.

Present design options to customers.

Conduct product demos.

Collaborate closely with customer architects to finalize designs.

Manage Team:

Set FAST goals and provide feedback.

Understand team members' aspirations and provide guidance and opportunities.

Ensure team members are upskilled.

Engage the team in projects.

Proactively identify attrition risks and collaborate with BSE on retention measures.

Certifications:

Obtain relevant domain and technology certifications.

Skill Examples:

Proficiency in SQL Python or other programming languages used for data manipulation.

Experience with ETL tools such as Apache Airflow Talend Informatica AWS Glue Dataproc and Azure ADF.

Hands-on experience with cloud platforms like AWS Azure or Google Cloud particularly with data-related services (e.g. AWS Glue BigQuery).

Conduct tests on data pipelines and evaluate results against data quality and performance specifications.

Experience in performance tuning.

Experience in data warehouse design and cost improvements.

Apply and optimize data models for efficient storage retrieval and processing of large datasets.

Communicate and explain design/development aspects to customers.

Estimate time and resource requirements for developing/debugging features/components.

Participate in RFP responses and solutioning.

Mentor team members and guide them in relevant upskilling and certification.

Knowledge Examples:

Knowledge Examples

Knowledge of various ETL services used by cloud providers including Apache PySpark AWS Glue GCP DataProc/Dataflow Azure ADF and ADLF.

Proficient in SQL for analytics and windowing functions.

Understanding of data schemas and models.

Familiarity with domain-related data.

Knowledge of data warehouse optimization techniques.

Understanding of data security concepts.

Awareness of patterns frameworks and automation practices.

Additional Comments:

We are seeking an experienced and driven Senior Data Engineer with 8+ years of hands-on experience in designing and building robust, scalable data pipelines. This role requires deep expertise in PySpark, SQL, and cloud data platforms such as Azure Databricks, with additional exposure to AWS or GCP environments. The ideal candidate will also be well-versed in dimensional modeling techniques (e.g., Kimball/star schema) and best practices for ETL/ELT pipeline development. You will collaborate with data architects, analysts, and business stakeholders to deliver high-quality data solutions that drive insights and business value. Key Responsibilities: Design, build, and optimize scalable ETL/ELT pipelines using PySpark and SQL Develop data workflows on Azure Databricks and integrate across cloud platforms (AWS or GCP) Implement and maintain data models using Kimball/star schema or similar dimensional modeling approaches Ensure data quality, consistency, and performance across large datasets Collaborate with cross-functional teams to understand business requirements and translate them into scalable data solutions Contribute to data architecture and platform decisions in a cloud-native environment Participate in code reviews, documentation, and Agile team ceremonies Must-Have Skills: 8+ years of experience in Data Engineering or a related field Strong hands-on experience with PySpark for data transformation and processing Advanced SQL skills for querying and managing large-scale datasets Proven experience with Azure Databricks for big data development Familiarity with cloud platforms like AWS and/or GCP Solid understanding of dimensional data modeling using Kimball/star schema Experience building and maintaining ETL/ELT data pipelines in production environments Nice-to-Have Skills: Experience with orchestration tools (e.g., Apache Airflow, Azure Data Factory) Exposure to data governance, cataloging, and lineage tools Familiarity with Delta Lake and Lakehouse architectures Proficiency in Python beyond PySpark (e.g., for utilities, API integration) Background in business domains such as finance, e-commerce, or retail analytics

Skills

Pyspark,Sql,Azure databricks,ELT

About UST

UST is a global digital transformation solutions provider. For more than 20 years, UST has worked side by side with the world’s best companies to make a real impact through transformation. Powered by technology, inspired by people and led by purpose, UST partners with their clients from design to operation. With deep domain expertise and a future-proof philosophy, UST embeds innovation and agility into their clients’ organizations. With over 30,000 employees in 30 countries, UST builds for boundless impact—touching billions of lives in the process.

More Jobs at UST Global

Lead I - TR

Coimbatore

5.0 - 7.0 yrs

Salary: Not disclosed

Lead I - Software Engineering (Adobe Marketo Engineer)

Cochin

5.0 - 5.0 yrs

INR 4 - 7 Lacs

Lead I - Software Engineering - AS400

Thiruvananthapuram

5.0 - 7.0 yrs

INR 5 - 7 Lacs

Lead I - Software Testing (Performance Test / Jmeter)

Thiruvananthapuram

5.0 - 7.0 yrs

INR 5 - 8 Lacs

Scrum Master I

Pune, Maharashtra

9.0 - 9.0 yrs

Salary: Not disclosed

Mock Interview

Practice Video Interview with JobPe AI

Start PySpark Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

UST Global

Information Technology Services

Oxnard

Login to

Please Verify Your Phone or Email

Confirm Action

Search

Profile

Upskill and Grow with AI

Lead II - Data Engineering- Pyspark, Sql, Azure databricks, ELT