Posted:3 hours ago|
Platform:
On-site
Full Time
Primary Skills: Python, Pyspark, Glue, Redshift, Lambda, DMS, RDS
,Cloud Formation and other AWS serverless ; Strong exp in SQL
Detailed JD
Seeking a developer who has good Experience in Athena, Python code, Glue,
Lambda, DMS , RDS, Redshift Cloud Formation and other AWS serverless
resources.
Can optimize data models for performance and efficiency. Able to write SQL
queries to support data analysis and reporting Design, implement, and
maintain the data architecture for all AWS data services.
Work with stakeholders to identify business needs and requirements for data-
related projects Design and implement ETL processes to load data into the
data warehouse
Good Experience in Athena, Python code, Glue, Lambda, DMS, RDS,
Redshift, Cloud Formation and other AWS serverless resources
About the Role: We are seeking a highly skilled and experienced Data Engineer to join our
team, focusing on building and maintaining robust data pipelines and infrastructure for our U.S.
healthcare payer operations. You will be responsible for the ingestion, transformation, and
availability of large, complex healthcare datasets, enabling our data scientists and ML engineers
to develop impactful solutions. This role requires a strong foundation in data warehousing,
ETL/ELT processes, and cloud-based data platforms within a highly regulated environment.
Responsibilities:
• Design, develop, and optimize scalable data pipelines (ETL/ELT) to ingest, transform, and load
diverse healthcare data from various sources into our data warehouse/lake.
• Build and maintain robust data models (e.g., star schema, snowflake schema) to support analytics,
reporting, and machine learning initiatives.
• Write highly optimized SQL queries and Python scripts for data manipulation, cleansing,
validation, and transformation.
• Ensure data quality, integrity, and reliability across all data pipelines.
• Collaborate with data scientists, ML engineers, and business stakeholders to understand data
requirements and translate them into efficient data solutions.
• Implement and manage data governance policies, security measures, and access controls for
sensitive healthcare data (e.g., HIPAA compliance).
• Monitor data pipeline performance, troubleshoot issues, and implement solutions for continuous
improvement.
• Automate data workflows using orchestration tools and integrate with CI/CD pipelines.
• Contribute to the selection, evaluation, and implementation of new data technologies and tools.
• Document data pipelines, data models, and data flow processes thoroughly.
Required Technical Skills:
• Programming Language: Strong proficiency in Python for data processing and automation.
• Databases: Expert-level SQL skills for complex querying, data manipulation, and optimization.
Experience with relational and NoSQL databases.
• Data Pipelines: Proven experience designing and implementing ETL/ELT processes.
• Cloud Platforms: Extensive hands-on experience with data services on AWS or Azure (e.g.,
AWS S3, Redshift, Glue, EMR, Athena, Azure Data Lake, Azure Synapse Analytics, Azure Data
Factory).
• Containerization: Familiarity with Docker for deploying data-related applications.
• Version Control: Proficient with Git and GitHub.
• Testing: Experience with Python unittest framework and Pytest for data pipeline testing.
• CI/CD: Understanding of CI/CD concepts and experience integrating data pipeline
deployments with AWS CodePipeline or alternatives.
• Data Modeling: Strong knowledge of data warehousing concepts and data modeling techniques.
10decoders
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Practice Python coding challenges to boost your skills
Start Practicing Python Nowchennai, tamil nadu, india
Salary: Not disclosed
hyderābād
7.5 - 8.5 Lacs P.A.
chennai, tamil nadu, india
Salary: Not disclosed
kolkata, west bengal, india
Salary: Not disclosed
bhubaneswar, odisha, india
Salary: Not disclosed
cuttack, odisha, india
Salary: Not disclosed
raipur, chhattisgarh, india
Salary: Not disclosed
guwahati, assam, india
Salary: Not disclosed
ranchi, jharkhand, india
Salary: Not disclosed
amritsar, punjab, india
Salary: Not disclosed