This position is posted by Jobgether on behalf of a partner company. We are currently looking for an
Azure Data Engineer (5 to 7 yrs) (Python / PySpark / SQL / Databricks)
in
India
.We are seeking an experienced Data Engineer to join a dynamic team focused on delivering scalable and high-quality healthcare data solutions. The ideal candidate will work closely with product teams, software engineers, and clinical leaders to design, enhance, and maintain Databricks pipelines and Azure-based data platforms. This role offers the opportunity to work with diverse healthcare datasets, implement best practices for data engineering, and contribute to actionable insights that drive decision-making. You will collaborate with stakeholders to ensure data quality, create unified data models, and support machine learning and AI initiatives, all within a flexible and collaborative remote environment.
Accountabilities:
- Design, develop, and enhance Databricks pipelines to support business-critical healthcare data
- Collaborate with business stakeholders to gather requirements and deliver accurate and intuitive data products
- Implement and maintain data models and metadata structures to support machine learning and analytics
- Monitor, troubleshoot, and resolve data quality issues, ensuring reliable and trusted datasets
- Extract, transform, and integrate data from heterogeneous sources, including claims, EHR, ADT, and FHIR datasets
- Support platform integration efforts, including acquisitions and cloud-based data migrations
- Maintain documentation, adhere to coding standards, and participate in code reviews to ensure best practices
Requirements
- 5-7 years of experience in data engineering, preferably in the healthcare domain
- Strong hands-on experience with Python, PySpark, SQL, and Databricks
- Proven expertise in Azure cloud services and cloud-based data platforms
- Experience developing scalable data pipelines and implementing data governance, quality, and security practices
- Familiarity with normalized, dimensional, star schema, and snowflake data models
- Experience serving datasets to BI tools and software engineering applications
- Strong problem-solving skills, attention to detail, and effective communication abilities
- Preferred: Databricks or Microsoft Azure certifications; experience with machine learning or AI data pipelines
Benefits
- Flexible remote working options and working hours
- Stock options and other performance incentives
- Supportive, collaborative, and innovative work environment
- Opportunities for mentorship, learning, and professional growth
- Benefits exceeding statutory requirements
- Exposure to cutting-edge healthcare data analytics and AI-driven solutions
Jobgether is a Talent Matching Platform
that partners with companies worldwide to efficiently connect top talent with the right opportunities through
AI-driven job matching.
When you apply, your profile goes through our
AI-powered screening process
designed to identify top talent efficiently and fairly.🔍 Our AI evaluates your CV and LinkedIn profile thoroughly, analyzing your skills, experience, and achievements.📊 It compares your profile to the job's core requirements and past success factors to determine your match score.🎯 Based on this analysis, we automatically shortlist the 3 candidates with the highest match to the role.🧠 When necessary, our human team may perform an additional manual review to ensure no strong profile is missed.The process is
transparent, skills-based, and free of bias
, focusing solely on your fit for the role.Once the shortlist is completed, it is shared directly with the company that owns the job opening. The final decision and next steps (such as interviews or assessments) are then managed by their internal hiring team.
Thank you for your interest!