Lead Data Engineer

4 - 7 years

15 - 20 Lacs

Posted:1 day ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

    About the Role

    We are looking for a hands on Data Engineer to design, build, and operate robust data pipelines and platforms on Snowflake with Azure. You will use

    strong SQL

    ,

    Python/PySpark

    ,

    ADF pipelines

    , and modern data modeling practices to ingest data from diverse data sources, and enable AI/ML use cases via

    VectorDB indexing and embeddings

    . The role emphasizes reliability, performance, cost efficiency, and secure data operations in line with our enterprise platforms and standards.

    Key Responsibilities

    Design build data pipelines

    on Snowflake and Azure (ADF, PySpark) to ingest data from REST APIs, files, and databases into curated zones.

    Model data

    optimized for analytics, reporting, and downstream applications.

    Develop embeddings VectorDB indices

    to power semantic search/retrieval (e. g. , generating embeddings and indexing into enterprise approved vector stores; integrate with pipeline orchestration).

    Own performance cost optimization

    in Snowflake (SQL tuning, partitioning, caching, clustering, compute sizing).

    Implement CI/CD and DevOps

    practices (Git branching, automated deploys for ADF/Snowflake).

    Harden reliability

    (monitoring, alerting, retry logic, SLA tracking) and

    security/compliance

    (RBAC, secrets management, data governance, data lineage).

    Collaborate with stakeholders

    (product, analytics, and platform teams) to translate requirements into technical design and deliver incremental value.

    Must Have

    Qualifications

    4-7 years

    total experience in data engineering in large scale enterprise systems

    Snowflake

    : Min 3 years of experience in Snowflake with exposure to warehouse configuration, schema design, performance tuning; stored procedures/tasks; loading strategies. Exposure to Snowflake Cortex AI.

    SQL/Python/PySpark

    : Design and implement scalable data processing solutions using SQL, Python, and distributed compute frameworks, including unit/integration tests.

    Azure ADF

    : ADLS Gen2, ADF pipelines/activities, triggers, parameterization; monitoring troubleshooting.

    Data modeling

    : Apply data modeling techniques, including medallion architecture (Bronze/Silver/Gold).

    API ingestion

    : designing resilient ingestion of REST/JSON, pagination, auth, rate limit handling.
     

    VectorDB embeddings

    : Experience generating embeddings and building vector indices for retrieval augmented scenariosExposure to building knowledge graphs and Gremlin or Cypher graph query languages on CosmosDB/Neo4j

    Version control CI/CD

    : Git, pull requests, automated deployment pipelines. Maintain a results-oriented mindset with strong analytical and problem-solving skills.

    Good to Have

    Experience in Healthcare IndustryPrior experience working on data migration projects.

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Providence Global Center logo
Providence Global Center

Business Services, Technology

Seattle

RecommendedJobs for You

pune, maharashtra, india

gurugram, haryana, india

hyderabad, telangana, india