Data Engineer

5 years

0 Lacs

Posted:11 hours ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Data Engineer – NiFi / Cloudera / Iceberg / Snowflake / Databricks


Overview


We are seeking a Data Engineer with strong Apache NiFi expertise to design and implement pipelines that move and transform data from Cloudera (HDFS/Hive/Impala) into Apache Iceberg tables, with downstream integration into Snowflake and Databricks. The ideal candidate will have hands-on experience with modern data lakehouse architectures and will play a critical role in enabling scalable, governed, and high-performance data platforms.


Key Responsibilities:


  • Data Ingestion & Pipeline Development
  • Design, configure, and maintain NiFi data flows to extract, transform, and load data from Cloudera into Iceberg tables.
  • Implement streaming and batch ingestion pipelines with NiFi processors and custom scripting where needed.
  • Optimize NiFi workflows for scalability, reliability, and monitoring.
  • Data Lakehouse Enablement
  • Build and manage Apache Iceberg-based datasets for structured, semi-structured, and unstructured data.
  • Ensure schema evolution, partitioning, and metadata management in Iceberg.
  • Develop integration flows from Iceberg to Snowflake and Databricks for analytics, ML, and reporting use cases.
  • Integration & Orchestration
  • Work with Snowflake to ingest curated data from Iceberg for enterprise reporting and commercial insights.
  • Collaborate with Databricks teams to enable advanced analytics and machine learning use cases.
  • Integrate NiFi pipelines with orchestration tools (Airflow, Oozie, or AWS/Azure/GCP schedulers).
  • Performance, Security & Governance
  • Tune NiFi flows and Snowflake/Databricks ingestion for performance and cost optimization.
  • Implement role-based security and ensure compliance (HIPAA, GDPR, SOX if applicable).
  • Work with governance teams to enable lineage, metadata tracking, and auditability.


Qualifications:


  • Bachelor’s degree in Computer Science, Information Systems, or related field.
  • 5+ years of data engineering experience, with at least 2+ years working with Apache NiFi.
  • Strong experience with Cloudera ecosystem (HDFS, Hive, Impala, Spark).
  • Hands-on expertise with Apache Iceberg (schema evolution, time travel, partitioning, compaction).
  • Working knowledge of Snowflake and Databricks integration patterns.
  • Proficiency in SQL and one programming language (Python, Java, or Scala).
  • Understanding of data lakehouse architectures and ETL/ELT best practices.

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Mastek logo
Mastek

Information Technology and Services

Mumbai

RecommendedJobs for You

bengaluru, karnataka, india