Data Architect with data lake implementation

Techmantra Gulf

6 - 11 years

25 - 37 Lacs

bengaluru

Posted:3 months ago| Platform:

Apply

Skills Required

delta lake data lake data architecture data governance apache flink cloud iceberg dataops apache hudi

Work Mode

Work from Office

Job Type

Full Time

Job Description

Skills Required:

Familiarity with data processing engines such as Apache Spark, Flink, or other big data tools.
Design, develop, and implement robust data lake architectures on cloud platforms (AWS/Azure).
Implement streaming and batch data pipelines using Apache Hudi, Apache Hive, and cloud-native services like AWS Glue, Azure Data Lake, etc.
Architect and optimize ingestion, compaction, partitioning, and indexing strategies in Apache Hudi.
Develop scalable data transformation and ETL frameworks using Python, Spark, and Flink.
Work closely with DataOps/DevOps to build CI/CD pipelines and monitoring tools for data lake platforms.
Ensure data governance, schema evolution handling, lineage tracking, and compliance.
Sound knowledge of Hive, Parquet/ORC formats, and DeltaLake vs Hudi vs Iceberg
Strong understanding of schema evolution, data versioning, and ACID guarantees in data lakes
Collaborate with analytics and BI teams to deliver clean, reliable, and timely datasets.
Troubleshoot performance bottlenecks in big data processing workloads and pipelines.
Experience with data governance tools and practices, including data cataloging, data lineage, and metadata management.
Strong understanding of data integration and movement between different storage systems (databases, data lakes, data warehouses).
Strong understanding of API integration for data ingestion, including RESTful services and streaming data.
Experience in data migration strategies, tools, and frameworks for moving data from legacy systems (on-premises) to cloud-based solutions.
Proficiency with data warehousing solutions (e.g., Google BigQuery, Snowflake).
Expertise in data modeling tools and techniques (e.g., SAP Datasphere, EA Sparx).
Strong knowledge of SQL and NoSQL databases (e.g., MongoDB, Cassandra).
Familiarity with cloud platforms (e.g., AWS, Azure, Google Cloud).

Nice To Have

Experience with Apache Iceberg, Delta Lake
Familiarity with Kinesis, Kafka, or any streaming platform
Exposure to dbt, Airflow, or Dagster
Experience in data cataloging, data governance tools, and column-level lineage tracking

More Jobs at Techmantra Gulf

Customer Support / IT Sales / Business Development Executive

Noida

Experience: Not specified

INR 0 - 3 Lacs

Data Engineer (Databricks, SQL, Python)

bengaluru

4.0 - 5.0 yrs

INR 6 - 7 Lacs

SAP CPI Consultant

bengaluru

1.0 - 5.0 yrs

INR 6 - 10 Lacs

Boomi Associate

bengaluru

1.0 - 4.0 yrs

INR 3 - 6 Lacs

SAP FIORI and ODATA Developer

bengaluru

3.0 - 6.0 yrs

INR 5 - 8 Lacs

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.