Data Architect

5 - 10 years

0 Lacs

Posted:3 weeks ago| Platform: Foundit logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Location

Experience

Role

About the Role:

As a Big Data Engineer, you will play a critical role in integrating multiple data sources, designing scalable data workflows, and collaborating with data architects, scientists, and analysts to develop innovative solutions. You will work with rapidly evolving technologies to achieve strategic business goals.

Must-Have Skills:

  • 4+ year's of mandatory experience with Big data.
  • 4+ year's mandatory experience in Apache Spark.
  • Proficiency in Apache Spark, Hive on Tez, and Hadoop ecosystem components.
  • Strong coding skills in Python & Pyspark.
  • Experience building reusable components or frameworks using Spark
  • Expertise in data ingestion from multiple sources using APIs, HDFS, and NiFi.
  • Solid experience working with structured, unstructured, and semi-structured data formats (Text, JSON, Avro, Parquet, ORC, etc.).
  • Experience with UNIX Bash scripting and databases like Postgres, MySQL and Oracle.
  • Ability to design, develop, and evolve fault-tolerant distributed systems.
  • Strong SQL skills, with expertise in Hive, Impala, Mongo and NoSQL databases.
  • Hands-on with Git and CI/CD tools
  • Experience with streaming data technologies (Kafka, Spark Streaming, Apache Flink, etc.).
  • Proficient with HDFS, or similar data lake technologies
  • Excellent problem-solving skills you will be evaluated through coding rounds

Key Responsibilities:

  • Must be capable of handling existing or new Apache HDFS cluster having name node, data node & edge node commissioning & decommissioning.
  • Work closely with data architects and analysts to design technical solutions.
  • Integrate and ingest data from multiple source systems into big data environments.
  • Develop end-to-end data transformations and workflows, ensuring logging and recovery mechanisms.
  • Must able to troubleshoot spark job failures.
  • Design and implement batch, real-time, and near-real-time data pipelines.
  • Optimize Big Data transformations using Apache Spark, Hive, and Tez
  • Work with Data Science teams to enhance actionable insights.
  • Ensure seamless data integration and transformation across multiple systems.
  • Mock Interview

    Practice Video Interview with JobPe AI

    Start Job-Specific Interview
    cta

    Start Your Job Search Today

    Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

    Job Application AI Bot

    Job Application AI Bot

    Apply to 20+ Portals in one click

    Download Now

    Download the Mobile App

    Instantly access job listings, apply easily, and track applications.

    coding practice

    Enhance Your Skills

    Practice coding challenges to boost your skills

    Start Practicing Now

    RecommendedJobs for You

    bangalore rural, chennai, bengaluru

    hyderabad, telangana

    hyderabad, telangana, india

    hyderabad, telangana, india