Posted:4 days ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description


Location: Mumbai

Department: Data Science & Analytics

Reports To: Head – Data Science & Analytics


About the Role

We are looking for a Data Engineer to design, build, and optimize scalable data pipelines and platforms powering Square Yards’ advanced analytics, AI-driven products, and real estate data intelligence services. You will work closely with data scientists, product teams, and cloud engineers to ensure seamless data flow, high availability, and performance across our platforms.


Key Responsibilities

● Design, develop, and maintain ETL/ELT pipelines to ingest, transform, and store large-scale structured and unstructured data.

● Manage and optimize real-time streaming pipelines using Apache Kafka and batch processing with Apache Spark .

● Build scalable data models and ensure data quality, governance, and reliability across multiple sources (transactional, behavioral, geospatial, property datasets).

● Work with Cassandra and Elasticsearch for high-volume data storage, retrieval, and indexing.

● Deploy and manage cloud-native data pipelines on Google Cloud Platform (BigQuery, Dataflow, Pub/Sub, etc.) .

● Partner with data science and AI teams to productionize predictive models, recommendation engines, NLP/LLM-powered solutions, and RAG pipelines .

● Monitor, troubleshoot, and optimize system performance and data workflows.

● Automate workflows to reduce manual intervention and improve efficiency.


Required Skills & Experience

● Bachelor’s or Master’s degree in Computer Science, Engineering, or related field.

● 3–6 years of experience in data engineering or a related field (flexible based on role level).

● Strong programming skills in Java and Python .

● Hands-on experience with Apache Spark, Kafka, Cassandra, and Elasticsearch .

● Proficiency with cloud data services (preferably GCP) .

● Strong understanding of data modeling, ETL design, and distributed systems .

● Familiarity with API integrations and real-time data services .

● Knowledge of geospatial data processing and visualization tools is a plus.

● Experience supporting machine learning and AI pipelines in production is an advantage.


Preferred Qualifications

● Exposure to LLM, NLP, and recommendation systems .

● Experience with data governance, lineage, and observability tools .

● Strong problem-solving skills and ability to work in fast-paced environments.

● Excellent collaboration and communication skills. What We Offer

● Opportunity to work on cutting-edge AI & data products in real estate.

● Collaborative environment with cross-functional teams (AI, engineering, product).

● Growth opportunities and continuous learning in big data, cloud, and AI.

Mock Interview

Practice Video Interview with JobPe AI

Start Java Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Java Skills

Practice Java coding challenges to boost your skills

Start Practicing Java Now

RecommendedJobs for You

sholinganallur, tamil nadu, india

gurugram, haryana, india

bengaluru east, karnataka, india

noida, uttar pradesh, india

gurugram, haryana, india