Home
Jobs

PySpark / Reltio

7 - 10 years

7 - 10 Lacs

Posted:1 week ago| Platform: Foundit logo

Apply

Skills Required

Work Mode

On-site

Job Type

Full Time

Job Description

Develop and maintain data pipelines using PySpark in distributed computing environments (e.g., AWS EMR, Databricks). Integrate and synchronize data between enterprise systems and the Reltio MDM platform. Design and implement data transformation, cleansing, and enrichment processes. Collaborate with data architects, business analysts, and Reltio solution architects to ensure high-quality data modeling. Work on API-based integration between Reltio and upstream/downstream applications. Optimize PySpark jobs for performance and cost-efficiency. Ensure data quality, integrity, and governance throughout the pipeline. Troubleshoot and resolve data and performance issues in existing workflows. Required Skills & Qualifications: 7+ years of experience in PySpark development and distributed data processing. Strong understanding of Apache Spark, DataFrames, and Spark SQL. Experience with Reltio MDM, including entity modeling, survivorship rules, match & merge configuration. Proficiency in working with REST APIs and JSON data formats. Experience with cloud platforms like AWS and data services (e.g., S3, Lambda, step function) Good knowledge of data warehousing concepts, ETL workflows, and data modeling. Familiarity with CI/CD practices and version control tools like Git. Strong problem-solving and communication skills.

Mock Interview

Practice Video Interview with JobPe AI

Start Reltio Mdm Interview Now
Virtusa
Virtusa

Information Technology and Services

Southborough

20,000+ Employees

2933 Jobs

    Key People

  • Kris Canekeratne

    Chairman and CEO
  • Sanjay Singh

    President and COO

RecommendedJobs for You

Chennai, Tamil Nadu, India

Chennai, Tamil Nadu, India

Mysore, Karnataka, India

Bengaluru / Bangalore, Karnataka, India

Bengaluru / Bangalore, Karnataka, India