Home
Jobs

Data Platform and Data SRE - Senior Architect

10 - 15 years

20 - 25 Lacs

Posted:4 hours ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

We are seeking an experienced Data Platform Reliability Engineer to lead our efforts in designing, implementing, and maintaining highly reliable data infrastructure. The ideal candidate will bring extensive expertise in building enterprise-grade data platforms with a focus on reliability engineering, governance, and SLA/SLO design. This role will be instrumental in developing advanced monitoring solutions, including LLM-powered systems, to ensure the integrity and availability of our critical data assets.
Platform Architecture and Design
  • Design and architect scalable, fault-tolerant data platforms leveraging modern technologies like Snowflake, Databricks, and cloud-native services
  • Establish architectural patterns that ensure high availability and resiliency across data systems
  • Develop technical roadmaps for platform evolution with reliability as a core principle
Reliability Engineering
  • Implement comprehensive SLA/SLO frameworks for data services
  • Design and execute chaos engineering experiments to identify and address potential failure modes
  • Create automated recovery mechanisms for critical data pipelines and services
  • Establish incident management processes and runbooks
Monitoring and Observability
  • Develop advanced monitoring solutions, including LLM-powered anomaly detection
  • Design comprehensive observability strategies across the data ecosystem
  • Implement proactive alerting systems to identify issues before they impact users
  • Create dashboards and visualization tools for reliability metrics
Data Quality and Governance
  • Establish data quality monitoring processes and tools
  • Implement data lineage tracking mechanisms
  • Develop automated validation protocols for data integrity
  • Collaborate with data governance teams to ensure compliance with policies
Innovation and Improvement
  • Research and implement AI/ML approaches to improve platform reliability
  • Lead continuous improvement initiatives for data infrastructure
  • Mentor team members on reliability engineering best practices
  • Stay current with emerging technologies and reliability patterns in the data platform space
Qualifications
  • 10+ years of experience in data platform engineering or related fields
  • Proven expertise with enterprise data platforms (Snowflake, Databricks, etc.)
  • Strong background in reliability engineering, SRE practices, or similar disciplines
  • Experience implementing data quality monitoring frameworks
  • Knowledge of AI/ML applications for system monitoring and reliability
  • Excellent communication skills and ability to translate technical concepts to diverse stakeholders

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Tavant Technologies
Tavant Technologies

Information Technology & Services

Irvine

RecommendedJobs for You