Performance Architect

10 - 15 years

15 - 30 Lacs

Posted:1 day ago| Platform: Naukri logo

Apply

Work Mode

Hybrid

Job Type

Full Time

Job Description

Position Title:

Location:

Role based in India, reporting to senior management, working closely with U.S. infrastructure hosted on AWS

Company Overview:

Applied Research Works, Inc., a wholly owned subsidiary of Vatica Health, is a forward-thinking healthcare SaaS organization committed to operationalizing value-based care at scale. Our platform runs on a secure, HITRUST CSF r2 certified AWS architecture and processes large-scale customer data ingestion across multiple transactional data pipelines.

Position Summary:

We are seeking a technically exceptional Performance Architect who will own monitoring, performance tracking, and rapid root-cause analysis across all data flows and system transactions in the SaaS platform.

This role is mission-critical to ensuring system reliability, fast incident diagnosis, and continuous performance optimization.

The ideal candidate will combine architectural thinking with hands-on expertise in analyzing logs, NFS/NFS-based data landing zones, SFTP-driven data pipelines, Python/SQL execution flows, and real-time transaction monitoring.

The Performance Architect will work cross-functionally and closely with:

  • Development managers and engineering teams
  • Release & QA teams
  • Infrastructure and cloud operations teams
  • Site reliability and data pipeline owners

This role requires a leader who can detect anomalies early, diagnose performance bottlenecks quickly, conduct deep root-cause analysis, and document actionable RCA summaries with urgency and precision.

Key Responsibilities

  • Act as the primary owner of end-to-end performance architecture, monitoring, and diagnostics across all data ingestion and transactional workflows.
  • Monitor performance of customer data flows entering via SFTP NFS/NFS landing zones Python + SQL processing pipelines web transactions (PHP + Node.js portal).
  • Establish and maintain performance benchmarks, anomaly detection rules, and observability standards for different data pipeline paths and transaction types.
  • Perform root cause analysis for performance issues including:
  • File format or landing zone delays
  • NFS mount or I/O bottlenecks
  • Python job execution latency
  • SQL query performance issues
  • Web transaction slowdowns or failures
  • Release-induced performance regressions
  • Translate technical findings into actionable engineering tasks and drive resolution via documented insights and escalation.
  • Develop and maintain root-cause summaries, SOPs, performance dashboards, and knowledge articles for recurring performance patterns.
  • Collaborate closely with infra teams to monitor storage behavior, IOPS utilization, w_await, w_await patterns, throughput saturation, and transaction tracing.
  • Participate in incident triage calls, lead investigations, and escalate issues with the right level of urgency while maintaining cross-team confidence.
  • Own and maintain performance monitoring stack, including tools such as: Kibana, Grafana, DataDog, Data Flow Trace dashboards, and internal observability panels.
  • Ensure RCA completeness, accuracy, reproducibility, and speed.
  • Mentor junior SRE/infra engineers on performance debugging best practices, log interpretation, SQL diagnostics, and pipeline tracing.

Qualifications

  • Bachelors or Masters degree in Computer Science, Computer Engineering, Systems Architecture, or related field.
  • 10+ years of experience in performance architecture, SRE, cloud observability, distributed transaction tracing, or large-scale pipeline monitoring roles.
  • Deep expertise in:
  • SQL performance diagnostics (MySQL, Redshift, or OLAP flows)
  • Python performance profiling
  • Unix/Linux commands for investigation
  • SFTP/NFS based ingestion flows
  • Transaction tracing and log analysis
  • Experience working with globally distributed engineering and infrastructure teams across time zones.
  • Excellent written and verbal communication ability to present complex technical RCAs clearly to both technical and non-technical stakeholders.

Preferred Skills

  • Strong leadership and ownership mindset for root cause analysis, documentation, escalation, and architectural influence.
  • Experience with:
  • Ticketing systems (Redmine, Jira, Freshdesk, Zendesk, etc.)
  • Monitoring/observability tools (Kibana, Grafana, DataDog, Datadog APM, Splunk, internal dashboards)
  • Large-scale I/O performance behavior on AWS (EBS, NFS mounts, throughput tuning, IOPS tracking, w_await investigation, etc.)
  • Understanding of data privacy, security, and compliance standards for handling healthcare SaaS data (HIPAA, HITRUST, etc.)
  • Comfortable escalating critical performance incidents rapidly with clarity and urgency.
  • Experience mentoring SRE/infra teams and driving team-wide performance improvements.

Why this role matters

This is a high-visibility leadership role with impact on:

  • System reliability
  • Data pipeline throughput and latency
  • Release stability
  • Customer transaction performance
  • Faster RCA and incident resolution cycles

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You

bengaluru, karnataka, india

hyderabad, chennai, bengaluru

bengaluru, karnataka, india