Principal Site Reliability Engineer

10 - 12 years

0 Lacs

Posted:1 week ago| Platform: Foundit logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Join a globally recognized financial organization and advance your profession to new heights by contributing to revolutionary projects. You've discovered the perfect environment to have a major impact.

As a Principal Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking division, you will leverage your advanced expertise to identify new opportunities for influencing critical incident management and enhancing the end-to-end software development lifecycle for the firm. Your role will involve managing, designing, and implementing infrastructure components and essential services to boost reliability and ensure operational efficiency within the Card Site Reliability Engineering function. You will be part of a globally distributed team dedicated to maintaining production stability, automation, reliability, and observability. We seek solution-oriented, commercially minded, and customer-focused team members who excel in an agile environment and are eager to contribute to building innovative solutions from the ground up within a diverse and inclusive team.

Job responsibilities

  • Identifies and solves problems of high complexity.
  • Works with development teams throughout the Software Development Life Cycle to ensure sustainable software releases
  • Leads medium to large projects by bringing together the proper perspective, identifying roadblocks, and integrating feedback from team members and subject matter experts at the firm.
  • Manages complex business challenges with elegant, efficient solutions, harnessing the power of code and cloud infrastructure to configure, maintain, monitor, and optimize applications, driving continuous improvement and scalability.
  • Participates in support responsibilities for coverage of critical applications. Sees problems as opportunities to improve
  • Architect and implement observability platforms and tools for proactive detection and continuous improvement.
  • Lead the design and development of core observability services, including metrics pipelines and log aggregation.
  • Leverage modern technologies such as Open Telemetry and AI/ML for anomaly detection and automated insights.
  • Collaborate with engineering and SRE teams to define service-level objectives (SLOs) and error budgets.
  • Provide technical leadership and mentorship to engineering teams, ensuring best practices in system design.
  • Champion observability as a first-class concern in the software development lifecycle.

Required qualifications, capabilities, and skills

  • Formal training or certification on Site Reliability Engineering concepts and 10+ years applied experience
  • Fluent in at least one programming language such as: Python, Java/Spring Boot.
  • Experience with cloud-native (AWS) instrumentation and streaming data platforms.
  • Proficient with continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform
  • Proficient with container and container orchestration: (ECS, Kubernetes, Docker).
  • Experience with troubleshooting common networking technologies and issues.
  • Ability to determine how each system relates to each other and build automation to improve reliability.
  • Experience with translating research, analysis, and tests into business recommendations.
  • Ability to balance and be accountable for the work of multiple architects and designers.
  • Understands and leads partnerships across job functions to develop efficient systems.
  • Engages team members and expresses complex ideas with appropriate level of detail, while providing constructive feedback.

Preferred qualifications, capabilities, and skills

  • Influence technology and policy decisions while fostering commitment and confidence in team members.
  • Develop effective solutions and analyze competitive positions by considering market trends.
  • Support the introduction of innovative methods and communicate clearly to persuade audiences.
  • Demonstrate concern and meet the needs of both internal and external customers.

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You