Software Engineer II, Reliability Engineering , ITC

2 - 4 years

0 Lacs

Posted:3 days ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Site Reliability Engineer II

India Technology Center

Who You’ll Work With

You will be a part of a team of talented Site Reliability Engineers focused on delivering reliabile and observable software used by millions of athletes* around the world. You will be a part of the Resilience Engineering organization which includes Reliability Engineering, Live Site Support Engineering, Peak Event Management, Insights & Efficiency Engineering, and Enterprise Systems Engineering.While a variety of engagement methods exist, SREs are primarily embedded with product delivery teams across Global Technology. These teams span all of Nike’s most critical digital properties: Nike.com, Nike App, SNKRS, brick & mortar retail, wholesale platforms, and supply chain technologies.

Who We Are Looking For

The ideal candidate will have a strong software engineering background, a demonstrated ability to influence and partner, and show a passion for learning and mentoring. This engineer will have a track record of delivering reliable and observable digital experiences through the application of concepts from Site Reliability Engineering, DevOps, and other relevant disciplines.
  • Bachelor’s degree in Computer Science, Information Systems, or other relevant subject areas
  • 2-4 years of professional experience in software engineering
  • Understanding of how to deliver large scale software with modern reliability and resilience concepts (multi-region, multi-cloud, active/active, canary deploys, synthetic testing, containers, etc.)
  • Hands-on experience building, deploying, and operating software using modern cloud-based distributed system techniques and micro-service architecture patterns. AWS experience preferred
  • Experience with modern observability tooling, processes, and mindset – Splunk, SignalFx, New Relic, CatchPoint, etc. Bonus points for experience with Open Source observability stacks. Extra bonus points for experience with AI Ops, AI/ML
  • Strong design and development experience with Java
  • Proficient with JavaScript on frontend (React, Angular, etc.) and backend (Node.js) components
  • Experience in other modern enterprise languages (functional or other – Scala, Python, Golang, etc.) is preferred
  • Basic understanding of DNS, Networking, Virtualization, Linux
  • Expertise in designing/building/supporting scalable cloud-based Micro Services
  • Experience with Docker and/or Serverless patterns
  • Experience with at least one No-SQL database like DynamoDb, Cassandra, etc.
  • Good understanding of RESTful APIs
  • Strong communication skills (written and verbal). They must be able to clearly articulate issues and their impact(s)
  • Highly confident and capable in reporting and communicating high value metrics to leadership. Deep understanding of the business landscape and how site reliability influences our consumers

What You’ll Work On

As a Site Reliability Engineer, you will be focused on maximum availability, observability, reliability, security, and performance for Nike Digital Experiences. SREs perform deep problem analysis, detect infrastructure or code defects, define, report, and create observability processes for Key Performance Indicators (KPIs), and work with product delivery teams to provide long-term solutions to production issues.
  • Observing, diagnosing, and quickly resolving production issues with precision to minimize service interruptions
  • Developing and implementing real-time monitoring solutions that deliver essential insights into system health and key performance indicators
  • Communicating technical issues and their business impacts clearly, ensuring alignment across teams and effective response strategies
  • Reporting high-value metrics and insights to leadership, demonstrating the impact of site reliability on consumer experience and overall business objectives
  • Managing IT service processes such as Incident, Problem, Change, and Knowledge Management to maintain service quality and reliability
  • Collaborating closely with both business and technical teams to analyze system performance, troubleshoot consumer-reported issues, and proactively optimize system efficiency
  • Leading initiatives to enhance application reliability for high-demand consumer web and mobile platforms, ensuring consistent performance
  • Leveraging negotiation and influence to foster alignment and drive collaborative solutions across multiple teams
  • Promoting a culture of growth by coaching, mentoring, and sharing knowledge, supporting continuous improvement and resilience across the team

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Java Skills

Practice Java coding challenges to boost your skills

Start Practicing Java Now

RecommendedJobs for You