Associate Vice President - SRE, Digital Business

8 years

0 Lacs

Posted:2 days ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Associate Vice President - SRE, Digital Business


Job Title: Associate Vice President - SRE, Digital Business

Location: Mumbai, Bengaluru


About Us Sonyliv is a leading OTT platform revolutionizing the way audiences consume entertainment.

With millions of users across the globe, our mission is to deliver seamless, high-quality, and reliable streaming experiences. We are looking for a Principal Site Reliability Engineer (SRE) to join our team and take ownership of ensuring the availability, scalability, and performance of our critical systems.


Job Summary As a Principal SRE Engineer, you will be responsible for designing, building, and maintaining reliable and scalable infrastructure to support our OTT platform.

You bring a developer's mindset, coupled with extensive SRE experience, and a passion for reliability and performance.

You'll ensure smooth system operations, take ownership of application and infrastructure reliability, and have a strong support mindset to tackle critical incidents, even during off-hours.


We're seeking a candidate with 8+ years of experience, a deep understanding of observability, and the ability to lead reliability initiatives across systems and teams.


Key Responsibilities

  • Full System Ownership: Take complete responsibility for the availability, reliability, and performance of systems, including both application and infrastructure layers.
  • Development & SRE Mindset: Leverage your experience as a developer and SRE to build tools, automation, and systems to improve system reliability and operational efficiency. Incident Management: Respond to and resolve critical system issues promptly, including being available for on-call support and handling emergencies during non-business hours, including late nights.
  • Infrastructure Management: Design, deploy, and manage infrastructure solutions using containers (Docker/Kubernetes), networks, and CDNs to ensure scalability and performance.
  • Observability: Drive best practices in observability, including metrics, logging, and tracing, to enhance system monitoring and proactive issue resolution. Implement and maintain observability tools like Prometheus, Grafana, ELK stack, or DataDog.
  • Reliability and Performance: Proactively identify areas for improvement in system reliability, performance, and scalability, and define strategies and best practices to address them.
  • Collaboration and Communication: Work closely with cross-functional teams, including development, QA, and support, to align goals and improve operational excellence. Communicate effectively across teams and stakeholders.
  • CI/CD and Automation: Build and enhance CI/CD pipelines to improve deployment reliability and efficiency. Automate repetitive tasks and processes wherever possible. Continuous Improvement: Stay up to date with the latest technologies and best practices in DevOps, SRE, and cloud computing. Apply them to improve existing systems and processes.


Required Skills and Experience

  • Experience: 10+ years of experience in software development, DevOps, and SRE roles.
  • Development Experience: Strong experience as a software developer with expertise in building scalable, distributed systems.
  • SRE/DevOps Experience: Hands-on experience managing production systems, ensuring uptime, and improving system reliability.


Technical Proficiency:

  • Strong experience with containers (Docker, Kubernetes).
  • In-depth understanding of networking concepts and CDNs (e.g., Akamai, Cloudfront).
  • Proficiency in infrastructure-as-code (IaC) tools like Terraform or CloudFormation. • Expertise in cloud platforms such as AWS, GCP, or Azure.
  • Observability Expertise: Proven experience in implementing and maintaining robust observability solutions, including monitoring, alerting, metrics, and tracing.
  • Incident Handling: Proven ability to handle critical incidents, perform root cause analysis, and implement permanent fixes.
  • Automation: Strong scripting/programming skills in Python, Go, or similar languages.
  • Reliability Focus: Demonstrated passion for system reliability, scalability, and performance optimization.
  • Soft Skills: Excellent communication, collaboration, and leadership skills. Ability to explain technical details to non-technical stakeholders.
  • On-Call Readiness: Willingness to participate in a 24x7 on-call rotation and support critical systems during off-hours.


Preferred Qualifications

  • Experience in OTT or video streaming platforms. • Understanding of video delivery workflows, encoding, and adaptive bitrate streaming technologies.
  • Experience working with hybrid infrastructure or multicloud cloud environment (on-premise and multi cloud).
  • Certifications in cloud platforms (AWS Certified Solutions Architect, Google Professional Cloud Architect, etc.).


Why join us? Sony Pictures Networks is home to some of India’s leading entertainment channels such as SET, SAB, MAX, PAL, PIX, Sony BBC Earth, Yay!, Sony Marathi, Sony SIX, Sony TEN, Sony TEN1, SONY Ten2, SONY TEN3, SONY TEN4, to name a few! Our foray into the OTT space with one of the most promising streaming platforms, Sony LIV brings us one step closer to being a progressive digitally-led content powerhouse.

Our independent production venture- Studio Next has already made its mark with original content and IPs for TV and Digital Media. But our quest to Go Beyond doesn’t end there. Neither does our search to find people who can take us there. We focus on creating an inclusive and equitable workplace where we celebrate diversity with our Bring Your Own Self Philosophy and are recognised as a Great Place to Work. - Great Place to Work Institute- Ranked as one of the Great Places to Work for since 5 years - Included in the Hall of Fame as a part of the Working Mother & Avtar Best Companies for Women in India study- Ranked amongst 100 Best Companies for Women In India - ET Human Capital Awards 2021- Winner across multiple categories - Brandon Hall Group HCM Excellence Award - Outstanding Learning Practices. The biggest award of course is the thrill our employees feel when they can Tell Stories Beyond the Ordinary

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You