Assistant Manager - Cloud Operations

4 years

0 Lacs

Posted:2 weeks ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

ROLE

Assistant Manager - Cloud Operations


REPORTING TO

General Manager - Cloud Infra & Architecture


ROLE/REQUIREMENT SUMMARY

  • Strong knowledge on AWS Cloud, Kubernetes Orchestration and Docker
  • Experience with deploying and maintaining GPU based production environments
  • Experience in automating and managing large-scale infrastructure
  • Knowledge and experience with Continuous Integration/Deployment best practices and servers (Jenkins, AWS Code Pipeline, Code Deploy and Code Commit)
  • Working experience with version control systems (Git, GitHub, Bitbucket)
  • Experience with Artifacts management
  • Working with AWS, GCP, Azure
  • Experience in configuring and automating monitoring tools like Grafana, New Relic, Kibana
  • Knowledge and experience with secure infrastructure
  • Hands on experience on managing Level 1, 2 ,3 Ticket/Order Triage Specialist, Incident, Problem, Service Request and Escalations
  • Experience working in a 24x7 production support model with excellent troubleshooting skills
  • Strong experience in working with ticketing tools like Freshservice, Onedesk, JIRA
  • Demonstrable experience on Collaboration with multiple teams, partners and vendors in order to uncover, analyse and resolve platform issues
  • Good understanding on flows of different platforms like Android, IOS, web and TV platforms
  • Ability to work under pressure and tight schedules while handling any incident or issue
  • Passion for learning the latest interactive technologies and techniques.



KEY DUTIES AND RESPONSIBILITIES

  • Ensure 99.9% Uptime of OTT applications and services
  • Work closely with Product Engineering, Technology, IT and Partners
  • Close work relations with the technical teams like project managers, R&D team leaders, Solution Architects, other partners, UAT and Service providers
  • Ensure, Enable, Manage and Track 24x7 Monitoring & Support with zero downtime and Zero issues
  • Ensure Issue triaging and fixes as per the defined SLA with RCA
  • No/Zero pending L1/L2/L3 issues
  • Track and Manage DevOps tickets for Tataplay OTT Apps and Internal Products
  • Present Weekly/Monthly Alerts, Issues Report and Tracker to stakeholders
  • Ensure No escalations
  • Create/Manage Operation Manual
  • Setup SOPs and Practices
  • Enable periodic security audits to ensure no security issues in infra, platform and services
  • Periodic Audit of IAM roles, Resources, access and report
  • Design, generate and interpret operational reports related to system health status, capacity management and system performance management
  • Ensure Team KT is done 100% for any planned migration
  • Cloud Cost for analysis and approval for various project deployments in Tata Play
  • Take ownership over the CI/CD processes
  • Build internal automated modelling tools to minimize operation effort
  • Provide new ideas or take initiative to control OpEx cost
  • Manage, Track, Report and deliver Platform and Cost Optimization Backlog
  • Work on different ideas to optimize Monitoring, Alerting, Service/Process Automations, Debugging and Triaging
  • Establish Cross Function Collaboration to identify and resolve platform issues
  • Develop and maintain effective working relationships with team members, client and stakeholders
  • Own the overall responsibility for Cloud Infra, DevOps and OTT Services
  • Build, Lead and guide a cross-company team of DevOps Engineers


TECHNICAL COMPETENCIES

  • Operational Excellence
  • Debugging and Problem-Solving Skills
  • Quality Control & Assurance
  • Audit & Control
  • Cost Management and Control


EDUCATION

BE/B.Tech/MCA and AWS Certification (Preferred)


PREFERRED EXPERIENCE

*Candidate should have grown from OTT / Ecommerce / B2C with large scale Production Infra and Workload

*Minimum 4 Years of Experience as a DevOps Engineer or Cloud Operation Engineer

*Solid communication skills (oral, written, and presentation) and strong interpersonal skills, along with the ability to work as a team member with minimum supervision

*Should be a good team player & able to work independently under pressure.

*Ability to prioritize and differentiate between priority and severity incidents

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You