High Touch Operations Manager (HTOM) – DC + AI Framework

0 years

0 Lacs

Posted:2 days ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Location:

Navi Mumbai

Role Overview

We are seeking an experienced

High Touch Operations Manager (HTOM)

with strong knowledge of

Data Center (DC) environments and AI/ML frameworks

. The role requires expertise in managing GPU-based infrastructures, optimizing AI workloads, and ensuring operational efficiency across networking, compute, and application layers. The HTOM will act as a key customer interface, driving proactive support, incident management, and collaboration with Cisco’s engineering and support teams.

Key Responsibilities

  • Oversee and manage GPU-based server infrastructures and optimize AI/ML workloads.
  • Maintain and manage existing systems, including hardware and software updates.
  • Identify and document performance bottlenecks with actionable improvement reports.
  • Troubleshoot and resolve complex issues spanning networking, compute, and AI applications.
  • Present daily/weekly updates and performance reports to customers and internal management.
  • Participate in weekly case reviews with client support and engineering teams.
  • Prepare incident summaries and RCA reports for catastrophic events and present them to customers.
  • Collaborate with application teams, business units, and cross-functional technical domains.
  • Stay updated with industry advancements and cutting-edge DC/AI technologies.

Required Skills & Competencies

  • Strong problem-solving skills to expedite resolution of critical issues.
  • Ability to minimize operational inefficiencies by reducing redundant efforts.
  • Skilled at improving IT staff productivity, efficiency, and proficiency.
  • Risk management expertise to mitigate vulnerabilities in network and infrastructure operations.
  • Strong customer focus, communication, and presentation skills.
  • Ability to work independently with minimal supervision.
  • Excellent teamwork and collaboration skills across multiple stakeholders.
  • Logical and detail-oriented approach to problem solving and incident management.
Skills: operations,server infrastructure,customer,ml,framework,management,incident handling,gpgpu,dc,collaboration

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Neev logo
Neev

Digital Agency / Technology Solutions

N/A

RecommendedJobs for You