Manager/ Senior Manager - Cloud Infrastructure & Operations

5 years

0 Lacs

Posted:3 days ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Role Overview:


Principal Accountabilities & Responsibilities

• Design, deploy, and manage secure, scalable cloud infrastructure (primarily AWS; GCP

exposure is a plus).

• Oversee core IT infrastructure, including networking, server provisioning, storage, and backup

solutions.

• Ensure adherence to institutional compliance standards and security best practices; conduct root

cause analysis for infrastructure incidents.

• Administer and optimize containerized environments using Docker and Kubernetes (EKS

preferred)

• Design and implement automated backup and disaster recovery strategies across cloud and on-

premise environments, ensuring data resilience and compliance with RTO/RPO objectives.

• Lead response to downtime events, developing proactive strategies to minimize system outages,

optimize recovery time, and maintain high availability across critical services.

• Monitor and analyse cloud workloads to ensure high performance and cost efficiency.

• Manage cloud infrastructure budgets and pricing strategies, ensuring optimal resource allocation

and cost control.

• Additionally, expertise in SRE and security may be essential.

• Serve as the single point of contact (SPOC) for vendor and license management (e.g., Zoom,

Microsoft 365). Guide and mentor juniors in the team.

• Lead digital transformation initiatives to modernize IT infrastructure in alignment with

academic and operational goals.


Skill and Ability Requirements

• Minimum 5+ years of progressively responsible experience in cloud infrastructure and

operations.

• Strong proficiency in Amazon Web Services (AWS), including EC2, S3, IAM, CloudWatch,

and Lambda.

• In-depth understanding of networking concepts such as VPC, DNS, Load Balancing, and

VPN.

• Expertise in containerization with Docker and orchestration using Kubernetes (EKS

preferred).

• Experience with cloud security tools, including AWS Security Hub and CloudTrail.

• Hands-on experience with monitoring tools, such as Prometheus, Grafana, and Datadog.

• Strong experience in cloud cost optimization, pricing analysis, and budget management

• Proficiency with CI/CD tools and pipelines (e.g., Jenkins, GitHub Actions).

• Experience integrating CI/CD for infrastructure-as-code (IaC) and AI/ML workflows.

• Experience deploying AI models and solutions in cloud environments (e.g., AWS SageMaker,

Azure ML, GCP Vertex AI).

• Proven experience in Linux system administration and shell scripting.

• Demonstrated ability to manage vendor relationships and license agreements (Zoom,

Microsoft 365).

• Familiarity with Google Cloud Platform (GCP) is a plus.

• Excellent problem-solving skills and the ability to lead incident response and root cause

analysis.


Qualification & Experience

• Bachelor’s degree/Master’s degree in Computer Science, Information Technology, or a related

field.

• AWS certifications are highly desirable.

• Minimum 5+ years of progressively responsible experience in cloud infrastructure and

operations.

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now