Job Summary
We are seeking an experienced and hands-on Cloud Infrastructure & Operations Manager to lead a team of 15 engineers responsible for managing the infrastructure layer of multi-tenant, cloud-hosted ERP products.The role covers platform reliability, product upgrades, cloud security, incident and preventive maintenance, disaster recovery, and compliance audits.This position also acts as a stage-gate for all production deployments, ensuring release readiness, rollback capability, and platform stability.
Principal Duties And Responsibilities
Cloud Infrastructure Oversight :
- Oversee provisioning, monitoring, and scaling of cloud environments (primarily Azure) for ERP products.
- Ensure optimal performance, cost control, and platform stability.
SaaS Product Operations
- Own product environment availability (Dev, UAT, Prod), plan platform upgrades, apply security patches, and manage certificates and access.
Incident Management
- Lead incident response for outages and degradation.
- Perform RCA, document learnings, and implement post-mortem action items.
Preventive Maintenance
- Define and execute regular health checks, patching schedules, environment cleanups, and alert tuning.
- Disaster Recovery Planning.
- Develop and test DR/BCP plans.
- Ensure business continuity across all cloud-hosted environments.
Security & Compliance
- Lead infrastructure-level compliance activities for SOC 2, ISO 27001, and secure deployment pipelines.
- Coordinate with infosec and audit teams.
- Production Deployment Stage-Gate.
- Review and approve all deployment tickets.
- Validate readiness, rollback strategy, and impact analysis before production cutover.
Team Leadership
- Lead, coach, and upskill a team of cloud and DevOps engineers.
- Foster a learning culture aligned with platform reliability and innovation.
Required Skills And Qualifications
- 10+ years of experience in Cloud Infrastructure / SaaS Operations.
- 3+ years managing teams in a cloud product environment (preferably multi-tenant SaaS).
- Strong hands-on knowledge of Azure (VMs, PaaS, Networking, Monitoring, Identity).
- Experience with ERP platforms (SAP Cloud, Infor, Oracle Cloud, or custom-built ERP solutions).
- Good grasp of DevOps practices, CI/CD pipelines, infrastructure as code (IaC).
- Familiarity with SOC 2, ISO 27001, and data privacy compliance.
- ITIL or SRE certification preferred.
Skills Matrix
- Cloud Platform Azure (App Services, VM, Networking, Storage, Defender)
- ERP Infra Multi-tenant ERP hosting, Cloud DB tuning, PaaS scaling .
- DevOps CI/CD (Azure DevOps, GitHub Actions), Automation .
- IaC Terraform / Bicep / ARM Templates
- Monitoring & Logging Azure Monitor, Application Insights, Log Analytics
- Incident Management ITIL, On-call Runbooks, RCA Writing
- Preventive Ops Scheduled health checks, capacity management
- Security & Access IAM, Azure AD, Role-based Access, Secret Rotation
- Disaster Recovery DR Drills, Geo-Redundancy, RTO/RPO
- Audit & Compliance SOC 2, ISO 27001, Risk Registers
(ref:hirist.tech)