Responsibilities:
- Design and implement cloud and hybrid (mix of cloud and on-premises) based infrastructure solutions that includes both virtualized compute and storage.
- Design and implement high availability and disaster recovery solutions that span the cloud and on-premises.
- Design and implement VPCs to deploy mission critical production applications using AWS infrastructure services.
- Act as a Technical Lead for customer onboarding projects and work with Customers to establish G2C connectivity.
- Install, configure, implement and support Windows/Linux servers including management of user/group accounts & policies, integration to Entra ID and MEDS.
- Manage the patching regime of all systems.
- Manage the global hosting asset/inventory and perform lifecycle planning and execution.
- Manage monitoring systems such as Prometheus, Nagios or equivalent.
- Manage other hosting related technologies such as proxy/web filtering, load balancers, WAF, backup and replication solutions, and so forth as DRBD, GlusterFS, NetApp ONTAP, etc.
- Provide L3 support for the cloud team on Azure, Storage, Networking, Linux platforms.
- Create and review monthly operations reports.
- Define SOPs and participate in Audits and DR activities.
- Ensure all solutions comply to corporate security policies and standards.
- Gather design requirements from stakeholders (e. g. , Business and application groups) and translate into functional and technical requirements.
- Automate repeatable operations activities to optimize delivery time.
- Able to perform Capacity Planning on customer environment on Managed Cloud and provide recommendations for improving availability and cost optimization.
- Manage the lifecycle of all requests and incidents that arise, driving for root cause of problems to prevent incidents from recurring.
- Participate in after-hours support (on call) and scheduled implementation activities as required.
Technical Skills/Experience:
- 6+ years of Experience with working on Azure Infrastructure services (Architecture / administration / operations)
- Strong working knowledge of AZURE services: AKS; Storage Accounts; Load Balancers; Virtual Network; DNS; IAM; Vault, VPN; Private Link; Express Route;
WAF; Entra ID; Billing; Trusted Advisor; SSO; Monitor; Backup; Should be proficient in Terraform, Python, Ruby, Bash, PowerShell
- Proficiency in Linux/Unix OS is must.
- Good working knowledge of Networking concepts and tools: Routing, Switching, Firewall security, Proxy, Reverse Proxy, HAProxy, Nginx
- Experience in using the tools: GiT, Jenkins, Puppet, Ansible
- Experience with System hardening guidelines e. g. , CIS, NIST
- Experience with Infrastructure Monitoring tools is required.
- Strong knowledge on Kubernetes platform
- Experience in working with Ticketing tools like Salesforce, Service now, Jira is preferred.
- Require minimal supervision and works well in Production Operations team.
- Ability to work efficiently under pressure on non-routine and highly complex tasks.
- Team player with strong interpersonal, written and verbal communication skills.
- Ability to work in a multicultural team spread across the globe (Europe, India, NA)
- Must have customer first mind set.
- Azure Solutions Architect Expert is a plus.
- Knowledge of AWS Cloud is a plus.
- Education: Bachelors or Master s Degree in CS/CE or equivalent
Total Experience Expected: 06-08 years