IAAS Engineer
Experience: 6+
Mode of Hire : Fulltime
Job Summary:
We are seeking an experienced and versatile IAAS Engineer with strong expertise in Windows Server, VMware virtualization, Azure, AWS, and NetApp storage administration. The ideal candidate will also have hands-on experience with Linux OS management, disaster recovery tools, monitoring platforms such as LogicMonitor, Agile methodologies, and automation tools including Terraform and Ansible. This role involves designing, implementing, and maintaining resilient infrastructure environments, ensuring high availability, performance, and security, while contributing to automation, monitoring, and operational improvements through collaborative teamwork and industry best practices.
Key Responsibilities:
Infrastructure Administration
- Strong knowledge of enterprise infrastructure architecture, performance optimization, and high-availability solutions across hybrid cloud and on-premises environments.
- Install, configure, upgrade, and maintain Windows Server (2012, 2016, 2019, 2022) and Linux OS (RHEL/CentOS, Ubuntu) in production, test, and development landscapes.
- Perform day-to-day system administration, including provisioning virtual machines, adding/removing disk space, managing user accounts, and implementing security hardening.
- Administer VMware vSphere/ESXi and vCenter, including VM lifecycle management, performance tuning, host patching, and resource capacity planning.
- Manage Azure and AWS infrastructure resources, including virtual machines, storage, networking, backups, and security groups.
- Administer NetApp storage systems, including volume provisioning, LUN management, snapshots, replication, and storage performance monitoring.
- Perform OS and application patching across Windows and Linux servers, ensuring minimal downtime and adherence to security compliance.
- Implement, test, and maintain backup and disaster recovery solutions, including replication, failover testing, and DR drills.
- Automate infrastructure provisioning, configuration, and maintenance tasks using Terraform, Ansible, PowerShell, and Bash scripting.
- Monitor infrastructure health, performance, and alerts using LogicMonitor and other monitoring tools.
- Collaborate closely with application, database, and network teams to deliver optimal infrastructure solutions aligned with business needs.
- Maintain accurate documentation of infrastructure topology, configuration standards, operational procedures, and troubleshooting guides.
- Participate in capacity planning, infrastructure upgrades, and continuous improvement initiatives.
- Manage and track infrastructure tasks and projects using Jira , ensuring alignment with Agile sprint planning and delivery timelines.
- Actively engage in knowledge sharing, cross-functional project planning, and operational excellence initiatives.
Day-to-Day Activities:
- Provision and configure new virtual machines in VMware, Azure, and AWS.
- Add or extend disk space for servers and storage systems.
- Perform OS patching for Windows and Linux environments.
- Monitor system performance, capacity, and health status via LogicMonitor.
- Troubleshoot server, storage, and virtualization issues.
- Manage NetApp storage volumes, LUNs, and snapshots.
- Implement security hardening and compliance checks.
- Execute backup, restore, and disaster recovery tests.
- Maintain documentation of configurations and changes.
- Track and prioritize infrastructure work items in Jira, aligned with Agile sprints.
- Collaborate with cross-functional teams for infrastructure requests and projects.