Objectives/Purpose of the Job
- As part of the Technical team to support and manage the capability of the infrastructure systems hosted in the Data Centers to deliver cost effective services to meet customers’ requirements.
- Incorporate continuous improvement process in operations through constant monitoring, reporting, evaluating and improving the operation metrics.
Key Responsibilities & Key Result Areas
- Provide infrastructure systems support to ensure smooth operations that conformed to the agreed Service Level Agreement (SLA).
- Work closely with Service Delivery Manager to meet service delivery requirements and support him/her in meetings.
- Manage systems changes through established change request process & provide status reports to the relevant parties.
- Respond promptly to incident, investigate & provide temporary &/or permanent resolution of incidents escalated. Provide timely status updates to relevant parties.
- Conduct root cause analysis & implement pro-active measures. Monitor effectiveness of implemented measures.
- Monitor & measure the performance & availability of systems proactively; implement corrective actions identified to improve performance & availability.
- Monitor the agreed service level (e.g. service request, system availability), document & maintain the configuration of the systems; provide regular reporting to relevant parties.
- Plan & implement service continuity measures, i.e., backup/restore procedures & disaster recovery plan, to ensure continuous operation of the business.
- Provide systems related technical advice to customers or project team.
- Perform and manage routine preventive maintenance and operational activities such as service requests, report generation, incident response, patch management.
- Perform infrastructure monitoring and escalation as per standard operations procedures.
- Ensure the management of Infrastructure systems adhere to established ITIL best practices, CIS hardening guidelines and methodologies where applicable.
- Attend to Audit RFI, responses, clarifications and meetings.
Technical Knowledge / Skill Sets / Competencies
- Server/Virtualization (RHEL, AIX and Solaris are
- Server OS: Win 2003, 2008, 2012, 2016, 2019
- Scripting: Windows power Unix shell scripting
- Automation skills: Ansible, Puppet etc. (preferred)
- Business Continuity/Disaster Recovery: BC/DR
- Cluster technologies – Microsoft cluster
- System Management Tools – SCCM, SCOM.
- Microsoft Services – AD, DNS, File server, DHCP,IIS etc
- SSL cert management
- ITSM, such as ServiceNow
- Monitoring tools such as Solarwinds Orion, Nagios
- Networking and related technologies (e.g. load balancing / DNS / SSL /
- firewalls / NAT)
- Secure File Transfer Protocol (SFTP) Service
- Good documentation write-up skills
- OS hardening: understand and perform compliance checks (Mandatory)
Requirements
- Degree in Information Technology, Electrical / Electronic, Information Systems or equivalent discipline
- 6 to 8 years of relevant experience in Linux server administration
- Good Cyber Security mindset
- Good analytical and problem-solving skills
- Possess initiative with positive working attitudes and customer services oriented
- Independent, resourceful and goal-oriented
- Strong teamwork, communication and interpersonal skills
- Manage and mentor a team of 3-5 Level 1 Systems Engineers
- ITIL v4 Foundation Certification (Good to have)
- OS Certification: Microsoft, Hyper V etc
- Familiar with ISO 9001 and ISO 27001
- 24x7, After office hours on-call standby
Roles and Responsibilities NA