Job
Description
As part of the MQ Operations team, you will play a critical role in ensuring that every product release meets the high standards of quality our customers expect. In this role, you will maintain and optimize the performance of a diverse range of machines and platforms within our testing infrastructure. You’ll be responsible for ensuring the reliability, efficiency, and security of these systems, working closely with internal stakeholders to align testing with real-world needs. This is an office-based role, offering the opportunity to collaborate directly with experienced team members who will support your integration into the team, department, and wider IBM community. You’ll thrive in a fast-paced, global environment, demonstrating strong personal organization, adaptability, and a proactive mindset. We’re looking for someone who communicates clearly, solves problems creatively. Key Responsibilities
Infrastructure Operations:
Maintain and optimize the performance of multiple machines and platforms used in testing environments. Monitoring & Reporting:
Implement tools and dashboards to monitor system health and performance metrics. Process Improvement:
Identify inefficiencies and contribute to automation and process enhancements to improve productivity and reduce downtime. Stakeholder Collaboration:
Work with internal teams to ensure testing infrastructure meets evolving requirements.
A successful individual for this role should possess: Several years of experience in one or more programming languages (e.g., Ansible, JavaScript, Node.js, Perl, python) Strong proficiency in machine administration and developing tools/automations for system maintenance. Familiarity with Kubernetes/OpenShift and container orchestration. Strong analytical and problem solving expertise, with the ability to investigate issues and apply fixes throughout the deployment landscape A good understanding of using development tools such as Git, VS Code, make etc. Ability to take ownership of tasks, proactively driving them to their completion. Automation skills in testing, scripting (e.g. Bash), pipelines, and utilities. Experience in monitoring, alerting and dashboarding.
Required education Bachelor's Degree Preferred education Master's Degree Required technical and professional expertise 6/7+ years experience in daily management and maintenance of infrastructure. 6/7+ years of experience in one or more programming languages like Ansible, Javascript, Node.js, Perl, Python etc. Strong proficiency in Infrastructure maintenance like performing system upgrades, applying software patches, managing user accounts/permissions, ensuring infrastructure remains complaint with company policies and security standards. Strong proficiency in Automation/Scripting in Infrastructure maintenance domain. Strong expertise in Windows servers maintenance, various flavours of Linux/Unix server maintenance. Excellent problem-solving, communication, and organizational skills Proven ability to learn quickly and adapt to new technologies Ability to take ownership of tasks, proactively driving them to their completion.
Preferred technical and professional experience Proficiency in scripting and automation tools (e.g., PowerShell, Bash) Familiarity with Agile development methodologies and tools. Knowledge of Terraform to provision and manage infrastructure Experience with IBM MQ.