Introduction
At IBM Software, we transform client challenges into solutions. Building the world’s leading AI-powered, cloud-native products that shape the future of business and society. Our legacy of innovation creates endless opportunities for IBMers to learn, grow, and make an impact on a global scale. Working in Software means joining a team fueled by curiosity and collaboration. You’ll work with diverse technologies, partners, and industries to design, develop, and deliver solutions that power digital transformation. With a culture that values innovation, growth, and continuous learning, IBM Software places you at the heart of IBM’s product and technology landscape. Here, you’ll have the tools and opportunities to advance your career while creating software that changes the world.Key Responsibilities
Your role and responsibilities
Infrastructure Automation
Design, implement, and maintain Infrastructure as Code (IaC) solutions using tools such as Terraform to automate Infra provisioning and management.CI/CD, Deployment & OrchestrationBuild & Manage CI/CD framework, platforms & pipelines (Jenkins, ArgoCD) for infrastructure and applications deployments, ensuring seamless and safe delivery processes across environments.Troubleshooting infrastructure level and product deployment level issuesDeploy, manage, and monitor containerized applications
using Docker
and Kubernetes
(preferably OpenShift).Ensure system reliability, scalability, and performance across distributed environments.Hybrid Cloud Management across hybrid cloud (e.g., AWS, Azure etc) ensuring consistency, scalability, and security.Monitoring & ObservabilityDevelop automation for monitoring setup, alerting, and self-healing/fault response, integrating with tools like Prometheus, Grafana etc.Documentation & Best PracticesCreate and maintain comprehensive documentation for automation workflows, infrastructure patterns, and runbooks; contribute to developing Cloud engineering best practices.Preferred Education
Bachelor's DegreeRequired Technical And Professional Expertise
6+ years of experience in infrastructure software engineering, systems engineering, or Reliability Engineering roles.Infrastructure as Code: Strong hands-on experience in Terraform, Pulumi, CRD etc.Proficiency in Coding languages (Go or Python).Experience with container management and orchestration (Docker, Kubernetes, Helm).Working knowledge of CI/CD pipelines, version control (Git), and related tooling.Observability as Code: Good knowledge with monitoring, logging, and alerting stacks.Solid understanding of networking, security, and troubleshooting.Strong troubleshooting skills and a passion for automation and process improvement.Excellent communication skills and ability to work in a collaborative team environment.Exposure to hybrid/multi-cloud architectures.Background in performance tuning, cost optimisation, or incident response automationPreferred Technical And Professional Experience
Knowledge of Python for scripting and automation. Experience working with public cloud platforms (IBM Cloud, AWS, Azure, or GCP). Experience with Ansible or other configuration management tools.