Job
Description
Your Role and Responsibilities
We’re looking for a Site Reliability Engineer to join us at IBM. Site Reliability Engineers take a different approach it comes to hosting and managing infrastructure, services, and applications. We approach this as a software problem instead of as an operations problem. Applying software engineering practices to hosting and managing enables our services to better adapt to all types of changes and failure scenarios. To help us accomplish this approach we use the latest techniques, practices and technology from the industry while hosting large-scale mission critical infrastructure. Some of the techniques, practices, and technologies we use are Docker, IaaS, PaaS, ChatOps, Continuous Delivery, Continuous Deployment, DevOps, and Immutable Infrastructure. We strive to keep learning and improving, and we work to share the knowledge we have learned throughout IBM.Our project is a highly visible connectivity layer between IBM and the Cloud, used by many groups and service providers in IBM to connect their internal and cloud-based assets and users. We are not just building a highly automated build process with regression testing, monitoring and deployment promotion, but also a service that customers can also use within their own DevOps pipeline to deploy their cloud solutions.This is an important technical role that will require participation in an evolving culture, designed to deliver software solutions from different teams into a continually available environment. Ultimately, your work will decide if code drops are ready to deploy to production and help ensure that, if a deployment fails, that it “fails small and recovers quickly”.Job Duties:Design and implement automated solutions for rolling out our teams’ applications from test to stage to productionExtend the DevOps tools we have with custom written models, to adapt them to specific team needs Drive requirements for our team's code base to made continuous deployment easier, and help implement those requirementsA day in the life at IBM (Optional Section) Throughout the day, you will collaborate with your teammates and interact with our product owners Participate and/or lead in our lunch and learn sessions.Take a break and have fun by participating with other IBMers in collaborative video games.Take advantage of our exercise room, which includes cardio and weights.Work in an open environment where creativity is welcome and encouraged.Staying relevant to emerging trends in areas related to DevOps and Cloud.Seek recognition by attaining such awards as “Extremely Smart Person” and “Meritorious Coding before Caffeine.” Required education Bachelor's Degree Preferred education Master's Degree Required technical and professional expertise Required Professional and Technical ExpertiseAt least 8 years’ experience in troubleshooting complex systems, including operating systems, networks and application code.At least 8 years’ experience in software development experience in at least one modern language such as Python or Java.At least 5 years’ experience with Automation, DevOps, Continuous Delivery and Continuous Deployment technologies. (e.g. Ansible, Docker, Jenkins, Kubernetes)Demonstrated experience developing an enterprise system with automated deploymentKnowledge of UNIX/Linux systems. Preferred technical and professional experience Preferred Professional and Technical ExpertiseBS in computer science or equivalent experienceExperience with SCM systems like Git.Strong background in network management and diagnostics.Basic security knowledge.Database knowledge including SQL and NoSQL.10 years’ experience with the Linux/UNIX operating systems.4 years experience with Docker; building and deploying containers.Working knowledge and/or experience with the Agile methodologies.