Role Description
Role Proficiency:Act under guidance of Lead II/Architect understands customer requirements and translate them into design of new DevOps (CI/CD) components. Capable of managing at least 1 Agile Team
Outcomes:
- Interprets the DevOps Tool/feature/component design to develop/support the same in accordance with specifications
- Adapts existing DevOps solutions and creates own DevOps solutions for new contexts
- Codes debugs tests documents and communicates DevOps development stages/status of DevOps develop/support issues
- Select appropriate technical options for development such as reusing improving or reconfiguration of existing components
- Optimises efficiency cost and quality of DevOps process tools and technology development
- Validates results with user representatives; integrates and commissions the overall solution
- Helps Engineers troubleshoot issues that are novel/complex and are not covered by SOPs
- Design install configure troubleshoot CI/CD pipelines and software
- Able to automate infrastructure provisioning on cloud/in-premises with the guidance of architects
- Provides guidance to DevOps Engineers so that they can support existing components
- Work with diverse teams with Agile methodologies
- Facilitate saving measures through automation
- Mentors A1 and A2 resources
- Involved in the Code Review of the team
Measures Of Outcomes:
- Quality of deliverables
- Error rate/completion rate at various stages of SDLC/PDLC
- # of components/reused
- # of domain/technology certification/ product certification obtained
- SLA for onboarding and supporting users and tickets
Outputs Expected:
Automated components :
- Deliver components that automat parts to install components/configure of software/tools in on premises and on cloud
- Deliver components that automate parts of the build/deploy for applications
Configured Components:
- Configure a CI/CD pipeline that can be used by application development/support teams
Scripts:
- Develop/Support scripts (like Powershell/Shell/Python scripts) that automate installation/configuration/build/deployment tasks
Onboard Users:
- Onboard and extend existing tools to new app dev/support teams
Mentoring:
- Mentor and provide guidance to peers
Stakeholder Management:
- Guide the team in preparing status updates keeping management updated about the status
Training/SOPs :
- Create Training plans/SOPs to help DevOps Engineers with DevOps activities and in onboarding users
Measure Process Efficiency/Effectiveness:
- Measure and pay attention to efficiency/effectiveness of current process and make changes to make them more efficiently and effectively
Stakeholder Management:
- Share the status report with higher stakeholder
Skill Examples:
- Experience in the design installation configuration and troubleshooting of CI/CD pipelines and software using Jenkins/Bamboo/Ansible/Puppet /Chef/PowerShell /Docker/Kubernetes
- Experience in Integrating with code quality/test analysis tools like Sonarqube/Cobertura/Clover
- Experience in Integrating build/deploy pipelines with test automation tools like Selenium/Junit/NUnit
- Experience in Scripting skills (Python/Linux/Shell/Perl/Groovy/PowerShell)
- Experience in Infrastructure automation skill (ansible/puppet/Chef/Powershell)
- Experience in repository Management/Migration Automation – GIT/BitBucket/GitHub/Clearcase
- Experience in build automation scripts – Maven/Ant
- Experience in Artefact repository management – Nexus/Artifactory
- Experience in Dashboard Management & Automation- ELK/Splunk
- Experience in configuration of cloud infrastructure (AWS/Azure/Google)
- Experience in Migration of applications from on-premises to cloud infrastructures
- Experience in Working on Azure DevOps/ARM (Azure Resource Manager)/DSC (Desired State Configuration)/Strong debugging skill in C#/C Sharp and Dotnet
- Setting and Managing Jira projects and Git/Bitbucket repositories
- Skilled in containerization tools like Docker/Kubernetes
Knowledge Examples:
- Knowledge of Installation/Config/Build/Deploy processes and tools
- Knowledge of IAAS - Cloud providers (AWS/Azure/Google etc.) and their tool sets
- Knowledge of the application development lifecycle
- Knowledge of Quality Assurance processes
- Knowledge of Quality Automation processes and tools
- Knowledge of multiple tool stacks not just one
- Knowledge of Build Branching/Merging
- Knowledge about containerization
- Knowledge on security policies and tools
- Knowledge of Agile methodologies
Additional Comments:
Automation Engineer Relevant Experience: 3 to 5 years of hands-on experience with Kubernetes and cloud-native automation, focusing on eliminating repetitive tasks through scripting, IaC, and self-healing mechanisms. Job Summary: The Automation Engineer will play a critical role in reducing operational toil within Kubernetes-based environments by designing, developing, and implementing automation solutions that streamline repetitive tasks and improve system reliability. This role involves close collaboration with SRE and platform engineering teams to build self-healing mechanisms, enhance observability, and integrate automation into CI/CD pipelines, ensuring faster, more resilient deployments and minimal manual intervention. Key Responsibilities:
- Toil Reduction & Automation
- Identify repetitive, manual operational tasks and design automation solutions to eliminate them.
- Develop scripts, tools, and pipelines to automate deployments, scaling, monitoring, and incident response.
- Kubernetes & Cloud Operations
- Manage and optimize Kubernetes clusters across multiple environments (dev, staging, production).
- Implement automated cluster lifecycle management (provisioning, upgrades, scaling).
- Reliability & Observability
- Build self-healing mechanisms for common failure scenarios.
- Enhance observability by automating metrics, logging, and ing integrations.
- CI/CD & Infrastructure as Code
- Implement and maintain CI/CD pipelines for application and infrastructure deployments.
- Use Infrastructure as Code (IaC) tools for consistent environment management.
- Collaboration & Best Practices
- Work closely with SREs, developers, and platform teams to improve reliability and reduce MTTR.
- Advocate for automation-first culture and SRE principles across teams. Required skills
- Automation & Scripting: Proficiency in Python or Bash for automation tasks.
- Kubernetes Expertise: Hands-on experience with Kubernetes (deployment, scaling, troubleshooting).; CKA/CKAD certification preferred
- Cloud Platforms: Experience with AWS
- CI/CD Tools: Jenkins, GitLab CI, or similar.
- IaC Tools: Terraform.
- Observability: Familiarity with Splunk.
- Version Control: Strong Git skills and experience with GitOps workflows.
- Problem-Solving: Ability to analyze operational pain points and design automation solutions.
Skills
Kubernetes,Cloud Platform,Python Scripting,Sre