About The Role
We are seeking an accomplished DevOps Lead with 12+ years of experience in cloud infrastructure, automation, Blockchain, and CI/CD processes. The DevOps Lead will play a pivotal role in architecting scalable cloud environments, driving automation, ensuring secure deployments, and enabling efficient software delivery pipelines. The role involves working with AWS, Huawei Cloud, Kubernetes, Terraform, blockchain-based infrastructure, and modern DevOps toolchains while providing leadership, technical guidance, and client-facing communication.
Key Responsibilities
Leadership & Team Management
- Lead, mentor, and grow a team of DevOps engineers, setting technical direction and ensuring adherence to best practices.
- Facilitate collaboration across engineering, QA, security, and blockchain development teams.
- Act as the primary technical liaison with clients, managing expectations, requirements, and solution delivery.
Infrastructure Automation & Management
- Architect, implement, and manage infrastructure as code (IaC) using Terraform across multi-cloud environments.
- Standardize environments across AWS, Digital Ocean, Huawei Cloud with a focus on scalability, reliability, and security.
- Manage provisioning, scaling, monitoring, and cost optimization of infrastructure resources.
CI/CD & Automation
- Build, maintain, and optimize CI/CD pipelines supporting multiple applications and microservices.
- Integrate automated testing, static code analysis, and security scans into the pipelines.
- Implement blue-green / canary deployments and ensure zero downtime release strategies.
- Promote DevSecOps by embedding security policies into every phase of the delivery pipeline.
Containerization & Orchestration
- Deploy, manage, and monitor applications on Kubernetes clusters (EKS, CCE, or equivalent).
- Utilize Helm charts, Kustomize, and operators for environment consistency.
- Optimize container performance and manage networking, storage, and secrets.
Monitoring, Logging & Incident Response
- Implement and manage monitoring and alerting solutions (Prometheus, Grafana, ELK, CloudWatch, Loki).
- Define SLOs, SLIs, and SLAs for production systems.
- Lead incident response, root cause analysis, and implement preventative measures.
Governance, Security & Compliance
- Implement best practices for secrets management, key rotation, and role-based access control.
- Integrate vulnerability scanning and security audits into pipelines.
Required Skills & Qualifications
- 12+ years of experience in DevOps, with at least 5+ years in a lead capacity.
- Proven expertise with Terraform and IaC across multiple environments.
- Strong hands-on experience with AWS and Huawei Cloud infrastructure services.
- Deep expertise in Kubernetes cluster administration, scaling, monitoring, and networking.
- Advanced experience designing CI/CD pipelines using Jenkins, GitHub Actions, GitLab CI, or similar.
- Solid background in automated deployments, configuration management, and version control (Git, Ansible, Puppet, or Chef).
- Strong scripting and automation skills (Python, Bash, Go, or similar).
- Proficiency with monitoring/observability tools (Prometheus, Grafana, ELK, CloudWatch, Datadog).
- Strong understanding of blockchain infrastructure, node operations, staking setups, and deployment automation.
- Knowledge of container security, network policies, and zero-trust principles.
- Excellent communication, client handling, and stakeholder management skills with proven ability to present complex DevOps concepts to non-technical audiences.
- Ability to design and maintain highly available, scalable, and fault-tolerant systems in production environments.
Skills:- Amazon Web Services (AWS), HUAWEI, Information security management system, CI/CD, Automation and Linux/Unix