This role will involve managing and scaling Docker and Kubernetes infrastructure, designing and implementing cloud architectures, and leading containerization and infrastructure automation across various projects. You will work with a broader set of DevOps and CNCF tools, applying deep expertise in CI/CD, security, and infrastructure-as-code to support high-availability applications across diverse cloud environments.
What Youll Do:
- Design, implement, and manage Kubernetes clusters (EKS) across AWS environments, maintaining secure, scalable, and resilient solutions.
- Lead the development and automation of CI/CD pipelines using tools such as ArgoCD, Cilium, TeamCity, CodeBuild, CodeDeploy, and CodePipeline to streamline application deployment and configuration.
- Expertly manage cloud resources using Terraform, and develop reusable, version-controlled IaC modules, promoting modular, scalable infrastructure deployment.
- Strong understanding and experience with Helm charts, and CNCF applications such as Cilium, Karpenter, and Prometheus.
- Configure and optimize Kubernetes clusters, ensuring compliance with container security standards.
- Oversee Docker image creation, tagging, and management, including maintaining secure, efficient Docker repositories (ECR, JFrog).
- Utilize monitoring tools (Prometheus, Grafana, Splunk, Cloudwatch Container Insight) to ensure system performance, detect issues, and proactively address performance concerns.
- Act as an escalation point for critical issues, conduct root cause analyses and maintain SOPs and documentation for efficient incident response and knowledge sharing.
- Provide training, mentorship, and technical guidance to junior team members, fostering a culture of continuous learning within the CCoE team.
What Youll Bring:
- BE/B.Tech or higher in CS, IT or EE
- Hand-on experience of 7-8yrs in delivering container-based deployments using Docker and Kubernetes
- Good exposure in writing Helm Charts and configuring Kubernetes CI/CD pipelines
- Experience in writing manifest files for Deployment, Service, Pod, Daemon Sets, Persistent Volume (PV), Persistent Volume Claim (PVC), Storage, Namespaces
- Hand-on experience in delivering container-based deployments using Docker and Kubernetes
- Experience in Cloud, DevOps, and Linux and Experience in DevOps Tools like Git, Helm, Terraform, Docker, and Kubernetes
- Strong hands-on experience in Python, Yaml, or similar languages.
- Build and deploy Docker containers to break up monolithic apps into microservices, improving developer workflow
- Strong understanding of container security and relevant tool experience like Sysdig, CrowdStrike, etc
- Strong knowledge of container performance monitoring and scaling policies
- Deep Linux knowledge with an understanding of the container ecosystem
- Good Experience to develop images using Docker container Technology
- Should have good communication skills and a can-do attitude.