Key Responsibilities
Design, build, and manage scalable cloud infrastructure on AWS (EC2, Lambda, S3, Route 53, Kinesis, SQS, SNS, IAM, VPC, NAT Gateways, Subnets, IP Routing).
Provision and manage Kubernetes (EKS) clusters, containerized applications with Docker, and support deployments at scale. Build, optimize, and maintain CI/CD pipelines using Jenkins and GitHub Actions. Implement and manage Infrastructure as Code (IaC) using Terraform and CloudFormation. Design and implement Disaster Recovery (DR) strategies and highly available systems in AWS. Implement monitoring, logging, and alerting solutions with a strong focus on Grafana for observability, enabling effective troubleshooting and debugging. Collaborate with development teams to debug REST applications, troubleshoot complex issues, and improve reliability. Ensure application and infrastructure security following best practices. Work with databases such as PostgreSQL, Redshift, and manage RDBMS in cloud-native environments. Configure and maintain networking and security components including VPNs, DNS, load balancing, and firewalls. Support Agile/Scrum teams, participate in sprint planning, and collaborate across diverse cross-functional teams. Provide on-call support as part of a 24x7 DevOps rotation for critical production systems. Maintain and support tools within the Atlassian Suite (Jira, Confluence, Bitbucket).
Required Skills & Experience
7 10 years of proven DevOps or Cloud Engineering experience. Expertise in AWS provisioning and cloud-native architecture. Strong hands-on experience with Kubernetes (EKS) and containerization using Docker. Proficiency in Terraform, CloudFormation, and Infrastructure as Code. Solid experience in CI/CD pipelines with Jenkins and GitHub Actions. Strong working knowledge of Grafana for logging, monitoring, troubleshooting, and debugging. In-depth knowledge of application security and secure system design. Strong understanding of networking fundamentals: HTTP/HTTPS, SSL, SFTP, FTP, VPN, DNS. Proficiency with PostgreSQL and RDBMS administration. Knowledge of scripting languages such as Python for automation. Exceptional troubleshooting and debugging skills. Familiarity with Nginx (nice to have).
Educational Qualification
- Experience working in cross-functional teams and collaborating effectively with different stakeholders.
- Strong problem-solving and analytical skills.
- Excellent communication skills to document and present technical concepts clearly.
- Bachelor s or master s degree in computer science or a related field.