Cloud DevOps - Technical Lead / Technical Architect
We are looking for a highly self-motivated individual with Cloud DevOps as a Technical Lead / Technical Architect.
Required Skills & Experience:
-
10–15 years of strong experience in DevOps, cloud infrastructure, security, and automation roles.
-
Expertise in one or more cloud platforms (AWS, GCP, Azure) with deep hands-on experience.
-
In-depth understanding of IAM systems, VPC architecture, network segmentation, peering, firewalls, load balancers, VPNs, and DNS.
-
Strong programming/scripting skills (Python, Bash, Go, or similar).
-
Proven experience leading infrastructure modernization initiatives and platform migration efforts.
-
Practical experience with containers (Docker, Podman), orchestration (Kubernetes, ECS, AKS, EKS).
-
Strong knowledge of CI/CD tools (e.g., GitHub Actions, GitLab, Jenkins, ArgoCD, Spinnaker).
-
Clear understanding of infrastructure cost models and experience optimizing cloud usage and spend.
-
Experience in defining and improving CI/CD pipelines.
-
Knowledge of Python scripting and automation.
-
Understanding of cloud identity and access management.
-
Familiarity with compute runtimes, including native compute, virtual machines, and containers.
-
Configuration and management of databases such as Oracle, Cloud SQL, and Cloud Spanner.
-
Proficiency in troubleshooting and debugging complex issues.
-
Design, Architecture and Implementation experience of dynamic scaling strategies for containerized fullstack native/AI applications.
-
Knowledge on MLOps and LLMOps.
Preferred Qualifications:
-
Cloud certifications: AWS Solutions Architect Professional, GCP Professional DevOps Engineer, Azure Expert Engineer, etc.
-
Familiarity with Zero Trust Network Access (ZTNA) principles and modern identity governance.
-
Experience with service mesh architectures (Istio, Linkerd), API gateways, and internal developer platforms (IDPs).
-
Exposure to SRE practices (SLIs, SLOs, error budgets) and modern reliability engineering.
-
Understanding on the SDLC.
-
Understanding on the Agile methodologies.
-
Communication with customer and producing the Daily status report.
-
Should have good oral and written communication.
-
Should be a good team player.
-
Should be proactive and adaptive.
Key Responsibilities:
Technical Architecture & Engineering Excellence:
-
Architect and manage multi-cloud infrastructure (preferably AWS, GCP, or Azure) with focus on availability, scalability, and security.
-
Design and enforce infrastructure-as-code (IaC) standards using Terraform, CloudFormation, or Pulumi across environments.
-
Lead implementation of DevSecOps best practices including vulnerability scanning, secrets management, policy-as-code (OPA/Conftest), and secure CI/CD flows.
-
Design and optimize VPC, Subnet, Routing, VPN, Peering, and hybrid network topologies ensuring zero-trust principles.
-
Manage IAM roles, policies, service accounts, federated access and enforce least-privilege models using automation.
-
Enable infrastructure observability using tools such as Prometheus, Grafana, ELK, Datadog, or Cloud-native monitoring stacks.
-
Lead Kubernetes and container strategy including security hardening, deployment automation, cost optimization, and multi-cluster governance.
Leadership & Team Building:
-
Lead and mentor a team of DevOps engineers, infrastructure specialists, and cloud platform engineers.
-
Drive technical reviews, root cause analysis (RCA), and retrospectives — ensuring continuous improvement in operations.
-
Define DevOps standards, enforce code quality, and set clear KPIs for infrastructure performance and uptime.
-
Collaborate closely with developers, architects, product managers, and security teams to drive platform scalability and reliability.
-
Communicate effectively with senior management and stakeholders — presenting technical roadmaps, risks, and progress.
Security, Compliance, and Governance:
-
Implement secure infrastructure aligned with ISO 27001, SOC2, NIST, or industry-specific standards.
-
Automate governance using tools like Sentinel, OPA/Gatekeeper, and integrate security into the CI/CD lifecycle.
-
Lead incident response processes for infrastructure-level security or availability incidents.