We're Hiring: Kubernetes SME Location: Hybrid (with flexibility to commute to Chennai if required) Duration: 6+ months contract (extendable based on performance) Senior Cloud-Native & AIOps Specialist – Manage Kubernetes across on-prem, Azure & AWS, implement AIOps automation for monitoring & issue resolution. Strong coding & cloud-native expertise required. Key Responsibilities:- Kubernetes Deployment & Operations: - Architect and deploy production-grade Kubernetes clusters on on-premise datacenters, AKS, and EKS. - Configure multi-cluster and hybrid environments with secure interconnectivity. - Implement namespace, resource, and workload management (Deployments, StatefulSets, DaemonSets). - Manage networking, ingress controllers, and service mesh (Istio/Linkerd).- Integrate persistent storage and backup/restore strategies. - Implement scaling strategies (HPA, VPA, cluster autoscaler) and performance tuning. AIOps & Observability: - Implement AIOps platforms (Moogsoft, BigPanda, Dynatrace, Datadog) integrated with Kubernetes environments. - Automate incident detection, root cause analysis, and remediation workflows. - Correlate metrics, logs, and traces from multiple sources for proactive issue prevention. - Build real-time dashboards and predictive analytics for IT operations. - Drive self-healing infrastructure initiatives. Automation & Infrastructure as Code: - Use Terraform/Pulumi for provisioning Kubernetes clusters and cloud/on-prem resources. - Deploy applications via Helm and GitOps workflows (ArgoCD, FluxCD). - Automate operational runbooks using Python, Go, and Bash scripting. Security & Compliance: - Apply RBAC, Pod Security Policies, and secrets management. - Perform image scanning (Trivy, Aqua, Prisma) and enforce policies (OPA/Gatekeeper, Kyverno). - Ensure compliance with CIS Benchmarks, NIST, and industry regulations. Required Skills: 1. Core Kubernetes Skills - Kubernetes cluster setup (on-prem, AKS, EKS) - Networking & ingress configuration - Persistent storage management - Service mesh implementation - Scaling & performance optimisation 2. Cloud & On-Prem Skills: - Azure & AWS Kubernetes services - Private cloud/on-prem cluster deployments - Hybrid connectivity (VPN, Direct Connect, ExpressRoute) 3. AIOps Skills: - Experience with AIOps tools (Moogsoft, BigPanda, Dynatrace, Datadog) - Predictive maintenance & incident automation 4. Automation Skills: - Terraform, Pulumi, Ansible - GitOps (ArgoCD, FluxCD) - CI/CD pipelines (Azure DevOps, Jenkins, GitHub Actions) 5. Programming & Scripting: Python, Go, Bash Preferred Skills: CKA, CKAD, or Cloud Architect certifications Experience in air-gapped / highly secure environments, Multi-cluster federation management MLOps integration for AIOps use cases Job Type: Contractual / Temporary Contract length: 6 months Work Location: In person