Posted:1 month ago| Platform: SimplyHired logo

Apply

Work Mode

On-site

Job Description

We're Hiring: Kubernetes SME

Location: Hybrid (with flexibility to commute to Chennai if required)

Duration: 6+ months contract (extendable based on performance)

Senior Cloud-Native & AIOps Specialist – Manage Kubernetes across on-prem, Azure & AWS, implement AIOps automation for monitoring & issue resolution. Strong coding & cloud-native expertise required.

Key Responsibilities:-

Kubernetes Deployment & Operations:

- Architect and deploy production-grade Kubernetes clusters on on-premise datacenters, AKS, and EKS.

- Configure multi-cluster and hybrid environments with secure interconnectivity.

- Implement namespace, resource, and workload management (Deployments, StatefulSets, DaemonSets).

- Manage networking, ingress controllers, and service mesh (Istio/Linkerd).- Integrate persistent storage and backup/restore strategies.

- Implement scaling strategies (HPA, VPA, cluster autoscaler) and performance tuning.

AIOps & Observability:

- Implement AIOps platforms (Moogsoft, BigPanda, Dynatrace, Datadog) integrated with Kubernetes environments.

- Automate incident detection, root cause analysis, and remediation workflows.

- Correlate metrics, logs, and traces from multiple sources for proactive issue prevention.

- Build real-time dashboards and predictive analytics for IT operations.

- Drive self-healing infrastructure initiatives.

Automation & Infrastructure as Code:

- Use Terraform/Pulumi for provisioning Kubernetes clusters and cloud/on-prem resources.

- Deploy applications via Helm and GitOps workflows (ArgoCD, FluxCD).

- Automate operational runbooks using Python, Go, and Bash scripting.

Security & Compliance:

- Apply RBAC, Pod Security Policies, and secrets management.

- Perform image scanning (Trivy, Aqua, Prisma) and enforce policies (OPA/Gatekeeper, Kyverno).

- Ensure compliance with CIS Benchmarks, NIST, and industry regulations.

Required Skills:

1. Core Kubernetes Skills

- Kubernetes cluster setup (on-prem, AKS, EKS)

- Networking & ingress configuration

- Persistent storage management

- Service mesh implementation

- Scaling & performance optimisation

2. Cloud & On-Prem Skills:

- Azure & AWS Kubernetes services

- Private cloud/on-prem cluster deployments

- Hybrid connectivity (VPN, Direct Connect, ExpressRoute)

3. AIOps Skills:

- Experience with AIOps tools (Moogsoft, BigPanda, Dynatrace, Datadog)

- Predictive maintenance & incident automation

4. Automation Skills:

- Terraform, Pulumi, Ansible

- GitOps (ArgoCD, FluxCD)

- CI/CD pipelines (Azure DevOps, Jenkins, GitHub Actions)

5. Programming & Scripting: Python, Go, Bash

Preferred Skills:

CKA, CKAD, or Cloud Architect certifications

Experience in air-gapped / highly secure environments,

Multi-cluster federation management

MLOps integration for AIOps use cases

Job Type: Contractual / Temporary
Contract length: 6 months

Work Location: In person

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You