Posted:2 days ago|
Platform:
On-site
Part Time
Role Proficiency:
Act under guidance of DevOps; leading more than 1 Agile team.
Outcomes:
Measures of Outcomes:
Outputs Expected:
Automated components :
Configured components:
Scripts:
Training/SOPs :
Measure Process Efficiency/Effectiveness:
Operations:
Skill Examples:
Knowledge Examples:
• 5+ years of experience as an SRE, DevOps Engineer, or similar role. • Proficiency in scripting and automation (Bash, Python, Go, etc.). • Strong experience with containerization and orchestration (Docker, Kubernetes, Helm). • Solid understanding of Linux systems administration and networking fundamentals. • Experience with cloud platforms (AWS, Azure, or GCP). • Experience with IaC tools like Terraform or CloudFormation. • Familiarity with GitOps and modern deployment practices. • Hands-on experience with observability tools (e.g., Prometheus, Grafana, Datadog). • Strong troubleshooting and incident response skills. Preferred: • Experience in a high-traffic, microservices-based architecture. • Exposure to service meshes (Istio, Linkerd). • Certifications (AWS Certified DevOps Engineer, CKA, etc.) • Experience with security automation and compliance (e.g., SOC2, ISO27001). Soft Skills: • Strong communication and collaboration abilities. • Ability to thrive in a fast-paced, agile environment. • Analytical mindset and proactive approach to problem-solving. • A passion for automation, performance, and system design. Design, build, and maintain reliable, scalable, and secure cloud-based infrastructure (AWS, Azure, or GCP). • Develop and improve observability using monitoring, ing, logging, and tracing tools (e.g., Prometheus, Grafana, ELK, Datadog, etc.). • Automate repetitive tasks and infrastructure using Infrastructure-as-Code (Terraform, CloudFormation, Pulumi). • Create and maintain CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, ArgoCD, etc.) to support fast and safe delivery. • Lead incident response, root cause analysis, and postmortems to ensure high uptime and rapid recovery. • Optimize system performance, reliability, and cost-effectiveness through proactive monitoring and tuning. • Collaborate with software engineering teams to define SLAs/SLOs and improve service reliability. • Implement and maintain security best practices across environments (e.g., secrets management, IAM, firewalls, etc.). • Maintain disaster recovery plans, backups, and high-availability strategies.
Kubernetes,Cloud Platform,Python Scripting,Sre
UST Global
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Practice Python coding challenges to boost your skills
Start Practicing Python Nowthiruvananthapuram
5.5 - 8.0 Lacs P.A.
trivandrum, kerala, india
Salary: Not disclosed
ahmedabad
5.5 - 6.7 Lacs P.A.
ahmedabad, gujarat, india
Salary: Not disclosed
Kochi, Kerala, India
Salary: Not disclosed
karnataka
Salary: Not disclosed
Bengaluru
5.0 - 5.5 Lacs P.A.
Bengaluru
5.1 - 8.94 Lacs P.A.
Bengaluru
5.55 - 8.94 Lacs P.A.
Trivandrum, Kerala, India
Salary: Not disclosed