Posted:1 day ago|
Platform:
Work from Office
Full Time
We are seeking a motivated and detail-oriented 5+ exp SRE TechOps Engineer to join our growing operations team. In this role, you will be the first line of defence, responsible for monitoring our production environments, responding to alerts, and performing initial troubleshooting of our cloud infrastructure. The ideal candidate will have a strong foundation in cloud technologies, a passion for automation, and expert-level knowledge of the ELK stack for monitoring and analysis. This is a fantastic opportunity to grow your skills in a modern, cloud-native environment.
System Monitoring: Proactively monitor the health and performance of our applications and infrastructure using Azure Monitor and the ELK stack. Incident Response: Serve as the initial responder for all production alerts, following established runbooks and escalation procedures. Triage and Troubleshooting: Perform initial investigation and triage of incidents, gathering logs and data to identify the root cause. Issue Escalation: Escalate unresolved and complex issues to the L1/L2 engineering teams, providing detailed ticket information in Jira. Automation: Assist in the maintenance and improvement of our CI/CD pipelines using GitHub Actions. Infrastructure Support: Provide basic support for our Kubernetes and Terraform-managed infrastructure. Documentation: Contribute to the creation and maintenance of runbooks and other operational documentation. Collaboration: Work closely with development and other operations teams to ensure the stability and reliability of our services.
Required Qualifications:
ELK Stack Expertise: Proven, expert-level experience with the ELK (Elasticsearch, Logstash, Kibana) stack, including creating dashboards, setting up alerts, and writing complex queries for log analysis and troubleshooting. Cloud Experience: Hands-on experience with Microsoft Azure, including Azure Monitor, virtual machines, and networking basics. CI/CD Familiarity: Understanding of Continuous Integration and Continuous Deployment (CI/CD) principles, with some experience using tools like GitHub Actions. Containerization and IaC: Good knowledge of Kubernetes and Infrastructure as Code (IaC) concepts, preferably with some exposure to Terraform. Ticketing Systems: Proficiency in using Jira for incident tracking and management. Problem-Solving Skills: Strong analytical and troubleshooting skills with the ability to remain calm and effective under pressure. Communication: Excellent verbal and written communication skills.
Insightek Global Consulting
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
trivandrum, kerala, india
Salary: Not disclosed
hyderābād
5.3 - 9.3 Lacs P.A.
hyderābād
5.475 - 10.0 Lacs P.A.
mumbai, indore, pune
14.0 - 24.0 Lacs P.A.
bengaluru
25.0 - 35.0 Lacs P.A.
5.0 - 12.0 Lacs P.A.
bengaluru
25.0 - 35.0 Lacs P.A.
chennai, tamil nadu, india
Experience: Not specified
Salary: Not disclosed
india
Salary: Not disclosed
hyderabad, telangana, india
Salary: Not disclosed