Jobs
Interviews
1 Job openings at SBS-Global - Enabling Business Transformation
Infrastructure Support Engineer

maharashtra

9 - 13 years

INR 0.00016 - 0.00023 Lacs P.A.

On-site

Full Time

You will play a crucial role in overseeing the operations of shipped products and services, adhering to the agreed-upon Eyes on glass/Follow the sun engagement models. This involves closely monitoring product/service operations against key performance indicators established by the business and promptly taking necessary actions in response to any identified deviations. Furthermore, you will collaborate with the Service Reliability Engineering (SRE) team and client stakeholders to define and document appropriate responses to various incident scenarios, creating detailed runbooks for reference. To streamline day-to-day operations and enhance the team's overall efficiency, you will focus on automating operations using cutting-edge technology stacks tailored to the task at hand. As the primary responder to incidents in production or other high-value environments, you will execute predefined responses outlined in runbooks or based on your expert judgment of the situation. Your responsibilities will also involve initiating communication with support teams across all service functions, coordinating incident response activities, and working closely with tech leads, SRE leads, and development teams to resolve issues effectively. In addition to handling immediate incident responses, you will be tasked with preparing thorough incident root cause analysis (RCA) and postmortem reports. These reports will not only explain the analyses conducted but also outline preventive measures to mitigate similar incidents in the future. By collaborating with SRE, development teams, or working independently, you will ensure clear communication and proactive steps are taken to prevent future incidents, all while driving service/product reliability enhancements through infrastructure and observability configuration code. Qualifications: - 9-12 years of relevant experience - Compensation Package: 10lpa - 25lpa Technical Skills: - Proficiency in CI/CD tools like Jenkins, CircleCI, or Gitlab for deployment execution - Knowledge of Infrastructure as Code (IAC) tech stacks such as Terraform, Ansible, ARM, or Cloudformation for infrastructure provisioning and management - Experience with observability tools for logging, monitoring, tracing, and alerting (e.g., Datadog, Prometheus, Grafana, ELK, EFK, Splunk) - Hands-on experience supporting at least one public cloud platform (AWS, Azure, GCP) - Familiarity with container ecosystem tech stacks for workload management (e.g., Docker, Kubernetes, Openshift) - Understanding of system performance tuning, scaling, highly available systems, disaster recovery solutions, and common networking setup and security practices - Proficiency in operating Linux OS, managing backend storage solutions (SQL, NoSQL databases), caching solutions, and networking configuration and security Professional Skills: - Strong communication and articulation skills, proficient in English - Ability to collaborate effectively with cross-functional teams - Capacity to work under pressure during production incidents with composure - Strong analytical, deductive, and reasoning skills - Drive and ownership to deliver work efficiently without being constrained by role boundaries - Availability for rotation- and need-based 24x7 team participation,

cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Job Titles Overview