Posted:2 weeks ago|
Platform:
On-site
Full Time
About Us Exotel is one of Asia's largest customer communication platforms. We are on a mission to move enterprise customer communication to the cloud. In 2020, we powered over 4 billion calls and connected over 320 million people. We work with some of the most innovative companies such as Ola, Swiggy, Zerodha, Whitehat Jr, Practo, Flipkart, GoJek, etc. We also power customer communication for some of the top banks in the country. Join us on this journey to improve how companies look at customer communication. Read our growth story here . SRO @ Exot el The SRO (Site Reliability Operations) team manages the setup/expansions of Exotel’s production Infrastructure in managed data centres ( DCs) at multiple locations. The SRO team also makes sure that our DCs are up and running all the ti me.Infrastructure includes Linux services, Linux cloud servers, Linux bare-metal servers, Network devices, internet leased lines, telephone lines, telephony hardware e tc.This team provides 24/7 coverage and support and is responsible for monitoring, reporting, troubleshooting, resolving, and escalating any Production infrastructure-related issues. This includes incidents where a Network infrastructure or a Carrier may experience issues. It also involves identifying, troubleshooting, and resolving issues with systems and applications reported through monitoring systems or trouble ticke ts.We as a team love to increase the efficiency and speed of execution by constantly automating the regular activiti es. What are we looking for? Design & Manage complex & large-scale Data Center infrastructures. (e.g. Servers/Network/Security/vendors/software upgrades, patches, hotfixes ) per business requir ement.Drive automation strategies and deployment processes following SDLC pro cessesAutomate systems administration-related solutions for various project and operational needsMonitor and react to security-related incidents as necessary and involve required stakeholders for short-term and long-term solu tions.Lead & drive root cause analysis efforts across multiple infrastructure layers( OS/ Networ k/App)Provide on-call and out-of-hours support for business-critical ser vices.Troubleshoot issues in detail whenever there is a failure with any component - Server/Monitoring/Service related issues following a solid data-driven approach while arriving at the hypothesis. Drive & implement short-term and long-term solu tions.Administer monitoring services such as Grafana, Nagios, Prometheus and custom-s criptsExplore and implement the latest technologies to improve the stability, security, efficiency, and scalability of the envir onmentDrive initiatives to reduce TAT, and MTTR for existing processes and pra cticesPerform benchmarking exercises for different system comp onentsDrive initiatives to improve the stability, security, efficiency, and scalability of the envir onmentMentor juniors in th e team What will you do? Must-haves [Must Have] 7+ years of strong hands-on working knowledge of RHEL/CentOS/Ubuntu in an enterprise environment & good understanding of the design and configuration of UNIX/Lin ux systems.[Must Have] Hands-on experience with Orchestration/Configuration Management tools (e.g. Ansible, Chef, or Puppet) and CI/CD tools li ke Jenkins.[Must Have] 7+ years of experience in supporting and managing a large number of complex multi-server, multi-vendor, multi-technology infra structures.[Must Have] 7+ years of experience in leading projects from technical design through t o delivery.[Must Have] Hands-on experience with one or more scripting languages (e.g. Ba sh, Python)[Must Have] Strong in Computer Science fundamentals and strong exploratory skills for exploring new-age t echnologies[Must Have] Exposure to a few of the following: Logging (Rsyslog), Monitoring frameworks (Prometheus, Nagios), Linux Security, Databases - MySQL/SQL[Must Have] A "SRE" mindset. You own what you will set up &a mp; manage. Good-to-haves 4+ years of hands-on experience in setting up and managing physical data centr e environmentsHave experience working in AWS services, VPC, EC2, S3, ELB, RDS, IAM, CloudFront Lambda,a nd Cloudwatch. Show more Show less
Exotel
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
My Connections Exotel
Bengaluru, Karnataka, India
Salary: Not disclosed
Bengaluru, Karnataka, India
Salary: Not disclosed