Sr. Site Reliability Engineer

4 - 8 years

0 Lacs

Posted:2 days ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

As an experienced Systems Administrator with a solid background in Linux, infrastructure management, and incident response, you will be responsible for monitoring, troubleshooting, and ensuring the reliability of systems in virtualized and cloud-based environments. You will collaborate with the operations team to manage escalations and oversee incident management. Additionally, your role will involve implementing strategies to enhance daily operations, focusing on system stability, security, and scalability. Real-time monitoring of system performance and capacity, addressing alerts, and optimizing systems will be crucial aspects of your responsibilities. You will lead troubleshooting efforts, coordinate responses to network and system issues, and oversee the setup and maintenance of server, application, and network equipment. Ensuring effective outage notification and escalation for prompt resolution, mentoring team members on technical skills and troubleshooting methods, and maintaining up-to-date documentation of processes and procedures in the WIKI will also be part of your role. Key Skills: - Minimum 4 years of experience in Linux system administration. Technical Skills: - Datacenter technologies and cloud platforms such as AWS/GCP. - Application deployment using tools like Git and StackStorm. - Strong troubleshooting skills across networks and systems, familiarity with network protocols (TCP/IP, UDP, ICMP), and tools like TCPdump. - Advanced diagnostic skills in network performance and system capacity monitoring. - Proficiency in Linux command-line and system administration. Soft Skills: - Analytical skills with the ability to interpret and act on data. - Effective prioritization and escalation of issues. - Adaptability to shift work and capacity for multitasking in high-pressure scenarios. - Excellent leadership, communication, and interpersonal skills. Qualifications: - Bachelors degree in Computer Science, Engineering (BE/B.Tech), MCA, or M.Sc (IT). Must-Have: - Basic experience with Configuration Management tools like Ansible, SaltStack, or StackStorm. - Basic experience with CI/CD tools such as Jenkins. - Experience with monitoring tools like Nagios, Sensu, or Zabbix. - Basic experience with Log Analytics tools like Splunk, Elasticsearch, Sumo Logic, Prometheus, or Grafana. - Knowledge of Virtualization technologies such as VMware, KVM, or similar. - Strong fundamentals in Linux, troubleshooting, and networking. - Knowledge of Containerization technologies like Kubernetes, Rancher, or similar. Good to Have: - Experience with Cloud Providers like AWS or GCP. - Advanced knowledge of Networking concepts including BGP, F5 Load Balancer, and switching protocols. - Relevant certifications such as RHCSA, CCNA, or equivalent.,

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You