Home
Jobs

Site Reliability Engineer

4 - 6 years

6 - 10 Lacs

Posted:3 months ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

You demonstrate passion for achieving the highest level of uptime, emphasizing scalability and high-performance. You have the zeal to enhance our systems observability ensuring that we have the necessary insights and tools to monitor, troubleshoot, and optimize our applications and infrastructure. - Expertise in debugging and root causing issues with an instinct to automate repetitive tasks. - Enhance System Observability: You will be implementing and maintaining robust observability solutions which provides real-time insights into the performance and health of our systems to proactively identify and address potential issues before they impact the users. - Troubleshooting and Root Cause Analysis: Utilize your expertise to investigate and resolve incidents quickly during crisis situations, performing root cause analysis to prevent recurrence - Automation: Leverage your coding skills to create tools and automating runbooks to improve efficiency. - Documentation: Documenting and managing Runbooks and best practices to ensure knowledge sharing and team efficiency. - Communication: Strong interpersonal skills and ability to work effectively across multiple business and technical teams. Minimum Qualifications At least 4 years of prior demonstrated experience in Site Reliability Engineering, or an Infrastructure-focused role. Proficient in at-least one programming or scripting languages like Java, Python, Golang, Perl, Ruby, , etc., for developing tools in Observability, ETL, etc. First hand experience in debugging, performance tuning and root cause analysis of Applications. Support of internet-facing production services and distributed systems via deployments, onCall and Incident Management. Proficiency in implementing and coordinating telemetry using monitoring and observability tools like Splunk, Grafana, and Prometheus, or similar. Experience in solving and resolving issues in Kubernetes from both an operating system and application perspective. Strong understanding of database principles and working knowledge in distributed storage and infrastructural solutions such as Oracle, Cassandra, SOLR, and Kafka Good command on Linux, Networking concepts (TLS/SSL, DNS, Load Balancers, etc.,) and troubleshooting skills in large scale environments Deep understanding of basic security concepts and protocols - authentication, authorization, signing, encryption, SSL/TLS, SSH/SFTP, PKI, X509 certificates and PGP. Excellent knowledge of ITIL terminology for incident and problem management Bachelor of Science in Computer Science or other related discipline. Preferred Qualifications Building and operating container orchestrating systems like Kubernetes or EKS. Experience with container management and micro-services architectures such as Docker in cloud and on-premises infrastructure. Hands-on experience in Java programming. Good experience in performance tuning databases. Understanding of Git, CI/CD, Release Engineering and DevOps. Strong Load Balancing (Nginx, Envoy, NetScaler) experience is a huge plus. Familiarity with Modern web services architectures, cloud platforms such as AWS, GCP, Azure and distributed storage systems (ScaleIO, Amazon S3). Understanding of security standards, policies, and cryptography. Track record of excellent interpersonal, analytical, and communication skills.

Mock Interview

Practice Video Interview with JobPe AI

Start Git Interview Now

My Connections Apple

Download Chrome Extension (See your connection in the Apple )

chrome image
Download Now
Apple
Apple

Computers and Electronics Manufacturing

Cupertino California

10001 Employees

238 Jobs

    Key People

  • Tim Cook

    CEO
  • Luca Maestri

    CFO

RecommendedJobs for You

Hyderabad, Chennai, Bengaluru

Bengaluru / Bangalore, Karnataka, India

Noida, Uttar Pradesh, India

Noida, Uttar Pradesh, India

Noida, Uttar Pradesh, India