Sr. Engineer - Reliability

9 - 14 years

40 - 65 Lacs

Posted:3 weeks ago| Platform: Naukri logo

Apply

Work Mode

Hybrid

Job Type

Full Time

Job Description

CrowdStrike is looking to hire a Senior Engineer to the TechOps SRE team. Were looking for a deeply-technical, hands-on engineer, who loves to develop automation and tooling through software to ensure delivery of mission critical solutions and services for large-scale distributed systems.

What You’ll Do:

  • Expertise with Linux engineering and administration for thousands of bare metal servers and virtual machines
  • Responsible for troubleshooting server hardware issues
  • Responsible for all operational aspects of our platform - Availability, Latency, Throughput, Monitoring, Issue Response (analysis, remediation, deployment) and Capacity Planning with respect to Latency and Throughput
  • Work in a team of highly motivated engineers distributed across the globe
  • Use your passion for technology, automation, and tooling to ensure our platform operates 24x7
  • Obsess about learning, and champion the newest technologies & tricks with others, raising the technical IQ of the team. We don’t expect you to know all the technology we use but you will be able to get up to speed on new technology quickly
  • Have broad exposure to our entire architecture and become one of our experts in our overall process flow
  • Have an intrinsic drive to make things better
  • Bias towards small/medium development projects and the occasional larger projects
  • Have experience with modern monitoring and telemetry stacks (ELK, Prometheus, Grafana)
  • Gather and analyze metrics from both operating systems and applications to assist in performance tuning
  • Ability to lead incident analysis for incidents, champion incident response practices and assist in correlating incidents to systemic problems, and drive towards resolution.

What You’ll Need:

  • Bachelors degree and/or equivalent experience in Computer Science
  • 8+ years of experience in software engineering
  • 8+ years of experience in one or more of: C++, Java, Python, Go
  • Experience with storage technologies (Examples: SAN, NAS, NFS, Object Storage, FreeNAS, iSCSI)
  • Experience with Infrastructure technologies (Examples: Linux, Windows, VMware, Docker, Kubernetes, etc.)
  • Experience writing technical documentation
  • Configuration management experience with one or more tools such as Puppet, Chef, Ansible
  • Solid understanding of application design, including operational trade-offs of various designs
  • Analytical skills coupled with a strong sense of urgency, ownership, and drive
  • Ability to work with well in a diverse, team-focused environment with other SREs and Engineers
  • Ability to broadly communicate and present recommended conventions defined by the reliability team broadly

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Crowdstrike logo
Crowdstrike

Computer and Network Security

Remote

RecommendedJobs for You

mumbai suburban, navi mumbai, mumbai (all areas)