Site Reliability Engineering Professional

10 - 12 years

10 - 12 Lacs

Posted:2 days ago| Platform: Foundit logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

IBM's product and technology landscape includes Research, Software, and Infrastructure. Entering this domain positions you at the heart of IBM, where growth and innovation thrive.

Your Role and Responsibilities

In this Site Reliability Engineer role, you will build and maintain an observability stack for IBM's Cloud Object Storage service using managed services as well as custom built services. This stack is used by Cloud Object Storage SREs and devs to understand the health of the service. Work duties and responsibilities include:

  • Design, setup, configure and implement the COS Monitoring System using technologies such as

    Elasticsearch, Logstash, Kibana, Kafka, Kafka Mirrors, Filebeat, Grafana and Sysdig

    .
  • Automate CICD tasks and infrastructure using

    Ansible, Terraform, Jenkins, and Travis

    .
  • Experience with microservices and distributed application architecture, such as

    containers and Kubernetes

    .
  • Experience with

    Linux administration

    and programming languages such as

    Java, Python and SQL

    .
  • Performance and configuration tuning to support the increasing load of data flowing into the COS Monitoring System.
  • Provide design recommendations and thought leadership to provide best-in-class observability as part of the COS Monitoring System.
  • Provide

    24x7 on-call customer support

    on a rotational basis.
  • Design and develop dashboards for metrics analysis.
  • Design, Develop and Configure an alerting solution for an end-to-end incident management and recovery process by integrating Sysdig with Pagerduty, Email and Slack.

Required Education

  • Bachelor's Degree

Required Technical and Professional Expertise

  • Ability and tenacity to solve increasingly complex technical issues through analysis and a variety of problem-solving techniques.
  • Working knowledge of

    Object-Oriented Python

    with demonstrable experience in applying these skills.
  • Working knowledge of

    Linux environments

    .
  • Experience working in an

    Agile-Scrum

    development environment.
  • Experience using tools such as

    Jira, GitHub

    and Logging and monitoring tools.
  • BS in CS, CE or similar field, plus

    10-12 years relevant work experience

    .

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
IBM logo
IBM

Information Technology

Armonk

RecommendedJobs for You