Senior Site Reliability Engineer

3 - 7 years

5 - 9 Lacs

Posted:1 day ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

About the Role: Position summary &

Main tasks/activities

(short and precise definition of the role and most important activities listed)
You would be playing a key role in ensuring the reliability, stability, scalability and security of our Logging & Monitoring cloud systems and infrastructure. You will be designing, implementing, and testing highly automated solutions to shape the technology platform fulfils our business and product vision, ultimately bring value to our customers with positive user experiences.
Key Responsibilities:
  • End-to-end responsibility, from development to production, in designing, deploying, operating, and continuously improving performance and fault-tolerance of large-scale multi-cloud solutions.
  • Ensure system security, data integrity, and high availability of the platform.
  • Establish and improve monitoring, logging, and alerting frameworks to detect and resolve issues promptly.
  • Keep up with technology trends and identify promising new solutions that meet our requirements.
  • Create technical support documentation and provide hands-on troubleshooting and consulting to our customers.

About the Team

(Description of the team and context in which the role sits in)
Our Logging & Monitoring squad develops and operates state-of-the-art logging, monitoring and event management platforms to collect application behaviour information, detect / limit service disruption and provide the associated reporting capabilities. Our ambition is to help empower the developers, application and platform owners identify any growing risks, have a clear understanding of their SLAs, reduce the mean time to resolution and be ahead of the curve with regards to long term trends.

About you:

(education, linguistic ability, professional experience, leadership qualities, soft skills)
We are happy to meet you if you possess:
  • Hands on expertise in container orchestration systems such as

    Kubernetes

    running in a

    hybrid cloud environment such as Azure

    .
  • Experience in

    continuous integration/deployment

    , and system engineering experience in large-scale,

    distributed cloud solutions (like but not limited to Kafka, Elasticsearch, Otel, Observability)

    .
  • Experience programming in one or more of the following such as Go, Java, Python and in scripting languages (Shell or PowerShell).
  • Hands-on expertise in open-source application and infrastructure monitoring tools, e.g., ELK and/or TICK stack,

    Prometheus (must have)

    and Grafana.
  • Passion for sharing knowledge, through interactive sessions as well as documentation.
  • Strong analytical and problem-solving skills, as well as the ability to focus on details without losing track of the bigger picture.
  • Excellent oral and written English skills, additional language skills are a plus.

About Swiss Re


If you are an experienced professional returning to the workforce after a career break, we encourage you to apply for open positions that match your skills and experience.

Keywords:

Reference Code:

135085

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Swiss Re logo
Swiss Re

Insurance and Reinsurance

Zürich

RecommendedJobs for You