Home
Jobs

Sr. SDE - 3 - DevOps

0 years

0 Lacs

Posted:1 week ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

We're looking for a passionate Sr. Software Engineer (SDE-3) - DevOps to build and scale robust infrastructure and deployment pipelines. You'll solve complex system challenges, automate workflows, optimize performance, and ensure high availability across our services. This role requires close collaboration with engineering teams to drive reliability, security, and scalability in everything we build.

Responsibilities

  • Implement and maintain robust monitoring solutions using tools like Prometheus, Grafana, ELK, New Relic, etc.
  • Configure alerting mechanisms to ensure proactive identification and resolution of potential issues.
  • Creating and maintaining various Ansible playbooks for automation pieces.
  • Ensure configuration and compliance with configuration management tools.
  • Administer and troubleshoot Linux-based systems.
  • Troubleshoot problems across a wide array of services and functional areas.
  • Oversee the monitoring and stability of applications hosted on EKS (Elastic Kubernetes Service).
  • Work closely with development teams to optimize application performance.
  • Prepare detailed reports on infrastructure resource usage.
  • Identify means to optimize infrastructure utilization and reduce costs.
  • Demonstrate expertise in managing and optimizing infrastructure on AWS, GCP, and Azure.
  • Collaborate with cross-functional teams to ensure seamless integration with cloud services.
  • Create documentation outlining the setup, configuration, and maintenance procedures for each monitoring tool.
  • Develop and document incident response plans to address system outages or performance degradation promptly.
  • Maintain an incident response playbook for reference during critical situations.
  • Implement and document incident reporting procedures, including the creation of incident tickets, categorization, and prioritization.
  • Lead incident management efforts, ensuring timely resolution and post-incident reviews for continuous improvement.

Requirements

  • Hands-on experience with monitoring tools like New Relic, Prometheus, Grafana, ELK, or Datadog. (Preference is New Relic).
  • Hands-on experience with Incident Management tools like Opsgenie and PagerDuty.
  • Install, customize, support, and enhance system monitoring infrastructure.
  • Integrate monitoring and incident management tools with the infrastructure.
  • Support to day operation of our monitoring functions.
  • Sit with teams and design end-to-end monitoring of the APIs and relevant workloads that are critical.
  • Hands-on experience with Cloud platforms such as AWS/GCP or private cloud environments.
  • Strong experience in Container Technologies (Docker/ Kubernetes) and containerizing applications.
  • Monitoring concepts to be very strong, and should have experience with ELK stack, Prometheus, and Grafana.
  • Strong knowledge of Linux (Ubuntu, CentOS, and RHEL).
  • System troubleshooting and problem solving across platforms and application domains.
  • Proficiency in any programming or scripting language, such as Shell Script, Python, or Ruby.
  • Experience with infrastructure-as-code (e. g., Terraform).
  • Experience with continuous integration, unit testing, and integration testing.
  • Experience with RDBMS and NoSQL databases - PostgreSQL, MongoDB.
  • Lead incident response efforts, providing timely resolution of system outages and performance issues.
  • Ability to work independently with minimal direction; self-starter/self-motivated.
  • Fintech experience - advantageous.
This job was posted by Vibhuti Juneja from Pluang.

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You

Gurgaon, Haryana, India

Gurgaon, Haryana, India