Lead Site Reliability Engineer

3 - 8 years

13 - 18 Lacs

Posted:3 days ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Software engineering is the application of engineering to the design, development, implementation, testing and maintenance of software in a systematic method. The roles in this function will cover all primary development activity across all technology functions that ensure we deliver code with high quality for our applications, products and services and to understand customer needs and to develop product roadmaps.

These roles include, but are not limited to analysis, design, coding, engineering, testing, debugging, standards, methods, tools analysis, documentation, research and development, maintenance, new development, operations and delivery. With every role in the company, each position has a requirement for building quality into every output. This also includes evaluating new tools, new techniques, strategies; Automation of common tasks; build of common utilities to drive organizational efficiency with a passion around technology and solutions and influence of thought and leadership on future capabilities and opportunities to apply technology in new and innovative ways. Generally work is self-directed and not prescribed.

Primary Responsibilities:

  • Manage Azure Cloud Infrastructure and building resilient and self-scaling systems
  • Implement solutions to continuously improve operational reliability of the cloud infrastructure
  • You will be responsible for the availability, performance, monitoring and Infra Provisioning for the Platform which comprises of Cloud infrastructure and On Prem technologies
  • Closely partner with Engineering and Technical Support teams to drive resolution of critical issues
  • Publish and implement operational standards for all Cloud infrastructure and services
  • Work towards reducing Operations toil by automating repeatable tasks
  • Focus would be to mentor and develop other members in the SRE subject area
  • Application deployments using CI/CD tools, code repository, code scanning, artifact repo, compliance scanning, packaging, deployment, and configuration management
  • Build Operations Dashboards leveraging tools like Dynatrace, Splunk or Grafana
  • Handling incident, change and problem management
  • Help with provisioning of Infrastructure using Terraform
  • Enhancing Platform Observability Dashboards
  • Closely partnering with Development Teams and help address Platform related roadblocks
  • Conduct post-mortem after a production issues.
  • React to production deficiencies by continuously implementing automation, self-healing, and real-time monitoring to production systems
  • Work with Docker, Kubernetes, Azure cloud, Prometheus, Grafana, Java, Python and many other modern SaaS technologies
  • Participate in projects involving people of many different disciplines: Engineering, Cloud, Networking, CI/CD, Project management, Monitoring, alerting etc.
  • Stay informed of new technologies and Innovate
  • Works with less structured, more complex issues
  • Serves as a resource to othersComply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but
  • not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regards to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment). The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do so.

Required Qualifications:

  • Bachelors or advanced Degree in a related technical field
  • 3+ years IT Experience
  • 3+ years DevOps Experience
  • 2+ years experience on Infrastructure as Code (Terraform/Ansible/Chef/Puppet)
  • 2+ years experience on Docker and Container Orchestration (Kubernetes/OpenShift)
  • 2+ years experience on DevOps and CI/CD tools such as Git, Jenkins
  • 2+ years experience on Kafka Support
  • 2+ years experience on Monitoring tools and technologies (Splunk, Dynatrace, new relic)

Preferred Qualifications:

  • Infrastructure Engineering Experience
  • Cloud Experience (Azure/AWS/GCP)
  • Automation experience
  • Good Knowledge on SRE principles
  • Hands on scripting with one or more: YAML, JSON, PowerShell, BASH or Python

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Java Skills

Practice Java coding challenges to boost your skills

Start Practicing Java Now
Optum logo
Optum

Hospitals and Health Care

Eden Prairie MN

RecommendedJobs for You