Lead Site Reliability Engineer

5 - 9 years

0 Lacs

Posted:1 day ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Role Overview: As a Lead Site Reliability Engineer at UKG, you play a crucial role in enhancing, hardening, and supporting service delivery processes through developing software solutions. Your responsibilities include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance analysis, monitoring, alerting, chaos engineering, and auto remediation. Your passion for learning and staying updated with technology trends is essential, along with your dedication to innovation and ensuring a flawless customer experience. You will have an "automate everything" mindset to deploy services rapidly, consistently, and with high availability. Key Responsibilities: - Engage in and improve the lifecycle of services from conception to end-of-life, including system design consulting, and capacity planning - Define and implement standards and best practices related to system architecture, service delivery, metrics, and automation of operational tasks - Support services, product & engineering teams by providing common tooling and frameworks for increased availability and improved incident response - Enhance system performance, application delivery, and efficiency through automation, process refinement, postmortem reviews, and configuration analysis - Collaborate closely with engineering professionals to deliver reliable services - Increase operational efficiency, effectiveness, and service quality by treating operational challenges as a software engineering problem - Guide junior team members and champion Site Reliability Engineering - Actively participate in incident response, including on-call responsibilities - Partner with stakeholders to influence and drive the best possible technical and business outcomes Qualification Required: - Engineering degree, or related technical discipline, or equivalent work experience - Experience in coding in higher-level languages (e.g., Python, JavaScript, C++, or Java) - Knowledge of Cloud-based applications & Containerization Technologies - Understanding of best practices in metric generation and collection, log aggregation pipelines, time-series databases, and distributed tracing - Working experience with industry standards like Terraform, Ansible - Fundamentals in 2 of the following areas: Computer Science, Cloud architecture, Security, or Network Design - Minimum 5 years of hands-on experience in Engineering or Cloud - Minimum 5 years" experience with public cloud platforms (e.g., GCP, AWS, Azure) - Minimum 3 years" experience in configuration and maintenance of applications and/or systems infrastructure for large-scale customer-facing companies - Experience with distributed system design and architecture (Note: The company details were not provided in the job description.),

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
UKG logo
UKG

Human Resources Software

Lowell

RecommendedJobs for You