Home
Jobs

7 - 12 years

20 - 30 Lacs

Posted:3 months ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Job Description: Role Title: Lead Site Reliability Engineer Position Description: Historically, the role of IT has been to provide a reliable ecosystem to run the business, drive efficiencies and reduce costs. These areas remain integral, however, driven by the quickening pace of innovation, IT must evolve, proactively partnering with the business to enable new digital business models that power new types of customer engagement. At Elanco, our engineer roles bring adaptive set of skills covering Software-as-a-Service (SaaS), Commercial-of-the-Shelf (CotS) and/or Custom Developed applications. The role is part of our software engineering team established to deliver Engineering expertise to business facing products and services. As an Engineer you will be deployed into a multi-disciplined product team applying your software engineering talent to Elancos biggest opportunities. To be successful in an engineering role in Elanco requires a highly motivated individual, with an innovative mindset and a willingness to drive tangible outcomes. The individual must be able to articulate complex technical topics and collaborate with the internal engineering organisation to improve engineering across the enterprise. The Role We are seeking a skilled and motivated engineer, passionate about improving application reliability across our enterprise. As part of our Platform Engineering organization, you will join a product team focused on a suite of capabilities designed to enhance all aspects of our engineering portfolio. In this role, you will be primarily accountable for configuring and operating our observability toolset. You will also lead the charge across the enterprise, driving the transition from reactive to proactive application support. This is a fantastic opportunity to join a growing engineering team with the scope to partner across our entire enterprise of products. Your contributions will help ensure that everything we deliver to our customers come with top-notch reliability as standard. Typical responsibilities: Help define Elancos approach to reliability of applications partnering with our product manager for our portfolio health products. Collaborate with stakeholders such as product and platform owners, to define service level objectives (SLOs), and service-level indicators (SLIs) for system operations focused on the critical features of the customers journey and experience. Assist and coach product teams implementation of telemetry against SLIs/SLOs to ensure adequate traceability is in place. Track and manage reliability performance against agreed SLOs, in partnership with product teams or other stakeholders, and ensure systems continue to meet SLOs over time. Ensure key stakeholders, product owners, and platform owners are informed of reliability concerns and their potential impact to the customers experience. Provide expert knowledge on reliability approaches, to ensure our organization achieves its goals and roadmap for reliability. Champion reliability being treated as a feature in products and platforms and promote the concept across all phases of the software development life cycle. Create dashboards and reports to communicate key metrics, to product teams and key stakeholders. Beyond observability engage in initiatives across the product line including cost, security, and adoption helping the team drive to a health portfolio throughout an applications lifecycle. Participate problem management activities, including post-mortem incident analysis, and provision of technical insight, documented findings, outcomes and recommendations as part of a root cause analysis to troubleshoot priority incidents. Implement automation to reduce probability and/or impact of problems recurring and target self-healing through automation of reoccurring incidents. For critical applications, utilize practices such as chaos engineering and performance engineering to test in preproduction environments. This includes disaster recovery (DR) testing, performance testing, and tabletop planning exercises. Participate and exert influence in organizational learning initiatives such as communities of practice to share knowledge and foster a continuous learning and improvement mindset. Support architects working on new solutions, including analyzing requirements, supporting technical architecture activities, prototyping, designing and developing reusable infrastructure artifacts, testing, implementing, and preparing for ongoing support. Train and mentor junior and engineers to ensure SRE best practices evolve and scale successfully in the organization Partner with the product manager of portfolio health to build out golden paths, education and services to package the capability in a consumable way on our developer portal. Be a product team champion extending into product teams helping to deliver foundational platform engineering capabilities where applicable. Partner with compliance teams to ensure the data we bring into observability platforms meets privacy and compliance standards Maintain consistent standards and set out a taxonomy of telemetry to enable future opportunities including leveraging of AI capability. Basic Qualifications: Experience in some of the following areas essential. 10-15 years of hands-on engineering experience. 5 years experience in Platform Engineering, SRE or similar role 5-10 years of experience working with modern application architecture methodologies (Service Orientated Architecture, API-Centric Design, Twelve-Factor App, FAIR, etc.). 5 + years of experience working with Cloud Native design patterns, with a preference towards Microsoft Azure / Google Cloud. 5 + years of experience designing and delivering digital solutions following a product-mindset and a variety of delivery methodologies (e.g. Agile, CCPM, etc.). 5 + years of experience working within a DevSecOps” culture, including modern software development practices, covering Continuous Integration and Continuous Delivery (CI/CD), Test-Driven Development (TDD), etc. Experience with enterprise observability platforms. E.g Datadog, New Relic Experience with monitoring 3rd party and SaaS applications. Experience establishing standards around MELT (Metrics, Events, Logging and Tracing and implementing at an enterprise level. Experience with Open Telemetry advantageous. Experience supporting digital platforms, including Integrations, Release Management, Regression Testing, Integrations, Data Obfuscation, etc. Experience scaling an “API-Ecosystem”, designing, and implementing “API-First” integration patterns. Experience working with authentication and authorisation protocols/patterns. Experience defining and implementing large-scale, transformative digital solutions. Demonstrated influence and communication skills across all levels of IT and third parties. Experience working in complex, diverse landscapes (business, technology, regulatory, partners, providers, geographies, etc.). Strong organizational and communications skills with multiple examples of being able to convey complex technical topics, that resulted in a definitive direction. Education Requirements: Bachelor’s degree in information technology.

Mock Interview

Practice Video Interview with JobPe AI

Start Cloud Interview Now

My Connections Elanco Innovation And Alliance Centre

Download Chrome Extension (See your connection in the Elanco Innovation And Alliance Centre )

chrome image
Download Now
Elanco Innovation And Alliance Centre
Elanco Innovation And Alliance Centre

Veterinary Pharmaceuticals

Greenfield

Approximately 5,000 Employees

10 Jobs

    Key People

  • Jeff Simmons

    President and CEO
  • Holly T. Cregger

    Chief Financial Officer

RecommendedJobs for You

Hyderabad, Chennai, Bengaluru

Bengaluru / Bangalore, Karnataka, India

Noida, Uttar Pradesh, India

Noida, Uttar Pradesh, India

Noida, Uttar Pradesh, India