Principal Site Reliability Engineer

8 - 18 years

0 Lacs

Posted:2 days ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

As a Site Reliability Engineer/Cloud Engineer (SRE) at Amgen, you will play a crucial role in optimizing performance, standardizing processes, and automating critical infrastructure and systems to ensure reliability, scalability, and cost-effectiveness. Working towards operational excellence through automation, incident response, and proactive performance tuning, you will collaborate closely with cross-functional teams to establish best practices for service availability, efficiency, and cost control. Your responsibilities will include: - Ensuring the reliability, scalability, and performance of Amgen's infrastructure, platforms, and applications by proactively identifying and resolving performance bottlenecks and implementing long-term fixes. - Driving the adoption of automation and Infrastructure as Code (IaC) to streamline operations, minimize manual interventions, and enhance scalability. - Establishing standardized operational processes, tools, and frameworks across Amgen's technology stack to ensure consistency, maintainability, and best-in-class reliability practices. - Implementing and maintaining comprehensive monitoring, alerting, and logging systems to detect issues early and ensure rapid incident response. - Partnering with software engineering and IT teams to integrate reliability, performance optimization, and cost-saving strategies throughout the development lifecycle. - Executing capacity planning processes to support future growth, performance, and cost management, and maintaining disaster recovery strategies to ensure system reliability. Basic qualifications required for this role include a Master's degree with 8 to 10 years of experience, a Bachelor's degree with 10 to 14 years of experience, or a Diploma with 14 to 18 years of experience in IT infrastructure, Site Reliability Engineering, or related fields. Must-have skills for this position: - Extensive experience with AWS Cloud Services - Proficiency in CI/CD (Jenkins/Gitlab), Observability, IAC, Gitops, etc. - Experience with containerization (Docker) and orchestration tools (Kubernetes) - Strong hands-on experience in SRE tasks and automation using Python/Scripting language - Well-versed with FinOps, Infra-Ops, & Platform Operations - Ability to learn new technologies quickly, strong problem-solving and analytical skills, excellent communication, and teamwork skills - Leadership skills to guide a team of 4 to 5 on technical blockers Good-to-have skills include knowledge of cloud-native technologies, strategies for cost optimization in multi-cloud environments, familiarity with distributed systems, databases, and large-scale system architectures, and a Bachelor's degree in computer science and engineering preferred. Soft skills required for this role are the ability to foster a collaborative and innovative work environment, strong problem-solving abilities, attention to detail, and a high degree of initiative and self-motivation.,

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You