We are looking for a Senior Site Reliability Engineer, to join our Shared Capabilities, Service Reliability and Operations group. We provide innovative team collaboration and an opportunity to build, operate and support scalable and reliable services that underpin Thomson Reuters products.
About the Role
- Be a Professional SRE:
Implement
site reliability engineering and DevOps best practices.Feed
non-functional requirements into the product backlog, such as, but not limited to, high availability, scalability, self-healing, observability, continuous delivery, security - Build and maintain monitoring for all aspects of infrastructure, micro-services and the platform and implement Alerting mechanism using cloud native solutions
- Automate IaC and CICD and promote best practices for our CI/CD processes
- Provide primary operational support and engineering for multiple large, distributed platforms
- Act as the go to person for any production issue. Troubleshoot and monitor until successful mitigation, communicate effectively,
postmortem
and implementation of the learnings. - Focus on Continuous improvement and technical standards - drive improvements in productivity, monitoring, tooling and set industry best practices.
- Act as the go to person for any production issue. Communicate effectively, manage mitigation, remediation,
postmortem
and implementation of the learnings. - On-call Rotation: Participate in on-call/shift rotations (L2).
- When on-call, you are expected to drive the troubleshooting and mitigation activities while working on incident
- Maintain end-to-end security ensuring that we meet best practices standards
- Keep up-to-date with emerging cloud technology trends, especially around DevOps, Service Reliability and Security.
- Adopt pan-TR operation principles to ensure consistency and efficiency
- Documenting tribal knowledge. Constant upkeep of documentation and runbooks can ensure that teams get the information they need right when they need it
- Be collaborative:
- Extreme collaboration within our teams - Canada, US, Mexico, and India
you're a fit for the role of Senior Site Reliability Engineer if you:
- Bachelors degree in Computer Science or related field - a must
- 6-10 years of experience as a SRE
- Minimum of 3 years of experience as DevOps engineer and/or Cloud engineer with
hands on
experience in AWS or Azure cloud technologies - Highly skilled in Unix/Linux and knowledge (exposure to RHEL)
- Proven experience in building and operating
PRODUCTION
cloud native infrastructure, applications and services on AWS or Azure - Must have experience with Version Control and CI/CD (Git / CodePipeline or Azure DevOps )
- Must have experience with writing Infrastructure as Code (IaC) (Terraform, CloudFormation, or Azure Resource Manager )
- Must have scripting and programming experience (Bash, PowerShell, or Python)
- Experience or knowledge of Distributed logging: ELK, DataDog, CloudWatch, or Azure Monitor
What s in it For You
Join us to inform the way forward with the latest AI solutions and address real-world challenges in legal, tax, compliance, and news. Backed by our commitment to continuous learning and market-leading benefits, you'll be prepared to grow, lead, and thrive in an AI-enabled future. This includes:
Industry-Leading Benefits:
We offer comprehensive benefit plans to include flexible vacation, two company-wide Mental Health Days off, access to the Headspace app, retirement savings, tuition reimbursement, employee incentive programs, and resources for mental, physical, and financial wellbeing.Flexibility Work-Life Balance:
Flex My Way is a set of supportive workplace policies designed to help manage personal and professional responsibilities, whether caring for family, giving back to the community, or finding time to refresh and reset. This builds upon our flexible work arrangements, including work from anywhere for up to 8 weeks per year, and hybrid model, empowering employees to achieve a better work-life balance.Career Development and Growth:
By fostering a culture of continuous learning and skill development, we prepare our talent to tackle tomorrow s challenges and deliver real-world solutions. Our skills-first approach ensures you have the tools and knowledge to grow, lead, and thrive in an AI-enabled future.Culture:
Globally recognized and award-winning reputation for inclusion, innovation, and customer-focus. Our eleven business resource groups nurture our culture of belonging across the diverse backgrounds and experiences represented across our global footprint.Hybrid Work Model:
We ve adopted a flexible hybrid working environment (2-3 days a week in the office depending on the role) for our office-based roles while delivering a seamless experience that is digitally and physically connected.Social Impact:
Make an impact in your community with our Social Impact Institute. We offer employees two paid volunteer days off annually and opportunities to get involved with pro-bono consulting projects and Environmental, Social, and Governance (ESG) initiatives.