We are looking for a Site Reliability Engineer to join our team and help us to:
Provide best-in-class Production Support for Data Science engineering, application development, and production Line-of-Business teamsDesign, engineer, and implement critical infrastructure to support Data Science platforms such as Alteryx, Dataiku and Azure Machine LearningAct as a Problem Manager to troubleshoot and resolve complex Technical Incidents with an eye on developing sustainable automateCollaborate with cross-functional teams like engineering, hardware, platform services, cloud, and operationsMonitor and Analyze MongoDB performance, with a foci on automation and reliabilityEnsure compliance with operational, risk, and change management guidelinesProvide occasional weekend or after-hours supportDevelop and document best practicesYour teamYou will be working in the Data Platform SRE team in Pune or Hyderabad to focus on reliability, operations, andefficiency in collaboration with our Global Data Science Service team. We enable solutions for numerous lines ofbusiness by implementing modern data science initiatives across Wealth Management, Investment Banking, andCorporate Center divisions including Risk and Finance, HR, and other Technology Service Teams. We offer flexibility inthe workplace and equal opportunities for all team members.
Your expertise
Hands-on experience with UNIX/Windows Administration, ideally 5+ yearsAlteryx administration experience is desirableKnowledge regarding analytic products such as Dataiku, Alteryx, Azure Synapse, and DataBricks is a plusAbility to solve complex issues with solution-design thinkingDev-Ops experience with GitLab CI/CD and PowerShell scripting skills or other programming languagesTrack-record of influencing IT stakeholders and business partnersA confident communicator who can explain technology to business stakeholders