Design and scale observability solutions (monitoring, logging, tracing), optimize alerting and incident response, automate with scripting, and collaborate with teams to ensure system reliability and performance.
Responsibilities: * Ensure high availability through proactive monitoring & incident response. * Design, implement & optimize SLO-driven systems on AWS using Kubernetes, Docker & Linux.
Responsibilities: * Collaborate with cross-functional teams on project delivery. * Develop frontend & backend using React.js, Node.js, Python, JavaScript, HTML/CSS. * Ensure code quality through testing and documentation.