Job
                                Description
                            
                            
                                Position: Senior SRE Manager Work Location: Bangalore, India (Hybrid) About Symphony Technology Group (STG) STG is a Silicon Valley (California) based private equity firm with a long and successful track record of transforming high-potential software and software-enabled services companies and insights-oriented companies into definitive market leaders. The firm brings expertise, flexibility, and resources to build strategic value and unlock the potential of innovative companies. Partnering to build customer-centric, market-winning portfolio companies, STG creates sustainable foundations for growth that bring value to all existing and future stakeholders. The firm is dedicated to transforming and building outstanding technology companies in partnership with world-class management teams. STGs portfolio has $11 billion of assets under management (as of March 2024). STGs expansive portfolio has consisted of more than 30 global companies. STG Labs is the incubation center for many of STGs portfolio companies, building their engineering, professional services, and support delivery teams in India. STG Labs offers an entrepreneurial start-up environment for software and AI engineers, data scientists and analysts, and project and product managers and provides a unique opportunity to work directly for a software or technology company. Based in Bangalore, STG Labs supports hybrid working. In India, our competitive employment package includes health insurance, life insurance, accident coverage, a liberal leave policy, and many more benefits. We pride ourselves on providing great employee programs that are centered on supporting the health, wellness, and ongoing training and development of our people within a flexible work environment. We are an equal opportunity employer and makes hiring decisions based on experience, skills, aptitude, and can-do approach. https://stg.com Job Description: 15+ years of experience in software engineering, system administration, or a related technical field, with a focus on reliability engineering. Education: Bachelors or Masters degree in Computer Science, Engineering, or related field. What is the role As a SRE Lead, you would be a professional capable of providing strategic direction, technical expertise, and leadership to ensure the ongoing success and reliability of the organization's offerings Key Responsibilities Provide expert guidance and leadership in designing, building, and maintaining highly available, scalable, and reliable SaaS infrastructure. Architect resilient systems and solutions that meet stringent SLAs and support the company s growth objectives. Lead efforts to ensure the reliability and uptime of our product, driving proactive monitoring, alerting, and incident response practices. Develop and implement strategies for fault tolerance, disaster recovery, and capacity planning. Conduct thorough post-incident reviews and root cause analyses to identify areas for improvement and prevent recurrence. Drive automation initiatives to streamline operational workflows, reduce manual effort, and improve efficiency. Champion DevOps best practices, promoting infrastructure as code, CI/CD pipelines, and other automation tools and methodologies. Support and partner with other teams on improving our observability systems to monitor site stability and performance Continuously learn and explore new tools, techniques, and methodologies to drive innovation and enhance the DevOps platform. Work closely with development teams to optimize application performance and efficiency. Implement tools and techniques to measure and improve service latency, throughput, and resource utilization. Identify and implement cost-saving measures to ensure cloud infrastructure spending is optimized. Proactively identify and address security vulnerabilities in the cloud environment Collaborate closely with engineering, product management, CISO and other teams to align on reliability goals, prioritize projects, and drive cross-functional initiatives. Communicate effectively with stakeholders to provide visibility into reliability initiatives, progress, and challenges Foster a culture of collaboration and knowledge sharing across the organization. Maintain documentation of processes, configurations, and technical guidelines. Responsible for people leadership of DevOps team hiring, managing performance objectives, providing regular feedback, handling performance assessment, managing career growth, mentoring, employee engagement etc. Key skills: 15+ years of DevOps experience with software configuration management and release engineering. 8+ years of production experience with AWS, including but not limited to EKS, EC2, IAM, ECS, Lambda, S3, RDS, and Cloudwatch Good understanding of virtualization and container technologies Docker, EKS/AKS/Kubernetes, etc.) Very proficient with designing and implementing cloud solutions in AWS Cloud technology implementations and management platforms. Experience with tools such as Git, BitBuckets, Jenkin and other similar tooling Experience with Automation (Python, Bash, PowerShell, etc.) Experience working on Python and / or NodeJS and / or Java based applications Strong Cloud networking skills VCNs, subnets, etc Experience with database admin on both relational (MSql) and NoSQL db (MongoDB) Exposed to deploying canaries for testing and conducting security checks in CI/CD pipelines Proficient in configuration management tools like Ansible and infrastructure-as-code frameworks such as Terraform and CloudFormation. Experience with Logging, monitoring and alerting tools such as Sumo Logic, Datadog, CloudWatch, uptrends, PagerDuty etc. for proactive system monitoring and troubleshooting. Understanding architectural implications of meeting any industry standards such as ISO 27001, HIPAA, and NIST frameworks, and GDPR standards a plus. Experience in implementing Cloud Security Concepts (SaaS, PaaS, IaaS), Network and Application Security and /or Data protection. Experience hiring, Mentoring, and leading DevSecOps, SRE and Platform engineers. Have a passion for technology and a willingness to explore and adopt emerging technologies for practical business purposes. Experience working in a fast-paced, agile environment while providing consistent application lifecycle management.