Posted:2 days ago|
Platform:
On-site
Full Time
Experience: 19-22 Years Work Location: Kolkata(1st Preference)/ Mumbai / Pune / Chennai / Hyderabad / Bangalore / Delhi / Noida/ Coimbatore Job Description: The Sr. SRE Architect will play a pivotal role in consulting SRE related solution across domains, designing and implementing Observable, Scalable, Reliable, and Resilient systems and applications that ensure the highest levels of availability and performance for the applications and services. This role requires a consulting mindset, deep understanding of software engineering, system architecture, and operations, along with a passion to automate repetitive tasks with GenAI tools and scripts. Key Responsibilities · SRE Consulting: SRE design and architecture solutioning, capability building and customer interactions on SRE. · System Design and Architecture: Lead the design and architecture of scalable and reliable systems that meet the needs of our growing user base and business requirements. · Automation and Tooling: Develop and maintain automation tools and frameworks that streamline operations and improve system reliability. · Monitoring and Observability: Implement and enhance monitoring, logging, and alerting systems to ensure proactive detection and resolution of issues. · Capacity Planning: Conduct capacity planning and performance tuning to ensure systems can handle current and future demands. · Incident Management: Lead incident response efforts, perform root cause analysis, and implement corrective actions to prevent recurrence. · Collaboration and Mentorship: Work closely with software engineers, DevOps, and other stakeholders to promote best practices in reliability engineering and provide mentorship to junior team members. · Continuous Improvement: Identify areas for improvement in existing systems and processes, and drive initiatives to enhance system reliability and performance. Skillset: · Experience: Overall 16-20 years of experience along with minimum of 10+ years of experience in site reliability engineering, DevOps, or a related field, with a proven track record of designing and implementing reliable systems at scale. · Technical Skills: · Strong programming skills in languages such as Python, Go, or Java/.Net. · In-depth knowledge of cloud platforms (AWS, GCP, Azure) and container orchestration (Kubernetes, Docker). · Experience with infrastructure as code (Terraform, Ansible, Puppet). · Proficiency in monitoring and observability tools (Prometheus, Grafana, Splunk, AppDynamics, Dynatrace, ELK stack). · Solid understanding of networking, security, and system performance tuning. Soft Skills: · Strong problem-solving and analytical skills. · Excellent communication and collaboration abilities. · Ability to work in a fast-paced environment and manage multiple priorities. · Passion for continuous learning and staying up-to-date with industry trends and technologies. Preferred Skillset: · Experience with chaos engineering and resilience testing. · Familiarity with service mesh architectures (Istio, Linkerd). · Certifications in cloud platforms (Azure Certified Architect, AWS Certified Architect, Google Cloud Professional Architect, etc.).
LTIMindtree
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Practice Python coding challenges to boost your skills
Start Practicing Python NowKolkata, West Bengal, India
Salary: Not disclosed
Kolkata, West Bengal, India
Salary: Not disclosed