Observability SME

4 - 8 years

0 Lacs

Posted:3 days ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

As an Observability SME, you play a crucial role that demands a combination of technical expertise, leadership skills, and a deep understanding of observability practices. Your primary responsibility is to lead a team of skilled engineers in the development and maintenance of cutting-edge observability solutions. These solutions enable comprehensive monitoring and analysis of systems and applications. Your role involves working closely with application teams, enhancing the observability platform (Dynatrace SAS) by integrating metrics, errors, logs, and traces from various sources to derive predictive intelligence from the data. Collaboration is key in your role. You will collaborate with cross-functional teams to understand their value chain and to evaluate and adopt emerging technologies and best practices that enhance system observability. Defining and implementing comprehensive monitoring and alerting strategies for complex distributed systems is a crucial aspect of your role. You will work with tools teams and application teams to establish and enforce best practices for logging, tracing, and monitoring. Selecting and implementing observability tools and technologies that align with the organization's goals and requirements is another significant responsibility. Staying updated with industry trends and advancements in observability ensures that our systems leverage the latest innovations. Identifying and addressing performance bottlenecks and inefficiencies in collaboration with development and operations teams is essential for enhancing system reliability and responsiveness. In incident response and troubleshooting, you will collaborate with incident response teams to diagnose and resolve production issues related to observability. Developing and maintaining incident response playbooks streamlines troubleshooting processes. As a mentor to your team, fostering a culture of continuous learning is crucial. Providing mentorship and training to enhance team members" technical skills and knowledge contributes to the team's overall growth. To excel in this role, you should have a strong background in distributed systems, cloud technologies, and proficiency in tools like Dynatrace, Prometheus, Grafana, ELK stack, or similar. In-depth knowledge of distributed systems, microservices architecture, and cloud platforms is essential. Exceptional communication skills are crucial for effectively conveying complex technical concepts to both technical and non-technical stakeholders. Expertise in scripting and programming languages (e.g., Python, Go, Java) is required. Experience with containerization and orchestration technologies (Docker, Kubernetes) is considered a plus. Proficiency in monitoring tools, incident management, and other relevant technologies is expected. Strong communication skills to collaborate with diverse teams and convey technical information to non-technical stakeholders are essential. A problem-solving mindset with the ability to make sound decisions under pressure is highly valued in this role.,

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You