Job
Description
You will be working as a Monitoring Team Lead for a Data Pipeline L1 team, overseeing the daily operations to ensure the health and stability of data pipelines, and managing incident response. Your role will involve leading the team, monitoring performance, and escalating issues as needed. As a Team Leader, you will guide and mentor the L1 monitoring team to ensure proficiency in data pipeline monitoring, troubleshooting, and escalation procedures. You will manage team performance, distribute tasks effectively, and resolve conflicts. Acting as a point of contact for the team, you will represent them to stakeholders and advocate for their needs. Your responsibilities will also include developing team strengths and promoting a positive work environment. In terms of Data Pipeline Monitoring, you will continuously monitor data pipelines for performance, availability, and data quality issues. Utilizing monitoring tools, you will detect and analyze alerts related to data pipelines to ensure data freshness, completeness, accuracy, consistency, and validity. For Incident Management, you are required to detect, log, categorize, and track incidents within the ticketing system. Any unresolved issues should be escalated to L2/L3 teams based on predefined SLAs and severity. You will also coordinate with other teams to resolve incidents quickly and efficiently while ensuring proper communication and updates to relevant stakeholders throughout the incident lifecycle. Managing Service Level Agreements (SLAs) related to data pipeline monitoring and incident response will be essential. You will monitor and ensure that the team meets or exceeds established SLAs. Process Improvement is another key aspect where you will identify opportunities to enhance monitoring processes, automation, and efficiency. Implementing best practices for data pipeline monitoring and incident management and conducting regular reviews of service performance are part of your responsibilities. Your role will also involve providing technical expertise to the team, staying updated on industry best practices and new technologies related to data pipelines and monitoring. Maintaining and updating documentation related to data pipeline monitoring processes, procedures, and escalation paths is crucial. Accurate shift handovers to the next shift, with updates on ongoing issues, will also be expected. Qualifications: - Proven experience in data pipeline monitoring and incident management. - Strong understanding of data pipeline concepts, including ingestion, transformation, and storage. - Experience with monitoring tools and technologies. - Excellent communication, interpersonal, and leadership skills. - Ability to work independently and as part of a team in a fast-paced environment. - Experience with cloud services (AWS, Azure, or GCP) is a plus. - Knowledge of data governance principles and practices is beneficial. Skills to be evaluated on: - Data Operation/Operations Team Lead. Mandatory Skills: - Data Operation, Operations Team Lead. Desirable Skills: - Lead Operations, data operations, operations management, team management.,