Job
Description
Role Overview: You will be responsible for designing, building, and optimizing GenAI-enabled data pipelines, collaborating closely with data scientists, ML engineers, and platform teams to operationalize AI-driven reasoning, contextual knowledge retrieval, and graph-based inference within DAZN's global data platform. This role offers an opportunity to work at the intersection of data engineering, AI agents, and knowledge graphs, directly impacting streaming quality, content intelligence, discovery, and fan engagement. Key Responsibilities: - Build and maintain pipelines integrating LLMs, LangChain, LangGraph, and other orchestration frameworks. - Develop autonomous and human-in-the-loop agent workflows for data retrieval, transformation, validation, and enrichment. - Implement graph-based reasoning pipelines to support complex querying, relationship mapping, and semantic search. - Design, optimize, and scale data ingestion and transformation pipelines using Python, SQL, and cloud-native technologies. - Work with streaming and structured/unstructured datasets for analytics, personalization services, and operational dashboards. - Automate workflows, build reusable components, and enhance data observability, lineage, and quality frameworks. - Integrate LLMs (OpenAI, AWS Bedrock, Gemini, etc.) into DAZN data systems via retrieval-augmented generation (RAG), prompt engineering, and agent-based reasoning. - Build evaluation, monitoring, and guardrail mechanisms for safe and reliable GenAI execution. - Implement proactive data validation, anomaly detection, and quality monitoring using agent-based or rule-based systems. - Ensure consistency, accuracy, and reliability of data assets consumed across DAZN product and analytics teams. - Collaborate with data scientists, ML engineers, backend teams, and analytics stakeholders to translate requirements into robust data solutions. - Advocate for best practices in GenAI engineering, agent systems, and data platform modernization. Qualification Required: - Approximately 4 years of total IT experience, including about 1 year of hands-on experience building GenAI/Agentic systems. - Strong programming skills in Python with experience in LangChain/LangGraph, prompt engineering, LLM tool calling, and graph-based reasoning or vector-search tools. - Solid SQL expertise for building and optimizing transformations. - Experience building and deploying scalable data pipelines using cloud services, with a preference for AWS. - Practical experience building multi-agent workflows, autonomous agents, retrieval-augmented pipelines, and human-in-the-loop systems. - Familiarity with vector databases, embeddings, and semantic retrieval. - Understanding of agent routing, memory, and tool orchestration patterns. Additional Details of the Company: DAZN is redefining how the world consumes sports by delivering low-latency streaming, immersive fan experiences, and intelligent, data-driven personalization to millions of subscribers globally. The Hyderabad Tech Hub is a fast-scaling engineering center that focuses on building data, AI, and platform capabilities for DAZN's global sports ecosystem. As DAZN expands its GenAI and autonomous systems footprint, foundational capabilities around LLM-driven workflows, graph-based intelligence, and multi-agent orchestration are being developed. Working at DAZN Hyderabad offers the opportunity to work on cutting-edge GenAI and data platform initiatives at a massive global scale, solving real-world challenges across content intelligence, personalization, discovery, and streaming analytics, while collaborating with high-performing teams across various countries.,