Posted:5 days ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

As the Lead Data Infrastructure Engineer at CBG, you will be responsible for developing and managing a robust data infrastructure and pipelines across the network. Your role is crucial in enabling the delivery of data as a product, ensuring high-quality, accessible, and timely data for advanced analytics, reporting, and decision-making across all operational and business functions. Your key responsibilities will include designing, implementing, and managing scalable data architecture for collecting, storing, and processing large volumes of data from CBG plant systems. You will own the cloud/on-prem data lake, data warehouse, and structured databases supporting real-time and batch processing. Additionally, you will develop and maintain robust, automated data pipelines using modern ETL/ELT tools, ensuring reliability, efficiency, and monitoring of all data flows from source to destination systems. Moreover, you will implement processes to ensure data accuracy, consistency, completeness, and freshness, working closely with Data Governance and Compliance teams to define standards, validation rules, and audit trails. Collaborating with data scientists, business analysts, application teams, and plant operations will be essential to understand and prioritize data requirements and enable self-service data access through APIs, secure dashboards, and curated datasets. You will also be responsible for maintaining a data catalogue and lineage tracking system to improve data discoverability and reusability across the organization. Providing documentation and training on data schema, usage, and access policies will be part of your role. Ensuring data is stored and accessed securely, following best practices in encryption, role-based access, and regulatory compliance is a key aspect of your responsibilities. The ideal candidate for this role should have a B.E. / B.Tech degree with expertise in SQL, Python, DevOps, and distributed data technologies such as Spark and Kafka. Experience with cloud platforms like AWS, Azure, or GCP, and associated data services is necessary. A strong understanding of CI/CD for data pipelines and MLOps integration is required. Familiarity with industrial data sources like OPC-UA, MQTT, and SCADA systems will be highly desirable. Excellent leadership, documentation, and stakeholder communication skills are essential for success in this role.,

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Reliance Industries Limited logo
Reliance Industries Limited

Conglomerate (Petrochemicals, Refining, Telecommunications, Retail)

Mumbai

RecommendedJobs for You

Bengaluru, Karnataka