Founding Data Engineer (ETL & Web Integration)

4 years

0 Lacs

Posted:2 days ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Contractual

Job Description

Job Title:


About Us


The Role


What You'll Do:


  • Build Resilient Scrapers & Integrations:

    Design, build, and maintain robust pipelines to extract data from a wide variety of internet sources, including complex websites (with dynamic content, rate limits, and CAPTCHAs) and public APIs.
  • Master Data Transformation:

    Architect and implement complex ETL/ELT processes to clean, normalize, deduplicate, and structure raw data into a highly organized analytical database.
  • Design the Core Data Model:

    Own the design and implementation of our core database schemas, optimizing for storage, cost, and query performance for AI workloads.
  • Create the AI Feature Layer:

    Engineer and productionize a reusable feature layer that serves as the "single source of truth" for our AI Engineer, enabling rapid model iteration.
  • Guarantee Data Quality:

    Implement and automate a rigorous data quality and governance framework. You are the gatekeeper of data integrity.
  • Collaborate with AI:

    Work as a critical partner to our Founding AI Engineer, understanding their data needs and ensuring the data you provide is perfectly suited for training and inference.


What We're Looking For:


  • 4+ years of pure, hands-on data engineering experience.
  • Expert-level web scraping experience

    with tools like Scrapy, Selenium, Puppeteer, and/or Beautiful Soup. You must have a playbook for handling anti-scraping measures.
  • Deep expertise in Python and advanced SQL.
  • Proven experience building and orchestrating data pipelines using tools like Airflow, Prefect, or Dagster.
  • Strong, practical skills in data modeling for analytics and machine learning.
  • A fanaticism for data quality, documentation, and reproducibility.
  • Familiarity with containerization (Docker) and CI/CD principles.
  • Nice to have:

    Experience with B2B data, MLOps basics, and cloud platforms (Azure/AWS/Supabase).


Why Join GoDiverse?


  • Pure Engineering Focus:

    This is a role for a builder, not a manager. Spend 100% of your time on challenging technical problems.
  • Build the Foundation:

    Own the most critical asset of our AI company—its data. Your work is the bedrock of our entire platform.
  • Foundational Role:

    This is a key hire with significant influence on our product and culture, offering scope for leadership and potential equity as we scale.


How to Apply


To apply, please complete our short application form by clicking the "Apply" button.


Please note: Applications submitted through any other channel will not be considered. This process helps us review every application fairly and efficiently.


Apply Link - https://tally.so/r/EkkKXN

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You