Lakehouse Engineer

10 years

0 Lacs

Posted:13 hours ago| Platform: Linkedin logo

Apply

Work Mode

Remote

Job Type

Full Time

Job Description

Granica is redefining how enterprises prepare and optimize data at the most fundamental layer of the AI stack—where raw information becomes usable intelligence. Our technology operates deep in the data infrastructure layer, making data efficient, secure, and ready for scale.We eliminate the hidden inefficiencies in modern data platforms—slashing storage and compute costs, accelerating pipelines, and boosting platform efficiency. The result: 60%+ lower storage costs, up to 60% lower compute spend, 3× faster data processing, and 20% overall efficiency gains.

Why It Matters

Massive data should fuel innovation, not drain budgets. We remove the bottlenecks holding AI and analytics back—making data lighter, faster, and smarter so teams can ship breakthroughs, not babysit storage and compute bills.

Who We Are

  • World renowned researchers in compression, information theory, and data systems
  • Elite engineers from Google, Pure Storage, Cohesity and top cloud teams
  • Enterprise sellers who turn ROI into seven‑figure wins.

Powered by World-Class Investors & Customers

$65M+ raised from NEA, Bain Capital, A* Capital, and operators behind Okta, Eventbrite, Tesla, and Databricks. Our platform already processes hundreds of petabytes for industry leaders

Our Mission:

We’re building the default data substrate for AI, and a generational company built to endure.

🏄What You’ll Do

  • Partner closely with customers to understand their technical environment, data challenges, and integration needs.
  • Design and implement robust, scalable data pipelines from scratch using PySpark and Python, often in mission-critical environments.
  • Configure and integrate modern data lakehouse and warehouse technologies including Apache Iceberg, Apache Hive, Delta Lake, Snowflake, and Databricks.
  • Act as a trusted technical advisor—guiding customers through solution architecture, deployment, and troubleshooting.
  • Contribute to internal tooling and automation to improve deployment velocity and system reliability.
  • Collaborate with Granica’s engineering and product teams to influence roadmap decisions based on real-world customer use cases.
  • Be an ambassador of the Granica product, both internally and externally.

💻Must-Have Qualifications

What We're Looking For

  • 5–10 years of hands-on experience in software engineering, data engineering, or infrastructure roles.
  • Strong proficiency in Python and PySpark, with the ability to write clean, efficient, and scalable code.
  • Proven experience building data pipelines from scratch, including ingestion, transformation, and optimization.
  • Deep understanding and hands-on experience with:
    • Apache Iceberg
    • Apache Hive
    • Apache Delta Lake
    • Snowflake
    • Databricks
  • Experience working with large-scale data systems and distributed computing architectures.
  • Ability to thrive in fast-paced, ambiguous environments typical of early-stage startups.
  • Excellent problem-solving, communication, and customer-facing skills.

✨Nice-to-Haves

  • Experience with Kubernetes, Terraform, or cloud-native infrastructure (AWS/GCP/Azure).
  • Familiarity with security and privacy best practices in data processing pipelines.
  • Prior experience in customer-facing technical roles (solutions engineer, customer engineer, etc.) is a strong plus.

Why Granica? 💛

  • Work hands-on with petabyte-scale datasets, design performant systems and compression algorithms that matter
  • Tackle meaningful problems that push the boundaries of what’s possible in data infrastructure and AI
  • Work with top-tier engineering talent from companies like Google, Tesla, and Palantir and bleeding-edge data technologies.
  • Own and lead critical projects with high visibility and impact.
  • Flexible remote work culture with a globally distributed team.
  • Outcome-driven culture: Low ego, high trust, customer-obsessed. We scaled to multimillion-dollar ARR without a dedicated sales team—just product pull and ROI.
  • Generous benefits: Unlimited PTO, flexible hybrid setup, competitive compensation, full health coverage
  • Backed by top-tier VCs with strong runway and bold ambitions.

Benefits

  • Highly competitive compensation with uncapped commissions and meaningful equity
  • Immigration sponsorship and counseling
  • Premium health, dental, and vision coverage
  • Flexible remote work and unlimited PTO
  • Quarterly recharge days and annual team off-sites
  • Budget for learning, development, and conferences
Granica celebrates diversity and is committed to creating an inclusive environment for all employees.
We do not discriminate on the basis of race, religion, color, gender expression or identity, sexual orientation, national origin, citizenship, age, marital status, veteran status, disability status, or any other characteristic protected by law.

Mock Interview

Practice Video Interview with JobPe AI

Start PySpark Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You