Senior AI/Big Data Engineer

0 years

0 Lacs

Posted:23 hours ago| Platform: Foundit logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

  • Design and implement

    large-scale, fault-tolerant data pipelines on OCI, using services like

    OCI Data Integration

    ,

    OCI Data Flow (Apache Spark)

    ,

    Object Storage

    , and

    Autonomous Database

    .
  • Build and manage

    streaming data architectures

    using tools such as

    OCI GoldenGate

    ,

    Apache Kafka

    , and

    Spark/Flink Streaming

    .
  • Enforce standards and automation across the

    entire data lifecycle

    , including schema evolution, dataset migration, and deprecation strategies.
  • Improve platform

    resilience, data quality, and observability

    with advanced monitoring, alerting, and automated data governance.
  • Serve as a

    technical leader

    , mentoring junior engineers, reviewing designs and code, and promoting engineering best practices.
  • Collaborate cross-functionally with ML engineers, platform teams, and data scientists to integrate data services with AI/ML workloads.
  • Partner in

    AI pipeline enablement

    , ensuring Lakehouse services efficiently support model training, feature engineering, and real-time inference.

Minimum Qualifications

  • Bachelor's or Master's degree in

    Computer Science

    ,

    Engineering

    , or related technical field.
  • 4-6 year's experience designing and building

    cloud-based data pipelines

    and

    distributed systems

    .
  • Proficiency in at least one core language:

    Python

    ,

    Java

    , or

    Scala

    .
  • Familiar with lakehouse formats (Iceberg, Delta, Hudi), file formats (Parquet, ORC, Avro), and streaming platforms (Kafka, Kinesis).
  • Strong understanding of distributed systems fundamentals:

    partitioning

    ,

    replication

    ,

    idempotency

    ,

    consensus protocols

    .

Engineering & Infrastructure

  • 5+ years building

    distributed systems

    or

    production-grade data platforms

    in the cloud.
  • Strong coding proficiency in

    Python

    ,

    Java

    , or

    Scala

    , with an emphasis on performance and reliability.
  • Expertise in

    SQL and PLSQL

    , data modeling, and query optimization.
  • Proven experience with

    cloud-native architectures

    -especially

    OCI

    , AWS, Azure, or GCP.

Lakehouse & Streaming Mastery

  • Deep knowledge of

    modern lakehouse/table formats

    :

    Apache Iceberg

    ,

    Delta Lake

    , or

    Apache Hudi

    .
  • Production experience with

    big data compute engines

    :

    Spark

    ,

    Flink

    , or

    Trino

    .
  • Skilled in

    real-time streaming

    and event-driven architectures using

    Kafka

    ,

    Flink

    ,

    GoldenGate

    , or

    Streaming

    .
  • Experience managing

    data lakes

    , catalogs, and metadata governance in large-scale environments.

AI/ML Integration

  • Hands-on experience enabling

    ML pipelines

    : from data ingestion to model training and deployment.
  • Familiarity with

    ML frameworks

    (e.g.,

    PyTorch

    ,

    XGBoost

    ,

    scikit-learn

    ).
  • Understanding of

    modern ML architectures

    : including

    RAG

    ,

    prompt chaining

    , and

    agent-based workflows

    .
  • Awareness of

    MLOps practices

    , including model versioning, feature stores, and integration with AI pipelines.

DevOps & Operational Excellence

  • Deep understanding of

    CI/CD

    , infrastructure-as-code (IaC), and release automation using tools like

    Terraform

    ,

    GitHub Actions

    , or

    CloudFormation

    .
  • Experience with

    Docker

    ,

    Kubernetes

    , and

    cloud-native container orchestration

    .
  • Strong focus on

    testing, documentation

    , and system observability (Prometheus, Grafana, ELK stack).
  • Comfortable with

    cost/performance tuning

    , incident response, and data security standards (IAM, encryption, auditing).

Preferred Qualifications

  • Experience with

    Oracle's cloud-native tools

    :

    OCI Data Integration

    ,

    Data Flow

    ,

    Autonomous Database

    ,

    GoldenGate

    ,

    OCI Streaming

    .
  • Experience with

    query engines

    like

    Trino

    or

    Presto

    , and tools like

    dbt

    or

    Apache Airflow

    .
  • Familiarity with

    data cataloging

    ,

    RBAC/ABAC

    , and enterprise data governance frameworks.
  • Exposure to

    vector databases

    and

    LLM tooling

    (embeddings, vector search, prompt orchestration).
  • Solid understanding of

    data warehouse design principles

    , star/snowflake schemas, and ETL optimization.

Soft Skills & Team Expectations

  • Proven ability to lead technical initiatives

    independently end-to-end

    .
  • Comfortable working in

    cross-functional teams

    and mentoring junior engineers.
  • Excellent

    problem-solving skills

    , design thinking, and attention to operational excellence.
  • Passion for

    learning emerging data and AI technologies

    and sharing knowledge across teams.

Career Level - IC3

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Oracle logo
Oracle

Information Technology

Redwood City

RecommendedJobs for You

noida, uttar pradesh, india