AI Platform Engineer / ML Infrastructure Engineer

10 years

0 Lacs

Posted:2 weeks ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

US/India Deep-Tech Startup | Chennai & San Jose

(Full-time • On-site / Hybrid)


About Us

San Jose, CA


Our work sits at the intersection of:

  • real-time data systems,
  • safety-critical engineering,
  • applied machine learning, and
  • modern information retrieval (vector search, graph reasoning, RAG).

We operate in a domain where reliability, interpretability, and intelligent automation truly matter.

If you enjoy working on technically challenging problems that have real-world impact, you’ll feel at home here.


Role Overview


end-to-end AI and data platforms

This role blends modern ML Ops, data engineering, knowledge retrieval, and systems design.

If you are comfortable thinking across multiple layers of the stack and enjoy solving open-ended problems, we’d love to speak with you.


What You’ll Work OnData & Retrieval Systems

  • Build and optimize pipelines using

    vector search

    , embeddings, and metadata-aware retrieval.
  • Experiment with

    graph / knowledge-based systems

    (e.g., RDF stores, property graphs, knowledge graphs).
  • Help design hybrid retrieval workflows (vector + graph + rules).

ML & Inference Infrastructure

  • Set up and manage

    ML pipelines

    , model serving endpoints, and inference layers.
  • Work on streaming/batch data flows across AWS/GCP.
  • Build observability and monitoring for data quality, drift, and safety.

Cloud & Infra Automation

  • Deploy cloud resources using

    AWS + Terraform

    (EC2, S3, RDS/Neptune, networking, IAM).
  • Manage containerized services (Docker + optional Kubernetes experience).
  • Ensure scalable, secure, and reproducible infrastructure.

Systems-Level Problem Solving

  • Work cross-discipline to design architectures that integrate real-time data, ML models, graph semantics, and domain constraints.
  • Bring strong reasoning on system trade-offs, reliability, safety, and performance.

What We’re Looking For

some

Core Skills

  • Experience with

    vector databases

    (pgvector, Pinecone, Weaviate, Milvus) or strong understanding of semantic search.
  • Experience with

    graph databases

    or

    knowledge representation

    (Neo4j, Neptune, RDF, SPARQL, property graphs).
  • Solid grounding in

    Python

    , data pipelines, and model-serving frameworks.
  • Hands-on experience with

    AWS services

    (EC2, S3, IAM, networking).
  • Practical understanding of

    Terraform

    or infra-as-code.

Bonus Skills (not required but appreciated)

  • Experience building or maintaining RAG pipelines.
  • Understanding of ML observability and safety.
  • Exposure to healthcare or high-reliability systems.
  • Interest in temporal reasoning, anomaly detection, or multimodal data.

Who You Are

  • You think like a systems engineer—holistic, curious, rigorous.
  • You enjoy technical depth more than buzzwords.
  • You are comfortable exploring ambiguous problems.
  • You want to work on ML infrastructure that actually matters in the real world.
  • You appreciate startups where engineers have autonomy and ownership.

Location

Chennai preferred, hybrid options available.


Qualifications & Experience

Required Qualifications

strong foundational experience

  • Solid hands-on experience with

    Python

    and at least one ML/DS workflow.
  • Working knowledge of

    cloud services

    (AWS preferred; GCP/Azure okay).
  • Experience setting up or maintaining

    data pipelines

    (batch or streaming).
  • Exposure to

    vector search

    , embeddings, or semantic retrieval systems.
  • Good understanding of

    databases

    — SQL, NoSQL, or graph.
  • Experience deploying or managing infra using

    Terraform

    or other IaC frameworks.
  • Ability to design and reason about

    systems

    (latency, scaling, reliability, security).
  • Strong debugging and problem-solving mindset.

Preferred Experience (Good to Have, Not Mandatory)

These are not hard requirements — but if you have them, we’d love to hear about it:

  • Experience with

    graph databases

    (Neo4j, Neptune, RDF, SPARQL, ArangoDB).
  • Familiarity with

    RAG pipelines

    , retrieval frameworks, or knowledge graphs.
  • Hands-on with

    Docker

    , containerized ML workloads, or microservices.
  • Exposure to

    ML Ops tools

    (SageMaker, Vertex, MLFlow, Kubeflow, BentoML).
  • Understanding of

    vector DBs

    like Pinecone, Weaviate, Milvus, or pgvector.
  • Knowledge of

    data modeling

    , schema design, or graph-based reasoning.
  • Experience building systems that require

    safety, accuracy, and auditability

    .
  • Interest in healthcare, IoT, or real-time monitoring systems.

Experience Range

meaningful track record

4–10 years of experience


Why Join Us

You will work on deeply meaningful problems at the frontier of AI, safety, and intelligent automation.

platform engineering

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You