Senior CV/LLM Engineer

5 years

0 Lacs

Posted:1 week ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Job description

About Doaz

  • Doaz is a hyper-growth startup on a mission to turn fragmented industrial knowledge into instant, actionable insight. We build LLM- and Vision-AI solutions for construction, heavy industry, and finance leaders who must transform terabytes of drawings, specifications, and regulations into real-time decisions.
  • We’re expanding our GeoAI programs (including joint work with POSCO E&C) and launching drawing-change detection services that automatically compare plan versions, detect deltas, and explain design impacts.

Why You’ll Love Working Here

• True 0-to-1 ownership — Ship models that land in production sites within weeks.

• Global impact, lean crew — 30 teammates across KR🇰🇷/PK🇵🇰/IN🇮🇳; no bureaucracy, only builders.

• Tech freedom — YOLO or RT-DETR? Gemma-VL, Qwen-VL, or LLaVA? You choose, we fund.

Role Overview

We’re hiring a Senior Computer Vision & Multimodal LLM Engineer (GeoAI & Drawing Change Analysis).

You’ll lead end-to-end development of a version-aware drawing-diff engine (PDF/DWG raster & vector), symbol/text extraction, and change-impact narratives powered by RAG/LLM. Expect fast cycles from prototype → service: detection models, OCR/layout understanding, retrieval, and explainable outputs that engineers can trust.


Key Responsibilities

Drawing Change Analysis (CV)

  • Build a robust diff pipeline for architectural/structural/MEP drawings: rasterization, layer parsing, vector geometry ops, and semantic change clustering.
  • Train/finetune detectors & segmenters (e.g., YOLOv8/RT-DETR/Detectron2/SAM) for symbols (columns, openings, sleeves), title blocks, and revision clouds; achieve production-grade mAP/F1.
  • Implement geometry-aware post-processing (IoU/topology checks, snapping, graph connectivity) to reduce false positives.

Document & Layout Understanding

  • Engineer OCR + layout models (PaddleOCR/Tesseract + DocFormer/LayoutLMv3/Donut) to read legends, notes, schedules, and BOM tables; normalize to structured JSON.
  • Build version-aware entity tracking (IDs, gridlines, BH IDs, coordinates) across revisions.

GeoAI & LLM/RAG

  • Design retrieval over drawings/specs (BM25 + vector) with reranking; ground LLM answers in evidence with citations and clickable locations.
  • Generate change-impact summaries (e.g., slab shear reinforcement, opening proximity to columns) with rules + LLM verification; measure factual precision.

Productization & DevOps

  • Ship FastAPI/gRPC microservices, batch & streaming workers (Ray/Celery), GPU inference (Triton/TensorRT), and observability (Prometheus/Grafana).
  • Own evaluation: dataset curation, data labeling guidelines, ablation/A-B tests, and regression suites.

Collaboration

  • Work closely with domain SMEs (geotech/structural) to encode rules (KDS/KBC, internal standards) and prioritize what matters to the field.

Minimum Qualifications

  • 5+ years of production Python (3.x) building ML-heavy backends; strong PyTorch.
  • 3+ years in computer vision for detection/segmentation/OCR or document AI at scale.
  • Hands-on with multimodal LLM/RAG (LangChain/LlamaIndex), vector DBs (Pinecone/Weaviate/FAISS), and rerankers.
  • Proven experience parsing engineering drawings or complex PDFs (vector/raster), including geometry and layout reasoning.
  • Solid MLOps: reproducible training, CI/CD, model packaging, monitoring; cloud on AWS/GCP.
  • Fluent written & spoken English (Korean a plus).

Preferred Extras

  • GPU orchestration (Kubernetes/Ray/Slurm), high-performance inference (ONNX/TensorRT).
  • Experience with VLMs (Gemma-VL, Qwen-VL, LLaVA), CLIP, or doc-layout models.
  • Open-source contributions, papers, or strong public demos in CV/doc AI/RAG.
  • Full-stack chops (TypeScript/Next.js/React) for quick operator tools and review UIs.

Compensation & Benefits

  • Competitive base salary (market-leading) , around 20 lakh (yearly)
  • Performance-based annual bonus (up to 20%).
  • cloud credits, and AI tools support.


Hiring Process (≈ 2–3 weeks)

  • Quick intro call (15 min, mutual fit).
  • 48-hour take-home: Drawing Diff + Evidence-Grounded Summary (provide code + short README; clarity > polish).
  • Deep-dive tech interview: architecture, modeling choices, evaluation, and scaling plan.
  • Culture & vision chat with Founder/CEO.
  • Offer — if all green, written offer within 24 h.


How to Apply

Email doaz@doaz.ai with subject [CV/LLM Engineer – Your Name] and include:

  • Résumé/CV with measurable outcomes (metrics, latency, cost, accuracy).
  • Current or recent salary.
  • GitHub and/or live demos of CV/doc-AI/RAG work (links preferred).
  • A one-page diagram of your “Drawing Revision → Detection → Evidence → LLM Narrative” pipeline, noting models, retrieval, and evaluation metrics.


Employment type: Full-time

Ready to turn messy drawings and specs into instant, trusted intelligence?

Let’s build the future together at Doaz.


Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You