Job description
About Doaz
- Doaz is a hyper-growth startup on a mission to turn fragmented industrial knowledge into instant, actionable insight. We build LLM- and Vision-AI solutions for construction, heavy industry, and finance leaders who must transform terabytes of drawings, specifications, and regulations into real-time decisions.
- We’re expanding our GeoAI programs (including joint work with POSCO E&C) and launching drawing-change detection services that automatically compare plan versions, detect deltas, and explain design impacts.
Why You’ll Love Working Here
• True 0-to-1 ownership — Ship models that land in production sites within weeks.
• Global impact, lean crew — 30 teammates across KR🇰🇷/PK🇵🇰/IN🇮🇳; no bureaucracy, only builders.
• Tech freedom — YOLO or RT-DETR? Gemma-VL, Qwen-VL, or LLaVA? You choose, we fund.
Role Overview
We’re hiring a Senior Computer Vision & Multimodal LLM Engineer (GeoAI & Drawing Change Analysis).
You’ll lead end-to-end development of a version-aware drawing-diff engine (PDF/DWG raster & vector), symbol/text extraction, and change-impact narratives powered by RAG/LLM. Expect fast cycles from prototype → service: detection models, OCR/layout understanding, retrieval, and explainable outputs that engineers can trust.
Key Responsibilities
Drawing Change Analysis (CV)
- Build a robust diff pipeline for architectural/structural/MEP drawings: rasterization, layer parsing, vector geometry ops, and semantic change clustering.
- Train/finetune detectors & segmenters (e.g., YOLOv8/RT-DETR/Detectron2/SAM) for symbols (columns, openings, sleeves), title blocks, and revision clouds; achieve production-grade mAP/F1.
- Implement geometry-aware post-processing (IoU/topology checks, snapping, graph connectivity) to reduce false positives.
Document & Layout Understanding
- Engineer OCR + layout models (PaddleOCR/Tesseract + DocFormer/LayoutLMv3/Donut) to read legends, notes, schedules, and BOM tables; normalize to structured JSON.
- Build version-aware entity tracking (IDs, gridlines, BH IDs, coordinates) across revisions.
GeoAI & LLM/RAG
- Design retrieval over drawings/specs (BM25 + vector) with reranking; ground LLM answers in evidence with citations and clickable locations.
- Generate change-impact summaries (e.g., slab shear reinforcement, opening proximity to columns) with rules + LLM verification; measure factual precision.
Productization & DevOps
- Ship FastAPI/gRPC microservices, batch & streaming workers (Ray/Celery), GPU inference (Triton/TensorRT), and observability (Prometheus/Grafana).
- Own evaluation: dataset curation, data labeling guidelines, ablation/A-B tests, and regression suites.
Collaboration
- Work closely with domain SMEs (geotech/structural) to encode rules (KDS/KBC, internal standards) and prioritize what matters to the field.
Minimum Qualifications
- 5+ years of production Python (3.x) building ML-heavy backends; strong PyTorch.
- 3+ years in computer vision for detection/segmentation/OCR or document AI at scale.
- Hands-on with multimodal LLM/RAG (LangChain/LlamaIndex), vector DBs (Pinecone/Weaviate/FAISS), and rerankers.
- Proven experience parsing engineering drawings or complex PDFs (vector/raster), including geometry and layout reasoning.
- Solid MLOps: reproducible training, CI/CD, model packaging, monitoring; cloud on AWS/GCP.
- Fluent written & spoken English (Korean a plus).
Preferred Extras
- GPU orchestration (Kubernetes/Ray/Slurm), high-performance inference (ONNX/TensorRT).
- Experience with VLMs (Gemma-VL, Qwen-VL, LLaVA), CLIP, or doc-layout models.
- Open-source contributions, papers, or strong public demos in CV/doc AI/RAG.
- Full-stack chops (TypeScript/Next.js/React) for quick operator tools and review UIs.
Compensation & Benefits
- Competitive base salary (market-leading) , around 20 lakh (yearly)
- Performance-based annual bonus (up to 20%).
- cloud credits, and AI tools support.
Hiring Process (≈ 2–3 weeks)
- Quick intro call (15 min, mutual fit).
- 48-hour take-home: Drawing Diff + Evidence-Grounded Summary (provide code + short README; clarity > polish).
- Deep-dive tech interview: architecture, modeling choices, evaluation, and scaling plan.
- Culture & vision chat with Founder/CEO.
- Offer — if all green, written offer within 24 h.
How to Apply
Email doaz@doaz.ai with subject [CV/LLM Engineer – Your Name] and include:
- Résumé/CV with measurable outcomes (metrics, latency, cost, accuracy).
- Current or recent salary.
- GitHub and/or live demos of CV/doc-AI/RAG work (links preferred).
- A one-page diagram of your “Drawing Revision → Detection → Evidence → LLM Narrative” pipeline, noting models, retrieval, and evaluation metrics.
Employment type: Full-time
Ready to turn messy drawings and specs into instant, trusted intelligence?
Let’s build the future together at Doaz.