Oindrila Ray

1 Job openings at Oindrila Ray
Artificial Intelligence Engineer kolkata 3 - 5 years INR 3.0 - 8.0 Lacs P.A. Remote Full Time

We are looking for a skilled AI Engineer / Data Scientist with hands-on experience in OCR pipelines and Large Language Model (LLM) fine-tuning . The ideal candidate will work on developing, fine-tuning, and optimizing Vision-Language Models (VLMs) to extract structured information from scanned or image-based documents. You will collaborate closely with data scientists and backend engineers to build scalable, accurate, and production-ready AI pipelines . Key Responsibilities Fine-tune and adapt multimodal LLMs (e.g., Qwen-VL, LLaVA, or similar) for domain-specific document understanding. Design prompt templates and instruction sets to improve structured JSON output. Perform incremental and cross-dataset fine-tuning for robust generalization. Implement evaluation metrics and create validation datasets to track performance. Optimize inference using quantization, LoRA adapters, and frameworks such as Ray Serve, vLLM, or Unsloth. Collaborate with backend teams to integrate models into production systems . Develop monitoring tools to log model confidence, token usage, and latency metrics. Required Skills & Experience Strong programming skills in Python , with experience in PyTorch and Hugging Face Transformers . Expertise in OCR tools like PaddleOCR, Tesseract, or EasyOCR . Hands-on experience in fine-tuning and serving LLMs / VLMs (e.g., Qwen, LLaVA, Mistral, Vicuna). Solid understanding of LoRA / QLoRA / PEFT for efficient model training. Experience with structured data extraction , prompt engineering , and JSON schema generation . Familiarity with Unsloth, Ray Serve, or vLLM for scalable inference. Proficiency with Docker, CUDA, and NVIDIA GPU environments . Strong grasp of tokenization, attention mechanisms, and quantization (4-bit / 8-bit) . Nice to Have Experience with LLM model serving and orchestration . Exposure to CI/CD pipelines and deployment on AWS, Azure, or GCP . Knowledge of PDF parsing tools (Camelot, PyMuPDF, etc.). Prior work in document intelligence or AI-based invoice/data extraction systems . What We Offer Opportunity to work on cutting-edge Vision-Language AI models . Hands-on involvement in production-grade OCR + LLM pipelines . Competitive compensation and a flexible work culture . Collaborative environment that fosters AI innovation in enterprise automation.