Job
Description
As a Machine Learning Engineer at our company, you will play a crucial role in solving complex problems and assisting stakeholders in making data-driven decisions by leveraging quantitative methods, particularly machine learning. Your primary responsibilities will involve synthesizing a large volume of information and extracting crucial signals from data in a programmatic manner. Here's what you can expect in this role: - Architect and train CNN/ViT models for tasks such as classification, detection, segmentation, and OCR. - Build and optimize RNN/LSTM/GRU models for sequence learning, speech, or time-series forecasting. - Research and implement transformer-based architectures that bridge vision and language tasks. - Create scalable pipelines for data ingestion, annotation, augmentation, and synthetic data generation. - Design and implement multi-agent workflows using frameworks like LangChain, LangGraph, CrewAI, or similar tools. - Develop role hierarchies, state graphs, and integrations to enable autonomous vision + language workflows. - Fine-tune open-weight LLMs using methods like LoRA/QLoRA, PEFT, or RLHF. - Develop RAG pipelines that integrate vector databases such as FAISS, Weaviate, pgvector. - Develop reproducible training workflows with PyTorch/TensorFlow and experiment tracking using tools like W&B, MLflow. - Deploy models using TorchServe, Triton, or KServe on cloud AI stacks like AWS Sagemaker, GCP Vertex, Kubernetes. - Develop robust APIs/micro-services using FastAPI, gRPC, and ensure CI/CD, monitoring, and automated retraining. You are an ideal candidate for this role if you possess: - B.S./M.S. in Computer Science, Electrical Engineering, Applied Math, or a related discipline. - 5+ years of experience building deep learning systems with CNNs and RNNs in a production environment. - Strong Python skills and proficiency in Git workflows. - Proven track record of delivering computer vision pipelines for tasks such as OCR, classification, and detection. - Hands-on experience with LLM fine-tuning and multimodal AI. - Experience with containerization (Docker) and deployment on cloud AI platforms. - Knowledge of distributed training, GPU acceleration, and inference optimization. Preferred qualifications include: - Research experience in transformer architectures like ViTs and hybrid CNN-RNN-Transformer models. - Previous work in sequence modeling for speech or time-series data. - Contributions to open-source deep learning frameworks or vision/sequence datasets. - Experience with edge AI deployment and hardware optimization. If you join our team, you will be expected to uphold our company values of Integrity, Accountability, Inclusion, Innovation, and Teamwork. The job location is in Bangalore, Karnataka, India, with a travel requirement of 10% to 25%. The Requisition ID for this position is 142959. Please note that the workplace type is an External Careers Page in the Information Technology field.,