We are seeking a Data Scientist (AI/ML Ops) to take a central role in shaping how AI models are operationalized and brought to life at scale. In this role, you will design, implement, and optimize advanced data pipelines, model evaluation frameworks, and monitoring processes that ensure AI systems are trained and deployed with accuracy, efficiency, and compliance. Youll apply your expertise in data science and ML operationalization to create automated feedback loops, establish robust evaluation metrics, and drive continuous improvement across the AI lifecycle. This is a high-impact opportunity to work on frontier AI initiatives with global reach, partnering directly with leading Big Tech enterprises. You will collaborate closely with Data Engineers, AI Researchers, IT, and Product Teams to translate complex business challenges into data-driven solutions, while influencing best practices in Responsible AI and MLOps. Beyond building pipelines and models, you’ll help define the standards for how safe, scalable, and trustworthy AI systems are delivered worldwide. Role & responsibilities Design data preprocessing pipelines for multi-modal datasets (text, audio, video, images). Engineer features and transformations for model training, fine-tuning, and evaluation. Apply data augmentation, synthetic data generation, and bias mitigation techniques. Develop and evaluate ML models for classification, ranking, clustering, and anomaly detection tailored to operational use cases. Expertise in power analysis, significance testing, and advanced methods such as bootstrapping and Bayesian inference. Create offline/online evaluation metrics (accuracy, F1, precision/recall, latency, fairness metrics) to track model performance. Support RLHF pipelines by integrating human feedback into model optimization loops. Collaborate with engineering teams to integrate models into production pipelines. Design automated monitoring, retraining triggers, and continuous improvement loops to maintain model quality in dynamic environments. Build dashboards and reporting frameworks for real-time monitoring of model and data health. Define labeling strategies, taxonomy structures, and annotation guidelines for supervised learning. Partner with annotation platforms to ensure accuracy, throughput, and SLA compliance. Perform statistical quality checks (inter-annotator agreement, sampling-based audits). Partner with clients, product teams, and operations leads to translate business problems into data/ML solutions. Work with Legal/Privacy teams to ensure datasets meet regulatory and ethical AI standards. Contribute to internal best practices, reusable frameworks, and knowledge bases for AI/ML Operations. Preferred candidate profile Masters or Ph.D. in Computer Science, Data Science, Applied Mathematics, or related fields. 6+ years of experience in ML/AI data science with a focus on operational deployment. Track record of delivering ML models into production environments at scale. Programming: Python (NumPy, Pandas, Scikit-learn, PyTorch/TensorFlow), SQL. ML Techniques: Supervised/unsupervised learning, NLP, CV, RLHF. MLOps Tools: MLflow, Kubeflow, SageMaker, Vertex AI, or similar. Data Handling: Spark, Dataflow, or equivalent big data frameworks. Visualization & Reporting: Tableau, Looker, or similar BI tools. Experience with Generative AI model evaluation (LLMs, diffusion models). Knowledge of Responsible AI frameworks (bias/fairness metrics, explainability). Exposure to Trust & Safety operations or high-volume data environments.