Posted:1 week ago| Platform:
On-site
Full Time
Key Responsibilities Provide strategic leadership for the AI Alignment division, encompassing Trust and Safety, Interpretability for Pricing of information. Develop and implement comprehensive strategies for AI alignment, including safety measures, interpretability techniques, and robust red teaming protocols. Drive the integration of advanced safety and interpretability techniques such as RLHF, DPO, PPO, LIME, and SHAP across our AI development pipeline. Collaborate with product and research teams to define and implement safety and interpretability aspects that ensure our AI models deliver helpful, honest, and transparent outputs. Lead cross-functional initiatives to integrate safety measures and interpretability throughout the AI development lifecycle. Represent the company in industry forums, conferences, and regulatory discussions related to AI alignment and ethics. Manage resource allocation, budgeting, and strategic planning for the AI Alignment division. Mentor and develop team members, fostering a collaborative and innovative research environment. Liaise with executive leadership to communicate progress, challenges, and strategic recommendations for AI alignment efforts Required Qualifications: Master’s or Ph.D. in Computer Science, Artificial Intelligence, Machine Learning, Mathematics, or a related field. Strong knowledge of machine learning, deep learning, and reinforcement learning techniques. Experience with frameworks such as TensorFlow, PyTorch, or JAX. Proficiency in programming languages such as Python, C++, or Java. Experience in handling large-scale datasets and distributed computing. Strong problem-solving skills and ability to work independently and in a team environment. Excellent written and verbal communication skills. Show more Show less
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Greater Bengaluru Area
0.0 - 0.0 Lacs P.A.
Pune, Maharashtra, India
0.0 - 0.0 Lacs P.A.
Gurugram, Haryana, India
0.0 - 0.0 Lacs P.A.