Data Scientist - Junior/Mid-Senior
At SpicyChat, we’re on a mission to build the best uncensored roleplaying agent in the world , and we’re looking for a passionate Data Scientist to join our team. Whether you’re early in your data science career or growing into a mid-senior role, this is a unique opportunity to work hands-on with state-of-the-art LLMs in a fast-paced, supportive environment. Role Overview We’re looking for a Data Scientist (Junior to Mid-Senior level) who will support our LLM projects across the full data pipeline—from building clean datasets and dashboards to fine-tuning models and supporting cross-functional collaboration. You’ll work closely with ML engineers, product teams, and data annotation teams to bring AI solutions to life. What You’ll Be Doing ETL and Data Pipeline Development: Design and implement data extraction, transformation, and loading (ETL) pipelines. Work with structured and unstructured data from various sources. Data Preparation: Clean, label, and organize datasets for training and evaluating LLMs. Collaborate with annotation teams to ensure high data quality. Model Fine-Tuning & Evaluation: Support the fine-tuning of LLMs for specific use cases. Assist in model evaluation, prompt engineering, and error analysis. Dashboarding & Reporting: Create and maintain internal dashboards to track data quality, model performance, and annotation progress. Automate reporting workflows to help stakeholders stay informed. Team Coordination & Collaboration: Communicate effectively with ML engineers, product managers, and data annotators. Ensure that data science deliverables align with product and business goals. Research & Learning: Stay current with developments in LLMs, fine-tuning techniques, and the AI ecosystem. Share insights with the team and suggest improvements based on new findings. Qualifications Required: 1–4 years of experience in a data science, ML, or analytics role. Proficient in Python and data science libraries (Pandas, NumPy, scikit-learn). Experience with SQL and data visualization tools (e.g., Streamlit, Dash, Tableau, or similar). Familiarity with machine learning workflows and working with large datasets. Strong communication and organizational skills. Bonus Points For: Experience fine-tuning or evaluating large language models (e.g., OpenAI, Hugging Face, LLaMA, Mistral, etc.). Knowledge of prompt engineering or generative AI techniques. Exposure to tools like Weights & Biases, Airflow, or cloud platforms (AWS, GCP, Azure). Previous work with cross-functional or remote teams. Why Join NextDay AI? 🌍 Remote-first: Work from anywhere in the world. ⏰ Flexible hours: Create a schedule that fits your life. 🌴 Unlimited leave: Take the time you need to rest and recharge. 🚀 Hands-on with LLMs: Get practical experience with cutting-edge AI systems. 🤝 Collaborative culture: Join a supportive, ambitious team working on real-world impact. 🌟 Mission-driven: A chance to be part of an exciting mission and an amazing team. Ready to join us in creating the ultimate uncensored roleplaying agent? Send us your resume along with some details on your coolest projects. We’re excited to see what you’ve been working on!