Posted:4 days ago| Platform:
On-site
Full Time
Job Description: Machine Learning / Data Science Engineer/Data Scientists Location: Pune Experience Required: 3–6 years Type: Full-Time Education: BTech / MTech / MSc / PhD in Computer Science, Data Science, Applied Mathematics, Statistics, or a related field. About Anervea.ai Anervea.ai is building a next-generation intelligence stack for the pharmaceutical industry. Our products help commercial, clinical, and medical affairs teams make smarter decisions—faster. From predicting the success of clinical trials and decoding competitor movement, to surfacing real-time KOL signals and automating HCP engagement, our platform powers strategic decision-making at scale. We’re not a services firm—we’re a product-first, AI-native company solving real problems using applied machine learning, generative AI, and life sciences data. Our clients include major US and EU pharma companies, and our team is a mix of engineers, researchers, and life science domain experts. We’re looking for ML engineers and data scientists who are passionate about learning, driven to build usable solutions , and ready to push boundaries. Role Overview As an ML / Data Science Engineer at Anervea, you’ll work on designing, training, deploying, and maintaining machine learning models across multiple products. You’ll build models that predict clinical trial outcomes, extract insights from structured and unstructured healthcare data, and support real-time scoring for sales or market access use cases. You’ll collaborate closely with AI engineers, backend developers, and product owners to translate data into product features that are explainable, reliable, and impactful. Key Responsibilities Develop and optimize predictive models using algorithms such as XGBoost, Random Forest, Logistic Regression, and ensemble methods Engineer features from real-world healthcare data (clinical trials, treatment adoption, medical events, digital behavior) Analyze datasets from sources like ClinicalTrials.gov, PubMed, Komodo, Apollo.io, and internal survey pipelines Build end-to-end ML pipelines for inference and batch scoring Collaborate with AI engineers to integrate LLM-generated features with traditional models Ensure explainability and robustness of models using SHAP, LIME, or custom logic Validate models against real-world outcomes and client feedback Prepare clean, structured datasets using SQL and Pandas Communicate insights clearly to product, business, and domain teams Document all processes, assumptions, and model outputs thoroughly Technical Skills Required Strong programming skills in Python (NumPy, Pandas, scikit-learn, XGBoost, LightGBM) Experience with statistical modeling and classification algorithms Solid understanding of feature engineering , model evaluation, and validation techniques Exposure to real-world healthcare, trial, or patient data (strong bonus) Comfortable working with unstructured data and data cleaning techniques Knowledge of SQL and NoSQL databases Familiarity with ML lifecycle tools (MLflow, Airflow, or similar) Bonus: experience working alongside LLMs or incorporating generative features into ML Bonus: knowledge of NLP preprocessing, embeddings, or vector similarity methods Personal Attributes Strong analytical and problem-solving mindset Ability to convert abstract questions into measurable models Attention to detail and high standards for model quality Willingness to learn life sciences concepts relevant to each use case Clear communicator who can simplify complexity for product and business teams Independent learner who actively follows new trends in ML and data science Reliable, accountable, and driven by outcomes—not just code Bonus Qualities Experience building models for healthcare, pharma, or biotech Published work or open-source contributions in data science Strong business intuition on how to turn models into product decisions Show more Show less
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Salary: Not disclosed
Experience: Not specified
Salary: Not disclosed
Experience: Not specified
Salary: Not disclosed
Experience: Not specified
Salary: Not disclosed
Salary: Not disclosed
Salary: Not disclosed
Hyderabad, Telangana, India
Experience: Not specified
Salary: Not disclosed
Experience: Not specified
Salary: Not disclosed
Greater Hyderabad Area
Salary: Not disclosed
Salary: Not disclosed