Description
About Norstella:At Norstella, our mission is simple: to help our clients bring life-saving therapies to market quicker—and help patients in need. Founded in 2022, but with history going back to 1939, Norstella unites best-in-class brands to help clients navigate the complexities at each step of the drug development life cycle —and get the right treatments to the right patients at the right time.Each Organization (Citeline, Evaluate, MMIT, Panalgo, The Dedham Group) Delivers Must-have Answers For Critical Strategic And Commercial Decision-making. Together, Via Our Market-leading Brands, We Help Our Clients
- Citeline – accelerate the drug development cycle
- Evaluate – bring the right drugs to market
- MMIT – identify barrier to patient access
- Panalgo – turn data into insight faster
- The Dedham Group – think strategically for specialty therapeutics
By combining the efforts of each organization under Norstella, we can offer an even wider breadth of expertise, cutting-edge data solutions and expert advisory services alongside advanced technologies such as real-world data, machine learning and predictive analytics. As one of the largest global pharma intelligence solution providers, Norstella has a footprint across the globe with teams of experts delivering world class solutions in the USA, UK, The Netherlands, Japan, China and India.The Role: NLP Data Scientist, AI & Life SciencesWe are seeking a skilled NLP Data Scientist with a focus on cutting-edge Language Models to join our AI & Life Sciences Solutions team. Your expertise in processing and understanding natural language, paired with your experience in Electronic Health Records (EHR) and clinical data analysis, will be crucial in driving our data science initiatives. You will be instrumental in developing rich, multimodal real-world datasets that will accelerate RWD-driven drug development within the pharmaceutical industry.
Responsibilities
- Lead the application of advanced NLP and Large Language Models (LLMs), including state-of-the-art open-source models (e.g., Llama3, Mixtral, Gemma) and other foundational models, to extract and interpret complex, unstructured medical data from diverse sources such as EHRs, clinical notes, and laboratory reports.
- Architect and deploy innovative and scalable NLP solutions that leverage the latest in deep learning to solve complex healthcare challenges, working closely with clinical scientists and data scientists.
- Design and implement robust data pipelines for cleaning, preprocessing, and validating unstructured data, ensuring the accuracy and reliability of all extracted insights.
- Develop and optimize prompt engineering strategies for fine-tuning LLMs and enhancing their performance on specialized clinical tasks.
- Translate complex findings into clear, actionable insights for both technical and non-technical stakeholders, driving data-informed decisions across the organization.
Qualifications
- Advanced Degree: Master's or Ph.D. in Computer Science, Data Science, Computational Linguistics, Computational Biology, Physics, or a related analytical field.
- Clinical Data Expertise: Proven experience (3+ years) in handling and interpreting Electronic Health Records (EHRs) and clinical laboratory data.
- Advanced NLP & Generative AI: Deep experience (3+ years) with modern NLP techniques like semantic search, knowledge graph construction, and few-shot learning.
- LLM Proficiency: Practical, hands-on experience (2+ years) with fine-tuning, prompt engineering, and inference optimization for LLMs.
- Technical Stack: Expert proficiency in Python and SQL, with strong experience using Hugging Face Transformers, PyTorch, and/or TensorFlow. Experience in a cloud environment, specifically AWS, with large-scale data systems.
- MLOps & Workflow Automation: Familiarity with modern MLOps practices (e.g., Git) and a proven track record of developing automated, scalable workflows.
- Analytical Prowess: A strong analytical mindset with excellent problem-solving skills and a detail-oriented approach to data.
- Communication: Exceptional verbal and written communication skills with the ability to articulate complex technical findings to a diverse audience.
Preferred Qualifications
- Healthcare Compliance: Experience managing Protected Health Information (PHI) and a working knowledge of healthcare data privacy laws such as HIPAA.
- Medical Terminologies: Familiarity with standard healthcare codes and terminologies, including ICD-10, CPT, LOINC, and SNOMED CT.
- Advanced Retrieval Systems: Practical experience with Retrieval-Augmented Generation (RAG) systems and vector databases for managing and querying large volumes of unstructured medical documents.
Location: Remote- India
Our Guiding Principles For Success At Norstella
01: Bold, Passionate, and Mission-First02: Integrity, Truth, and Reality03: Kindness, Empathy, and Grace04: Resilience, Mettle, and Perseverance05: Humility, Gratitude, and Learning
Benefits
- Health Insurance
- Provident Fund
- Reimbursement of Certification Expenses
- Gratuity
- 24x7 Health Desk
Norstella is an equal opportunities employer and does not discriminate on the grounds of gender, sexual orientation, marital or civil partner status, pregnancy or maternity, gender reassignment, race, color, nationality, ethnic or national origin, religion or belief, disability or age. Our ethos is to respect and value people’s differences, to help everyone achieve more at work as well as in their personal lives so that they feel proud of the part they play in our success. We believe that all decisions about people at work should be based on the individual’s abilities, skills, performance and behavior and our business requirements. Norstella operates a zero tolerance policy to any form of discrimination, abuse or harassment.
Sometimes the best opportunities are hidden by self-doubt. We disqualify ourselves before we have the opportunity to be considered. Regardless of where you came from, how you identify, or the path that led you here- you are welcome. If you read this job description and feel passion and excitement, we’re just as excited about you.
All legitimate roles with Norstella will be posted on Norstella’s job board which is located at norstella.com/careers. If a role is not posted on this job board, a candidate should assume the role is not a legitimate role with Norstella. Norstella is not responsible for an application that may be submitted by or through a third-party and candidates should proceed with extreme caution if a third-party approaches them about an open role with Norstella. Norstella will never ask for anything of value or any type of payment during or as part of any recruitment, interview, or pre-hire onboarding process. If you are aware of or have reason to believe a job posting purportedly for a role with Norstella is fraudulent or otherwise not authorized by Norstella, please contact the Company using the following email address: ApplicationHelp@norstella.com.