General Summary:
Are you a data scientist with experience in natural language processing, generative AI, machine learning, and advanced analytics, and looking to apply your expertise in a collaborative and intellectually stimulating environment? Are you passionate about the future of healthcare?
Our enterprise data science team is looking for a Boston- or London-based data scientist with experience in using generative AI methods to solve complex business problems and a passion for addressing business needs through data analysis and solutions. You will work on a highly collaborative team of data scientists, engineers, and strategists to deliver analytical insights and data products that drive value and impact for our highest priority business needs. You’ll work side-by-side with internal partners to apply advanced analytics and generative AI to complex, high-value business problems and digital solutions to deliver measurable decisions and outcomes that contribute meaningfully to our business and patients.
Key Duties And Responsibilities:
- Collaborate with a team of data scientists, engineers, and strategists and our cross-functional partners to conceptualize, deploy, and evaluate custom data science solutions for business problems using generative AI, NLP, machine learning, and other advanced analytical methods
- Design and execute impact measurement plans (e.g. A/B tests) to quantify the value of these solutions
- Lead quantitative assessment and insight generation for enterprise document ecosystem
- Develop advanced insights related to enterprise adoption of generative AI tools such as custom agents
- Build and deliver compelling data visualizations and outputs to communicate findings to technical collaborators, non-technical audiences, and business leaders
- Participate in the broader data science community to stay current with methodology, software, and data development and availability
- Bring an entrepreneurial and ethical mindset, openness, transparency, and collegiality to your work
Minimum Qualifications:
- Bachelor’s, Master’s, or PhD degree in a computational or quantitative discipline, including but not limited to data science, statistics, computer science, computational linguistics, biomedical informatics, neuroscience, physics, epidemiology, health economics
- 5+ years of experience developing data science solutions in an industry or academic context, with 2+ years of experience integrating generative AI methods into document solutions (e.g. custom document generation, summarization, QC)
- Expertise in programming languages (e.g. Python, R, SQL, JavaScript), version control, and other data science related tools (e.g. Snowflake, Databricks, Azure/Foundry, dbt)
- Expertise in working with natural language data and building text-based products, using both classic and state of the art NLP techniques (e.g. text mining, word embeddings, transformer-based models)
- Experience with LLM prompt engineering and familiarity with LLM-based workflows/architectures such as retrieval-augmented generation, multi-agent architectures
- Experience with statistical/analytical methodologies and machine learning algorithms (e.g. classification, regression, clustering, feature selection/engineering, deep learning, time-series analysis, network analysis, hypothesis testing)
- Exceptional communication skills and ability to present findings to non-technical audiences
- Experience in effective data visualization approaches and a keen eye for detail in the visual communication of findings
- Demonstrated history of adherence to highest standards of data ethics
Preferred Qualifications:
- 4+ years of industry GenAI data science experience
- Familiarity with LLMOps, including deployment, monitoring, and maintenance of GenAI data solutions
- Prior experience with using advanced analytics and/or developing advanced data visualizations in business settings
- Familiarity with data product UX/UI design and testing
- Prior exposure to clinical data, real-world data (EMR, claims), manufacturing or supply chain data, or life sciences-related research data
- Knowledge of the biopharma or healthcare industry