Publications

See Google Scholar for the most up-to-date publications.

2026

Extracting Social Determinants of Health From Electronic Health Records: Development and Comparison of Rule-Based and Large Language Model Methods

Bo Wang, Dia Kabir, Cheryl R. Clark, Karmel W. Choi, Jordan W. Smoller

JMIR Medical Informatics

Multi-Criteria Validation of LLM-Inferred Depression Severity from Outpatient Psychiatry Notes

Mihael Cudic, William Meyerson, Bo Wang, Qingqing Yin, Pratik N Khadse, Taylor Burke, Chris J Kennedy, Jordan W Smoller

medRxiv

2025

Prediction of early-onset bipolar using electronic health records

Bo Wang, Yi-Han Sheu, Hyunjoon Lee, Robert G. Mealer, Victor M. Castro, Jordan W. Smoller

Journal of Child Psychology and Psychiatry

Sexual Trauma, Polygenic Scores, and Mental Health Diagnoses and Outcomes

Allison M. Lake, Yu Zhou, Bo Wang, Ky'Era V. Actkins, Yingzhe Zhang, John P. Shelley, Anindita Rajamani, Michael Steigman, Chris J. Kennedy, Jordan W. Smoller, Karmel W. Choi, Nikhil K. Khankari, Lea K. Davis

JAMA Psychiatry

Development and validation of electronic health record-based ascertainment of obsessive-compulsive disorder cases and controls

Bo Wang, Tyne W Miller-Fleming, Dongmei Yu, Donald Hucks, Emily Gantz, Rebecca Johnston, Angela Maxwell-Horn, Nancy Cox, James Sutcliffe, Carol A Mathews, others

medRxiv

2024

A longitudinal multi-modal dataset for dementia monitoring and diagnosis

Dimitris Gkoumas, Bo Wang, Adam Tsakalidis, Maria Wolters, Matthew Purver, Arkaitz Zubiaga, Maria Liakata

Language Resources and Evaluation

Subtle variation in sepsis-III definitions markedly influences predictive performance within and across methods

Samuel N. Cohen, James Foster, Peter Foster, Hang Lou, Terry Lyons, Sam Morley, James Morrill, Hao Ni, Edward Palmer, Bo Wang, Yue Wu, Lingyi Yang, Weixin Yang

Scientific Reports

2022

Template-based abstractive microblog opinion summarization

Iman Munire Bilal, Bo Wang, Adam Tsakalidis, Dong Nguyen, Rob Procter, Maria Liakata

Transactions of the Association for Computational Linguistics (TACL)

BIGBIO: a framework for data-centric biomedical natural language processing

Jason Alan Fries, Leon Weber, Natasha Seelam, Gabriel Altay, Debajyoti Datta, Ruisi Su, Samuele Garda, Sunny MS Kang, Stella Biderman, Matthias Samwald, Stephen H. Bach, Wojciech Kusa, Samuel Cahyawijaya, Fabio Barth, Simon Ott, Mario Sänger, Bo Wang, Alison Callahan, Daniel León Perinan, Théo Gigant, Patrick Haller, Jenny Chim, Jose Posada, John Giorgi, Karthik Rangasai Sivaraman, Marc Pàmies, Marianna Nezhurina, Robert Martin, Moritz Freidank, Nathan Dahlberg, Shubhanshu Mishra, Shamik Bose, Nicholas Broad, Yanis Labrak, Shlok S Deshmukh, Sid Kiblawi, Ayush Singh, Minh Chien Vu, Trishala Neeraj, Jonas Golde, Albert Villanova Moral, Benjamin Beilharz

The 36th Conference on Neural Information Processing Systems (NeurIPS)

Dataset debt in biomedical language modeling

Jason Fries, Natasha Seelam, Gabriel Altay, Leon Weber, Myungsun Kang, Debajyoti Datta, Ruisi Su, Samuele Garda, Bo Wang, Simon Ott, others

BigScience – Workshop on Challenges & Perspectives in Creating Large Language Models

2021

Natural Language Processing markers in first episode psychosis and people at clinical high-risk

Sarah E Morgan, Kelly Diederen, Petra E Vértes, Samantha HY Ip, Bo Wang, Bethany Thompson, Arsime Demjaha, Andrea De Micheli, Dominic Oliver, Maria Liakata, others

Translational psychiatry

Evaluation of thematic coherence in microblogs

Iman Munire Bilal, Bo Wang, Maria Liakata, Rob Procter, Adam Tsakalidis

The 59th Annual Meeting of the Association for Computational Linguistics (ACL)

Modelling paralinguistic properties in conversational speech to detect bipolar disorder and borderline personality disorder

Bo Wang, Yue Wu, Nemanja Vaci, Maria Liakata, Terry Lyons, Kate EA Saunders

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)