%0 Journal Article
%@ 2564-1891
%I JMIR Publications
%V 4
%N 
%P e53424
%T Application of a Language Model Tool for COVID-19 Vaccine Adverse Event Monitoring Using Web and Social Media Content: Algorithm Development and Validation Study
%A Daluwatte,Chathuri
%A Khromava,Alena
%A Chen,Yuning
%A Serradell,Laurence
%A Chabanon,Anne-Laure
%A Chan-Ou-Teung,Anthony
%A Molony,Cliona
%A Juhaeri,Juhaeri
%+ Epidemiology and Benefit-Risk Department, Sanofi, 1755 Steeles Ave West, Toronto, ON, M2R 3T4, Canada, 1 4166672753, alena.khromava@sanofi.com
%K adverse event
%K COVID-19
%K detection
%K large language model
%K mass vaccination
%K natural language processing
%K pharmacovigilance
%K safety
%K social media
%K vaccine
%D 2024
%7 20.12.2024
%9 Original Paper
%J JMIR Infodemiology
%G English
%X Background: Spontaneous pharmacovigilance reporting systems are the main data source for signal detection for vaccines. However, there is a large time lag between the occurrence of an adverse event (AE) and the availability for analysis. With global mass COVID-19 vaccination campaigns, social media, and web content, there is an opportunity for real-time, faster monitoring of AEs potentially related to COVID-19 vaccine use. Our work aims to detect AEs from social media to augment those from spontaneous reporting systems. Objective: This study aims to monitor AEs shared in social media and online support groups using medical context-aware natural language processing language models. Methods: We developed a language model–based web app to analyze social media, patient blogs, and forums (from 190 countries in 61 languages) around COVID-19 vaccine–related keywords. Following machine translation to English, lay language safety terms (ie, AEs) were observed using the PubmedBERT-based named-entity recognition model (precision=0.76 and recall=0.82) and mapped to Medical Dictionary for Regulatory Activities (MedDRA) terms using knowledge graphs (MedDRA terminology is an internationally used set of terms relating to medical conditions, medicines, and medical devices that are developed and registered under the auspices of the International Council for Harmonization of Technical Requirements for Pharmaceuticals for Human Use). Weekly and cumulative aggregated AE counts, proportions, and ratios were displayed via visual analytics, such as word clouds. Results: Most AEs were identified in 2021, with fewer in 2022. AEs observed using the web app were consistent with AEs communicated by health authorities shortly before or within the same period. Conclusions: Monitoring the web and social media provides opportunities to observe AEs that may be related to the use of COVID-19 vaccines. The presented analysis demonstrates the ability to use web content and social media as a data source that could contribute to the early observation of AEs and enhance postmarketing surveillance. It could help to adjust signal detection strategies and communication with external stakeholders, contributing to increased confidence in vaccine safety monitoring. 
%M 39705077
%R 10.2196/53424
%U https://infodemiology.jmir.org/2024/1/e53424
%U https://doi.org/10.2196/53424
%U http://www.ncbi.nlm.nih.gov/pubmed/39705077