A novel natural language processing pipeline (NLP) may be able to help expedite the time-consuming process of gleaning this information from summaries.
Researchers have developed a rule-based natural language processing pipeline (NLP) that can automatically extract the temporal information of a patient’s first seizure onset from epilepsy monitoring unit (EMU) discharge summaries.
Although early onset of seizure activity can be a risk factor for sudden unexpected death in epilepsy (SUDEP), this information is often documented as clinical narratives in EMU summaries and manually extracting it is time consuming and labor intensive, the researchers explain in AMIA Joint Summits on Translational Science.
Around 65 million people worldwide live with epilepsy, including 3.4 million in the United States. SUDEP is the most common cause of dying from seizures in these patients.
To better elucidate the mechanism of SUDEP, the National Institute for Neurological Disorders and Stroke funded the Center for SUDEP Research (CSR), which had enrolled 2739 patients from 7 sites as of March 2022.
“Around 10% (313 out of 3128) of the CSR EMU discharge summaries do not have first seizure onset date explicitly documented using a standard date format. Instead, such temporal information is embedded in clinical narratives,” the authors said.
To speed up extracting first seizure onset information from these summaries, the investigators developed their NLP by randomly picking 300 discharge summaries to form the development set. Based on this, they constructed a collection of 4 rules to “extract temporal expressions related to the first seizure onset from clinical free text.”
They then applied these constructed rules on another 200 unseen summaries to evaluate the efficacy of the approach and compared results with manual evaluations of the summaries carried out by a domain expert.
Researchers found their extraction pipeline had a precision of 0.75, recall of 0.651, and an F1-score of 0.697.
“This is an encouraging initial result, which will allow us to gain insights into potentially better-performing approaches,” they wrote.
Machine learning and deep learning have been used for temporal relation extraction, while hybrid approaches have combined rule-based and machine learning components.
In the current analysis, the investigators applied their approach on 2 test sets. Although their method generally had a good performance, it did differ between these sets. They hypothesized this could be due in part to “the difference between composition of the 2 sets in terms of the types of first seizure onset temporal expressions.”
During an analysis of failure cases, they found the rule-based approach could not identify correct seizure onset temporal expressions when the model did not recognize certain temporal expressions written in the summaries. In other cases, it did not recognize seizure-related events.
The authors also note their rules can only be applied within sentences, and they add an interesting future direction of the work could be to extract first seizure onset by using information spread across multiple sentences.
In the current study, researchers only focused on the extraction of first seizure onset date. However, other temporal factors, like seizure duration and frequency, could be important when assessing SUDEP risk. In the future, they hope to perform a comprehensive study to extract all these temporal expressions from clinical narratives. They also plan to investigate automatically constructing patient seizure timelines based on discharge summaries and other data.
“To the best of our knowledge, this is the first instance where NLP has been applied to extract first seizure onset–related temporal expressions from clinical narratives,” the researchers concluded. “This work is also a first step towards automatically constructing patient timelines for seizure-related events from patient discharge summaries.”
Reference
Tao S, Abeysinghe R, De La Esperanza BT, Lhatoo S, Zhang G-Q, Cui L. Extracting temporal expressions of first seizure onset from epilepsy patient discharge summaries. AMIA Jt Summits Transl Sci Proc. Published online June 16, 2023.
POSITIVE Trial Update on Patients With Breast Cancer Attempting Pregnancy
December 8th 2023At last year’s San Antonio Breast Cancer Symposium, POSITIVE trial primary outcome data were presented on breast cancer–free interval, with women who paused endocrine therapy to attempt pregnancy having a similarly small rate of recurrence vs external controls from the SOFT and TEXT trials: 8.9% vs 9.2%.
Read More
Emily Goldberg Shares Insights as a Genetic Counselor for Breast Cancer Risk Screening
October 30th 2023On this episode of Managed Care Cast, Emily Goldberg, MS, CGC, a genetic counselor at JScreen, breaks down how genetic screening for breast cancer works and why it is so important to increase awareness and education around these screening tools available to patients who may be at risk for cancer.
Listen
Examining Telehealth Uptake to Increase Equitable Care Access
January 26th 2023To mark the publication of The American Journal of Managed Care®’s 12th annual health IT issue, on this episode of Managed Care Cast, we speak with Christopher M. Whaley, PhD, health care economist at the RAND Corporation, who focuses on health economics issues, including the influence of the COVID-19 pandemic on health care delivery.
Listen
What We’re Reading: ChatGPT in Health Care; Menthol Ban Meeting; Health System Cyberattack Impact
November 30th 2023ChatGPT is changing health care while also raising questions about artificial intelligence's promises and limitations; 24 Biden officials met with the National Organization of Black Law Enforcement Executives to discuss the FDA’s proposal to ban menthol cigarettes; a recent hospital cyberattack shows how the vulnerability of health care systems can put patients at risk.
Read More
Provisional CDC data show that US babies gained roughly a year in life expectancy in 2022; study findings indicate that the COVID-19 pandemic significantly raised the risk of preterm birth for expectant California mothers; New York Attorney General Letitia James said about 4 million New Yorkers were affected by a data breach of the medical transcription company Perry Johnson & Associates.
Read More