Clinical Context-Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation

doi:10.2196/19810

. 2020 Oct 23;22(10):e19810.

doi: 10.2196/19810.

Clinical Context-Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation

Muhammad Afzal^#^{1

2}, Fakhare Alam^#², Khalid Mahmood Malik², Ghaus M Malik³

Affiliations

¹ Department of Software, Sejong University, Seoul, Republic of Korea.
² Department of Computer Science & Engineering, School of Engineering and Computer Science, Oakland University, Rochester, MI, United States.
³ Department of Neurosurgery, Henry Ford Hospital, Detroit, MI, United States.

^# Contributed equally.

PMID: 33095174
PMCID: PMC7647812
DOI: 10.2196/19810

Clinical Context-Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation

Muhammad Afzal et al. J Med Internet Res. 2020.

. 2020 Oct 23;22(10):e19810.

doi: 10.2196/19810.

Authors

Muhammad Afzal^#^{1

2}, Fakhare Alam^#², Khalid Mahmood Malik², Ghaus M Malik³

Affiliations

¹ Department of Software, Sejong University, Seoul, Republic of Korea.
² Department of Computer Science & Engineering, School of Engineering and Computer Science, Oakland University, Rochester, MI, United States.
³ Department of Neurosurgery, Henry Ford Hospital, Detroit, MI, United States.

^# Contributed equally.

PMID: 33095174
PMCID: PMC7647812
DOI: 10.2196/19810

Abstract

Background: Automatic text summarization (ATS) enables users to retrieve meaningful evidence from big data of biomedical repositories to make complex clinical decisions. Deep neural and recurrent networks outperform traditional machine-learning techniques in areas of natural language processing and computer vision; however, they are yet to be explored in the ATS domain, particularly for medical text summarization.

Objective: Traditional approaches in ATS for biomedical text suffer from fundamental issues such as an inability to capture clinical context, quality of evidence, and purpose-driven selection of passages for the summary. We aimed to circumvent these limitations through achieving precise, succinct, and coherent information extraction from credible published biomedical resources, and to construct a simplified summary containing the most informative content that can offer a review particular to clinical needs.

Methods: In our proposed approach, we introduce a novel framework, termed Biomed-Summarizer, that provides quality-aware Patient/Problem, Intervention, Comparison, and Outcome (PICO)-based intelligent and context-enabled summarization of biomedical text. Biomed-Summarizer integrates the prognosis quality recognition model with a clinical context-aware model to locate text sequences in the body of a biomedical article for use in the final summary. First, we developed a deep neural network binary classifier for quality recognition to acquire scientifically sound studies and filter out others. Second, we developed a bidirectional long-short term memory recurrent neural network as a clinical context-aware classifier, which was trained on semantically enriched features generated using a word-embedding tokenizer for identification of meaningful sentences representing PICO text sequences. Third, we calculated the similarity between query and PICO text sequences using Jaccard similarity with semantic enrichments, where the semantic enrichments are obtained using medical ontologies. Last, we generated a representative summary from the high-scoring PICO sequences aggregated by study type, publication credibility, and freshness score.

Results: Evaluation of the prognosis quality recognition model using a large dataset of biomedical literature related to intracranial aneurysm showed an accuracy of 95.41% (2562/2686) in terms of recognizing quality articles. The clinical context-aware multiclass classifier outperformed the traditional machine-learning algorithms, including support vector machine, gradient boosted tree, linear regression, K-nearest neighbor, and naïve Bayes, by achieving 93% (16127/17341) accuracy for classifying five categories: aim, population, intervention, results, and outcome. The semantic similarity algorithm achieved a significant Pearson correlation coefficient of 0.61 (0-1 scale) on a well-known BIOSSES dataset (with 100 pair sentences) after semantic enrichment, representing an improvement of 8.9% over baseline Jaccard similarity. Finally, we found a highly positive correlation among the evaluations performed by three domain experts concerning different metrics, suggesting that the automated summarization is satisfactory.

Conclusions: By employing the proposed method Biomed-Summarizer, high accuracy in ATS was achieved, enabling seamless curation of research evidence from the biomedical literature to use for clinical decision-making.

Keywords: automatic text summarization; biomedical informatics; brain aneurysm; deep neural network; semantic similarity; word embedding.

©Muhammad Afzal, Fakhare Alam, Khalid Mahmood Malik, Ghaus M Malik. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 23.10.2020.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: None declared.

Figures

**Figure 1**
Proposed Biomed-Summarizer architecture with four major components: data preprocessing, quality recognition, context identification, and summary construction. PQR: prognosis quality recognition; CCA: clinical context-aware; PICO: Population/Problem, Intervention, Comparison, Outcome.

**Figure 2**
Process steps of proposed prognosis quality recognition (PQR) model training and testing.

**Figure 3**
Clinical context–aware (CCA) classifier trained on 250-dimension feature vectors, 100 nodes at the embedding layer, 100 memory units of the long short-term memory (LSTM) layer logical hidden layers, and 5 classification nodes.

**Figure 4**
Step-by-step scenario of query execution, retrieval of documents, quality checking, clinical context-aware (CCA) classification, semantic similarity, ranking, and summary creation. A: Aim; P: Population/Patients/Problem; I: Intervention; R: Results; O: Outcome; PICO: Patient/Problem, Intervention, Comparison, Outcome.

See this image and copyright information in PMC

Cited by

The McMaster Health Information Research Unit: Over a Quarter-Century of Health Informatics Supporting Evidence-Based Medicine.
Lokker C, McKibbon KA, Afzal M, Navarro T, Linkins LA, Haynes RB, Iorio A. Lokker C, et al. J Med Internet Res. 2024 Jul 31;26:e58764. doi: 10.2196/58764. J Med Internet Res. 2024. PMID: 39083765 Free PMC article.
Initial Development of an Automated Platform for Assessing Trainee Performance on Case Presentations.
King AJ, Kahn JM, Brant EB, Cooper GF, Mowery DL. King AJ, et al. ATS Sch. 2022 Sep 23;3(4):548-560. doi: 10.34197/ats-scholar.2022-0010OC. eCollection 2022 Dec. ATS Sch. 2022. PMID: 36726701 Free PMC article.
A systematic review of automatic text summarization for biomedical literature and EHRs.
Wang M, Wang M, Yu F, Yang Y, Walker J, Mostafa J. Wang M, et al. J Am Med Inform Assoc. 2021 Sep 18;28(10):2287-2297. doi: 10.1093/jamia/ocab143. J Am Med Inform Assoc. 2021. PMID: 34338801 Free PMC article.
Increasing Women's Knowledge about HPV Using BERT Text Summarization: An Online Randomized Study.
Bitar H, Babour A, Nafa F, Alzamzami O, Alismail S. Bitar H, et al. Int J Environ Res Public Health. 2022 Jul 1;19(13):8100. doi: 10.3390/ijerph19138100. Int J Environ Res Public Health. 2022. PMID: 35805761 Free PMC article. Clinical Trial.
Comparing generative and extractive approaches to information extraction from abstracts describing randomized clinical trials.
Witte C, Schmidt DM, Cimiano P. Witte C, et al. J Biomed Semantics. 2024 Apr 23;15(1):3. doi: 10.1186/s13326-024-00305-2. J Biomed Semantics. 2024. PMID: 38654304 Free PMC article.

See all "Cited by" articles

References

1. Nasr Azadani M, Ghadiri N, Davoodijam E. Graph-based biomedical text summarization: An itemset mining and sentence clustering approach. J Biomed Inform. 2018 Aug;84:42–58. doi: 10.1016/j.jbi.2018.06.005. https://linkinghub.elsevier.com/retrieve/pii/S1532-0464(18)30111-4 - DOI - PubMed
1. Schulze F, Neves M. Entity-Supported Summarization of Biomedical Abstracts. The COLING 2016 Organizing Committee; Proceedings of the Fifth Workshop on Building and Evaluating Resources for Biomedical Text Mining (BioTxtM2016); December 2016; Osaka, Japan. pp. 40–49. https://www.aclweb.org/anthology/W16-5105/
1. Alami N, Meknassi M, En-nahnahi N. Enhancing unsupervised neural networks based text summarization with word embedding and ensemble learning. Exp Syst App. 2019 Jun;123:195–211. doi: 10.1016/j.eswa.2019.01.037. - DOI
1. Allahyari M, Pouriyeh S, Assefi M, Safaei S, D. E, B. J, Kochut K. Text Summarization Techniques: A Brief Survey. arXiv. 2017 Jul 28;:1–9. https://arxiv.org/abs/1707.02268v3
1. Gambhir M, Gupta V. Recent automatic text summarization techniques: a survey. Artif Intell Rev. 2016 Mar 29;47(1):1–66. doi: 10.1007/s10462-016-9475-9. - DOI

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database

[1] Nasr Azadani M, Ghadiri N, Davoodijam E. Graph-based biomedical text summarization: An itemset mining and sentence clustering approach. J Biomed Inform. 2018 Aug;84:42–58. doi: 10.1016/j.jbi.2018.06.005. https://linkinghub.elsevier.com/retrieve/pii/S1532-0464(18)30111-4 - DOI - PubMed

[2] Nasr Azadani M, Ghadiri N, Davoodijam E. Graph-based biomedical text summarization: An itemset mining and sentence clustering approach. J Biomed Inform. 2018 Aug;84:42–58. doi: 10.1016/j.jbi.2018.06.005. https://linkinghub.elsevier.com/retrieve/pii/S1532-0464(18)30111-4 - DOI - PubMed

[3] Schulze F, Neves M. Entity-Supported Summarization of Biomedical Abstracts. The COLING 2016 Organizing Committee; Proceedings of the Fifth Workshop on Building and Evaluating Resources for Biomedical Text Mining (BioTxtM2016); December 2016; Osaka, Japan. pp. 40–49. https://www.aclweb.org/anthology/W16-5105/

[4] Schulze F, Neves M. Entity-Supported Summarization of Biomedical Abstracts. The COLING 2016 Organizing Committee; Proceedings of the Fifth Workshop on Building and Evaluating Resources for Biomedical Text Mining (BioTxtM2016); December 2016; Osaka, Japan. pp. 40–49. https://www.aclweb.org/anthology/W16-5105/

[5] Alami N, Meknassi M, En-nahnahi N. Enhancing unsupervised neural networks based text summarization with word embedding and ensemble learning. Exp Syst App. 2019 Jun;123:195–211. doi: 10.1016/j.eswa.2019.01.037. - DOI

[6] Alami N, Meknassi M, En-nahnahi N. Enhancing unsupervised neural networks based text summarization with word embedding and ensemble learning. Exp Syst App. 2019 Jun;123:195–211. doi: 10.1016/j.eswa.2019.01.037. - DOI

[7] Allahyari M, Pouriyeh S, Assefi M, Safaei S, D. E, B. J, Kochut K. Text Summarization Techniques: A Brief Survey. arXiv. 2017 Jul 28;:1–9. https://arxiv.org/abs/1707.02268v3

[8] Allahyari M, Pouriyeh S, Assefi M, Safaei S, D. E, B. J, Kochut K. Text Summarization Techniques: A Brief Survey. arXiv. 2017 Jul 28;:1–9. https://arxiv.org/abs/1707.02268v3

[9] Gambhir M, Gupta V. Recent automatic text summarization techniques: a survey. Artif Intell Rev. 2016 Mar 29;47(1):1–66. doi: 10.1007/s10462-016-9475-9. - DOI

[10] Gambhir M, Gupta V. Recent automatic text summarization techniques: a survey. Artif Intell Rev. 2016 Mar 29;47(1):1–66. doi: 10.1007/s10462-016-9475-9. - DOI

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Clinical Context-Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation

Affiliations

Clinical Context-Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources