Semi-automated screening of biomedical citations for systematic reviews

doi:10.1186/1471-2105-11-55

. 2010 Jan 26:11:55.

doi: 10.1186/1471-2105-11-55.

Semi-automated screening of biomedical citations for systematic reviews

Byron C Wallace¹, Thomas A Trikalinos, Joseph Lau, Carla Brodley, Christopher H Schmid

Affiliations

PMID: 20102628
PMCID: PMC2824679
DOI: 10.1186/1471-2105-11-55

Semi-automated screening of biomedical citations for systematic reviews

Byron C Wallace et al. BMC Bioinformatics. 2010.

. 2010 Jan 26:11:55.

doi: 10.1186/1471-2105-11-55.

Authors

Byron C Wallace¹, Thomas A Trikalinos, Joseph Lau, Carla Brodley, Christopher H Schmid

Affiliation

¹ Department of Computer Science, Tufts University, Medford, MA, USA. bwallace@tuftsmedicalcenter.org

PMID: 20102628
PMCID: PMC2824679
DOI: 10.1186/1471-2105-11-55

Abstract

Background: Systematic reviews address a specific clinical question by unbiasedly assessing and analyzing the pertinent literature. Citation screening is a time-consuming and critical step in systematic reviews. Typically, reviewers must evaluate thousands of citations to identify articles eligible for a given review. We explore the application of machine learning techniques to semi-automate citation screening, thereby reducing the reviewers' workload.

Results: We present a novel online classification strategy for citation screening to automatically discriminate "relevant" from "irrelevant" citations. We use an ensemble of Support Vector Machines (SVMs) built over different feature-spaces (e.g., abstract and title text), and trained interactively by the reviewer(s). Semi-automating the citation screening process is difficult because any such strategy must identify all citations eligible for the systematic review. This requirement is made harder still due to class imbalance; there are far fewer "relevant" than "irrelevant" citations for any given systematic review. To address these challenges we employ a custom active-learning strategy developed specifically for imbalanced datasets. Further, we introduce a novel undersampling technique. We provide experimental results over three real-world systematic review datasets, and demonstrate that our algorithm is able to reduce the number of citations that must be screened manually by nearly half in two of these, and by around 40% in the third, without excluding any of the citations eligible for the systematic review.

Conclusions: We have developed a semi-automated citation screening algorithm for systematic reviews that has the potential to substantially reduce the number of citations reviewers have to manually screen, without compromising the quality and comprehensiveness of the review.

PubMed Disclaimer

Figures

**Figure 1**
**Shown are the typical approach and our modified approach that includes semi-automated abstract screening on the left and right-hand, respectively (see text for details)**. In the modified approach the reviewers train and use a classification model to exclude completely "irrelevant" citations ("Level 1a"). They will trust the model's exclusions, and will review only the citations suggested by the classification model.

**Figure 2**
**An article is broken down into its component parts (title, abstract text, and keywords), and these are in turn represented as either bag-of-words or bag-of-UMLS-biomedical concepts vectors**.

**Figure 3**
**Construction of confusion matrices for the semi-automated abstract screening strategy**. The leftmost matrix represents citations that are labeled by the reviewer while training the classification model. The middle matrix displays the predictions of the trained model over the remaining unlabeled set of citations U. The rightmost matrix shows the corresponding crosstabulation at the end of "Level 1a" (see Figure 1). The quantities mentioned in this figure are used in the definition of *Yield* and *Burden*, the chosen evaluation metrics (see Equations 1 and 2). Superscripts T and U refer to model training and applying the model to yet unlabeled citations, respectively. tp^[T|U]: "true positives", tn^[T|U]: "true negatives", fp^[T|U]: "false positives", fn ^[T|U]: "false negatives". We assume that reviewers will never erroneously exclude a citation that is eligible for systematic review, i.e. fn^T= 0.

**Figure 4**
**Yield (blue) and burden (red) curves for four learning strategies over the proton beam dataset as a function of the size of thetraining set**. The thick lines are averages over 10 runs. Thin lines denote individual runs. Clockwise from the upper left, the strategies shown are: random sampling, SIMPLE, *PAL*, and *PAL* with aggressive undersampling. It is desirable to achieve maximum *Yield* while minimizing *Burden*. The upper right-corner (100% yield and 100% burden) corresponds to the manual approach of citation screening. Every point where *Yield* (the blue line) is at 1.0 and *Burden* (the red line) is less than 1.0 is thus progress. Note that *Burden* curves are U-shaped because classifiers trained on very small training sets tend to classify the majority of the unlabeled citations as "relevant" (due to our undersampling and cautious aggregation technique), and all citations classified as "relevant" must be subsequently screened by a human. When the training set is very large, the reviewers manually screen the majority of the citations during training.

**Figure 5**
**Results over the COPD dataset**.

**Figure 6**
**Results over the micro-nutrients dataset**.

See this image and copyright information in PMC

Cited by

Efficiency and Workload Reduction of Semi-automated Citation Screening Software for Creating Clinical Practice Guidelines: A Prospective Observational Study.
Oami T, Okada Y, Sakuraya M, Fukuda T, Shime N, Nakada TA. Oami T, et al. J Epidemiol. 2024 Aug 5;34(8):380-386. doi: 10.2188/jea.JE20230227. Epub 2024 May 31. J Epidemiol. 2024. PMID: 38105001 Free PMC article.
Rapid network meta-analysis using data from Food and Drug Administration approval packages is feasible but with limitations.
Wang L, Rouse B, Marks-Anglin A, Duan R, Shi Q, Quach K, Chen Y, Cameron C, Schmid CH, Li T. Wang L, et al. J Clin Epidemiol. 2019 Oct;114:84-94. doi: 10.1016/j.jclinepi.2019.06.010. Epub 2019 Jun 18. J Clin Epidemiol. 2019. PMID: 31226413 Free PMC article.
Adverse events following implantable cardioverter defibrillator implantation: a systematic review.
Persson R, Earley A, Garlitski AC, Balk EM, Uhlig K. Persson R, et al. J Interv Card Electrophysiol. 2014 Aug;40(2):191-205. doi: 10.1007/s10840-014-9913-z. Epub 2014 Jun 20. J Interv Card Electrophysiol. 2014. PMID: 24948126 Review.
Making Science Computable Using Evidence-Based Medicine on Fast Healthcare Interoperability Resources: Standards Development Project.
Soares A, Schilling LM, Richardson J, Kommadi B, Subbian V, Dehnbostel J, Shahin K, Robinson KA, Afzal M, Lehmann HP, Kunnamo I, Alper BS. Soares A, et al. J Med Internet Res. 2024 Jun 25;26:e54265. doi: 10.2196/54265. J Med Internet Res. 2024. PMID: 38916936 Free PMC article.
Using text mining for study identification in systematic reviews: a systematic review of current approaches.
O'Mara-Eves A, Thomas J, McNaught J, Miwa M, Ananiadou S. O'Mara-Eves A, et al. Syst Rev. 2015 Jan 14;4(1):5. doi: 10.1186/2046-4053-4-5. Syst Rev. 2015. PMID: 25588314 Free PMC article. Review.

See all "Cited by" articles

References

1. Barza M, Trikalinos TA, Lau J. Statistical considerations in meta-analysis. Infect Dis Clin North Am. 2009;23:195–210. doi: 10.1016/j.idc.2009.01.003. - DOI - PubMed
1. Counsell C. Formulating questions and locating primary studies for inclusion in systematic reviews. Ann Intern Med. 1997;127:380–387. - PubMed
1. Wheeler P, Balk E, Bresnahan K, Shephard B, Lau J, DeVine D, Chung M, Miller K. Criteria for determining disability in infants and children: short stature. Evidence Report/Technology Assessment No. 73. Prepared by New England Medical Center Evidence-based Practice Center under Contract No. 290-97-001. 2003. - PMC - PubMed
1. Cole C, Binney G, Casey P, Fiascone J, Hagadorn J, Kim C, Wang C, Devine D, Miller K, Lau J. Criteria for determining disability in infants and children: Low Birth Weight. Evidence Report/Technology Assessment No. 70. Prepared by New England Medical Center Evidence-based Practice Center under Contract No. 290-97-0019. 2002. - PMC - PubMed
1. Perrin E, Cole C, Frank D, Glicken S, Guerina N, Petit K, Sege R, Volpe M, Chew P, MeFadden C, Devine D, Miller K, Lau J. Criteria for determining disability in infants and children: failure to thrive. Evidence Report/Technology Assessment No. 72. Prepared by New England Medical Center Evidence-based Practice Center under Contract No. 290-97-0019. 2003. - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources

[1] Barza M, Trikalinos TA, Lau J. Statistical considerations in meta-analysis. Infect Dis Clin North Am. 2009;23:195–210. doi: 10.1016/j.idc.2009.01.003. - DOI - PubMed

[2] Barza M, Trikalinos TA, Lau J. Statistical considerations in meta-analysis. Infect Dis Clin North Am. 2009;23:195–210. doi: 10.1016/j.idc.2009.01.003. - DOI - PubMed

[3] Counsell C. Formulating questions and locating primary studies for inclusion in systematic reviews. Ann Intern Med. 1997;127:380–387. - PubMed

[4] Counsell C. Formulating questions and locating primary studies for inclusion in systematic reviews. Ann Intern Med. 1997;127:380–387. - PubMed

[5] Wheeler P, Balk E, Bresnahan K, Shephard B, Lau J, DeVine D, Chung M, Miller K. Criteria for determining disability in infants and children: short stature. Evidence Report/Technology Assessment No. 73. Prepared by New England Medical Center Evidence-based Practice Center under Contract No. 290-97-001. 2003. - PMC - PubMed

[6] Wheeler P, Balk E, Bresnahan K, Shephard B, Lau J, DeVine D, Chung M, Miller K. Criteria for determining disability in infants and children: short stature. Evidence Report/Technology Assessment No. 73. Prepared by New England Medical Center Evidence-based Practice Center under Contract No. 290-97-001. 2003. - PMC - PubMed

[7] Cole C, Binney G, Casey P, Fiascone J, Hagadorn J, Kim C, Wang C, Devine D, Miller K, Lau J. Criteria for determining disability in infants and children: Low Birth Weight. Evidence Report/Technology Assessment No. 70. Prepared by New England Medical Center Evidence-based Practice Center under Contract No. 290-97-0019. 2002. - PMC - PubMed

[8] Cole C, Binney G, Casey P, Fiascone J, Hagadorn J, Kim C, Wang C, Devine D, Miller K, Lau J. Criteria for determining disability in infants and children: Low Birth Weight. Evidence Report/Technology Assessment No. 70. Prepared by New England Medical Center Evidence-based Practice Center under Contract No. 290-97-0019. 2002. - PMC - PubMed

[9] Perrin E, Cole C, Frank D, Glicken S, Guerina N, Petit K, Sege R, Volpe M, Chew P, MeFadden C, Devine D, Miller K, Lau J. Criteria for determining disability in infants and children: failure to thrive. Evidence Report/Technology Assessment No. 72. Prepared by New England Medical Center Evidence-based Practice Center under Contract No. 290-97-0019. 2003. - PMC - PubMed

[10] Perrin E, Cole C, Frank D, Glicken S, Guerina N, Petit K, Sege R, Volpe M, Chew P, MeFadden C, Devine D, Miller K, Lau J. Criteria for determining disability in infants and children: failure to thrive. Evidence Report/Technology Assessment No. 72. Prepared by New England Medical Center Evidence-based Practice Center under Contract No. 290-97-0019. 2003. - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Semi-automated screening of biomedical citations for systematic reviews

Affiliation

Semi-automated screening of biomedical citations for systematic reviews

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

Grants and funding

LinkOut - more resources

Full Text Sources