Automatic MeSH term assignment and quality assessment
- PMID: 11825203
- PMCID: PMC2243528
Automatic MeSH term assignment and quality assessment
Abstract
For computational purposes documents or other objects are most often represented by a collection of individual attributes that may be strings or numbers. Such attributes are often called features and success in solving a given problem can depend critically on the nature of the features selected to represent documents. Feature selection has received considerable attention in the machine learning literature. In the area of document retrieval we refer to feature selection as indexing. Indexing has not traditionally been evaluated by the same methods used in machine learning feature selection. Here we show how indexing quality may be evaluated in a machine learning setting and apply this methodology to results of the Indexing Initiative at the National Library of Medicine.
Similar articles
-
The NLM Indexing Initiative.Proc AMIA Symp. 2000:17-21. Proc AMIA Symp. 2000. PMID: 11079836 Free PMC article.
-
Automated indexing of the Hazardous Substances Data Bank (HSDB).AMIA Annu Symp Proc. 2003;2003:954. AMIA Annu Symp Proc. 2003. PMID: 14728459 Free PMC article.
-
The NLM Indexing Initiative's Medical Text Indexer.Stud Health Technol Inform. 2004;107(Pt 1):268-72. Stud Health Technol Inform. 2004. PMID: 15360816
-
Using multi-terminology indexing for the assignment of MeSH descriptors to health resources in a French online catalogue.AMIA Annu Symp Proc. 2008 Nov 6;2008:586-90. AMIA Annu Symp Proc. 2008. PMID: 18998933 Free PMC article.
-
A strategy for assigning new concepts in the MEDLINE database.AMIA Annu Symp Proc. 2005;2005:395-9. AMIA Annu Symp Proc. 2005. PMID: 16779069 Free PMC article.
Cited by
-
Using noun phrases for navigating biomedical literature on Pubmed: how many updates are we losing track of?PLoS One. 2011;6(9):e24920. doi: 10.1371/journal.pone.0024920. Epub 2011 Sep 14. PLoS One. 2011. PMID: 21935487 Free PMC article.
-
MeSH Up: effective MeSH text classification for improved document retrieval.Bioinformatics. 2009 Jun 1;25(11):1412-8. doi: 10.1093/bioinformatics/btp249. Epub 2009 Apr 17. Bioinformatics. 2009. PMID: 19376821 Free PMC article.
-
Ranking the whole MEDLINE database according to a large training set using text indexing.BMC Bioinformatics. 2005 Mar 24;6:75. doi: 10.1186/1471-2105-6-75. BMC Bioinformatics. 2005. PMID: 15790421 Free PMC article.
-
The Synergy Between PAV and AdaBoost.Mach Learn. 2005 Nov;61(1-3):71-103. doi: 10.1007/s10994-005-1123-6. Epub 2005 Jun 8. Mach Learn. 2005. PMID: 29456289 Free PMC article.
-
A recent advance in the automatic indexing of the biomedical literature.J Biomed Inform. 2009 Oct;42(5):814-23. doi: 10.1016/j.jbi.2008.12.007. Epub 2008 Dec 30. J Biomed Inform. 2009. PMID: 19166973 Free PMC article.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources