nih-gov/www.nlm.nih.gov/pubs/techbull/so02/so02_automated_indexing.html
2025-02-26 13:17:41 -05:00

209 lines
11 KiB
HTML

<!doctype html public "-//W3C//DTD HTML 4.01 Transitional//EN" >
<html lang="en">
<head>
<!--***********************change issue date and title below**********************-->
<title>Automated Indexing Implemented for Meeting Abstracts. NLM Technical Bulletin. Sep-Oct 2002</title>
<meta name="DC.Subject.IssueNum" content="328" />
<meta name="DC.Subject.IssueCover" content="/pubs/techbull/so02/so02_issue_cover.html" />
<meta name="DC.Subject.Keyword" content="Abstract" />
<meta name="DC.Subject.Keyword" content="Collections" />
<meta name="DC.Subject.Keyword" content="Data Creation and Maintenance System" />
<meta name="DC.Subject.Keyword" content="ELHILL" />
<meta name="DC.Subject.Keyword" content="Full Text" />
<meta name="DC.Subject.Keyword" content="Health Services Research" />
<meta name="DC.Subject.Keyword" content="Indexing" />
<meta name="DC.Subject.Keyword" content="Medical Literature Analysis and Retrieval System" />
<meta name="DC.Subject.Keyword" content="Medical Subject Headings" />
<meta name="DC.Subject.Keyword" content="Medical Text Indexer" />
<meta name="DC.Subject.Keyword" content="Meeting Abstracts" />
<meta name="DC.Subject.Keyword" content="MeSH Terms" />
<meta name="DC.Subject.Keyword" content="National Library of Medicine" />
<meta name="DC.Subject.Keyword" content="NLM Gateway" />
<meta name="DC.Subject.Keyword" content="Space Life Sciences" />
<meta name="DC.Subject.Keyword" content="Subheading" />
<meta name="DC.Subject.Keyword" content="Substance Name" />
</head>
<body link="#476B47" alink="#476B47" vlink="#476B47" text="#000000" bgcolor="ffffff"><noscript><iframe src="//www.googletagmanager.com/ns.html?id=GTM-MT6MLL" height="0" width="0" style="display:none;visibility:hidden" title="googletagmanager"></iframe></noscript><script>(function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start': new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='//www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-MT6MLL');</script>
<style type="text/css">
#skip, .skip, .skipnavigation {
position:absolute;
left:0px;
top:-500px;
width:1px;
height:1px;
overflow:hidden;
}
</style>
<div class="skipnavigation"><a title="Skip the navigation on this page" href="#skipnav" class="skipnavigation">Skip Navigation Bar</a></div>
<center>
<a id="skipnav" name="skipnav"></a>
<table border="0" width="550">
<tr><td width="550" align="center" colspan="3">
<img src="/pubs/techbull/new_tb_graphics/header_final.gif" border="0" alt="NLM Technical Bulletin Header"/>
<br />
<img src="/pubs/techbull/new_tb_graphics/toc_328.gif" border="0" alt="Article Navigation Bar" usemap="#subheader_issues"/>
<map name="subheader_issues" id="subheader_issues">
<area shape="RECT" coords="30,20,165,40" href="/pubs/techbull/so02/so02_issue_cover.html" alt="Table of Contents">
<area shape="RECT" coords="215,20,265,40" href="/pubs/techbull/tb.html" alt="NLM Technical Bulletin Home Page">
<area shape="RECT" coords="310,20,395,40" href="/pubs/techbull/back_issues.html" alt="Back Issues">
</map>
</td></tr>
<!--************************indicate date posted below**********************************-->
<!--************************add dates corrected to subsequent lines with [corrected] tag***************************-->
<tr><td width="30">&nbsp;</td><td width="470"><font size="2"><strong>September 19, 2002</strong> [posted]</font><br /></td><td width="30">&nbsp;</td></tr>
<tr><td width="30">&nbsp;</td><td width="470">
<!--************************indicate title of article below*********************-->
<tr><td width="30">&nbsp;</td><td width="470"><font size="5"><strong>Automated Indexing Implemented for Meeting Abstracts</strong></font><br /></td><td width="30">&nbsp;</td></tr>
<tr><td width="30">&nbsp;</td><td width="470">
<!--
<img src="/pubs/techbull/new_tb_graphics/black_pixel.gif" width="450" height="1"/>
-->
<br />
<p>
<!--************************change graphic to match first letter of article*************************-->
<img src="/pubs/techbull/new_tb_graphics/t.gif" border=0 align="left" alt="drop cap letter for t"/>
he National Library of Medicine (NLM) has had a long-standing interest in automated indexing and
has conducted research through its project, the <a href="http://ii.nlm.nih.gov">Indexing Initiative</a>.
The objective of the Indexing Initiative is to investigate methods whereby automated indexing algorithms
partially or completely substitute for traditional indexing. NLM saw a golden opportunity to apply its
research to meeting abstracts because they are now grouped in the <a href="http://gateway.nlm.nih.gov">NLM Gateway</a>
as a result of NLM's reinvention initiatives and they are full text.
</p>
<p>
Historically, NLM has collected and provided selected meeting
abstracts in electronic format as part of former specialty databases.
During reinvention, meeting abstracts were converted and added to the NLM Gateway
where there are now approximately 63,000 meeting abstracts in the areas of HIV/AIDS,
Health Services Research, and Space Life Sciences.
</p>
<p>
Prior to the retirement of the ELHILL<sup><font size="-2">&#174;</font></sup> mainframe
retrieval system and the legacy data creation and maintenance systems for the specialty
databases, indexers assigned Medical Subject Heading (MeSH<sup><font size="-2">&#174;</font></sup>) terms to meeting abstracts.
Meeting abstracts were last updated with the 2001 MeSH vocabulary in the fall of 2000.
New meeting abstracts added to the Gateway since that time have not been indexed.
</p>
<p>
The meeting abstracts on HIV/AIDS were divided into two Gateway collections: AIDS Meetings (with MeSH) and
AIDS Meetings (not indexed with MeSH terms). The abstracts in this latter collection were added
to the Gateway during the period November 2000 through July 2002. The Health Services Research collection contained
both MeSH-indexed and non-indexed meeting abstracts. The Space Life Sciences collection contained
indexed abstracts from professional meetings through 2000.
</p>
<p>
A sample of meeting abstracts was processed through the Medical Text Indexer (MTI)
automated indexing system, produced by the Indexing Initiative. MTI applies several
methods of discovering MeSH headings for titles and abstracts and combines them into an
ordered list of recommended indexing terms. NLM staff reviewed the MTI output for the
sample, and MTI algorithms were adjusted based on feedback. Through the iterative
process of output review and algorithm adjustment, the resulting quality of indexing terms
was greatly improved.
</p>
<p>
NLM decided to use MTI to index meeting abstracts available through the Gateway. MeSH terms will be added when
a meeting abstract is introduced into the Gateway prospectively. Terms applied will be maintained each
year in accordance with annual MeSH changes. The terms reside and are displayed in the Keyword field.
There are no MeSH/subheading combinations, and MTI does not indicate major MeSH topics.
</p>
<p>
For the new version of the Gateway released September 19, 2002, all of the existing meeting
abstracts in the NLM Gateway were processed by MTI. Automatically-generated MeSH terms were
added to the Keyword field, and all previously existing MeSH terms and substance names were removed. Gateway search
translations were adjusted to take advantage of the new Keywords. The AIDS meeting abstracts are now in
one single collection, AIDS Meetings, because they now share consistent subject analysis
generated by MTI.
</p>
<br /><br />
<p>
<!--************************indicate article author and section below*******************************-->
<strong>By Andrea Demsey</strong><br />
<strong>MEDLARS Management Section</strong><br />
<strong>and</strong><br />
<strong>Sonya Shooshan</strong><br />
<strong>Lister Hill National Center for Biomedical Communications</strong><br />
</p>
<!--
<img src="/pubs/techbull/new_tb_graphics/black_pixel.gif" width="450" height="1"/>
-->
<!--************************indicate indexing terms below**********************-->
<!--
<p><strong>Indexing Terms</strong></p>
<p>Meeting Abstracts, Automated Indexing</p>
<p>Automated Indexing, Meeting Abstracts</p>
<p>NLM Gateway, Automated Indexing, Meeting Abstracts</p>
<p>Indexing Initiative</p>
<p>Medical Text Indexer</p>
<p>Indexing, Automated, Meeting Abstracts</p>
<p>Abstracts, Meeting</p>
<p>Conference Proceedings Abstracts</p>
-->
<!--************************end indexing terms*********************************-->
<img src="/pubs/techbull/new_tb_graphics/black_pixel.gif" width="450" height="1" alt="black line separting article from citation"/>
<p>
<!--************************change citation information below*****************************-->
<em>Demsey A, Shooshan S. Automated Indexing Implemented for Meeting Abstracts. NLM Tech Bull. 2002 Sep-Oct;(328):e2.</em>
<!--************************end citation information**************************************-->
</p>
</td><td width="30">&nbsp;</td></tr>
</table>
<br />
<br />
<img src="/pubs/techbull/new_tb_graphics/footer_328.gif" border="0" alt="Article Navigation Bar" usemap="#footer_final"/>
<map name="footer_final" id="footer_final">
<area shape="RECT" coords="265,5,315,30" href="/pubs/techbull/tb.html" alt="NLM Technical Bulletin Home Page">
<area shape="RECT" coords="330,5,420,30" href="/pubs/techbull/back_issues.html" alt="Back Issues">
<area shape="RECT" coords="435,5,480,30" href="/pubs/techbull/new_index.html" alt="Index">
<area shape="RECT" coords="5,5,90,20" href="/pubs/techbull/so02/so02_technote.html" alt="Previous Page">
<area shape="RECT" coords="445,6,504,20" href="/pubs/techbull/so02/so02_popline.html" alt="Next Article">
</map>
</center>
<!-- BEGIN NLM FOOTER -->
<center>
<font size="2" face="helvetica, arial"><a href="/nlmhome.html">U.S. National Library of Medicine</a>, 8600 Rockville Pike, Bethesda, MD 20894<br /><a href="http://www.nih.gov/">National Institutes of Health</a>, <a href="//www.hhs.gov/">Department of Health &amp; Human Services</a><br /><a href="/copyright.html">Copyright</a>, <a href="/privacy.html">Privacy</a>, <a href="/accessibility.html">Accessibility</a>, <a href="http://www.nih.gov/icd/od/foia/index.htm">Freedom of Information Act (FOIA)</a><br/><a href="https://www.hhs.gov/vulnerability-disclosure-policy/index.html">HHS Vulnerability Disclosure</a>
<br />
<!-- ******************MODIFY "LAST UPDATED" ******************* -->
Last updated: 19 April 2012
</font>
</center>
<!-- END NLM FOOTER -->
<!-- ******************MODIFY EXPDATE AND EMAIL BELOW****************** -->
<!-- EXPDATE="2015-03-20" -->
<!-- EMAIL="nlmtechbull@mail.nlm.nih.gov" -->
<script src="//assets.nlm.nih.gov/jquery/jquery-latest.min.js"></script><script src="/core/nlm-notifyExternal/1.0/nlm-notifyExternal.min.js"></script></body>
</html>