nih-gov/www.ncbi.nlm.nih.gov/entrez/query/static/help/Summary_Matrices.html

720 lines
21 KiB
HTML

<html>
<head>
<title>Entrez Help Document: Summary Matrices</title>
<script LANGUAGE="JavaScript">
</script>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
<!-- if you use the following meta tags, uncomment them.
<META NAME="keywords" CONTENT="insert your keywords for the search engine">
<META NAME="description" CONTENT="insert the description to be displayed by the search engine. Also searched by the search engine.">
-->
<link rel="stylesheet" href="http://www.ncbi.nlm.nih.gov/corehtml/ncbi.css">
</head>
<body bgcolor="#FFFFFF" text="#000000" link="#CC6600" vlink="#CC6600">
<!-- the header -->
<table border="0" width="600" cellspacing="0" cellpadding="0">
<tr>
<td width="140"><a href="http://www.ncbi.nlm.nih.gov"> <img src="http://www.ncbi.nlm.nih.gov/corehtml/left.GIF" width="130" height="45" border="0"></a></td>
<td width="360" class="head1" valign="BOTTOM"> <span class="H1">Entrez Help Document</span></td>
<td width="100" valign="BOTTOM"></td>
</tr>
</table>
<!-- the quicklinks bar -->
<table CLASS="TEXT" border="0" width="600" cellspacing="0" cellpadding="3" bgcolor="#000000">
<tr CLASS="TEXT" align="CENTER">
<td width="100"><a href="http://www.ncbi.nlm.nih.gov/PubMed/" class="BAR">PubMed</a></td>
<td width="100"><a href="http://www.ncbi.nlm.nih.gov/Entrez/" class="BAR">Entrez</a></td>
<td width="100"><a href="http://www.ncbi.nlm.nih.gov/BLAST/" class="BAR">BLAST</a></td>
<td width="100"><a href="http://www.ncbi.nlm.nih.gov/omim/" class="BAR">OMIM</a></td>
<td width="100"><a href="http://www.ncbi.nlm.nih.gov/Taxonomy/taxonomyhome.html" class="BAR">Taxonomy</a></td>
<td width="100"><a href="http://www.ncbi.nlm.nih.gov/Structure/" class="BAR">Structure</a></td>
</tr>
</table>
<!--start of 2nd table -->
<p>Last modified : July 18, 2000</p>
<table border="0" width="460" cellspacing="0" cellpadding="0">
<tr>
<p><b>Summary Matrices</b><p>
This document provides the following summary tables for the Entrez
Nucleotide, Protein, Genome, Structure, and Popset data domains:</p>
<blockquote>
<a href="#Limits_Available_by_Database">Limits Available by Database</a><br>
<a href="#Indexes_Available_by_Database">Search Fields Available by Database</a><br>
<a href="#Search_Fields_and_Qualifiers">Search Field Desriptions and Qualifiers</a><br>
<a href="#Display_Formats">Display Formats</a></p>
</blockquote>
The <a href="/corehtml/query/static/help/pmhelp.html">PubMed help document</a> contains separate information about
the <a href="/corehtml/query/static/help/pmhelp.html#Limits">Limits</a>, <a href="/corehtml/query/static/help/pmhelp.html#SearchFieldDescriptionsandTags">Search Fields</a>, and <a href="/corehtml/query/static/help/pmhelp.html#DisplayingDocuments">Display Formats</a> available for that database.</p>
<A HREF="helpdoc.html">Back</A> to the Entrez Help Document</p>
<table border width="460" cellspacing="0" cellpadding="5">
<caption align=top>
<h2><A NAME="Limits_Available_by_Database"></A>Limits Available by Database</h2></caption>
<tr>
<td colspan=6 rowspan=13></td>
<th colspan=6 align=center>Databases</th>
</tr>
<tr>
<th>Limits</th>
<th>Nucleotide</th>
<th>Protein</th>
<th>Genome</th>
<th>Structure</th>
<th>PopSet</th>
</tr>
<tr align=center>
<th><A HREF="#Indexes_Available_by_Database">Search Fields</A></th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
</tr>
<tr align=center>
<th>Exclude ESTs</th>
<td>Yes</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
<tr align=center>
<th>Exclude STSs</th>
<td>Yes</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
<tr align=center>
<th>Exclude GSSs</th>
<td>Yes</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
<tr align=center>
<th>Exclude Working Draft</th>
<td>Yes</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
<tr align=center>
<th>Exclude Patents</th>
<td>Yes</td>
<td>Yes</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
<tr align=center>
<th>Molecule Type</th>
<td>Yes</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
<tr align=center>
<th>Gene Location</th>
<td>Yes</td>
<td>Yes</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
<tr align=center>
<th>Segmented Sequences</th>
<td>Yes</td>
<td>Yes</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
<tr align=center>
<th>Database Source</th>
<td>Yes</td>
<td>Yes</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
<tr align=center>
<th>Modification Date</th>
<td>Yes</td>
<td>Yes</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
</td>
</table>
<P>
<A HREF="helpdoc.html">Back</A> to the Entrez Help Document</p>
<table border width="460" cellspacing="0" cellpadding="5">
<caption align=top>
<h2><A NAME="Indexes_Available_by_Database"></A>Search Fields Available by Database</h2></caption>
<tr>
<td colspan=6 rowspan=28></td>
<th colspan=1 align=center></th>
<th colspan=5 align=center>Databases</th>
</tr>
<tr>
<th colspan=1 align=center><A HREF="#Search_Fields_and_Qualifiers">Search Field Descriptions and Qualifiers</A></th>
<th colspan=1 align=center>Nucleotide</th>
<th colspan=1 align=center>Protein</th>
<th colspan=1 align=center>Genome</th>
<th colspan=1 align=center>Structure</th>
<th colspan=1 align=center>PopSet</th>
</tr>
<tr align=center>
<th>Accession</th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
</tr>
<tr align=center>
<th>All Fields</th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
</tr>
<tr align=center>
<th>Author Name</th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
</tr>
<tr align=center>
<th>EC/RN Number</th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
</tr>
<tr align=center>
<th>Feature Key</th>
<td>Yes</td>
<td>No</td>
<td>Yes</td>
<td>No</td>
<td>Yes</td>
</tr>
<tr align=center>
<th>Filter</th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
</tr>
<tr align=center>
<th>Gene Name</th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>No</td>
<td>Yes</td>
</tr>
<tr align=center>
<th>Issue</th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
</tr>
<tr align=center>
<th>Journal Name</th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
</tr>
<tr align=center>
<th>Keyword</th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>No</td>
<td>Yes</td>
</tr>
<tr align=center>
<th>Modification Date</th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
</tr>
<tr align=center>
<th>Molecular Weight</th>
<td>No</td>
<td>Yes</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
<tr align=center>
<th>Organism</th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
</tr>
<tr align=center>
<th>Page Number</th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
</tr>
<tr align=center>
<th>Primary Accession</th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>No</td>
<td>Yes</td>
</tr>
<tr align=center>
<th>Properties</th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>No</td>
<td>Yes</td>
</tr>
<tr align=center>
<th>Protein Name</th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>No</td>
<td>Yes</td>
</tr>
<tr align=center>
<th>Publication Date</th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
</tr>
<tr align=center>
<th>SeqID String</th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>No</td>
<td>Yes</td>
</tr>
<tr align=center>
<th>Sequence Length</th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>No</td>
<td>No</td>
</tr>
<tr align=center>
<th>Substance Name</th>
<td>Yes</td>
<td>Yes</td>
<td>No</td>
<td>Yes</td>
<td>No</td>
</tr>
<tr align=center>
<th>Text Word</th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
</tr>
<tr align=center>
<th>Title Word</th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>No</td>
<td>No</td>
</tr>
<tr align=center>
<th>Uid</th>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
<tr align=center>
<th>Volume</th>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
</tr>
</table></p>
<A HREF="helpdoc.html">Back</A> to the Entrez Help Document</p>
<table border width="460" cellspacing="0" cellpadding="5">
<caption align=top>
<h2><A NAME="Search_Fields_and_Qualifiers"></A>Search Field Descriptions and Qualifiers</h2></caption>
<tr>
<td colspan=3 rowspan=26></td>
<th colspan=1>Search Field</th>
<th colspan=1>Definition</th>
<th colspan=1>Qualifier</th>
</tr>
<tr>
<td>Accession</td>
<td>Contains the unique accession number of the sequence or record, assigned
to the nucleotide, protein, structure, genome record, or PopSet by a sequence
database builder. The Structure database accession index contains the PDB IDs but
not the MMDB IDs.</td>
<td>[ACCN]</td>
</tr>
<tr>
<td>All Fields</td>
<td>Contains all terms from all searchable database fields in the database.</td>
<td>[ALL]</td>
</tr>
<tr>
<td>Author Name</td>
<td>Contains all authors from all references in the
database records. The format is last name space first initial(s),
without punctuation (e.g., marley jf).</td>
<td>[AUTH]</td>
</tr>
<tr>
<td>EC/RN Number</td>
<td>Number assigned by the Enzyme Commission or Chemical Abstract Service (CAS)
to designate a particular enzyme or chemical, respectively.</td>
<td>[ECNO]</td>
</tr>
<tr>
<td>Feature Key</td>
<td>Contains the biological features assigned or annotated to the
nucleotide sequences and defined in the DDBJ/EMBL/GenBank Feature Table
(http://www.ncbi.nlm.nih.gov/projects/collab/FT/index.html). Not available for
the Protein or Structure databases.</td>
<td>[FKEY]</td>
</tr>
<tr>
<td>Filter</td>
<td>Contains predetermined or filtered subsets of the various databases. These
subsets or filters are created by grouping records that are commonly linked
to other Entrez databases or within the same database.<P>
For example, the PopSet database Filter index includes PopSet all, PopSet medline, PopSet
nucleotide, and PopSet protein. The PopSet medline filter includes all PopSet records with
links to PubMed; the PopSet nucleotide filter includes all PopSet records with links to the
nucleotide database; and, the PopSet protein filter includes all PopSet records with links to
the protein database. The PopSet all filter includes all PopSet records.<P>The Nucleotide database
Filter index contains a great deal more filters because the database records are linked
to numerous external links. For more information see <A HREF="helpdoc.html#Link_Out">Link Out</A></A>.</td>
<td>[FILT]</td>
</tr>
<tr>
<td>Gene Name</td>
<td>Contains the standard and common names of genes found in the database records. This field is not
available in Structure database.</td>
<td>[GENE]</td>
</tr>
<tr>
<td>Issue</td>
<td>Contains the issue number of the journal in which the data were published.</td>
<td>[ISS]</td>
</tr>
<tr>
<td>Journal Name</td>
<td>Contains the name of the journal in which the data were published. Journal
names are indexed in the database in abbreviated form (e.g., J Biol Chem). Journals are also
indexed by their by ISSNs. Browse the index if you do not know the ISSN or are not sure how a
particular journal name is abbreviated.</td>
<td>[JOUR]</td>
</tr>
<tr>
<td>Keyword</td>
<td>Contains special index terms from the
controlled vocabularies associated with the GenBank, EMBL, DDBJ,
SWISS-Prot, PIR, PRF, or PDB databases. Browse the Keyword indexes of the individual
databases to become familiar with these vocabularies. A Keyword index is not available
in the Structure database.</td>
<td>[KYWD]</td>
</tr>
<tr>
<td>Modification Date</td>
<td>Contains the date that the most recent modification to that record is indexed in Entrez, in the format YYYY/MM/DD (e.g., 1999/08/05). A year alone, (e.g., 1999) will retrieve all records modified for that year; a year and month (e.g., 1999/03) retrieves all records modified for that month that are indexed in Entrez. </td>
<td>[MDAT]</td>
</tr>
<tr>
<td>Molecular Weight</td>
<td>Molecular weight of a protein, in Daltons (Da), calculated by the method described in the <a href="helpdoc.html#MolecularWeight">Searching by Molecular Weight</a> section of the Entrez help document. Note that molecular weight must be entered as a fixed 6 digit field, filled with leading zeros (not letter O), e.g., 002002 [MOLWT]</td>
<td>[MOLWT]</td>
</tr>
<tr>
<td>Organism</td>
<td>Contains the scientific and common names for the organisms
associated with protein and nucleotide sequences. </td>
<td>[ORGN]</td>
</tr>
<tr>
<td>Page Number</td>
<td>Contains the number of the first journal page of the article in which
the data were published.</td>
<td>[PAGE]</td>
</tr>
<tr>
<td>Primary Accession</td>
<td>Contains the primary accession number of the sequence or record, assigned
to the nucleotide, protein, structure, genome record, or PopSet by a sequence
database builder. A Primary Accession index is not available
in the Structure database.</td>
<td>[PACC]</td>
</tr>
<tr>
<td>Properties</td>
<td>Contains properties of the nucleotide or protein sequence. For
example, the Nucleotide database's Properties index includes molecule types,
publication status, molecule locations, and GenBank divisions. A
Properties index is not available in the Structure database.</td>
<td>[PROP]</td>
</tr>
<tr>
<td>Protein Name</td>
<td>Contains the standard names of proteins found in database records. Common names
may not be indexed in this field so it is best to also consider All Fields or Text Words.
A Protein Name index is not available in the Structure database. </td>
<td>[PROT]</td>
</tr>
<tr>
<td>Publication Date</td>
<td>Contains the date that records are released into
Entrez, in the format YYYY/MM/DD (e.g.,
1999/08/05). It is the date the entry first appeared in
GenBank explicitly indexed in Entrez. A year alone,
(e.g., 1999) will retrieve all records for that year; a
year and month (e.g., 1999/03) will retrieve all
records released into GenBank for that month.</td>
<td>[PDAT]</td>
</tr>
<tr>
<td>SeqID String</td>
<td>Contains the special string identifier, similar to a FASTA identifier, for a
given sequence. A SeqID String index is not available in the Structure database.</td>
<td>[SQID]</td>
</tr>
</tr>
<tr>
<td>Sequence Length</td>
<td>Contains the total length of the sequence. Sequence Length indexes are not available in the Structure or PopSet databases.</td>
<td>[SLEN]</td>
</tr>
<tr>
<td>Substance Name</td>
<td>Contains the names of any chemicals associated with this record
from the CAS registry and the MEDLINE Name of Substance field.
Substance Name indexes are not available in the Genome or PopSet databases.</td>
<td>[SUBS]</td>
</tr>
<tr>
<td>Text Word</td>
<td>Contains all of the "free text" associated with a record.</td>
<td>[WORD]</td>
</tr>
<tr>
<td>Title Word</td>
<td>Includes only those words found in the definition line of a record. The definition
line summarizes the biology of the sequence and is carefully constructed by database staff.
A standard definition line will include the organism, product name, gene symbol,
molecule type and whether it is a partial or complete cds. Title Word indexes are not available in the Structure or PopSet databases.</td>
<td>[TITL]</td>
</tr>
<tr>
<td>Uid</td>
<td>Contains the Medline unique identifier for records that contain published
references that are linked to PubMed. The Uid index is not browsable.</td>
<td>[UID]</td>
</tr>
</tr>
<tr>
<td>Volume</td>
<td>Contains the volume number of the journal in which the data were published.</td>
<td>[VOL]</td>
</tr>
</table><P>
<A HREF="helpdoc.html">Back</A> to the Entrez Help Document</p>
<table border width="460" cellspacing="0" cellpadding="5">
<caption align=top>
<h2><A NAME="Display_Formats"></A>Display Formats</h2></caption>
<tr>
<td colspan=4 rowspan=20></td>
<th width="100" colspan=1>Display Format</th>
<th width="200" colspan=1>Description</th>
<th width="100" colspan=1>Databases Available</th>
<th width= "60" colspan=1>Link</th>
</tr>
<tr>
<td>Summary</td>
<td>Default display, hotlinked Accession number and brief description</td>
<td>All databases</td>
<td>None</td>
</tr>
<tr>
<td>Brief</td>
<td>Hotlinked Accession number and abbreviated description</td>
<td>All databases</td>
<td>None</td>
</tr>
<tr>
<td>GenBank/GenPept</td>
<td>Full report format</td>
<td>Nucleotide, Protein</td>
<td>None</td>
</tr>
<tr>
<td>ASN.1</td>
<td>Abstract Syntax Notation 1 form, the computer-readable form of the data</td>
<td>All databases</td>
<td>None</td>
</tr>
<tr>
<td>FASTA</td>
<td>The definition line and sequence characters</td>
<td>All databases</td>
<td>None</td>
</tr>
<tr>
<td>Nucleotide Neighbors</td>
<td>Retrieves all similar nucleotide sequences for all documents retrieved and displays in default format</td>
<td>Nucleotide</td>
<td>Related Sequences</td>
</tr>
<tr>
<td>Protein Neighbors</td>
<td>Retrieves all similar protein sequences for all documents retrieved and displays in default format</td>
<td>Protein</td>
<td>Related Sequences</td>
</tr>
<tr>
<td>Genome Neighbors</td>
<td>Retrieves all similar genome sequences for all documents retrieved and displays in default format</td>
<td>Genome</td>
<td>Related Sequences</td>
</tr>
<tr>
<td>Structure Neighbors</td>
<td>Retrieves all similar structures for all documents retrieved and displays in default format</td>
<td>Structure</td>
<td>Related Sequences</td>
</tr>
<tr>
<td>Provider Links</td>
<td>Retrieves all external links for all documents retrieved and displays in default format
- see <A HREF="helpdoc.html#Link_Out">Link Out</A> for more information</td>
<td>All databases</td>
<td>LinkOut</td>
</tr>
<tr>
<td>PubMed Links</td>
<td>Retrieves all Medline links for all documents retrieved and displays in default format</td>
<td>All databases</td>
<td>PubMed</td>
</tr>
<tr>
<td>Nucleotide Links</td>
<td>Retrieves all Nucleotide links for all documents retrieved and displays in default format</td>
<td>All databases, except Nucleotide</td>
<td>Nucleotide</td>
</tr>
<tr>
<td>Protein Links</td>
<td>Retrieves all Protein links for all documents retrieved and displays in default format</td>
<td>All databases, except Protein</td>
<td>Protein</td>
</tr>
<tr>
<td>Genome Links</td>
<td>Retrieves all Genome links for all documents retrieved and displays in default format</td>
<td>Nucleotide, Protein, and Structure</td>
<td>Genomes</td>
</tr>
<tr>
<td>Structure Links</td>
<td>Retrieves all Structure links for all documents retrieved and displays in default format</td>
<td>Nucleotide, Protein, and Genome</td>
<td>Structure</td>
</tr>
<tr>
<td>PopSet Links</td>
<td>Retrieves all PopSet links for all documents retrieved and displays in default format</td>
<td>All databases</td>
<td>PopSet</td>
</tr>
<tr>
<td>Graphic Summary</td>
<td>The graphical view of the sequence accessible by selecting the hotlinked Accession numbers</td>
<td>Nucleotide, Protein, and Genome</td>
<td>None</td>
</tr>
<tr>
<td>Structure Summary</td>
<td>The Structure Summary accessible by selecting the hotlinked PDB numbers</td>
<td>Structure</td>
<td>None</td>
</tr>
<tr>
<td>PopSet Summary</td>
<td>The complete set of Accession Numbers comprising the PopSet accessible by selecting the hotlinked PopSet Accession Numbers</td>
<td>PopSet</td>
<td>None</td>
</tr>
</table><P>
<A HREF="helpdoc.html">Back</A> to the Entrez Help Document
</tr>
</table>
</body>
</html>