365 lines
12 KiB
HTML
365 lines
12 KiB
HTML
<html>
|
|
<head>
|
|
<title>Concise Protein BLAST Help Document</title>
|
|
<meta name="description" content="BLAST help document for concise protein database">
|
|
<meta name="keywords" content="BLAST protein clusters help protein">
|
|
|
|
<link rel="stylesheet" type="text/css"
|
|
href="/coreweb/styles/ncbi_styles.css">
|
|
|
|
<style type="text/css">
|
|
|
|
A.HelpLayer:link {
|
|
text-decoration: none;
|
|
font-family: Verdana, Arial, Sans-serif;
|
|
font-size: 11px;
|
|
color:#6C7F9A;
|
|
}
|
|
|
|
A.HelpLayer:visited {
|
|
text-decoration: none;
|
|
font-family: Verdana, Arial, Sans-serif;
|
|
font-size: 11px;
|
|
color:#6C7F9A;
|
|
}
|
|
|
|
A.HelpLayer:active {
|
|
text-decoration: none;
|
|
font-family: Verdana, Arial, Sans-serif;
|
|
font-size: 11px;
|
|
color:#4E7531;
|
|
}
|
|
|
|
A.HelpLayer:hover {
|
|
text-decoration: none;
|
|
color:#4E7531;
|
|
font-family: Verdana, Arial, Sans-serif;
|
|
font-size: 11px;
|
|
}
|
|
|
|
A {
|
|
text-decoration: none;
|
|
}
|
|
|
|
.NormalRes {
|
|
font-family: Verdana, Arial, Helvetica, sans-serif;
|
|
font-size: 11px;
|
|
font-style: normal;
|
|
font-variant: normal;
|
|
color: #666666;
|
|
text-decoration: none
|
|
}
|
|
|
|
A.NormalRes:link {
|
|
text-decoration: none;
|
|
font-family: Verdana, Arial, Sans-serif;
|
|
font-size: 11px;
|
|
font-weight: bold;
|
|
color: #336699;
|
|
}
|
|
|
|
A.NormalRes:active {
|
|
text-decoration: none;
|
|
font-family: Verdana, Arial, Sans-serif;
|
|
font-size: 11px;
|
|
font-weight: bold;
|
|
color: #336699;
|
|
}
|
|
|
|
A.NormalRes:visited {
|
|
text-decoration: none;
|
|
font-family: Verdana, Arial, Sans-serif;
|
|
font-size: 11px;
|
|
font-weight: bold;
|
|
color: #336699;
|
|
}
|
|
|
|
A.NormalRes:hover {
|
|
text-decoration: underline;
|
|
font-family: Verdana, Arial, Sans-serif;
|
|
font-size: 11px;
|
|
color:#656A0D;
|
|
font-weight: bold;
|
|
}
|
|
|
|
A.Database_links:link {
|
|
font-family: Verdana, Arial, Helvetica, sans-serif;
|
|
font-size: 11px;
|
|
font-weight: normal;
|
|
color: #666666;
|
|
font-variant: normal;
|
|
color: #666666;
|
|
text-decoration: none
|
|
}
|
|
|
|
A.Database_links:active {
|
|
font-family: Verdana, Arial, Helvetica, sans-serif;
|
|
font-size: 11px;
|
|
font-weight: normal;
|
|
color: #666666;
|
|
font-variant: normal;
|
|
color: #666666;
|
|
text-decoration: none
|
|
}
|
|
|
|
A.Database_links:visited {
|
|
font-family: Verdana, Arial, Helvetica, sans-serif;
|
|
font-size: 11px;
|
|
font-weight: normal;
|
|
color: #666666;
|
|
font-variant: normal;
|
|
color: #666666;
|
|
text-decoration: none
|
|
}
|
|
|
|
A.Database_links:hover {
|
|
font-family: Verdana, Arial, Helvetica, sans-serif;
|
|
font-size: 11px;
|
|
font-weight: normal;
|
|
color: #336699;
|
|
font-variant: normal;
|
|
text-decoration: underline;
|
|
}
|
|
|
|
.Database_links {
|
|
font-family: Verdana, Arial, Helvetica, sans-serif;
|
|
font-size: 11px;
|
|
font-weight: normal;
|
|
text-decoration: none;
|
|
color: #666666;
|
|
}
|
|
|
|
.MessageBox {
|
|
font-family: Arial, Helvetica, sans-serif;
|
|
font-size: 11px;
|
|
font-weight: normal;
|
|
text-decoration: none;
|
|
color: #1D5136;
|
|
}
|
|
|
|
.MessageBox2 {
|
|
font-family: Arial, Helvetica, sans-serif;
|
|
font-size: 11px;
|
|
font-weight: normal;
|
|
text-decoration: none;
|
|
color: #1D3551;
|
|
}
|
|
|
|
.menutitle {
|
|
color:white;
|
|
font-family: Verdana, Arial, Sans-serif;
|
|
font-size: 10px;
|
|
font: bold;
|
|
}
|
|
|
|
.regular {
|
|
font-family: Verdana;
|
|
font-size: 10pt;
|
|
line-height: 15pt;
|
|
}
|
|
|
|
.regular8 {
|
|
font-family: Verdana;
|
|
font-size: 8pt;
|
|
line-height: 15pt;
|
|
}
|
|
|
|
.regular12 {
|
|
font-family: Verdana;
|
|
font-size: 12pt;
|
|
line-height: 15pt;
|
|
}
|
|
|
|
</style>
|
|
|
|
<SCRIPT LANGUAGE="JavaScript1.2"
|
|
SRC="/coreweb/javascript/imagemouseover.js">
|
|
</SCRIPT>
|
|
|
|
</head>
|
|
|
|
<body
|
|
background="/coreweb/template1/pix/bg_main3.gif"
|
|
topmargin="20" marginheight="20">
|
|
|
|
<table width="97%" border="0" cellspacing="0" cellpadding="0"
|
|
align="center" ID="Table1">
|
|
<tr>
|
|
<td>
|
|
<table width="100%" border="0" cellspacing="0" cellpadding="0"
|
|
align="center" ID="Table2">
|
|
<tr>
|
|
<td width="100%" align="left" valign="middle" nowrap background="/coreweb/template1/pix/top_bg_white.gif"><img src="/coreweb/template1/pix/pixel.gif" width="550" height="1"
|
|
alt="" border="0"><br>
|
|
<img src="ProkBLAST_logo.gif" alt="" border="0"></td>
|
|
|
|
</tr>
|
|
</table>
|
|
|
|
<table width="100%" border="0" cellspacing="0" cellpadding="0" align="center" ID="Table4">
|
|
<tr>
|
|
<td><img src="/coreweb/template1/pix/top3_ulm_no_a.gif" width="145"
|
|
height="16" alt="" border="0" usemap="#unlmenu" name="unl_menu_pix"></td>
|
|
<td background="/coreweb/template1/pix/top3_mainmenu_mid_bg.gif"><img src="/coreweb/template1/pix/top3_mainmenu_left.gif" width="3" height="16" alt="" border="0"></td>
|
|
<td width="100%" valign="middle" nowrap background="/coreweb/template1/pix/top3_mainmenu_mid_bg.gif"><table border="0" cellspacing="0" cellpadding="0" width="100%" ID="Table5">
|
|
<td width=80><a href="/entrez/query.fcgi?db=PubMed" class="BAR">PubMed</a></td>
|
|
<td width=80><a href="/Entrez/" class="BAR">Entrez</a></td>
|
|
<td width=80><a href="/BLAST/" class="BAR">BLAST</a></td>
|
|
<td width=80 NOWRAP><a href="/entrez/query.fcgi?db=proteinclusters" class="BAR">Protein Clusters</a></td>
|
|
<td width=80><a href="/entrez/query.fcgi?db=Books" class="BAR">Books</a></td>
|
|
<td width=80><a href="/Taxonomy/Browser/wwwtax.cgi" class="BAR">TaxBrowser</a></td>
|
|
<td width=120 NOWRAP><a href="/entrez/query.fcgi?db=Structure" class="BAR">Entrez Structure</a></td>
|
|
|
|
</table>
|
|
</td>
|
|
<td background="/coreweb/template1/pix/top3_mainmenu_mid_bg.gif"
|
|
align="right"><img src="/coreweb/template1/pix/top3_mainmenu_right.gif"
|
|
width="5" height="16" alt="" border="0"></td>
|
|
</tr>
|
|
|
|
</table>
|
|
<table width="100%" border="0" cellspacing="0" cellpadding="0" align="center" ID="Table6">
|
|
<tr>
|
|
<td><img src="/coreweb/template1/pix/top4_ulm_left.gif" width="145"
|
|
height="4" alt="" border="0"></td>
|
|
<td width="100%" background="/coreweb/template1/pix/top4_mid_bg.gif"><img src="/coreweb/template1/pix/pixel.gif" width="1" height="1" alt="" border="0"></td>
|
|
<td align="right" background="/coreweb/template1/pix/top4_mid_bg.gif"><img src="/coreweb/template1/pix/top4_ulm_right.gif" width="5" height="4" alt="" border="0"></td>
|
|
</tr>
|
|
</table>
|
|
|
|
<!-- overview -->
|
|
<h2>Overview</h2><a href="#top"></a>
|
|
<table border=0 width=100% cellspacing=0 cellpadding=1 bgcolor=#FFFFFF>
|
|
<tr><td>
|
|
<p>This database consists of all proteins from complete genomes that are used in the protein cluster database (ProtClustDB).
|
|
We have precalculated clusters of similar proteins at the genus-level and
|
|
one representative is chosen from each cluster in order to reduce the dataset. The result is reduced
|
|
search times through the elimination
|
|
of redundant proteins while providing a broader taxonomic view.
|
|
</td></tr>
|
|
<!-- end overview -->
|
|
|
|
<!-- links to main topic areas in this help page -->
|
|
<tr><td><ul>
|
|
<li><a href="#background">About this database</a></li>
|
|
<li><a href="#query">Query Page</a></li>
|
|
<li><a href="#results">Results Page</a></li>
|
|
<li><a href="#bhelp">BLAST Help</a></li>
|
|
</ul></td></tr></table>
|
|
<!-- end of links to locations with the help page -->
|
|
|
|
<a href="/genomes/prokhits.cgi">Back to Concise Protein BLAST</a>.
|
|
|
|
<!-- body of page -->
|
|
<table border=0 width=100% cellspacing=0 cellpadding=1 bgcolor=#FFFFFF>
|
|
<tr><td>
|
|
<a name="background"></a><h2>About this database</h2>
|
|
<p>
|
|
The vast increase in genomic sequences has led to a flood of data to the protein databases as well. Many strain-specific genomes
|
|
are now being sequenced (for example Streptococcal genomes). The result
|
|
can be an overwhelming amount of data to look through when executing BLAST similarity searches. In order to help
|
|
alleviate both the processing of the data and to present a broader taxonomic view, the concise protein
|
|
database was constructed.
|
|
<p>
|
|
All proteins from complete genomes are compared by BLAST (all against all). Protein clusters
|
|
are constructed from the results such that all top blast hits are combined into a single cluster (each protein
|
|
in a given cluster must have all other proteins in the cluster as top BLAST hits).
|
|
<p>
|
|
Clusters may span a large taxonomic branch (kingdom) or may reside at a specific node (family, genus,
|
|
species, etc.). Clusters may consist of many proteins, or be comprised of only two proteins. From this entire set of clusters, genus-specific clusters
|
|
are used for this database. From the proteins at
|
|
the genus-level, one (randomly selected) is chosen as a representative for the Concise Microbial Protein BLAST database and will be found in BLAST queries. The other
|
|
proteins in the cluster are automatically linked to this representative and will also be found in the search results, although without the BLAST score and E-value as
|
|
they are not specifically examined. All proteins that do not belong
|
|
to the genus-level clusters are also added to the database for completeness.
|
|
<p>
|
|
The result will be faster processing times and reduced load on the database. The broader taxonomic view will help eliminate some
|
|
of the redundancy that is found when many proteins of closely related organisms are found in BLAST results.
|
|
|
|
</p>
|
|
<p>
|
|
<a href="#top">Return to top</a>
|
|
</p>
|
|
|
|
<a Name="query"></a><h2>Query Page</h2>
|
|
<p>
|
|
Queries can be either protein or nucleotide using blastp and blastx programs, respectively. Accessions, GIs, or sequences in FASTA
|
|
format can be entered in the query box.
|
|
</p>
|
|
<p>
|
|
Default parameters are set below the query box. The expect threshold is set low, which will help
|
|
reduce BLAST results. Information on each parameter is available by clicking on each name.
|
|
</p>
|
|
|
|
<a href="#top">Return to top</a>
|
|
|
|
<a name="results"></a><h2>Results Page</h2>
|
|
<p>
|
|
The results page is not the one typically returned for BLAST results although a link is provided
|
|
to view the results in standard format.
|
|
</p>
|
|
<p>
|
|
The query is shown, along with the length, and the number of hits for total proteins, and the proteins represented by the genus-level clusters.
|
|
</p>
|
|
<p>
|
|
Results are returned in a collapsed table format. Genus level clusters are represented with a plus (+) sign
|
|
at each level, which can be expanded. The table is sortable by organism name and by BLAST score. As only one protein from a genus-level cluster is chosen,
|
|
there will be no BLAST score nor E-value for the other proteins in a cluster as they are not searched when a query is submitted.
|
|
</p>
|
|
<p>
|
|
The table shows the organism name, protein name, accession, length, locus_tag, BLink, BL2seq, score, and E-value.
|
|
Links to taxonomy (organism name), protein (protein name, accession), and gene (locus_tag) entrez databases are available
|
|
for each protein. The BLink shows precomputed BLAST results for that specific protein, while the BL2seq link
|
|
runs a comparison of the query and that specific protein in the table using BLAST.
|
|
</p>
|
|
|
|
<a href="#top">Return to top</a>
|
|
|
|
|
|
<a name="bhelp"></a><h2>BLAST Help</h2>
|
|
<p>
|
|
Help and information on BLAST are available from the <a href="/BLAST/">main BLAST page</a>.
|
|
</p>
|
|
<p>
|
|
Microbial genomes can be searched <a href="/sutils/genom_table.cgi">here</a>.
|
|
</p>
|
|
|
|
|
|
<a href="#top">Return to top</a>
|
|
|
|
|
|
</td></tr></table>
|
|
|
|
<map name="unlmenu">
|
|
<area
|
|
alt="NCBI Home"
|
|
coords="2,0,39,15"
|
|
href="/"
|
|
onMouseOver="changpics(unl_menu_pix, unl_menu_home_a)"
|
|
onMouseOut="changpics(unl_menu_pix, unl_menu_noa)"
|
|
shape="RECT">
|
|
|
|
<area
|
|
alt="NCBI Search"
|
|
coords="40,0,91,15"
|
|
href="/entrez/query.fcgi?db=ncbisearch"
|
|
onMouseOver="changpics(unl_menu_pix, unl_menu_search_a)"
|
|
onMouseOut="changpics(unl_menu_pix, unl_menu_noa)"
|
|
shape="RECT">
|
|
|
|
<area
|
|
alt="NCBI SiteMap"
|
|
coords="92,0,143,15"
|
|
href="/Sitemap/"
|
|
onMouseOver="changpics(unl_menu_pix, unl_menu_sitemap_a)"
|
|
onMouseOut="changpics(unl_menu_pix, unl_menu_noa)"
|
|
shape="RECT">
|
|
</map>
|
|
|
|
<h5>Last modified Sept 6, 2006</h5></html>
|
|
|
|
</body>
|
|
</html>
|
|
|
|
|
|
|