198 lines
No EOL
12 KiB
HTML
198 lines
No EOL
12 KiB
HTML
<!doctype html public "-//W3C//DTD HTML 4.01 Transitional//EN" >
|
|
<html lang="en">
|
|
<head>
|
|
<!--***********************change issue date and title below**********************-->
|
|
<title>PubChem: An Entrez Database of Small Molecules. NLM Technical Bulletin. 2005 Jan-Feb</title>
|
|
<meta name="DC.Subject.IssueNum" content="342" />
|
|
<meta name="DC.Subject.IssueCover" content="/pubs/techbull/jf05/jf05_issue_cover.html" />
|
|
|
|
<meta name="DC.Subject.Keyword" content="Abstract" />
|
|
<meta name="DC.Subject.Keyword" content="ChemIDplus" />
|
|
<meta name="DC.Subject.Keyword" content="Chemistry" />
|
|
<meta name="DC.Subject.Keyword" content="Editor's Note" />
|
|
<meta name="DC.Subject.Keyword" content="Entrez" />
|
|
<meta name="DC.Subject.Keyword" content="Medical Subject Headings" />
|
|
<meta name="DC.Subject.Keyword" content="National Cancer Institute" />
|
|
<meta name="DC.Subject.Keyword" content="National Center for Biotechnology Information" />
|
|
<meta name="DC.Subject.Keyword" content="National Institutes of Health" />
|
|
<meta name="DC.Subject.Keyword" content="PubChem" />
|
|
<meta name="DC.Subject.Keyword" content="PubMed" />
|
|
<meta name="DC.Subject.Keyword" content="Substance Name" />
|
|
</head>
|
|
|
|
<body link="#476B47" alink="#476B47" vlink="#476B47" text="#000000" bgcolor="ffffff"><noscript><iframe src="//www.googletagmanager.com/ns.html?id=GTM-MT6MLL" height="0" width="0" style="display:none;visibility:hidden" title="googletagmanager"></iframe></noscript><script>(function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start': new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='//www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-MT6MLL');</script>
|
|
<style type="text/css">
|
|
#skip, .skip, .skipnavigation {
|
|
position:absolute;
|
|
left:0px;
|
|
top:-500px;
|
|
width:1px;
|
|
height:1px;
|
|
overflow:hidden;
|
|
}
|
|
</style>
|
|
<div class="skipnavigation"><a title="Skip the navigation on this page" href="#skipnav" class="skipnavigation">Skip Navigation Bar</a></div>
|
|
|
|
|
|
|
|
<center>
|
|
<a id="skipnav" name="skipnav"></a>
|
|
<table border="0" width="550">
|
|
|
|
<tr><td width="550" align="center" colspan="3">
|
|
<img src="/pubs/techbull/new_tb_graphics/header_final.gif" border="0" alt="NLM Technical Bulletin Header"/>
|
|
<br />
|
|
<img src="/pubs/techbull/new_tb_graphics/toc_342.jpg" border="0" alt="Article Navigation Bar" usemap="#subheader_issues"/>
|
|
|
|
<map name="subheader_issues" id="subheader_issues">
|
|
<area shape="RECT" coords="30,20,165,40" href="/pubs/techbull/jf05/jf05_issue_cover.html" alt="Table of Contents">
|
|
<area shape="RECT" coords="215,20,265,40" href="/pubs/techbull/tb.html" alt="NLM Technical Bulletin Home Page">
|
|
<area shape="RECT" coords="310,20,395,40" href="/pubs/techbull/back_issues.html" alt="Back Issues">
|
|
|
|
</map>
|
|
</td></tr>
|
|
|
|
|
|
<!--************************indicate date posted below**********************************-->
|
|
<!--************************add dates corrected to subsequent lines with [corrected] tag***************************-->
|
|
|
|
<tr><td width="30"> </td><td width="470"><font size="2"><strong>January 3, 2005</strong> [posted]</font><br /></td><td width="30"> </td></tr>
|
|
|
|
|
|
<tr><td width="30"> </td><td width="470">
|
|
|
|
<!--************************indicate title of article below*********************-->
|
|
<tr><td width="30"> </td><td width="470"><font size="5"><strong>PubChem: An Entrez Database of Small Molecules </strong></font><br /></td><td width="30"> </td></tr>
|
|
|
|
<tr><td width="30"> </td><td width="470">
|
|
|
|
<br />
|
|
<!--************************change graphic to match first letter of article*************************-->
|
|
<!--insert editor's note-->
|
|
<em>[Editor's Note: This article is a <a href="http://www.ncbi.nlm.nih.gov/Web/Newsltr/SummerFall04/pubchem.html">reprint</a> from the Summer/Fall 2004 issue of the <a href="http://www.ncbi.nlm.nih.gov/About/newsletter.html">NCBI News</a>, an online newsletter from the National Center for Biotechnology Information (NCBI).] </em>
|
|
<!--end editor's note-->
|
|
|
|
<p>
|
|
<img src="/pubs/techbull/new_tb_graphics/t.gif" border="0" align="left" alt="drop cap letter for t"/>
|
|
he NCBI has released three new Entrez databases that link small organic molecules to bioactivity assays, PubMed abstracts, and protein sequences and structures. The new databases compose the <a href="http://pubchem.ncbi.nlm.nih.gov">PubChem project</a> at NCBI, a part of the <a href="https://commonfund.nih.gov/">NIH Roadmap Initiative</a>. They are PubChem Substance, PubChem Compound, and PubChem Bioassay.
|
|
</p>
|
|
|
|
<p>PubChem Substance contains over 800,000 chemical samples imported from 14 public sources including <a href="http://chem.sis.nlm.nih.gov/chemidplus/chemidlite.jsp">ChemIDplus</a>, the <a href="http://dtp.nci.nih.gov/">Developmental Therapeutics Program</a> at the National Cancer Institute (NCI), <a href="http://www.genome.jp/kegg/">KEGG</a>, <a href="http://www.ncbi.nih.gov/Structure/MMDB/mmdb.shtml">NCBI Molecular Modeling Database</a> (MMDB), and the <a href="http://webbook.nist.gov/chemistry/">NIST Chemistry WebBook</a>. Chemical entities in PubChem Substance records that have known structures are validated, converted to a standardized form, and imported into PubChem Compound. This standardizing allows NCBI to compute chemical parameters and similarity relationships between compounds. The compounds are grouped into levels of chemical similarity from most general to most specific: same bonding connectivity and any tautomer; same bonding connectivity; same stereochemistry; same isotopes; and same stereochemistry and isotopes. PubChem Compound also indexes these chemicals using 34 fields, many of which represent computed chemical properties such as the number of chiral centers, the number of hydrogen bond donors/acceptors, molecular formula and weight, total formal charge, and octanol-water partition coefficients (XlogP). These groups are provided as Entrez links that allow similar compounds to be retrieved quickly. The third database, PubChem Bioassay, currently includes 173 bioactivity studies from the Developmental Therapeutics Program at NCI, and each of these studies is linked to records in PubChem Substance. The PubChem Bioassay interface allows users to view substances that meet certain activity and/or chemical criteria, and the matching records can either be viewed in PubChem Substance or downloaded in several formats.</p>
|
|
|
|
|
|
<p>As part of the Entrez system, the three PubChem databases are linked to several related Entrez databases, including <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi">PubMed</a>, <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein">Protein</a>, and <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Structure">Structure</a>. PubMed links are derived either from citations provided by submitters or by matching substance names to the MeSH medical thesaurus, which often provide extensive information about the biological activity of a substance. The Protein and Structure links reveal proteins known to interact with a compound and protein structures that contain the compound as a bound ligand. The reverse links also provide new functionalities. Now ligands within structures can be identified instantly by the link to PubChem Compound, as can chemicals described in PubMed abstracts.</p>
|
|
|
|
|
|
<p>Consider Gleevec, a potent tyrosine kinase inhibitor used to treat leukemia. In PubChem Substance, the query "gleevec" retrieves one record for Imatinib meslylate from ChemIDplus. Clicking on the SID (substance ID) number or the thumbnail structure loads a Substance Summary showing a view of the structure, other information including chemical properties and synonyms, and inks to PubChem Substance, PubChem Compound, PubMed, and records of identical compounds. This record contains both Imatinib meslylate and methanesulfonic acid; a link to identical compounds leads to substances that also contain the acid. In this case, one additional substance is found that was not retrieved by the query "gleevec", showing how similarity neighboring is able to overcome differing nomenclatures. As part of the standardizing process, substances that have multiple components give rise to several records in PubChem Compound to allow more powerful searching for similar compounds. In the present case, if the Compound Displayed pulldown menu is changed from Standardized to Component1, a different Compound record is shown that contains Imatinib mesylate without the acid, and this compound is linked to seven identical compounds, including itself (<a href="#fig1">Figure 1</a>). Clicking the link to the right of Same Connectivity loads these identical compounds into PubChem Compound, and then choosing Protein Structure from the Display pulldown menu and clicking Display reveals three crystal structures of tyrosine kinase domains containing bound Gleevec. Only one of these structures would have been found by the text query "gleevec" in Entrez Structure, illustrating the advantage of the precomputed chemical similarities provided by PubChem Compound.
|
|
|
|
|
|
|
|
|
|
</p>
|
|
<center>
|
|
<a name="fig1" id="fig1"> </a>
|
|
<img src="/pubs/techbull/jf05/graphics/pchem_fig1.gif" alt="figure 1: graphic" />
|
|
</center>
|
|
|
|
|
|
|
|
|
|
<p>PubChem Bioassay allows one to search for bioactivity. For instance, the query "leukemia AND lc50[tid description]" in PubChem Bioassay retrieves eight growth inhibition assays with measured LC50 values in various leukemia cell lines. Links are then provided to PubChem Substance and PubChem Compound for these chemicals so that they may be further explored.</p>
|
|
|
|
|
|
<p>Access PubChem at: <a href="http://pubchem.ncbi.nlm.nih.gov">http://pubchem.ncbi.nlm.nih.gov</a>.</p>
|
|
|
|
|
|
|
|
|
|
<p>
|
|
Questions should be directed to: <a href="mailto:info@ncbi.nlm.nih.gov">info@ncbi.nlm.nih.gov</a>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
<br /><br />
|
|
|
|
|
|
|
|
|
|
|
|
<!--************************indicate article author and section below*******************************-->
|
|
|
|
|
|
<p>
|
|
|
|
<strong>By Eric Sayers</strong><br />
|
|
<strong>National Center for Biotechnology Information</strong><br />
|
|
|
|
</p>
|
|
<!--************************indicate indexing terms below**********************-->
|
|
|
|
<!--<p><strong>Indexing Terms</strong></p>
|
|
<p>PubChem. New Entrez Database.</p>
|
|
<p>Entrez Databases. PubChem Released.</p>
|
|
|
|
-->
|
|
|
|
|
|
<!--************************end indexing terms*********************************-->
|
|
|
|
|
|
<img src="/pubs/techbull/new_tb_graphics/black_pixel.gif" width="450" height="1" alt="black line separting article from citation"/>
|
|
|
|
<p>
|
|
<!--************************change citation information below*****************************-->
|
|
<em>Sayers E. PubChem: An Entrez Database of Small Molecules. NLM Tech Bull. 2005 Jan-Feb;(342):e2.</em>
|
|
<!--************************end citation information********************************-->
|
|
</p>
|
|
|
|
</td><td width="30"> </td></tr>
|
|
</table>
|
|
<br />
|
|
|
|
|
|
<br />
|
|
|
|
<img src="/pubs/techbull/new_tb_graphics/footer_342.jpg" border="0" alt="Article Navigation Bar" usemap="#footer_final"/>
|
|
|
|
<map name="footer_final" id="footer_final">
|
|
<area shape="RECT" coords="265,5,315,30" href="/pubs/techbull/tb.html" alt="NLM Technical Bulletin Home Page">
|
|
<area shape="RECT" coords="330,5,420,30" href="/pubs/techbull/back_issues.html" alt="Back Issues">
|
|
<area shape="RECT" coords="435,5,480,30" href="/pubs/techbull/new_index.html" alt="Index">
|
|
<area shape="RECT" coords="5,5,90,20" href="/pubs/techbull/jf05/jf05_issue_cover.html" alt="Previous Page">
|
|
<area shape="RECT" coords="445,6,504,20" href="/pubs/techbull/jf05/jf05_myncbi.html" alt="Next Article">
|
|
|
|
|
|
</map>
|
|
</center>
|
|
|
|
<!-- BEGIN NLM FOOTER -->
|
|
<center>
|
|
<font size="2" face="helvetica, arial"><a href="/nlmhome.html">U.S.
|
|
National Library of Medicine</a>, 8600 Rockville Pike, Bethesda, MD 20894<br />
|
|
<a href="http://www.nih.gov/">National Institutes of Health</a>,
|
|
<a href="//www.hhs.gov/">Department of Health & Human Services</a>
|
|
<br /><a href="/copyright.html">Copyright</a>, <a href="/privacy.html">Privacy</a>,
|
|
<a href="/accessibility.html">Accessibility</a>,
|
|
<a href="http://www.nih.gov/icd/od/foia/index.htm">Freedom of Information Act (FOIA)</a><br/><a href="https://www.hhs.gov/vulnerability-disclosure-policy/index.html">HHS Vulnerability Disclosure</a>
|
|
<br />
|
|
|
|
</font>
|
|
</center>
|
|
|
|
|
|
<!-- END NLM FOOTER -->
|
|
<!-- ******************MODIFY EXPDATE AND EMAIL BELOW****************** -->
|
|
<!-- EXPDATE="2015-03-20" -->
|
|
<!-- EMAIL="nlmtechbull@mail.nlm.nih.gov" -->
|
|
|
|
|
|
|
|
|
|
<script src="//assets.nlm.nih.gov/jquery/jquery-latest.min.js"></script>
|
|
<script src="/core/nlm-notifyExternal/1.0/nlm-notifyExternal.min.js"></script>
|
|
</body>
|
|
</html> |