539 lines
29 KiB
HTML
539 lines
29 KiB
HTML
<!doctype html public "-//IETF//DTD HTML//EN">
|
|
<html>
|
|
|
|
<head>
|
|
<title>August 1997</title>
|
|
<meta name="GENERATOR" content="Microsoft FrontPage 3.0">
|
|
<meta name="AUTHOR" content="The KEVRIC Company">
|
|
</head>
|
|
|
|
<body bgcolor="#FFFFFF" text="#000000" link="#DD0000" vlink="#FF0000">
|
|
|
|
<p><img src="/Gifs/newslogo.gif"> </p>
|
|
|
|
<p><a name="toc">August 1997</a></p>
|
|
|
|
<hr>
|
|
|
|
<p><a href="#PubMed">PubMed Launched</a></p>
|
|
|
|
<p><a href="#Using Sequin">Using Sequin</a></p>
|
|
|
|
<p><a href="#Neighbors">Structure Neighbors</a></p>
|
|
|
|
<p><a href="#ORF Finder">ORF Finder</a></p>
|
|
|
|
<p><a href="#Electronic PCR">Electronic PCR</a> </p>
|
|
|
|
<p><a href="#CGAP">CGAP Revolutionizes Research</a></p>
|
|
|
|
<p><a href="#faq">Frequently Asked Questions</a> </p>
|
|
|
|
<p><a href="#FTP">NCBI Data by FTP</a> </p>
|
|
|
|
<p><a href="#Pubs">Recent Publications</a></p>
|
|
|
|
<hr>
|
|
<font face="Times" size="6">
|
|
|
|
<h3 align="left"><a name="PubMed">Vice President Launches PubMed, Lauds Free MEDLINE
|
|
Access</font></a><font face="Times" size="7"></h3>
|
|
|
|
<p align="left"></font><font face="Times" size="3">MEDLINE...will henceforth be available
|
|
free to the American people." With those words, Vice President Al Gore inaugurated
|
|
the PubMed search system at a Capitol Hill press conference on June 26. PubMed, which
|
|
provides Web access to the National Library of Medicine's (NLM) database of the
|
|
biomedical journal literature, MEDLINE, was heralded by Senator Tom Harkin (IA) as
|
|
"...the model of a smart, creative government initiative." The Vice President
|
|
viewed free access to MEDLINE as consistent with the Clinton administration’s other
|
|
"empowerment" initiatives stating, "This development...may do more to
|
|
reform and improve the quality of health care in the United States than anything else
|
|
we’ve done in a long time."</p>
|
|
</font>
|
|
|
|
<p align="left"><strong><font face="Times" size="3">Searching PubMed</font></strong></p>
|
|
|
|
<p align="left"><font face="Times">PubMed grew out of NCBI’s Entrez project which,
|
|
since 1992, has offered a subset of MEDLINE records related to molecular biology. In
|
|
addition to encompassing all of MEDLINE and PreMEDLINE, PubMed retains Entrez’s
|
|
ability to use one article as a "seed" to find other similar articles. By
|
|
traversing the <b>See Related Articles’ </b>links, a user can find articles similar
|
|
in concept with speed and precision. PubMed expands upon Entrez by linking MEDLINE
|
|
articles to full-text Web sites maintained by publishers. Currently, 95 journals are
|
|
linked to PubMed, including Cell, Journal of Biological Chemistry, Journal of Cell
|
|
Biology, New England Journal of Medicine, and Science. Access to publishers’ Web
|
|
sites may require subscriptions or registration.</font></p>
|
|
|
|
<p align="left"><img src="aug97.fig1.gore.gif"
|
|
width="475" height="394" alt="vpgore.gif (192782 bytes)"></p>
|
|
<font face="Helvetica" size="1"><i>
|
|
|
|
<p><big>NCBI Director David Lipman (far left) coaches Vice President Gore (seated) as he<br>
|
|
searches PubMed. NIH Director Harold Varmus (center) and NLM Director Donald <br>
|
|
Lindberg (far right) look on.</i></font></big></p>
|
|
|
|
<p align="left"> </p>
|
|
<b>
|
|
|
|
<p align="left"></b><strong>PubMed Options</strong></p>
|
|
|
|
<p align="left"><font face="Times">PubMed offers the option to search MEDLINE or any of
|
|
NCBI’s molecular biology databases. Users can select from a variety of search fields,
|
|
including but not limited to: text words, author names, and journal titles. A MEDLINE
|
|
citation for which there is a corresponding online, full-text article will have a button
|
|
at the top of the abstract page that links to the publisher’s Web site. Additional
|
|
links point to other NCBI databases, including sequences, 3D structures or
|
|
OMIM. Advanced query options allow for the creation of more complex Boolean search
|
|
expressions, and a special clinical query page is optimized to perform searches for
|
|
studies relating to the etiology, diagnosis, prognosis, or treatment of human diseases.</font></p>
|
|
|
|
<p align="left"><font face="Times">PubMed is available from the NCBI World Wide Web home
|
|
page (<a href="http://www.ncbi.nlm.nih.gov">http://www.ncbi.nlm.nih.gov</a>). Comments and
|
|
questions about PubMed are welcome. Send e-mail to <a href="mailto:info@ncbi.nlm.nih.gov">info@ncbi.nlm.nih.gov</a>
|
|
or call (301) 496-2475. </font> </p>
|
|
|
|
<p align="left"><a href="#toc">Return to Table of Contents</a></p>
|
|
|
|
<hr>
|
|
<a name="Using Sequin"><font face="Times" size="6">
|
|
|
|
<h3 align="left">Using Sequin to Submit Sets of Related Sequences</font></a> </h3>
|
|
|
|
<p align="left"><font face="Times" size="3">Sequin is a program developed at the NCBI for
|
|
submitting DNA sequences to GenBank, EMBL, or DDBJ. Both Sequin and BankIt, NCBI’s
|
|
Web-based sequence submission tool, can be used to submit simple mRNA or genomic sequences
|
|
along with associated coding sequences. However, Sequin has been outfitted with a number
|
|
of advanced sequence analysis capabilities. Unlike other sequence submission tools, Sequin
|
|
can process sets of related sequences such as segmented sets and those generated by
|
|
phylogenetic, population, or mutation studies. </p>
|
|
|
|
<p align="left">Like other World Wide Web submission tools, Sequin can be used to annotate
|
|
single sequences. However, it is usually easiest to annotate related sequences when they
|
|
are part of a multiple sequence alignment. Sequin can import the individual sequences, as
|
|
well as the alignment itself, from alignments that have been saved in FASTA+GAPs, PHYLIP,
|
|
or NEXUS format. If the sequences are related, but not yet aligned, Sequin will generate
|
|
an alignment from a file of FASTA-formatted sequences. Each new sequence in the alignment
|
|
will receive its own accession number.</p>
|
|
|
|
<p align="left">After the alignment is generated, annotation features, such as a coding
|
|
sequence or rRNA, can be marked just once on a single master sequence. These features can
|
|
then be propagated from the master sequence to other sequences in the alignment. The
|
|
proper location of the feature will be calculated by Sequin for each sequence individually
|
|
after taking into account any gaps or insertions. The Entrez nucleotide database is now
|
|
accessible from Sequin, allowing sequences from GenBank to be directly downloaded into
|
|
Sequin from Entrez. If the GenBank sequence is related to the sequences in the alignment,
|
|
it can be brought into the alignment as the master sequence. Features can then be copied
|
|
from this master sequence onto the new sequences. The GenBank record will not receive a
|
|
new accession number, but rather serves only to facilitate annotation of the newly
|
|
submitted sequences.</p>
|
|
|
|
<p align="left">Further information about Sequin, including downloading instructions and
|
|
help documentation, are available from the Sequin Web page (<a
|
|
href="http://www.ncbi.nlm.nih.gov/Sequin">http://www.ncbi.nlm.nih.gov/Sequin</a>). </font></p>
|
|
|
|
<p><a href="#toc">Return to Table of Contents</a></p>
|
|
|
|
<hr>
|
|
<font face="Times" size="6">
|
|
|
|
<h3 align="left"><a name="Neighbors">Structure Neighbors in Web Entrez</font></a><font
|
|
face="Times" size="7"></h3>
|
|
|
|
<p align="left"></font><font face="Times" size="3">WWW Entrez now contains
|
|
"neighbors" for proteins in its 3D structure database. Structure neighbors are
|
|
other proteins that have a similar 3D structure or shape. As with the protein sequence
|
|
neighbors in Entrez, structure neighbors are most often homologs with similar biological
|
|
functions. However, since protein evolution conserves 3D structure to a greater extent
|
|
than sequence, a protein’s structure neighbors may include more distant relatives not
|
|
present among its sequence neighbors. These additional similarities may provide further
|
|
insight into a protein’s properties and biological function. By incorporating
|
|
structure neighbors in Entrez, these distant relationships, detectable only by 3D
|
|
structure comparison, are readily accessible to molecular biologists.</p>
|
|
|
|
<p align="left">An example is provided by the globular domain of chicken histone H5 (PDB
|
|
accession code 1HST). The sequence neighbors of histone H5 are all other histones from a
|
|
variety of eukaryotic species. But the structure neighbors of histone H5 are diverse and
|
|
include a number of DNA binding proteins from bacteria. One of these is the E. coli
|
|
catabolite gene activator protein, or CAP, in complex with DNA (PDB accession code 1CGP).
|
|
The structures are remarkably similar, with 46 amino acid residues of histone H5
|
|
superimposing to 1.7 angstroms residual, even though sequence identity is only 10%. These
|
|
proteins would appear to be homologs, and the protein-DNA structure of CAP suggests a
|
|
model for the interaction of histone H5 with DNA.</p>
|
|
|
|
<p align="left"><img src="histone.gif"
|
|
alt="c-histone.gif (19115 bytes)"></p>
|
|
</font><font face="Helvetica" size="1"><i>
|
|
|
|
<p><big>Chicken histone H5</i></font></big></p>
|
|
|
|
<p align="JUSTIFY"> </p>
|
|
<font face="Times" size="3">
|
|
|
|
<p align="left"><font face="Times" size="3"><font face="Times" size="3">Structure
|
|
neighbors may be accessed from the "Structure Summary" of a protein in WWW
|
|
Entrez’s 3D structure database. Neighbor lists are displayed when one clicks the
|
|
button <b>Protein 3D Structures</b> in the line reading "Protein 3D Structures
|
|
similar to <Chain A> computed by VAST." Here <b><Chain A></b> is a
|
|
pull-down menu that allows one to select the individual polypeptide chain, or compact
|
|
domain within that chain, for which structure neighbors are to be retrieved. Structure
|
|
neighbor lists have been computed by the VAST algorithm (Vector Alignment Search Tool<small><small><small><small><sup>1</sup></small></small></small></small>),
|
|
and are sorted according to a VAST similarity score. VAST compares the relative
|
|
orientations of helices and beta-strands in two protein domains, and if similarity is more
|
|
extensive than one would expect by chance, produces a detailed structure alignment by
|
|
comparison of atomic coordinates.</font></font></p>
|
|
|
|
<p align="left">WWW Entrez also supports visualization of protein structures by molecular
|
|
graphics. The Cn3D viewer may be started by the <b>View </b>button on the "Structure
|
|
Summary" page. The 3D superpositions of structure neighbors can be viewed using the <b>Kinemage
|
|
</b>button on neighbor-list pages (viewing programs such as Cn3D and Kinemage are helper
|
|
applications that may be downloaded to your computer by following hotlinks on the
|
|
Structure home page). 3D viewing is important for interpretation of structural
|
|
similarities. In the 1HST protein, for example, one may see that histone H5 contains
|
|
positively charged residues in the region that superimposes onto the DNA-binding interface
|
|
of CAP as does CAP itself. 3D viewing thus supports the inference that these proteins
|
|
"dock" with DNA in a similar manner. Possible functional similarities of
|
|
structure neighbors may also be explored, of course, by examining the MEDLINE citations
|
|
and sequence neighbors associated with each protein.</font></p>
|
|
|
|
<p align="left"><font face="Times">WWW Entrez’s structure neighbor service is the
|
|
work of NCBI researchers T. Madej, C. Hogue, J.-F. Gibrat, J. Spouge, H. Ohkawa and S.
|
|
Bryant. Improvements in the visualization of structure similarities are still in progress,
|
|
and comments and suggestions are welcome.</font></p>
|
|
|
|
<p align="left"><small><small><font size="2"><sup>1</sup></font></small></small><font
|
|
face="Times" size="2"> Gibrat J-F, Madej T, Bryant SH: Surprising similarities in
|
|
structure comparison. <i>Curr Opin Struct Biol</i> 1996, 6:377–85.</font><font
|
|
face="Times" size="3"> </font> </p>
|
|
|
|
<p align="left"><a href="#toc">Return to Table of Contents</a></p>
|
|
|
|
<hr>
|
|
<font face="Times" size="6">
|
|
|
|
<h3 align="left"><a name="ORF Finder">Hunting For Open Reading Frames With ORF Finder</font></a><font
|
|
face="Times" size="7"></h3>
|
|
|
|
<p align="left"></font><font face="Times" size="3">Searching for open reading frames is
|
|
possible with NCBI’s software tool, ORF Finder, accessible from the NCBI World Wide
|
|
Web home page (<a href="http://www.ncbi.nlm.nih.gov">http://www.ncbi.nlm.nih.gov</a>). ORF
|
|
Finder is a graphical analysis tool which finds all open reading frames in a user’s
|
|
sequence or in a sequence retrieved from a database. ORF Finder also provides easy access
|
|
to the BLAST search page and allows the deduced amino acid sequence to be compared against
|
|
additional amino acid sequence databases using the BLAST options.</p>
|
|
|
|
<p align="left">To use ORF Finder, enter the accession or GI number of the sequence of
|
|
interest, or enter your query sequence directly into the text box in FASTA format. ORF
|
|
Finder will identify all open reading frames using the standard genetic code or an
|
|
alternative one for translation. Users can limit the search for open reading frames to a
|
|
portion of the query sequence by specifying the positions (in base pairs) in the
|
|
"From" and "To" boxes. Press the <b>ORF Find</b> button to retrieve a
|
|
graphic display of ORFs and their location in the sequence in 6 reading frames. Users have
|
|
the option to change the minimum ORF length to 50 or 300 nucleotides (in base pairs) and <b>Redraw</b>
|
|
the query sequence. The<b> Six Frames</b> option features a graphic of all
|
|
start and stop codons. Select a particular ORF by clicking on it to see the amino acid
|
|
sequence with all alternative start codons. After selecting a particular ORF of interest,
|
|
click on the <b>Accept </b>button and have the option to view the ORF in various formats:
|
|
GenBank flat-file, FASTA nucleotide, or FASTA amino acid sequence. Selecting <b>View </b>retrieves
|
|
the full GenBank record with its annotated sequence information. </p>
|
|
|
|
<p align="left">For those scientists submitting sequence data, ORF Finder is also packaged
|
|
with the Sequin sequence submission software. ORF Finder can be used in conjunction with
|
|
Sequin’s Sequence Editor to annotate new coding regions on the record, perform basic
|
|
editing, and translate nucleotide sequences. The Sequin program can be downloaded from
|
|
NCBI’s FTP site accessible from the NCBI WWW home page. </font></p>
|
|
|
|
<p><a href="#toc">Return to Table of Contents</a></p>
|
|
|
|
<hr>
|
|
<font face="Times" size="6">
|
|
|
|
<h3 align="left"><a name="Electronic PCR">Mapping Unique Genome Sites by Electronic PCR</font></a><font
|
|
face="Times" size="7"></h3>
|
|
|
|
<p align="left"></font><font face="Times" size="3">It is possible to determine the gene
|
|
map location of a new sequence using NCBI’s software tool, Electronic PCR (e-PCR),
|
|
located on the NCBI World Wide Web home page. Electronic PCR simulates conventional PCR
|
|
methods for identifying sequence tagged sites (STSs) by searching for sites in a query
|
|
sequence which match the sequence and orientation of a set of primers. STSs are unique DNA
|
|
landmarks used in the construction of genetic and physical maps of the human genome. The
|
|
e-PCR tool searches for matches between a user’s query sequence and STS primer
|
|
sequences in the STS database (dbSTS). Researchers can use e-PCR to assign sequence
|
|
database records to map positions, test primer feasibility, and integrate and anchor
|
|
genetic maps and sequence data. </p>
|
|
|
|
<p align="left">To map sequences by e-PCR, enter the sequence of interest in FASTA format
|
|
into the text box or retrieve a sequence from GenBank using an accession or GI number. The
|
|
"Retrieve STS from" setting can be used to limit the search to STSs from
|
|
specific organisms. Press the <b>Submit Query</b> button to begin. The results list the
|
|
STSs found with their relevant identifiers, position of the primer binding sites within
|
|
the query sequence, chromosome number (if known), and the expected and observed size of
|
|
the amplicon. Hypertext links to GenBank and dbSTS records (linked to Entrez) are provided
|
|
for more detailed information. </p>
|
|
|
|
<p align="left">The number of STS results one can expect for a typical search depend on a
|
|
variety of factors, such as length of the query sequence and size of the STS database.
|
|
Results that are reported are unequivocal and more reliable than those identified using
|
|
the general-purpose database search tool BLAST. Chances of obtaining STS matches will
|
|
improve as the number of sequences in the STS database continues to increase dramatically.</p>
|
|
|
|
<p align="left">For more information
|
|
contact <a href="mailto:info@ncbi.nlm.nih.gov">info@ncbi.nlm.nih.gov</a>. </font></p>
|
|
|
|
<p align="left"><a href="#toc">Return to Table of Contents</a></p>
|
|
|
|
<hr>
|
|
<font face="Times" size="6">
|
|
|
|
<h3 align="left"><a name="CGAP">CGAP Revolutionizes Cancer Research</font></a><font
|
|
face="Times" size="3"></h3>
|
|
</font><font face="Times" size="7">
|
|
|
|
<p align="left"></font><font face="Times" size="3">The knowledge that genetic mutations
|
|
are central to the development of cancerous cells has prompted the National Cancer
|
|
Institute, in partnership with NCBI, and other government, academic and industry leaders,
|
|
to initiate the Cancer Genome Anatomy Project, or CGAP.</p>
|
|
|
|
<p align="left">CGAP merges state-of-the-art technologies in pathology, molecular biology
|
|
and bioinformatics, to catapult a new strategic attack on cancer. It is an unprecedented
|
|
assemblage of sequence information characterizing the genetic constitution of cells at
|
|
various stages: normal, precancerous, and tumor.</p>
|
|
|
|
<p align="left">Worldwide, participants in CGAP are collecting a variety of tissue samples
|
|
from cells at different stages; generating cDNA libraries from the tissue samples; and
|
|
sequencing the cDNA libraries. NCBI uses powerful sequence similarity searching tools,
|
|
such as BLAST, to make electronic comparisons between the libraries of a given tissue type
|
|
at different stages, and generate discrete lists of genetic candidates as causative
|
|
components of the carcinogenic process.</p>
|
|
|
|
<p align="left">CGAP has collected over 40,000 DNA sequences so far in its trek over the
|
|
next few years toward a complete index of genes expressed in tumors—referred to as
|
|
the Tumor Gene Index. Initially, this index will be compiled from five major cancers:
|
|
prostate, breast, lung, colon, and ovarian. NCBI will continue to map new index sequences
|
|
to the Human Genome, building upon NCBI’s unique collection of human gene sequences
|
|
(UniGene) used to construct the Human Transcript Map.</p>
|
|
|
|
<p align="left">The focal point of the CGAP project is its Web site, located at <a
|
|
href="http://www.ncbi.nlm.nih.gov/ncicgap">www.ncbi.nlm.nih.gov/ncicgap</a>. Managed and
|
|
supported by NCBI members Mark Boguski, Ken Katz, Greg Schuler, and Carolyn
|
|
Tolstoshev, the CGAP Web site is the central repository for all of the information
|
|
generated by the project. This includes tissues, libraries, sequences, and links to
|
|
additional value-added information, such as related DNA and protein sequences, genome
|
|
mapping data, and biomedical references. </p>
|
|
|
|
<p align="left">Ultimately, the resourceful use of information housed in the CGAP Web site
|
|
is expected to lead to innovative diagnostic, preventative, and curative technologies
|
|
which will forever alter the way scientists conduct cancer research. </font></p>
|
|
|
|
<p align="left"><font face="ZapfDingbats" size="1"> </font></p>
|
|
|
|
<p align="left"><img src="cgap.gif"
|
|
alt="cgap.gif (8800 bytes)"></p>
|
|
|
|
<p><a href="#toc">Return to Table of Contents</a></p>
|
|
|
|
<hr>
|
|
|
|
<h3><a name="faq">Frequently Asked Questions</a> </h3>
|
|
<font face="Times" size="3"><i>
|
|
|
|
<p align="left">I submitted a sequence using BankIt one month ago, however I have not yet
|
|
received a GenBank accession number. Why?</i></p>
|
|
|
|
<p align="left">Since BankIt submissions normally receive GenBank accession numbers within
|
|
24-48 hours, an error in the submission process most likely occurred. Submitting sequence
|
|
information by BankIt involves completing, reviewing, and submitting your BankIt file.
|
|
Once you have finished entering your data and information, and have reviewed it for
|
|
accuracy and completeness, switch the selection from "Modify Submission" to
|
|
"Submit to GenBank" on the final BankIt page and click on the BankIt button one
|
|
last time. Users will receive a return message indicating receipt of the submission and a
|
|
GenBank accession number soon thereafter.</p>
|
|
<i>
|
|
|
|
<p align="left">I have a list of interesting titles I retrieved using PubMed. How do I
|
|
obtain the full documents?</i></p>
|
|
|
|
<p align="left">The PubMed system provides access to bibliographic citations and
|
|
corresponding abstracts, but does not contain the full-text of articles. However, PubMed
|
|
offers links to a number of publishers who provide access to full-text journal articles
|
|
from their Web sites. PubMed displays the link at the top of the Display title/abstract
|
|
page when available. Currently the number of participating journals is small and the
|
|
journals may require a user to subscribe before being able to view the full-text. A list
|
|
of PubMed journals that offer full-text can be retrieved from the URL: <a
|
|
href="http://www.ncbi.nlm.nih.gov/PubMed/fulltext.html">http://www.ncbi.nlm.nih.gov/PubMed/fulltext</a>.
|
|
If the journal you are interested in is not on the list, contact the nearest library for
|
|
information about obtaining articles.</p>
|
|
<i>
|
|
|
|
<p align="left"><em>How can I import references from PubMed into Endnote?</i></em></p>
|
|
|
|
<p align="left">First choose the MEDLINE format on the "document summaries page"
|
|
and then display. Select the appropriate save format at the bottom of the display screen
|
|
and press the "Save" button.</p>
|
|
<i>
|
|
|
|
<p align="left"><em>I recently read an article that referenced a GenBank accession number,
|
|
but I can’t find the sequence record in the database. Why?</i></em></p>
|
|
|
|
<p align="left">Sometimes authors request that a sequence be held confidential until
|
|
publication. Once GenBank is informed that the sequence has been published, GenBank staff
|
|
will verify the publication and release the sequence. GenBank encourages users who are
|
|
unable to retrieve a record to send the accession number and complete citation in which it
|
|
appeared to <a href="mailto:update@ncbi.nlm.nih.gov">update@ncbi.nlm.nih.gov</a>. </p>
|
|
<i>
|
|
|
|
<p align="left"><em>How can I find out if a particular gene has been mapped?</i></em></p>
|
|
|
|
<p align="left">Conduct a GenBank search using common names for the gene. If the gene has
|
|
been sequenced, its sequence record can be retrieved from GenBank. The accession number
|
|
obtained from that record can be entered into the text search tool of the Human Transcript
|
|
Map (available from the NCBI home page). Please note that the number of transcripts mapped
|
|
in this study is estimated to represent one-fifth of the total number of genes in the
|
|
human genome so the odds are that a gene has not yet been mapped.</p>
|
|
<i>
|
|
|
|
<p align="left"><em>I am interested in performing non-redundant BLAST searches on some
|
|
sequences I have. Is there a way to do this for new GenBank entries on a regular basis?</i></em></p>
|
|
|
|
<p align="left">Perform one BLAST search using the nonredundant database, nr. After that,
|
|
you can use BLAST to search the month database only. The month database has all new
|
|
records from the last month and is obviously much smaller than nr. It was intended for
|
|
this kind of surveillance blasting.</font></p>
|
|
|
|
<p><a href="#toc">Return to Table of Contents</a></p>
|
|
|
|
<hr>
|
|
|
|
<h3><a name="FTP">NCBI Data by FTP</a> </h3>
|
|
|
|
<p align="left"><font face="Times">The NCBI FTP site contains a variety of directories
|
|
with publicly available databases and software. The available directories include
|
|
‘repository’, ‘genbank’, ‘entrez’, ‘toolbox’,
|
|
‘pub’, and ‘sequin’.</font></p>
|
|
|
|
<p align="left"><font face="Times">The <b>repository </b>directory makes a number of
|
|
molecular biology databases available to the scientific community. This directory includes
|
|
databases such as PIR 53.0, Swiss-Prot, CarbBank, AceDB, and FlyBase.</font></p>
|
|
|
|
<p align="left"><font face="Times">The <b>genbank </b>directory contains files with the
|
|
latest full release of GenBank, the daily cumulative updates, and the latest release
|
|
notes.</font></p>
|
|
|
|
<p align="left"><font face="Times">The <b>entrez </b>directory contains the client
|
|
software for Network Entrez.</font></p>
|
|
|
|
<p align="left"><font face="Times">The <b>toolbox </b>directory contains a set of software
|
|
and data exchange specifications that are used by NCBI to produce portable software, and
|
|
includes ASN.1 tools and specifications for molecular sequence data. </font></p>
|
|
|
|
<p align="left"><font face="Times">The <b>pub </b>directory offers public-domain software,
|
|
such as BLAST (sequence similarity search program). Client software for Network BLAST and
|
|
PowerBlast is also included in this directory.</font></p>
|
|
|
|
<p align="left"><font face="Times">The <b>sequin </b>directory contains the new Sequin
|
|
submission software for Mac, PC, and UNIX platforms.</font></p>
|
|
|
|
<p align="left"><font face="Times">Data in these directories can be transferred through
|
|
the Internet by using the Anonymous FTP program. To connect, type: <b>ftp
|
|
ncbi.nlm.nih.gov. </b>Enter <b>anonymous </b>as the login name, and enter your e-mail
|
|
address as the password. Then change to the appropriate directory. For example, change to
|
|
the repository directory (cd repository) to download specialized databases.</font></p>
|
|
|
|
<p><a href="#toc">Return to Table of Contents</a></p>
|
|
|
|
<hr>
|
|
|
|
<h3 align="left"><a name="Pubs">Selected Recent Publications by NCBI Staff</a> </h3>
|
|
|
|
<p align="left"><font face="Times"><strong>Altschul, SF, TL Madden, AA Schaffer, J Zhang,
|
|
Z Zhang, </strong>W Miller, and <b>DJ Lipman</b>. Gapped BLAST and PSI-BLAST: a new
|
|
generation of protein database search programs. <i>Nucleic Acids Res</i> 25:3389–402,
|
|
1997.</font></p>
|
|
|
|
<p align="left"><font face="Times"><strong>Baxevanis, AD</strong> and <b>D Landsman</b>.
|
|
Histones and histone fold sequences and structures: a database. <i>Nucleic Acids Res</i>
|
|
25:272–3, 1997.</font></p>
|
|
|
|
<p align="left"><font face="Times"><strong>Galperin, MY</strong>. Sequence analysis of an
|
|
exceptionally conserved operon suggests enzymes for a new link between histidine and
|
|
purine biosynthesis. <i>Mol Microbiol</i> 24:443–5, 1997.</font></p>
|
|
|
|
<p align="left"><font face="Times"><strong>Leipe, DD</strong>. Biodiversity, genomes and
|
|
DNA sequence databases. <i>Curr Opin Genet Dev</i> 6:686–91, 1996.</font></p>
|
|
|
|
<p align="left"><font face="Times"><strong>Makalowski, W</strong>.<b> </b>Mermaid: a
|
|
not-so-new family of human repetitive elements. <i>Hum Genet</i> 99:696–7, 1997.</font></p>
|
|
|
|
<p align="left"><font face="Times"><strong>Mushegian, AR</strong> and <b>EV Koonin</b>.
|
|
Sequence analysis of eukaryotic developmental proteins: ancient and novel domains.<i>
|
|
Genetics </i>144:817–28, 1996.</font></p>
|
|
|
|
<p align="left"><font face="Times"><strong>Neuwald, AF</strong>, DJ Liu, <b>DJ Lipman, </b>and
|
|
<b>CE Lawrence</b>. Extracting protein alignment models from the sequence database. <i>Nucleic
|
|
Acids Res</i> 25:1655–77, 1997.</font></p>
|
|
|
|
<p align="left"><font face="Times"><strong>Schuler, GD</strong>. Sequence mapping by
|
|
electronic PCR. <i>Genome Res</i> 7:541–50, 1997.</font></p>
|
|
|
|
<p align="left"><font face="Times"><strong>Schuler, GD, MS Boguski</strong>, EA Stewart,
|
|
LD Stein, G Gyapay, et al. A gene map of the human genome. <i>Science </i>27:540–6,
|
|
1996.</font></p>
|
|
|
|
<p align="left"><font face="Times"><strong>Wolfsberg, TG </strong>and <b>D Landsman</b>. A
|
|
comparison of expressed sequence tags (ESTs) to human genomic sequences. <i>Nucleic Acids
|
|
Res </i>25:1626–32, 1997.</font></p>
|
|
|
|
<p align="left"><font face="Times"><strong>Zhang, Z </strong>and <b>TL Madden</b>.
|
|
PowerBLAST: A new network BLAST application for interactive or automated sequence analysis
|
|
and annotation. <i>Genome Res</i> 7:649–56, 1997.</font></p>
|
|
|
|
<p><a href="#toc">Return to Table of Contents</a></p>
|
|
|
|
<hr>
|
|
|
|
<h4 align="left"><a name="Masthead">Masthead</a></h4>
|
|
|
|
<p align="left"><font face="Times"><em>NCBI News</em> is distributed two to three times a
|
|
year. We welcome communication from users of NCBI databases and software and invite
|
|
suggestions for articles in future issues. Send correspondence and suggestions to <i>NCBI
|
|
News</i> at the address below.</font></p>
|
|
|
|
<p align="left"><font face="Times">NCBI News<br>
|
|
National Library of Medicine<br>
|
|
Bldg. 38A, Room 8N-803<br>
|
|
8600 Rockville Pike<br>
|
|
Bethesda, MD 20894<br>
|
|
Phone: (301) 496-2475<br>
|
|
Fax: (301) 480-9241<br>
|
|
E-mail: <a href="mailto:info@ncbi.nlm.nih.gov">info@ncbi.nlm.nih.gov</font></a></p>
|
|
|
|
<p align="left"><font face="Times"><em>Editors</em><br>
|
|
Dennis Benson<br>
|
|
Barbara Rapp</font></p>
|
|
|
|
<p align="left"><font face="Times"><em>Design Consultant</em><br>
|
|
Troy M. Hill</font></p>
|
|
|
|
<p align="left"><font face="Times"><em>Photography</em><br>
|
|
Karlton Jackson</font></p>
|
|
|
|
<p align="left"><font face="Times"><em>Writing, Editing, Graphics, and Production</em><br>
|
|
Veronica Johnson<br>
|
|
Donna Roscoe</font></p>
|
|
|
|
<p align="left"><font face="Times">In 1988, Congress established the National Center for
|
|
Biotechnology Information as part of the National Library of Medicine; its charge is to
|
|
create information systems for molecular biology and genetics data, and to perform
|
|
research in computational molecular biology.</font></p>
|
|
|
|
<p align="left"><font face="Times">The contents of this newsletter may be reprinted
|
|
without permission. The mention of trade names, commercial products, or organizations does
|
|
not imply endorsement by NCBI, NIH, or the U.S. Government. </font></p>
|
|
|
|
<p align="left"><font face="Times">NIH Publication No. 97-3272<br>
|
|
ISSN 1060-8788</font></p>
|
|
|
|
<p><a href="#toc">Return to Table of Contents</a> </p>
|
|
|
|
<hr>
|
|
</body>
|
|
</html>
|