nih-gov/www.ncbi.nlm.nih.gov/WebSub/html/help/organism.html

119 lines
4.4 KiB
HTML

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
<title>BankIt Submission Help: Organism Table</title>
<link rel="stylesheet" href="../../css/bankit.13.6.css" type="text/css">
<link rel="stylesheet" type="text/css" href="../../css/sp_3_74_ncbi_header.13.6.css">
<link rel="stylesheet" type="text/css" href="../../css/sp_1_82_layout.13.6.css">
</head>
<body class="help">
<header id="ncbi_header" class="ncbi-header" role="banner">
<div class="usa-grid">
<div class="usa-width-one-whole">
<div class="ncbi-header__logo">
<a href="https://www.ncbi.nlm.nih.gov/" class="logo" aria-label="NCBI Logo"
data-ga-action="click_image" data-ga-label="NIH NLM Logo">
<img src="https://www.ncbi.nlm.nih.gov/coreutils/nwds/img/logos/AgencyLogo.svg"
alt="NIH NLM Logo">
</a>
</div>
</div>
</div>
</header>
<h1>Providing Organism for Multiple Sequences</h1>
<div class="border1"><p>If you have not included organism names in your
nucleotide FASTA definition lines, BankIt can accept organism names for
multiple sequences in two ways: as a <strong>tab-delimited</strong> text
file (as described below) or by applying the same organism to all sequences
in the set using the web input form.
</p>
<p>To apply the same organism name (for example, in a population set) input
the name in the 'Name' field on the 'Organism and Source Modifiers' page. A
prompt in that field will display our most common organism names based on
your input; this is NOT an exhaustive list. Unique descriptive information
(such as strains, isolates, clones, or specimen vouchers) can then be
provided in a Source Modifier table.
</p>
<p>To provide different organism names, a two-column, tab-delimited table
containing the Sequence_ID and organism name for every sequence in the
submission is used.</p>
</div>
<h2>Contents of the Organism Table</h2>
<div class="border1"><p>The first row in the table contains the headers for
each column: Sequence_ID and Organism, which must be verbatim, as in the
example table below.</p>
<p>The first column contains the Sequence_IDs used to identify each sequence in the <a href="fasta.html">nucleotide FASTA file</a>.</p>
<p>The second column contains the name of the organism from which the
corresponding sequence was isolated.</p>
<p>Each sequence in the set must have a line in the Organism table. </p>
<p>Each Sequence_ID may appear only once in the Organism table.</p>
<table class="example">
<caption>Sample Organism Table</caption>
<tr>
<td nowrap> Sequence_ID </td>
<td nowrap> Organism </td>
</tr>
<tr>
<td nowrap> Seq1 </td>
<td nowrap> Carpodacus mexicanus </td>
</tr>
<tr>
<td nowrap> Seq2 </td>
<td nowrap> Uncultured Bacillus sp. </td>
</tr>
<tr>
<td nowrap> Seq3 </td>
<td nowrap> Phalaenopsis equestris var. leucaspis</td>
</tr>
<tr>
<td nowrap> Seq4 </td>
<td nowrap> Uncultured Archaeon </td>
</tr>
<tr>
<td nowrap> Seq5 </td>
<td nowrap> Influenza A virus </td>
</tr>
<tr>
<td nowrap> Seq6 </td>
<td nowrap> Vireo gilvus </td>
</tr>
<tr>
<td nowrap> Seq7 </td>
<td nowrap> Pelecanus erythrorhynchos </td>
</tr>
<tr>
<td nowrap> Seq8 </td>
<td nowrap> Hippodamia tredecimpunctata tibialis</td>
</tr>
<tr>
<td nowrap> Seq9 </td>
<td nowrap> Petunia integrifolia subsp. inflata</td>
</tr>
<tr>
<td nowrap> Seq10 </td>
<td nowrap> Fusarium oxysporum f. tuberosi</td>
</tr>
<tr>
<td nowrap> Seq11 </td>
<td nowrap> Dendroica tigrina </td>
</tr>
</table>
<p><a href="sample_files/org-table-sample.txt">Sample Organism Table</a> (right-click to save) as a <strong>tab-delimited</strong> text file.</p>
</div>
<h2>Saving the Organism Table</h2>
<div class="border1"><p>When using a spreadsheet program, be sure to save your file as <strong>tab-delimited</strong> text. If you are not sure that the "Save" option in your program will do this for you, use "Save As..."</p>
<p>In Excel, select "Save As..." from the File menu. In the "Save as type:" pull-down menu, select "Text (Tab delimited) (*.txt)."</p>
</div>
</body>
</html>