nih-gov/www.ncbi.nlm.nih.gov/WebSub/html/help/tpa.html

109 lines
4.2 KiB
HTML

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
<title>BankIt Submission Help: TPA Table</title>
<link rel="stylesheet" href="../../css/bankit.13.6.css" type="text/css">
<link rel="stylesheet" type="text/css" href="../../css/sp_3_74_ncbi_header.13.6.css">
<link rel="stylesheet" type="text/css" href="../../css/sp_1_82_layout.13.6.css">
</head>
<body class="help">
<header id="ncbi_header" class="ncbi-header" role="banner">
<div class="usa-grid">
<div class="usa-width-one-whole">
<div class="ncbi-header__logo">
<a href="https://www.ncbi.nlm.nih.gov/" class="logo" aria-label="NCBI Logo"
data-ga-action="click_image" data-ga-label="NIH NLM Logo">
<img src="https://www.ncbi.nlm.nih.gov/coreutils/nwds/img/logos/AgencyLogo.svg"
alt="NIH NLM Logo">
</a>
</div>
</div>
</div>
</header>
<h1>Setting TPA Accessions</h1>
<div class="border1"><p>BankIt accepts accessions for TPA submission in two
ways, as a <strong>tab-delimited</strong> text file (as described below) or
by applying the same accession(s) to all sequences in the set with a web form.
</div>
<h2>Reporting Primary Accession Numbers for Multiple TPA Sequence Submissions</h2>
<div class="border1"><p>BankIt accepts primary accession numbers for multiple
sequences in a two-column, tab-delimited table format. The table contains
the Sequence_ID and Primary Accession Numbers for every sequence in the
submission, as described below.</p>
<p>Accession numbers for GenBank primary nucleotide data can be of the
following format:
<ul>
<li>GenBank: U12345, AY987654, FJ345678, GQ123456</li>
<li>GenBank EST data: CJ012345, CK334455</li>
<li>WGS (Whole Genome Shotgun): AABB01000123, ABCD02000789 (must be the
specific WGS record)</li>
<li>Trace Archive: TI123456789, TI1133557799</li>
</ul>
Accession numbers of Reference Sequence (RefSeq) and CON(assembled) records
cannot be cited since they do not represent primary sequence data.
</p>
</div>
<h2>Contents of the TPA Accessions Table</h2>
<div class="border1"><p>The first row in the table contains the headings for
each column:Sequence_ID and Accession (these headings must be used verbatim)</p>
<p>The first column contains the Sequence_IDs used to identify each sequence in the <a href="fasta.html">nucleotide FASTA file</a>.</p>
<p>The second column contains the primary Accession numbers from which the TPA sequence was derived or assembled.</p>
<p>Each sequence in the submission must have a line in the table. </p>
<p>Each Sequence_ID may appear only once in the table.</p>
<p>Multiple primary sequence Accession Numbers reported for a TPA sequence should be separated by comma.</p>
<table class="example">
<caption>Sample TPA Accessions Table</caption>
<tr>
<td nowrap> Sequence_ID </td>
<td nowrap> Accession </td>
</tr>
<tr>
<td nowrap> Seq1 </td>
<td nowrap> DQ434518,DQ432823,DQ434519 </td>
</tr>
<tr>
<td nowrap> Seq2 </td>
<td nowrap> DQ433267 </td>
</tr>
<tr>
<td nowrap> Seq3 </td>
<td nowrap> DQ434582 </td>
</tr>
<tr>
<td nowrap> Seq4 </td>
<td nowrap> DQ433263,DQ433262,DQ433261,DQ434806 </td>
</tr>
<tr>
<td nowrap> Seq5 </td>
<td nowrap> DQ433569 </td>
</tr>
<tr>
<td nowrap> Seq6 </td>
<td nowrap> DQ434582 </td>
</tr>
<tr>
<td nowrap> Seq7 </td>
<td nowrap> DQ433889,DQ433888,DQ433066 </td>
</tr>
</table>
<p><a href="sample_files/tpa-table-sample.txt">Sample TPA Accessions Table</a> (right-click to save) as a <strong>tab-delimited</strong> text file.</p>
</div>
<h2>Saving the TPA Accessions Table</h2>
<div class="border1"><p>When using a spreadsheet program, be sure to save your file as <strong>tab-delimited</strong> text. If you are not sure that the "Save" option in your program will do this for you, use "Save As..."</p>
<p>In Excel, select "Save As..." from the File menu. In the "Save as type:" pull-down menu, select "Text (Tab delimited) (*.txt)."</p>
</div>
</body>
</html>