Setting TPA Accessions

BankIt accepts accessions for TPA submission in two ways, as a tab-delimited text file (as described below) or by applying the same accession(s) to all sequences in the set with a web form.

Reporting Primary Accession Numbers for Multiple TPA Sequence Submissions

BankIt accepts primary accession numbers for multiple sequences in a two-column, tab-delimited table format. The table contains the Sequence_ID and Primary Accession Numbers for every sequence in the submission, as described below.

Accession numbers for GenBank primary nucleotide data can be of the following format:

Accession numbers of Reference Sequence (RefSeq) and CON(assembled) records cannot be cited since they do not represent primary sequence data.

Contents of the TPA Accessions Table

The first row in the table contains the headings for each column:Sequence_ID and Accession (these headings must be used verbatim)

The first column contains the Sequence_IDs used to identify each sequence in the nucleotide FASTA file.

The second column contains the primary Accession numbers from which the TPA sequence was derived or assembled.

Each sequence in the submission must have a line in the table.

Each Sequence_ID may appear only once in the table.

Multiple primary sequence Accession Numbers reported for a TPA sequence should be separated by comma.

Sample TPA Accessions Table
Sequence_ID Accession
Seq1 DQ434518,DQ432823,DQ434519
Seq2 DQ433267
Seq3 DQ434582
Seq4 DQ433263,DQ433262,DQ433261,DQ434806
Seq5 DQ433569
Seq6 DQ434582
Seq7 DQ433889,DQ433888,DQ433066

Sample TPA Accessions Table (right-click to save) as a tab-delimited text file.

Saving the TPA Accessions Table

When using a spreadsheet program, be sure to save your file as tab-delimited text. If you are not sure that the "Save" option in your program will do this for you, use "Save As..."

In Excel, select "Save As..." from the File menu. In the "Save as type:" pull-down menu, select "Text (Tab delimited) (*.txt)."