nih-gov/www.ncbi.nlm.nih.gov/geo/info/seq.html
2025-03-17 02:05:34 +00:00

532 lines
34 KiB
HTML

<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en">
<head>
<title>Submitting high-throughput sequence data to GEO - GEO - NCBI</title>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />
<meta name="author" content="geo" />
<meta name="keywords" content="NCBI, national institutes of health, nih, database, archive, central, bioinformatics, biomedicine, geo, gene, expression, omnibus, chips, microarrays, oligonucleotide, array, sage, CGH" />
<meta name="description" content="Gene Expression Omnibus (GEO) is a database repository of high throughput gene expression data and hybridization arrays, chips, microarrays." />
<meta name="ncbiaccordion" content="collapsible: true, active: false" />
<meta name="ncbi_app" content="geo" />
<meta name="ncbi_pdid" content="documentation" />
<meta name="ncbi_page" content="Submitting high-throughput sequence data to GEO" />
<link rel="shortcut icon" href="/geo/img/OmixIconBare.ico" />
<link rel="stylesheet" type="text/css" href="/geo/css/reset.css" />
<link rel="stylesheet" type="text/css" href="/geo/css/nav.css" />
<link rel="stylesheet" type="text/css" href="/geo/css/info.css" />
<script type="text/javascript" src="/core/jig/1.15.10/js/jig.min.js"></script>
<script type="text/javascript" src="/geo/js/dd_menu.js"></script>
<script type="text/javascript" src="/geo/js/info.js"></script>
<script type="text/javascript">
jQuery.getScript("/core/alerts/alerts.js", function () {
galert(['#crumbs_login_bar', 'body &gt; *:nth-child(1)'])
});
</script>
<script type="text/javascript">
var ncbi_startTime = new Date();
</script>
</head>
<body id="info" class="seq">
<div id="all">
<div id="page">
<div id="header">
<div id="ncbi_logo">
<a href="/">
<img src="/geo/img/ncbi_logo.gif" alt="NCBI Logo" />
</a>
</div>
<div id="geo_logo">
<a href="/geo/"><img src="/geo/img/geo_main.gif" alt="GEO Logo" /></a>
</div>
</div>
<div id="nav_bar">
<ul id="geo_nav_bar">
<li><a href="#">GEO Publications</a>
<ul class="sublist">
<li><a href="/geo/info/GEOHandoutFinal.pdf">Handout</a></li>
<li><a href="/pmc/articles/PMC10767856/">NAR 2024 (latest)</a></li>
<li><a href="/pmc/articles/PMC99122/">NAR 2002 (original)</a></li>
<li><a href="/pmc/?term=10767856,4944384,3531084,3341798,3013736,2686538,2270403,1669752,1619900,1619899,539976,99122">All publications</a></li>
</ul>
</li>
<li><a href="/geo/info/faq.html">FAQ</a></li>
<li><a href="/geo/info/MIAME.html" title="Minimum Information About a Microarray Experiment">MIAME</a></li>
<li><a href="mailto:geo@ncbi.nlm.nih.gov">Email GEO</a></li>
</ul>
</div>
<div id="crumbs_login_bar"><a title="NCBI home page" href="/">NCBI</a> »
<a id="curr_page" title="GEO home page" href="/geo/">GEO</a> »
<a title="GEO documentation guide" href="/geo/info/">Info</a> »
<span>Submitting high-throughput sequence data to GEO</span><span id="login_status"><a href="/geo/submitter/" title="Click here to login. You need to do this only if you want to edit the contact information, submit data, see your unreleased data, or work with data already submitted by you. You do not need to login if you are here just to browse through public holdings">Login</a></span></div>
<div id="content">
<a name="top" id="top"></a>
<h1>Submitting high-throughput sequence data to GEO</h1>
<ul class="doc_list">
<li><a href="#deposit">Submission instructions</a><a href="https://youtu.be/RqkRPcF38Lw" target="_blank" title="Tutorial video"><img id="yt_icon" src="/geo/img/yt_icon.png" alt="Tutorial video" /></a>
<ul>
<li><a href="#metadata">Metadata spreadsheet <span class="red">REQUIRED</span></a></li>
<li><a href="#processed">Processed data files <span class="red">REQUIRED</span></a></li>
<li><a href="#raw">Raw data files <span class="red">REQUIRED</span></a></li>
</ul>
</li>
<li><a href="#video">Tutorial video</a></li>
<li><a href="#compression">Data file compression</a></li>
<li><a href="#singlecell">Single-cell studies</a></li>
<li><a href="#nanostring">NanoString GeoMx Digital Spatial Profiling (DSP) </a></li>
<li><a href="#organizing">Organizing your submission</a></li>
<li><a href="#data">Uploading your submission</a></li>
<li>General information
<ul>
<li><a href="#intro">Data provisions, standards and administration</a></li>
<li><a href="#categories">Categories of sequence submissions accepted by GEO</a></li>
</ul>
</li>
</ul>
<div class="highlight">
<b>WARNING:</b> If you are submitting human data, it is your responsibility to comply with
<a href="/geo/info/faq.html#patient">Human Subject Guidelines</a>.
</div>
<a name="deposit" id="deposit"></a>
<h2>Submission instructions <a class="arrow" title="Back to top" href="#top">Back to top</a></h2>
<p>
GEO accepts next generation sequence data that examine quantitative gene expression, gene regulation,
epigenomics or other aspects of functional genomics using methods such as RNA-seq (including single-cell), miRNA-seq, ChIP-seq,
RIP-seq, HiC-seq, methyl-seq, etc. We process all components of your study, including the samples,
project description, processed data files, and we submit the raw data files to the
<a href="/sra/">Sequence Read Archive (SRA)</a> on your behalf.
</p>
<ol>
<li>Check that GEO accepts your <a href="#categories">data type</a>. </li>
<li>Gather <a href="#raw">raw files</a>.</li>
<li>Gather <a href="#processed">processed data files</a>.</li>
<li>
Download metadata spreadsheet and fill in Metadata tab for your study.
Use one spreadsheet per data type (e.g., ChIP-seq, RNA-seq).
</li>
<li>In the metadata spreadsheet file, list the <a href="#raw">MD5 checksum</a> for all raw and processed data files in the 'MD5 Checksums' tab.</li>
<li>
Create single folder on your computer that contains all raw and processed data files for your experiment.
If you have multiple data types, please use one folder per experiment.
</li>
<li>Transfer your data to GEO by FTP using <a href="/geo/info/submissionftp.html">these instructions</a>.</li>
<li>After FTP transfer has completed, submit metadata file(s)
on the <a href="https://submit.ncbi.nlm.nih.gov/geo/submission/meta/">Submit to GEO</a> page.</li>
</ol>
<p>More information on required components:</p>
<ul id="deposit_components" class="doc_list">
<li><span>
<a name="metadata" id="metadata"></a>
<b>Metadata spreadsheet</b>
<p><span class="button"><a href="/geo/info/examples/seq_template.xlsx">Download metadata spreadsheet</a></span></p>
<p>
Metadata refers to descriptive information about the overall study, individual samples,
all protocols, and references to processed and raw data file names.
Information is supplied by completing all fields of a metadata template spreadsheet.
Guidelines on the content of each field are provided within the spreadsheet.
</p>
<p>
Provide enough details so that users can get a general understanding of the study and samples from the GEO records.
Please spell out acronyms and abbreviations. Submit a separate metadata spreadsheet for each data type.
</p>
<p>
<a name="sra" id="sra"></a><b>Have you already submitted raw data to SRA and now want to submit to GEO?</b><br />
If you already have your raw data in SRA, you do not need to submit it again to GEO.
We need only processed data and a specialized metadata file in order to create GEO
records and link them to your raw data records previously submitted to SRA.
</p>
<p><span class="button"><a href="/geo/info/examples/seq_template_with_sra_accessions.xlsx">Download metadata spreadsheet with SRA accessions</a></span></p>
<p>
You will need to enter the PRJNA, SAMN and SRX or SRR for all samples with raw data already submitted to SRA.
You can get that information for your SUB ID on <a href="https://submit.ncbi.nlm.nih.gov/subs/">the Submission Portal page</a>.
</p>
</span></li>
<li><span>
<a name="processed" id="processed"></a>
<b>Processed data files </b>
<p>
GEO requires that submitters deposit the processed data that support the findings of their study.
The processed data should have a quantitative component, such as gene abundances or other count data.
Please do not submit alignment files (e.g., BAM, SAM, BED) as processed data, as these are considered
intermediary files and do not include a quantitative component. When standard alignments are the
only processed data available, please <a href="mailto:geo@ncbi.nlm.nih.gov">write to us</a>
to inquire about whether your data are suitable for submission to GEO.
</p>
<p>
Processed data format and content will depend on the data type: RNA-seq processed data can include raw and/or normalized counts (FPKM, TPM, etc) of sequencing
reads for the features of interest (protein-coding genes, lncRNA, miRNA, circRNA, etc).
</p>
<p>
ChIP-Seq and ATAC-seq processed data can include peak files with quantitative data, tag density files, etc. Common formats include WIG, bigWig, bedGraph.
Please leave files in native format and do not paste peak data into Excel.
</p>
<p>
Methylation data are often provided as average beta values.
</p>
<p>Processed data guidelines:</p>
<ul class="geo_doc_list">
<li><span>
Processed data may be formatted either as a matrix table or individual files for each sample.
</span></li>
<li><span>
If processed data for all samples is submitted in a matrix table, column headers should match
the library name for each sample listed in the SAMPLES section of the metadata spreadsheet.
</span></li>
<li><span>
Provide <a href="/geo/info/faq.html#summarysubset">complete data</a> with values for all features (e.g., genes) and all samples.
Do not submit lists of genes identified with differential expression.
</span></li>
<li><span>
Features (e.g., genes, transcripts) in processed data files should be traceable using public accession numbers or chromosome coordinates.
The reference assembly used (e.g., hg19, mm9, GCF_000001405.13) should be provided in the metadata spreadsheet.
</span></li>
<li><span>
If you provide WIG, bedGraph, GFF, or GTF files, please refer to the UCSC file format FAQ for format requirements.
</span></li>
</ul>
</span></li>
<li><span>
<a name="raw" id="raw"></a>
<b>Raw data files</b>
<p>
Raw data are a required part of GEO submissions. The raw data files should be the original files containing reads and quality scores,
as generated by the sequencing instrument. Edited files may not be processed correctly by SRA.
</p>
<p>
Raw data for high throughput sequencing studies submitted to GEO will be brokered to SRA for you.
</p>
<p>
Raw data can instead be submitted directly to <a href="https://submit.ncbi.nlm.nih.gov/about/sra/">SRA</a>.
After you have received the SRA accessions, please see <a href="/geo/info/seq.html#deposit">above</a>
for instructions and <a href="/geo/info/examples/seq_template_with_sra_accessions.xlsx">specific template</a> for this case.
Please submit the metadata and processed data to GEO.
</p>
<p>
If your raw data files exceed 2 terabytes (TB), submit them directly to <a href="https://submit.ncbi.nlm.nih.gov/about/sra/">SRA</a>.
After you have received the SRA accessions, please see <a href="/geo/info/seq.html#deposit">above</a>
for instructions and <a href="/geo/info/examples/seq_template_with_sra_accessions.xlsx">specific template</a> for this case.
Please submit the metadata and processed data to GEO.
</p>
<p>
If the R1 and R2 files for one library exceed 600 GB, split them into smaller files so that they will load efficiently.
R1/R2 file pairs larger than 600GB will be delayed in loading.
</p>
<p>
<b>Raw Data File Formats</b>: Acceptable file formats include FASTQ, as well as other formats described in the
<a href="/sra/docs/submitformats/">SRA File Format Guide</a>.
Files that do not conform to supported format requirements will be deleted from our systems.
</p>
<p class="red">
Raw file names should not include white space or special characters such as: /, &amp;, #, % or any non-ascii characters. All raw files must have unique names.
</p>
<p>
<b>Barcode/Multiplexed Data</b>: For bulk RNA-seq studies, we require that raw data files be demultiplexed so that each
barcoded sample ends up with a dedicated run file. Most single-cell sequencing studies studies should be submitted with multiplexed raw data files
(e.g. 10x Genomics, Drop-Seq, InDrops).
</p>
<p>
<b>Paired-end Experiments</b>: We usually expect 2 files per run (3 or 4 files per run when sequences
and qualities are included in separate files).
</p>
<p>
<b>MD5 Checksums</b>: We recommend that submitters provide MD5 checksums for their raw data files.
The checksums are used to verify file integrity. Checksums can be calculated using the following methods:
<ul>
<li><b>Unix</b>: md5sum &lt;file&gt;</li>
<li><b>OS X</b>: md5 &lt;file&gt;</li>
<li><b>Windows</b>: Application required. Many are available for free download.</li>
</ul>
</p>
</span></li>
</ul>
<a name="video" id="video"></a>
<h2>Tutorial video <a class="arrow" title="Back to top" href="#top">Back to top</a></h2>
<iframe width="560" height="315" src="https://www.youtube.com/embed/RqkRPcF38Lw?si=Gqfz9NGmWWfxRul0" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen="true"></iframe>
<a name="compression" id="compression"></a>
<h2>Data file compression <a class="arrow" title="Back to top" href="#top">Back to top</a></h2>
<p>
<ul class="doc_list">
<li><span>
Individual files can be compressed to speed transfer, but this is not required.
<ul class="geo_doc_list">
<li><span>Acceptable compression formats are gzip and bzip2 (i.e. files ending with a .gz or .bz2 extension). </span></li>
<li><span>Never compress binary files (e.g., BAM, bigWig, bigBed, HDF5).</span></li>
<li><span>DO NOT upload ZIP archives (files with a .zip extension). Zip archives with fastq files are often corrupted and will delay submission processing.</span></li>
</ul>
</span></li>
<li><span>
Do not submit raw data files in tar archives. GEO must be able to easily access raw data files for QC purposes.
<ul class="geo_doc_list">
<li><span>Exception: You can submit tar.gz archives of base-called fast5 files for raw data produced by Nanopore instruments.</span></li>
</ul>
</span></li>
</ul>
</p>
<a name="singlecell" id="singlecell"></a>
<h2>Single-cell studies <a class="arrow" title="Back to top" href="#top">Back to top</a></h2>
<p>
Raw and processed data are required for single-cell data submissions. We are expecting the raw data for single-cell studies to be submitted in fastq format.
Fastq files are preferred for 10X Genomics studies so that they can be correctly archived in SRA format.
</p>
<p>
Single-cell data should be multiplexed so that each GEO sample record (GSM) represents many individual cells. If your data are not already multiplexed
and you have separate raw data files for each cell, please <a href="mailto:geo@ncbi.nlm.nih.gov">email GEO</a> for guidance with your submission.
</p>
<p>
Processed data for single-cell studies should be cell-level data. Files can be submitted as Cell Ranger software output files (barcodes.tsv, features.tsv, matrix.mtx), H5 or HDF5 archives, or RDS objects.
Processed data for single-cell TCR and BCR samples should include contig annotations and cell barcode information.
</p>
<p>
If you have used the Cell Ranger aggr pipeline and you are submitting H5/HDF5 archives, you must submit the aggregation.csv file.
</p>
<p>
When submitting multi-omics types of studies (ADT, HTO, TCR, BCR, GDO, CMO, LMO) and using 10X Genomics protocols and software you must submit the feature_reference.csv file so that the data can be correctly interpreted.
List different *omics libraries on separate rows in the SAMPLES section of the metadata spreadsheet:
</p>
<p>
<ul class="doc_list">
<li><span>sample1_GEX</span></li>
<li><span>sample1_TCR</span></li>
<li><span>sample1_ADT</span></li>
<li><span>sample1_HTO</span></li>
</ul>
</p>
<p>
If the feature_reference.csv file is not available, please submit a supplementary file named "feature_README.txt"
(<a href="/geo/info/examples/feature_README.txt">download the template for this file</a>)
with the following information as applicable for the included samples:
</p>
<div class="wide-scroll">
<table class="overview">
<thead>
<tr>
<th>id</th>
<th>name</th>
<th>read</th>
<th>pattern</th>
<th>sequence</th>
<th>feature_type</th>
<th>multiplexed_sample</th>
<th>target_gene_id</th>
<th>target_gene_name</th>
<th>mhc_allele</th>
</tr>
</thead>
<tbody>
<tr>
<td>ADT_1</td>
<td>TotalSeq-A0106 anti-mouse CD11c</td>
<td>R2</td>
<td>5P(BC)</td>
<td>GTTATGGACGCTTGC</td>
<td>Antibody Capture</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>HTO_1</td>
<td>TotalSeq-A0301 anti-mouse Hashtag 1</td>
<td>R2</td>
<td>5P(BC)</td>
<td>ACCCACCAGTAAGAC</td>
<td>Multiplexing Capture</td>
<td>unimmunized mouse 1</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>GDO_1</td>
<td>GF1B_ch4:132977695-132977714</td>
<td>R2</td>
<td>GCATAGCTCTTTAAAC(BC)</td>
<td>TTTGGCAGGGCGTCCCATCC</td>
<td>CRISPR Guide Capture</td>
<td></td>
<td>ENSG00000165702</td>
<td>GFI1B</td>
<td></td>
</tr>
</tbody>
</table>
</div>
<a name="nanostring" id="nanostring"></a>
<h2>NanoString GeoMx Digital Spatial Profiling (DSP) <a class="arrow" title="Back to top" href="#top">Back to top</a></h2>
<p>
Some NanoString GeoMx® Digital Spatial Profiling (DSP) formats use high throughput sequencing methods to produce the raw data.
NanoString GeoMx DSP studies may analyze either RNA or Protein targets using hundreds to thousands of probes. Raw data should be submitted
in fastq format. Processed data may be submitted in supplementary text files that include matrices or you can submit the DCC files which
contain raw read counts along with RTS_IDs (probe barcodes). If DCC files are provided, you must also submit the PKC file that includes
probe metadata describing the targets and is needed to interpret RTS_IDs in the DCC files.
</p>
<a name="organizing" id="organizing"></a>
<h2>Organizing your submission <a class="arrow" title="Back to top" href="#top">Back to top</a></h2>
<p>
<ul class="doc_list">
<li><span>If you have one dataset, place all data files in a single folder given a meaningful name (e.g., geo_sub_RNAseq).</span></li>
<li><span>If you have multiple datasets, use a separate folder for each dataset (e.g., RNAseq, ChIPseq, HiC).</span></li>
<li><span>Do not use subfolders within your dataset folder.</span></li>
<li><span>All files must have unique names.</span></li>
</ul>
</p>
<p>
Here is an example of the folder structure:
</p>
<img alt="Folder structure" src="/geo/img/directory_structure_diagram.jpg" width="300" />
<a name="data" id="data"></a><a name="data_additional" id="data_additional"></a>
<h2>Uploading your submission <a class="arrow" title="Back to top" href="#top">Back to top</a></h2>
<div id="steps_accordion" data-jig="ncbiaccordion">
<h3>Submit a new high-throughput sequencing submission</h3>
<div>
<div class="submit_table">
<div class="submit_row">
<div class="submit_steps">
<p>
<b>1. Transfer all your raw and processed data files to the <a href="/geo/info/submissionftp.html">GEO FTP server</a>.</b><br /><br />
Submit a separate folder for each dataset you upload, e.g., RNAseq, ChIPseq, HiC. <br />
<span class="red">Do not upload the metadata file(s) by FTP.</span>
</p>
<span class="button"><a href="/geo/info/submissionftp.html">Transfer files</a></span>
</div>
<div class="submit_steps">
<p>
<b>2. Submit metadata.</b><br /><br />
Upload your metadata file(s) <span class="red">after FTP transfer is complete</span> (one metadata file for each dataset).
This step will place your submission into our processing queue for review.
</p>
<span class="button"><a href="https://submit.ncbi.nlm.nih.gov/geo/submission/meta/">Upload metadata</a></span>
</div>
</div>
</div>
</div>
<h3>Submit additional or replacement files for a submission in progress</h3>
<div>
<p>
You may use this option to upload files requested by curators or submit additional supplementary files for an existing record.
</p>
<div class="submit_table">
<div class="submit_row">
<div class="submit_steps">
<p>
1. Transfer the replaced or additional data files for your current GEO submission to the <a href="/geo/info/submissionftp.html">GEO FTP server</a>.
</p>
<span class="button"><a href="/geo/info/submissionftp.html">Transfer Files</a></span>
</div>
<div class="submit_steps">
<p>2. After the FTP transfer is complete, notify GEO using the <a href="https://submit.ncbi.nlm.nih.gov/geo/submission/">Submit to GEO</a> web form.</p>
<span class="button"><a href="https://submit.ncbi.nlm.nih.gov/geo/submission/">Notify GEO</a></span>
</div>
</div>
</div>
</div>
</div>
<a name="intro" id="intro"></a>
<h2>General information <a class="arrow" title="Back to top" href="#top">Back to top</a></h2>
<table class="overview columns">
<tbody>
<tr>
<td>
<b>Data provision and standards</b>
<p>GEO sequence submission procedures are designed to encourage provision of <a href="https://www.fged.org/projects/minseqe/">MINSEQE</a> elements:</p>
<ul class="geo_doc_list">
<li><span>Thorough descriptions of the biological samples under investigation, and procedures to which they were subjected</span></li>
<li><span>Thorough descriptions of the protocols used to generate and process the data</span></li>
<li><span>Final processed (or summary) data from which the conclusions in associated manuscripts are based</span></li>
<li><span>
Original raw data files containing sequence reads and quality scores, which will be uploaded to NCBI's
<a href="/sra/">Sequence Read Archive (SRA)</a> database.
</span></li>
</ul>
</td>
<td>
<b>Administration</b>
<p>All standard GEO administration and processing procedures apply to sequence submissions. These include: </p>
<ul class="geo_doc_list">
<li><span>Unique and stable GEO accession numbers are issued to studies; these accessions can be cited in manuscripts</span></li>
<li><span>GEO accession numbers are typically issued within 5 business days after completion of submission</span></li>
<li><span>Data can be held private until publication</span></li>
<li><span>Reviewers can have anonymous access to private records</span></li>
<li><span>Submitters can update their records at any time</span></li>
</ul>
<p> More information on these aspects is provided in our <a href="/geo/info/faq.html">FAQ</a>.</p>
</td>
</tr>
</tbody>
</table>
<a name="categories" id="categories"></a>
<h2>Categories of sequence submissions processed by GEO <a class="arrow" title="Back to top" href="#top">Back to top</a></h2>
<table class="overview columns">
<thead>
<tr>
<th>GEO accepts</th><th>GEO does not accept</th>
</tr>
</thead>
<tbody>
<td>
<p>
Studies concerning quantitative gene expression, gene regulation, epigenetics,
or other functional genomic studies.
</p>
<p>Examples include:</p>
<ul class="geo_doc_list">
<li><span>mRNA profiling, RNA-seq (<a href="/geo/query/acc.cgi?acc=GSE232692">example</a>) </span></li>
<li><span>small RNA profiling, miRNA-seq (<a href="/geo/query/acc.cgi?acc=GSE247194">example</a>)</span></li>
<li><span>ChIP-Seq (<a href="/geo/query/acc.cgi?acc=GSE269788">example</a>)</span></li>
<li><span>HiC-seq (<a href="/geo/query/acc.cgi?acc=GSE277492">example</a>)</span></li>
<li><span>methyl-seq, bisulfite-seq (<a href="/geo/query/acc.cgi?acc=GSE247853">example</a>)</span></li>
<li><span>single-cell RNA-seq (<a href="/geo/query/acc.cgi?acc=GSE240432">example</a>)</span></li>
<li><span>CITE-seq (<a href="/geo/query/acc.cgi?acc=GSE245950">example</a>)</span></li>
<li><span>NanoString (GeoMx Digital Spatial Profiling with HTS raw data: fastq) (<a href="/geo/query/acc.cgi?acc=GSE200563">example</a>) </span></li>
<li><span>Visium (<a href="/geo/query/acc.cgi?acc=GSE273439">example</a>)</span></li>
</ul>
<p>
If you have questions about whether GEO can accept your data type, please
<a href="mailto:geo@ncbi.nlm.nih.gov">e-mail GEO</a>.
</p>
</td>
<td>
<ul class="geo_doc_list">
<li><span>human data that require controlled access (submit to <a href="/sra/docs/submitdbgap/">dbGaP and controlled access SRA</a>)</span></li>
<li><span>transcript assemblies (submit directly to <a href="/sra/docs/submit/">SRA</a> and the <a href="/genbank/tsa/">Transcriptome Shotgun Assembly Database</a>)</span></li>
<li><span>whole genome sequencing (submit directly to <a href="/sra/docs/submit/">SRA</a> and <a href="/genbank/wgs.submit/">WGS</a>)</span></li>
<li><span>metagenomic sequencing (submit directly to <a href="/sra/docs/submit/">SRA</a>)</span></li>
<li><span>resequencing, human variation or copy number projects (submit directly to <a href="/sra/docs/submit/">SRA</a> and the appropriate NCBI <a href="/variation/">variation resource</a>)</span></li>
<li><span>survey sequencing, whole exome (submit directly to <a href="/sra/docs/submit/">SRA</a>)</span></li>
<li><span>mass spectrometry (submit directly to <a href="https://www.ebi.ac.uk/pride/archive/">PRIDE</a> resource at EBI)</span></li>
</ul>
<p>
For more information about submitting data to NCBI, please refer to the <a href="/home/submit-wizard.shtml">Submission Wizard</a>.
</p>
</td>
</tbody>
</table>
</div>
</div>
<div id="last_mod">
Last modified: October 8, 2024</div>
<div id="footer">
<span class="helpbar">|<a href="https://www.nlm.nih.gov"> NLM </a>|<a href="https://www.nih.gov"> NIH </a>|<a href="mailto:geo@ncbi.nlm.nih.gov"> Email GEO </a>|<a href="/geo/info/disclaimer.html"> Disclaimer </a>|<a href="https://www.nlm.nih.gov/accessibility.html"> Accessibility </a>|<a href="https://www.hhs.gov/vulnerability-disclosure-policy/index.html"> HHS Vulnerability Disclosure </a>|
</span>
</div>
</div>
<script type="text/javascript" src="https://www.ncbi.nlm.nih.gov/portal/portal3rc.fcgi/rlib/js/InstrumentOmnitureBaseJS/InstrumentNCBIBaseJS/InstrumentPageStarterJS.js"></script>
</body>
</html>