{"id":11265,"date":"2023-05-01T10:16:05","date_gmt":"2023-05-01T14:16:05","guid":{"rendered":"https:\/\/ncbiinsights.ncbi.nlm.nih.gov\/?p=11265"},"modified":"2023-05-01T10:16:05","modified_gmt":"2023-05-01T14:16:05","slug":"sequences-genbank-sra","status":"publish","type":"post","link":"https:\/\/ncbiinsights.ncbi.nlm.nih.gov\/2023\/05\/01\/sequences-genbank-sra\/","title":{"rendered":"Coming Soon! Including Sample Location and Collection Date and Time for Sequences Submitted to GenBank and SRA"},"content":{"rendered":"
As previously announced<\/a>, in collaboration with our partners at the International Nucleotide Sequence Database Collaboration (INSDC)<\/a>, we will begin to systematically gather \u2018location of collection\u2019 and \u2018date and time of collection\u2019 for sequence data submitted to GenBank<\/a> and the Sequence Read Archive<\/a> (SRA). Gathering information about where and when a biological sample was collected<\/a> aligns with other global sequence submission standardization efforts and will increase the utility of data made available through GenBank and SRA. These changes will be implemented in a phased approach through December 2024.<\/p>\n Sequence data submitted to GenBank and the SRA will need to include information about location and date and time of sample collection. These metadata will be entered using the pre-existing fields \u2018country\u2019 and \u2018collection_date.\u2019 Minimum information for these fields is described below. We encourage submitters to provide additional details when available: <\/p>\n Location of collection:<\/strong> Specification of where the biological sample was collected, at a minimum, by using the names for countries, oceans, or seas, from this list of locations<\/a>.<\/p>\n Date and time of collection: <\/strong>Date and time when the specimen was collected, at least to the nearest year, consistent with format guidance<\/a>.<\/p>\n In cases where this information cannot be provided (e.g., pathogen samples for which this information would lead to identifiability of a human) or is not relevant (e.g., study of a model organism lab stock or an established cell line), you can declare an appropriate exemption using the extended INSDC \u2018missing value\u2019 reporting standards<\/a>.<\/p>\n By the end of May 2023<\/strong> for all new registered BioSamples associated with GenBank and SRA data. We will update BioSample packages to require this information at the point of submission.<\/p>\n By the end of December 2024<\/strong> for all newly submitted sequence records, including sequences submitted to GenBank without BioSample references.<\/p>\n Follow us on Twitter\u00a0@NCBI<\/a>\u00a0and\u00a0join our mailing list<\/a>\u00a0to keep up to date with\u00a0GenBank, SRA, and other NCBI news.<\/p>\n Please send any comments or questions to info@ncbi.nlm.nih.gov<\/a>.<\/p>\n <\/p>\n","protected":false},"excerpt":{"rendered":" As previously announced, in collaboration with our partners at the International Nucleotide Sequence Database Collaboration (INSDC), we will begin to systematically gather \u2018location of collection\u2019 and \u2018date and time of collection\u2019 for sequence data submitted to GenBank and the Sequence Read Archive (SRA). Gathering information about where and when a biological sample was collected aligns … Continue reading Coming Soon! Including Sample Location and Collection Date and Time for Sequences Submitted to GenBank and SRA<\/span> What\u2019s new?<\/h5>\n
When will these changes take effect?<\/h5>\n
Stay up to date<\/h5>\n
Questions?<\/h5>\n