nih-gov/www.ncbi.nlm.nih.gov/mailman/pipermail/refseq-announce/2014q2/000117.html

117 lines
5.3 KiB
HTML

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<HTML>
<HEAD>
<TITLE> [Refseq-announce] Important reminder of upcoming RefSeq FTP changes
</TITLE>
<LINK REL="Index" HREF="index.html" >
<LINK REL="made" HREF="mailto:refseq-announce%40ncbi.nlm.nih.gov?Subject=Re%3A%20%5BRefseq-announce%5D%20Important%20reminder%20of%20upcoming%20RefSeq%20FTP%20changes&In-Reply-To=%3Cmailman.188833.1397770972.8578.refseq-announce%40ncbi.nlm.nih.gov%3E">
<META NAME="robots" CONTENT="index,nofollow">
<style type="text/css">
pre {
white-space: pre-wrap; /* css-2.1, curent FF, Opera, Safari */
}
</style>
<META http-equiv="Content-Type" content="text/html; charset=us-ascii">
<LINK REL="Next" HREF="000118.html">
</HEAD>
<BODY BGCOLOR="#ffffff">
<H1>[Refseq-announce] Important reminder of upcoming RefSeq FTP changes</H1>
<B>Public RefSeq Release announcements</B>
<A HREF="mailto:refseq-announce%40ncbi.nlm.nih.gov?Subject=Re%3A%20%5BRefseq-announce%5D%20Important%20reminder%20of%20upcoming%20RefSeq%20FTP%20changes&In-Reply-To=%3Cmailman.188833.1397770972.8578.refseq-announce%40ncbi.nlm.nih.gov%3E"
TITLE="[Refseq-announce] Important reminder of upcoming RefSeq FTP changes">refseq-announce at ncbi.nlm.nih.gov
</A><BR>
<I>Thu Apr 17 17:42:39 EDT 2014</I>
<P><UL>
<LI>Next message: <A HREF="000118.html">[Refseq-announce] Announcing RefSeq release 65
</A></li>
<LI> <B>Messages sorted by:</B>
<a href="date.html#117">[ date ]</a>
<a href="thread.html#117">[ thread ]</a>
<a href="subject.html#117">[ subject ]</a>
<a href="author.html#117">[ author ]</a>
</LI>
</UL>
<HR>
<!--beginarticle-->
<PRE>Please note that several changes to the RefSeq release FTP site will occur with RefSeq release 65, planned for early May. These changes were indicated in previous release announcements.
1. Directory name change: The 'microbial' directory will be removed. Two new directories 'archaea' and 'bacteria' will be added.
2. WGS management change: WGS accessions will no longer be processed on a per-project (WGS prefix) basis. Instead, these accessions will be processed and packaged the same as non-WGS accessions. This will significantly reduce the number of files in the /complete/ and (new) /archaea/ and /bacteria/ directories. Therefore, there will no longer be a series of files named like 'microbialNZ_*'. Instead, all WGS scaffolds will be found in concatenated files just like all other accession data. We will continue to provide a separate file for the WGS master records.
Please note that this change in WGS management will also impact the /refseq/daily/ and /refseq/wgs/ directory areas. This impact was not spelled out in previous emails. As WGS accessions are processed the same as other non-WGS accessions, these updates will now appear in the /daily/ update area. WGS master records will continue to be provided separately from other files as they are special meta-data only records. WGS mater files will be provided with names like 'rsnc.wgs_mstr.0403.2014.bna.gz' and 'rsnc.wgs_mstr.0403.2014.gbff.gz' (where &quot;wgs_mstr&quot; indicates the type). These files will be provided in the /refseq/wgs/ directory area.
3. File name &amp; Content change:
This change will occur in both /refseq/daily/ and /refseq/release/release-catalog/ directory areas
*WP2genomic.mapping.gz will be renamed *AutonomousProtein2Genomic.gz
*multispecies_WP_accession_to_taxname.gz will be renamed *MultispeciesAutonomousProtein2taxname.gz
Both file names will be modified in order to more accurately reflect the terminology that is being used to refer to the autonomous nonredundant protein dataset that utilizes the 'WP_' accession prefix. In addition, the content of the AutonomousProtein2Genomic file will be expanded to include:
* protein accession.version
* protein gi
* genomic accession.version (on which the autonomous WP protein is annotated)
* genomic gi
* genomic annotated strain-level taxid
* genomic species-level taxid
* genomic BioSample ID if available
* genomic organism name (e.g., species + strain)
Kim D. Pruitt, Ph.D
NCBI/NLM/NIH/DHHS
45 Center Drive
Building 45 Room 4AS47B MSC 6513
Bethesda, MD 20892-6513
Phone (301)435-5898
Please consider the environment before printing this e-mail.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: &lt;<A HREF="http://www.ncbi.nlm.nih.gov/mailman/pipermail/refseq-announce/attachments/20140417/8e837ad4/attachment.htm">http://www.ncbi.nlm.nih.gov/mailman/pipermail/refseq-announce/attachments/20140417/8e837ad4/attachment.htm</A>&gt;
</PRE>
<!--endarticle-->
<HR>
<P><UL>
<!--threads-->
<LI>Next message: <A HREF="000118.html">[Refseq-announce] Announcing RefSeq release 65
</A></li>
<LI> <B>Messages sorted by:</B>
<a href="date.html#117">[ date ]</a>
<a href="thread.html#117">[ thread ]</a>
<a href="subject.html#117">[ subject ]</a>
<a href="author.html#117">[ author ]</a>
</LI>
</UL>
<hr>
<a href="http://www.ncbi.nlm.nih.gov/mailman/listinfo/refseq-announce">More information about the RefSeq-announce
mailing list</a><br>
</body></html>