{"id":8357,"date":"2022-05-24T07:56:50","date_gmt":"2022-05-24T11:56:50","guid":{"rendered":"https:\/\/ncbiinsights.ncbi.nlm.nih.gov\/?p=8357"},"modified":"2022-05-24T07:56:50","modified_gmt":"2022-05-24T11:56:50","slug":"new-in-rapt","status":"publish","type":"post","link":"https:\/\/ncbiinsights.ncbi.nlm.nih.gov\/2022\/05\/24\/new-in-rapt\/","title":{"rendered":"New in RAPT: Better taxonomic assignment and GO annotation"},"content":{"rendered":"

We are excited to announce two improvements to the Read assembly and Annotation Pipeline Tool (RAPT<\/a>), which allows you to assemble genomic reads for bacterial or archaeal isolates and annotate their genes at the click of a button.<\/p>\n

Improved taxonomic assignment<\/h4>\n

Now RAPT verifies the scientific name you provide with the reads, and corrects it as needed with the Average Nucleotide Identity (ANI<\/a>) tool, which compares your genome to type strain assemblies in GenBank<\/a> to place it in the taxonomic tree. So, even if you only have a rough idea of the species you have sequenced, input datasets tailored to your genome will be used for the annotation and you will get the best possible gene set from RAPT. <\/p>\n

Gene Ontology<\/h4>\n

Genes are now assigned Gene Ontology<\/a>\u00a0(GO) terms when possible. The terms are derived from the collection of\u00a0Protein Family Models<\/a> (hidden Markov models, BlastRules and conserved domain architectures) used by RAPT to name the proteins. On average, a third of Coding Sequences (CDSs) annotated by RAPT will get at least one GO term. We are actively working on mapping more GO terms to our Protein Family Models, so this percentage will grow with time. See more information in this\u00a0blog post<\/a>.<\/p>\n

If you prefer to run RAPT on your own local machine or on the cloud, please visit our\u00a0github site<\/a>\u00a0to get started.<\/p>\n

==============<\/p>\n

New to RAPT?<\/h4>\n

RAPT is an easy-to-use pilot service for the de novo<\/em>\u00a0assembly and gene annotation of public or private Illumina genomic reads sequenced from bacterial or archaeal isolates. RAPT consists of three major components, the genome assembler\u00a0SKESA<\/a>, the taxonomic assignment tool\u00a0ANI<\/a>\u00a0and the Prokaryotic Genome Annotation Pipeline (PGAP<\/a>), and produces an annotated genome of quality comparable to RefSeq<\/a> in a couple of hours.<\/p>\n

Find more information about RAPT here<\/a> and check out our video!<\/p>\n