Fifteen new NCBI annotations in RefSeq: flies, harbor seal and more

In January and February, the NCBI Eukaryotic Genome Annotation Pipeline released new annotations in RefSeq for the following organisms:

  • Aythya fuligula (tufted duck)
  • Camelus ferus (Wild Bactrian camel)
  • Corvus moneduloides (New Caledonian crow)
  • Coturnix japonica (Japanese quail)
  • Drosophila ananassae (fly)
  • Drosophila virilis (fly)
  • Etheostoma spectabile (orangethroat darter)
  • Hylobates moloch (silvery gibbon)
  • Mustela erminea (ermine)
  • Nematostella vectensis (starlet sea anemone)
  • Nomia melanderi (Alkali bee)
  • Phoca vitulina (harbor seal)
  • Sapajus apella (Tufted capuchin)
  • Thamnophis elegans (Western terrestrial garter snake)
  • Xiphophorus hellerii (green swordtail)

See more details on the Eukaryotic RefSeq Genome Annotation Status page.

Important changes to the genomes FTP site in February

We have added the latest NCBI Eukaryotic Genome Annotation Pipeline results for the more than 580 species that we annotate to the genomes/refseq directory on the genomes FTP area. As we announced in December, we will stop publishing annotation results to the genus_species directories (example: genomes/Xenopus_tropicalis) on the genomes FTP site effective February 1, 2020. We will also move existing genus_species directories to genomes/archive/old_refseq during the month of February.X_t_assemblyFigure 1. The Assembly page for the Xenopus tropicalis UCB Xtro 10.0 (GCF_000004195.4) showing the blue download button. Annotation results such as the RefSeq transcript alignments that can be downloaded from the web page are now also under the genomes/refseq directory on the FTP site. The FTP path to the .bam alignment files is in red.

These FTP changes do not affect the Assembly download function. As always, you can download assembly data using the blue Download button on the web pages (Figure 1).


December 2019 RefSeq annotations: human, Tasmanian devil and more

tasmanian devil sits, looking to the right

In December, the NCBI Eukaryotic Genome Annotation Pipeline released new annotations in RefSeq for the following organisms:

  • Anarrhichthys ocellatus (wolf-eel)
  • Apis florea (little honeybee)
  • Contarinia nasturtii (swede midge)
  • Cucumis sativus (cucumber)
  • Galleria mellonella (greater wax moth)
  • Homo sapiens (human)
  • Nasonia vitripennis (jewel wasp)
  • Oncorhynchus kisutch (coho salmon)
  • Oreochromis aureus (blue tilapia)
  • Piliocolobus tephrosceles (Ugandan red Colobus)
  • Sarcophilus harrisii (Tasmanian devil)
  • Xenopus tropicalis (tropical clawed frog)

See more details on the Eukaryotic RefSeq Genome Annotation Status page.

August-October 2019 RefSeq annotations: mouse, firefly and more

mouse with dark brown fur, eating

Recently, the NCBI Eukaryotic Genome Annotation Pipeline has released new annotations in RefSeq for the following organisms:

  • Aedes albopictus (Asian tiger mosquito)
  • Aquila chrysaetos chrysaetos (golden eagle)
  • Archocentrus centrarchus (flier cichlid)
  • Calypte anna (Anna’s hummingbird)
  • Camarhynchus parvulus (bird)
  • Camelus dromedarius (Arabian camel)
  • Cannabis sativa (hemp)
  • Chanos chanos (milkfish)

Continue reading

New human genome annotation release with MANE Select and other improvements!

There’s a new RefSeq annotation available for the human genome, and it’s quite an update!

About the release

Annotation release 109.20190607 is the first release of our new bimonthly annotation schedule as announced in a previous post.   The annotated sequences are  the latest sequences for the GRCh38, patch 13 assembly, GRCh38.p13 (GCF_000001405.39). The chromosome backbone sequences remain the  same, but we’ve added 45 patch sequences representing novel and improved sequences that the Genome Reference Consortium will incorporate into the primary assembly in the future. The new annotation places the latest curated RefSeq transcripts and functional elements on the genome but keeps the same model dataset as in annotation release 109 except when the models have been replaced by curated RefSeqs or other review. We are also flagging MANE and other RefSeq Select transcripts.  Continue reading for more details on these improvements below. You can download the updated annotation here!

Continue reading

Human genome annotation will be updated every 2 months

NCBI will be updating the human genome RefSeq annotation more frequently to incorporate improvements made to genes and transcripts by RefSeq curation experts. Faster updates will allow us to include the latest datasets.

In the past, we’ve produced a full re-annotation of the human genome about once a year. The last full annotation, Homo sapiens Annotation Release 109, was in March 2018. A full annotation is produced by two main processes:

Continue reading

New RefSeq annotations for big brown bat, peregrine falcon and more

Hibernating brown bat

In January and February, the NCBI Eukaryotic Genome Annotation Pipeline released new annotations in RefSeq for the following organisms:

  • Aphis gossypii (cotton aphid)
  • Balaenoptera acutorostrata scammoni (minke whale)
  • Bombyx mandarina (wild silkworm)
  • Chelonia mydas (green sea turtle)
  • Corapipo altera (white-ruffed manakin)
  • Empidonax traillii (willow flycatcher)
  • Eptesicus fuscus (big brown bat)
  • Eumetopias jubatus (Steller sea lion)
  • Falco cherrug (Saker falcon)
  • Falco peregrinus (peregrine falcon)
  • Marmota flaviventris (yellow-bellied marmot)
  • Monomorium pharaonis (pharaoh ant)
  • Neopelma chrysocephalum (saffron-crested tyrant-manakin)
  • Ovis aries (sheep)
  • Pipra filicauda (wire-tailed manakin)
  • Rhopalosiphum maidis (corn leaf aphid)
  • Solanum pennellii (eudicot)
  • Tupaia chinensis (Chinese tree shrew)
  • Vigna unguiculata (cowpea)
  • Vombatus ursinus (common wombat)
  • Xiphophorus couchianus (Monterrey platyfish)

See more details on the Eukaryotic RefSeq Genome Annotation Status page.