Tag: NCBI Datasets

RefSeq Release 221

RefSeq Release 221

RefSeq release 221 is now available online and from the FTP site. You can access RefSeq data through NCBI Datasets.

What’s included in this release?

As of November 6, 2023, this full release incorporates genomic, transcript, and protein data containing:

  • 404,657,610 records
  • 300,054,945 proteins
  • 57,882,313 RNAs
  • sequences from 143,819 organisms 

Continue reading “RefSeq Release 221”

New Annotations in RefSeq!

New Annotations in RefSeq!

In July, August, and September, the NCBI Eukaryotic Genome Annotation Pipeline released fifty-six new annotations in RefSeq!

New Annotations
  • Achroia grisella (moth)
  • Acipenser ruthenus (sterlet)
  • Ahaetulla prasina (snake)
  • Alligator mississippiensis (American alligator)
  • Ammospiza caudacuta (bird)
  • Ammospiza nelsoni (bird)
  • Anopheles bellator (mosquito)
  • Anopheles coustani (mosquito)
  • Anopheles ziemanni (mosquito)
  • Arachis stenosperma (eudicot)
  • Carassius carassius (crucian carp)
  • Centropristis striata (black seabass)
  • Cornus florida (flowering dogwood) (pictured)
  • Corylus avellana (European hazelnut)
  • Corythoichthys intestinalis (scribbled pipefish) Continue reading “New Annotations in RefSeq!”
NCBI Datasets: Easily Access and Download Sequence Data and Metadata

NCBI Datasets: Easily Access and Download Sequence Data and Metadata

Effective May 2024, NCBI Datasets will replace legacy Genome and Assembly web resources 

As part of our ongoing effort to enhance your experience and modernize our services, NCBI will gradually replace the legacy Genome and Assembly resources with the newly introduced NCBI Datasets resource. NCBI Datasets is a continually evolving platform designed to provide easy and intuitive access to NCBI’s sequence data and metadata. 

  • The legacy Genome and Assembly web resources will no longer be available after May 2024
  • There will be no changes to how you access the databases using E-Utilities or EDirect 

Continue reading “NCBI Datasets: Easily Access and Download Sequence Data and Metadata”

Introducing the New NCBI Datasets Genome Annotation Table

Introducing the New NCBI Datasets Genome Annotation Table

As part of our ongoing effort to modernize and improve your experience, we are excited to introduce the new NCBI Datasets genome annotation table. You can now quickly and easily access annotated gene and protein sequences annotated by NCBI RefSeq or GenBank submitters.  

Features & Benefits
  • Easier than ever to search and download data for annotated genes  
  • Download gene, transcript and protein sequences, and metadata 
  • Annotation tables are available for ~7500 eukaryotic and ~1.5M prokaryotic annotated genomes   
  • Annotation data is now available for both RefSeq and GenBank submitted annotations 
  • Filter by gene type, gene name, and chromosome or location on the genome 

Continue reading “Introducing the New NCBI Datasets Genome Annotation Table”