Tag: RefSeq

Now Available: RefSeq Release 236

RefSeq release 236 is now available online and from the FTP site! You can access RefSeq data through NCBI Datasets. The release is provided in several directories as a complete dataset and also as divided by logical groupings.   

What’s included in this release? 

As of July 6, 2026, this full release incorporates genomic, transcript, and protein data containing:  

629,953,391 records 
482,864,455 proteins 
83,806,434 RNAs 
Sequences from 182,465 organisms

Continue reading “Now Available: RefSeq Release 236” →

Now Available! NCBI Hidden Markov Models (HMM) Release 20.0

Download release 20.0 of the NCBI protein profile Hidden Markov models (HMMs) used by the Prokaryotic Genome Annotation Pipeline (PGAP). You can search this collection against your favorite prokaryotic proteins to identify their function using the HMMER sequence analysis package.  

What’s new?  

Release 20.0 contains:  

18,950 HMMs maintained by NCBI  
497 new HMMs since release 19.0

Continue reading “Now Available! NCBI Hidden Markov Models (HMM) Release 20.0” →

Now Available: RefSeq Release 235

RefSeq release 235 is now available online and from the FTP site! You can access RefSeq data through NCBI Datasets. The release is provided in several directories as a complete dataset and also as divided by logical groupings.   

What’s included in this release? 

As of May 11, 2026, this full release incorporates genomic, transcript, and protein data containing:  

616,942,961 records 
473,570,633 proteins 
81,124,747 RNAs 
Sequences from 180,620 organisms

Continue reading “Now Available: RefSeq Release 235” →

New Data Available! Access Hantavirus Sequences at NCBI

Sequence data from the recent Andes hantavirus outbreak are now available through NLM’s NCBI resources, NCBI Virus web interface and NCBI Datasets command-line tool. These data were submitted by the University Hospitals of Geneva.

Access through NCBI Virus

To find sequence records from 2026, search for “Orthohantavirus andesense” in NCBI Virus and apply the “Collection Date” filter. To get a quick overview of Andes hantavirus data available through GenBank, visit the NCBI Virus Outbreak Statistics page (select Andes hantavirus) which shows the collection location and host for recently collected samples. Continue reading “New Data Available! Access Hantavirus Sequences at NCBI” →

Now Available: Updated Bacterial and Archaeal Reference Genome Collection

Download the updated bacterial and archaeal reference genome collection! We built this collection of 23,063 genomes by selecting the “best” genome assembly for each species among the 450,000+ prokaryotic genomes in RefSeq. 

What’s new? 

Two species are represented in this collection for the first time 
283 species are represented by a better assembly
Six species were removed because of changes in NCBI Taxonomy or uncertainty in their species assignment

Continue reading “Now Available: Updated Bacterial and Archaeal Reference Genome Collection” →

Now Available: RefSeq Release 234 

RefSeq release 234 is now available online and from the FTP site! You can access RefSeq data through NCBI Datasets. The release is provided in several directories as a complete dataset and also as divided by logical groupings.    Continue reading “Now Available: RefSeq Release 234 ” →

Now Available: RefSeq Release 233

RefSeq release 233 is now available online and from the FTP site! You can access RefSeq data through NCBI Datasets. The release is provided in several directories as a complete dataset and also as divided by logical groupings.

What’s included in this release?

As of January 26, 2026, this full release incorporates genomic, transcript, and protein data containing:

578,285,616 records
442,943,508 proteins
76,278,418 RNAs
Sequences from 174,157 organisms

Continue reading “Now Available: RefSeq Release 233” →

Now Available! NCBI Hidden Markov Models (HMM) Release 19.0

Download release 19.0 of the NCBI protein profile Hidden Markov models (HMMs) used by the Prokaryotic Genome Annotation Pipeline (PGAP). You can search this collection against your favorite prokaryotic proteins to identify their function using the HMMER sequence analysis package. 

What’s new? 

Release 19.0 contains: 

18,513 HMMs maintained by NCBI 
465 new HMMs since release 18.0

Continue reading “Now Available! NCBI Hidden Markov Models (HMM) Release 19.0” →

An Updated Bacterial and Archaeal Reference Genome Collection is Available!

Download the updated bacterial and archaeal reference genome collection! We built this collection of 22,420 genomes by selecting the “best” genome assembly for each species among the 450,000+ prokaryotic genomes in RefSeq.

What’s new?

One species is represented in this collection for the first time
323 species are represented by a better assembly
Six species were removed because of changes in NCBI Taxonomy or uncertainty in their species assignment

Continue reading “An Updated Bacterial and Archaeal Reference Genome Collection is Available!” →

An Updated Bacterial and Archaeal Reference Genome Collection is Available!

Download the updated bacterial and archaeal reference genome collection! We built this collection of 22,082 genomes by selecting the “best” genome assembly for each species among the 440,000+ prokaryotic genomes in RefSeq.

What’s new?

28 species are represented in this collection for the first time
228 species are represented by a better assembly
Six species were removed because of changes in NCBI Taxonomy or uncertainty in their species assignment

Continue reading “An Updated Bacterial and Archaeal Reference Genome Collection is Available!” →