GenBank release 234.0 (10/14/2019) is now available on the NCBI FTP site. This release has 6.69 trillion bases and 1.68 billion records.
The release has 216,763,706 traditional records containing 386,197,018,538 base pairs of sequence data. There are also 1,097,629,174 WGS records containing 5,985,250,251,028 base pairs of sequence data, 342,811,151 bulk-oriented TSA records containing 305,371,891,408 base pairs of sequence data, and 27,460,978 bulk-oriented TLS records containing 10,848,455,369 base pairs of sequence data.
During the 57 days between the close dates for GenBank releases 233.0 and 234.0, the traditional portion of GenBank grew by 19,463,100,909 base pairs and 2,898,357 sequence records. During that same period, 66,596 records were updated. An average of 52,017 traditional records were added and/or updated per day.
Between releases 233.0 and 234.0, the WGS component of GenBank grew by 399,327,917,868 base pairs and by 22,356,959 sequence records. The TSA component of GenBank grew by 10,644,726,229 base pairs and by 11,463,344 sequence records. The TLS component of GenBank grew by 316,654,540 base pairs and by 1,097,033 sequence records.
The total number of sequence data files increased by 33 with this release. The divisions are as follows:
- BCT: 22 new files, now a total of 387
- CON: 3 new files, now a total of 211
- INV: 33 new files, now a total of 110
- PAT: 3 new files, now a total of 201
- PHG: 1 new file, now a total of 4
- PLN: 5 new files, now a total of 186
- ROD: 1 new file, now a total of 18
- VRT: 8 new files, now a total of 165
Please read section 1.3 of the Release Notes for more information.
For downloading purposes, please keep in mind that the uncompressed GenBank release 234.0 flatfiles require roughly 1091 GB (sequence files only). The ASN.1 data require approximately 827 GB.
More information about GenBank release 234.0 is available in the release notes, as well as in the README files in the genbank and ASN.1 (ncbi-asn1) directories on FTP.