GenBank release 233.0 (8/21/2019) is now available on the NCBI FTP site. This release has 6.26 terabases and 1.65 billion records.
The release has 213,865,349 traditional records containing 366.7 billion base pairs of sequence data. There are also 1.07 billion WGS records containing 5.6 trillion base pairs of sequence data, 331.3 million bulk-oriented TSA records containing 294.7 trillion base pairs of sequence data, and 26 million bulk-oriented TLS records containing 10.5 billion base pairs of sequence data.
NCBI’s Genetic Relationship and Fingerprinting (GRAF) tool is a quality assurance tool that can quickly find duplicates and closely related subjects in your data using SNP genotypes.
The population tool GRAF-pop included in GRAF computes subject ancestries using genotypes and normalizes ancestry prediction in large datasets collected across different genotyping platforms, making it possible to generate population frequency based on more than a million dbGaP samples.
Who can use this?
GRAF is a tool for researchers; it is not designed to assess an individual’s ancestry or to find relatives.
You can use this tool against your own large datasets with results generated within hours or minutes, even when there is a very high genotype missing rate to the order of 99%. This tool can check genotype datasets obtained using different chips or platforms, plotting them in the same picture for comparison purposes.
As you may know, we have been offering a new BLAST results (Figure 1) as a test page since April. In response to your positive reception and after incorporating many improvements that you suggested, we made the new results the default today, August 1, 2019.
You will still be able to access to the traditional results for a several months. This will provide you additional time if you need it to adjust your workflows or teaching materials to the new display.