Month: July 2017

NCBI Facilitates Hackathons at the Bio-IT World Conference and Silicon Valley Artificial Intelligence

NCBI usually participates in hackathons as direct organizers and planners. However, NCBI staff recently functioned as facilitators in two hackathons organized by outside groups: one at the Bio-IT World conference, and one at the Silicon Valley Artificial Intelligence (SVAI) incubator.

Continue reading “NCBI Facilitates Hackathons at the Bio-IT World Conference and Silicon Valley Artificial Intelligence” →

Zebrafish (Danio rerio), 11 other organisms in June RefSeq genome annotations

In June, the NCBI Eukaryotic Genome Annotation Pipeline released new annotations in RefSeq for the following organisms, including Danio rerio (zebrafish):

Continue reading “Zebrafish (Danio rerio), 11 other organisms in June RefSeq genome annotations” →

July 19th NCBI Minute: Automate PubMed Searches & Save Citation Collections with My NCBI

Would you like to have new PubMed citations for your topic of interest conveniently appear in your email inbox or quickly collect those records for further review? Join NCBI on July 19, 2017, when we’ll show you how to have your PubMed searches automatically run and the results emailed to you daily, weekly or monthly. You will also learn how to create PubMed collections that you can share with others or keep privately for yourself. Don’t have a My NCBI account yet? Get started at MyNCBI.

Continue reading “July 19th NCBI Minute: Automate PubMed Searches & Save Citation Collections with My NCBI” →

dbSNP architecture redesign supports future human variation data expansion; changes to be introduced over the next year

To continue providing efficient and timely processing, annotation, and dissemination of data, dbSNP’s architecture and process flow have been redesigned. The technical redesign prepares the database for increasing data volumes and providing timely, effective and trustworthy reference SNP results as submission rates continue to increase.

Highlights of the new system include:

Use of data objects instead of a relational database
Improved algorithms for clustering data into unique Reference SNPs
Automation of the entire process to provide timely releases
Guaranteed data consistency across dbSNP data accessed using web-based products or downloaded content, such as VCF and FTP files

Continue reading “dbSNP architecture redesign supports future human variation data expansion; changes to be introduced over the next year” →

GenBank release 220.0 is available via FTP

GenBank release 220.0 (6/18/2017) has 201,663,568 traditional records containing 234,997,362,623 base pairs of sequence data. In addition, there are 487,891,767 WGS records containing 2,164,683,993,369 base pairs of sequence data, 176812130 TSA records containing 158,112,969,073 base pairs of sequence data, and 1,628,475 TLS records containing 824,191,338 base pairs of sequence data.

Continue reading “GenBank release 220.0 is available via FTP” →