NCBI usually participates in hackathons as direct organizers and planners. However, NCBI staff recently functioned as facilitators in two hackathons organized by outside groups: one at the Bio-IT World conference, and one at the Silicon Valley Artificial Intelligence (SVAI) incubator.
Month: July 2017
Zebrafish (Danio rerio), 11 other organisms in June RefSeq genome annotations
In June, the NCBI Eukaryotic Genome Annotation Pipeline released new annotations in RefSeq for the following organisms, including Danio rerio (zebrafish):
Continue reading “Zebrafish (Danio rerio), 11 other organisms in June RefSeq genome annotations”
July 19th NCBI Minute: Automate PubMed Searches & Save Citation Collections with My NCBI
Would you like to have new PubMed citations for your topic of interest conveniently appear in your email inbox or quickly collect those records for further review? Join NCBI on July 19, 2017, when we’ll show you how to have your PubMed searches automatically run and the results emailed to you daily, weekly or monthly. You will also learn how to create PubMed collections that you can share with others or keep privately for yourself. Don’t have a My NCBI account yet? Get started at MyNCBI.
dbSNP architecture redesign supports future human variation data expansion; changes to be introduced over the next year
To continue providing efficient and timely processing, annotation, and dissemination of data, dbSNP’s architecture and process flow have been redesigned. The technical redesign prepares the database for increasing data volumes and providing timely, effective and trustworthy reference SNP results as submission rates continue to increase.
Highlights of the new system include:
- Use of data objects instead of a relational database
- Improved algorithms for clustering data into unique Reference SNPs
- Automation of the entire process to provide timely releases
- Guaranteed data consistency across dbSNP data accessed using web-based products or downloaded content, such as VCF and FTP files
GenBank release 220.0 is available via FTP
GenBank release 220.0 (6/18/2017) has 201,663,568 traditional records containing 234,997,362,623 base pairs of sequence data. In addition, there are 487,891,767 WGS records containing 2,164,683,993,369 base pairs of sequence data, 176812130 TSA records containing 158,112,969,073 base pairs of sequence data, and 1,628,475 TLS records containing 824,191,338 base pairs of sequence data.
Continue reading “GenBank release 220.0 is available via FTP”