Introduction to NGS Analysis in a Cloud Environment for Novice Bioinformaticians:
We are pleased to announce a free workshop in New Orleans, Louisiana April 23 and 24. After a short cloud-onboarding session, early career computational virologists interested in extracting sequences from metagenomic samples will be exposed to new, community-generated tools!
In about a week, NCBI staff will join GeneReviews® on their home turf, Seattle, at the Annual Clinical Genetics Meeting hosted by the American College of Medical Genetics and Genomics (ACMG). While there we will have an exhibit booth (#531) where you can meet our staff, get answers to your questions, and pick-up informative handouts on our various resources for clinical practice.
Also, be sure to visit our two posters on Friday, April 5 from 10:30 AM to 12 PM.
The National Library of Medicine (NLM) seeks a Scientific Director with creative vision and strong leadership to guide its Intramural Research Program. One of the 27 Institutes and Centers (ICs) of the National Institutes of Health (NIH), NLM is a leader in computational health sciences research and the world’s largest biomedical library. The successful candidate will oversee a diverse group of some 150 scientific personnel, developing innovative new approaches to data science, biomedical informatics, and computational biology and their application to open questions in basic molecular biology, genomics, health, and healthcare.
NCBI will be updating the human genome RefSeq annotation more frequently to incorporate improvements made to genes and transcripts by RefSeq curation experts. Faster updates will allow us to include the latest datasets.
In the past, we’ve produced a full re-annotation of the human genome about once a year. The last full annotation, Homo sapiens Annotation Release 109, was in March 2018. A full annotation is produced by two main processes:
Graphs are the name of the game here! The teams will be building graphs, managing coordinates between samples and defining and identifying and marking haplotypes, and looking at population specific variants.
Please follow along on our GitHub, fork and make pull requests during and after the event, and stay tuned for updates on the findings.
RefSeq release 93 is accessible online, via FTP and through NCBI’s Entrez programming utilities, E-utilities.
This full release incorporates genomic, transcript, and protein data available as of March 13, 2019. It contains 192,722,653 records, including 135,670,032 proteins, 25,840,272 RNAs, and sequences from 88,816 organisms.
GenBank release 230.0 (2/15/2019) with 4.74 Terabases and 1.47 billion records is now available from the NCBI FTP site (flatfiles, ASN.1). There are two notable changes with this release. Because we have increased in the target maximum uncompressed file-size, the number of files dropped by about 1,000. We are also now assigning expanded WGS and protein accessions. WGS accessions now may have a six-letter Project Code prefix, a two-digit Assembly-Version number, followed by seven, eight, or nine digits, for example AAAABB010000001. Protein accessions may now have three-letter followed by seven digits, for example EAA0000001. See section 1.3.1 and 1.3.2 of the Release Notes for details.
The NCBI is in Chapel Hill, NC from March 11-13 for the “RNA-Seq in the Cloud” hackathon! We’re working hard to address public metadata in the RNA-Seq space. Please follow along on GitHub, fork and make pull requests during and after the event, and stay tuned for updates on findings.