Announcing the first ever RNA-Seq in the Cloud hackathon!


From March 11-13, 2019, the NCBI will help run a bioinformatics hackathon in the North Carolina Research Triangle hosted by the University of North Carolina, Chapel Hill (UNC).

Potential topics include:

  • technical metadata homogenization
  • a simple interface for using ontologies to make data searches more sensitive and specific
  • automated data analysis and visualization
  • novel isoform identification and comparison

We’re looking for people who have experience in working with subjects like these. If this describes you, please apply!

This event is for researchers, including students and postdocs, who use bioinformatics data or develop pipelines for large scale RNA-Seq analyses from high-throughput experiments. The event is open to anyone selected for the hackathon and willing to travel to UNC. Continue reading

Florida (USF) Biological Data Science “IronHack” February 25-27, 2019


From February 25-27, 2019, NCBI will help with a Data Science hackathon at USF in Tampa Florida!

The hackathon will focus on the genomics of Iron-linked Rare Diseases as well as large scale RNA-Seq indexing and analysis. This event is for researchers, including students and postdocs, who have already engaged in the use of large datasets or in the development of pipelines for analyses from high-throughput experiments. Some projects are available to other non-scientific developers, mathematicians, or librarians.

The event is open to anyone selected for the hackathon and willing to travel to Tampa.

Working groups of five to six individuals will be formed into five to eight teams. These teams will build or expand on pipelines and tools to analyze large datasets within a cloud infrastructure. Example subjects for such hackathons include:

  • Integrative pipelines to analyze large scale RNA-Seq experiments
  • Visualization tools for mapping phenotypes to genotypes
  • Rapid clinical diagnostics tools
  • Structural variant mining with single molecule sequencing data

Please see the application form for more details and additional projects.  The project list will continue to evolve and will be updated on the application form.

Continue reading

Save and Share in PubMed Labs!


We’ve recently added save and share options to PubMed Labs. From your PubMed Labs search results list, you can now use the ‘Save’ button to save a selection of results in a variety of formats, including Summary and Abstract. You can also use the ‘Email’ button to share a selection of results, including abstracts, with colleagues.

share options in pubmed labs - twitter, facebook and permalink

Figure 1. Click on the ‘Share’ button to share to Twitter and Facebook.

Continue reading

Improved ClinVar search quickly connects you to information about variants


If you’ve been searching in ClinVar, you might have noticed search improvements introduced in December that reliably connect you with information on your variant of interest. ClinVar has broadened its search capability to accept many different ways of expressing the same variation, including variation described on RefSeq transcripts and proteins. If your variant expression  is not reported in ClinVar, we alert you to other variants at the same genomic location or link you to related information in other NCBI resources such as dbSNP, LitVar, and PubMed. ClinVar will also now interpret expressions that contain minor errors or warn you about improper syntax that it cannot interpret.

sensor2Figure 1.  Improved search results in Clinvar showing mapping of an HGVS expression to the equivalent variant in ClinVar.

Here are some example queries that show the improved search results.

NM_001318787.1:c.2258G>A – an HGVS expression that is not in ClinVar, but ClinVar has an alternate expression for a variant (Figure 1).

NM_004958.3:c.7365C>A – a variant not in ClinVar, but another variant is at the same genomic location is in ClinVar.

NM_002113.2:c.19delG – a variant is not in ClinVar, but there is additional information for the variant in other databases.

We welcome your feedback on your search experience and any additional ideas on how to improve searching in ClinVar.

February 6 Webinar: New Variation Services for Normalizing, Remapping, and Annotating Variants


Join us on Wednesday, February, 2019, when NCBI staff will show you how to use a new set of NCBI variation services that rely on a variant data model called SPDI (Sequence Position Deletion Insertion). These services and data model allow you to inter-convert, map and disambiguate variants in standard formats (RefSNP accessions, HGVS and VCF). Unlike many current variant notation systems, SPDI provides unambiguous, machine-readable definitions of variants. SPDI not only powers SNP build and mapping procedures at NCBI but also our variant sensors that are active in the global search and ClinVar. These services and notation system provide valuable new tools for people who work with sequence variants.additional variant information.

Date and time: Wed, Feb 6, 2019 12:00 PM – 12:30 PM EDT

Register

After registering, you will receive a confirmation email with information about attending the webinar. A few days after the live presentation, you can view the recording on the NCBI YouTube channel. You can learn about future webinars on the Webinars and Courses page.

RefSeq release 92 updates 10,000 human transcripts


RefSeq release 92 is accessible online, via FTP and through NCBI’s Entrez programming utilities, E-utilities.

This full release incorporates genomic, transcript, and protein data available, as of January 4, 2019 and contains 185,738,687 records, including 130,366,644 proteins, 25,088,890 RNAs, and sequences from 86,867 organisms. The release is provided in several directories as a complete dataset and as divided by logical groupings.

Continue reading

dbSNP build 152 uses SPDI variant notation


dbSNP build 152 is a small incremental update from build 151 provided for you to begin testing and integrating the new build products into your workflow. Build 152 uses the new system with SPDI variant notation and is now available on FTP and the new RefSNP webpage.

The release notes have more information about what’s new in build 152. If you have any questions or comments, send us an email.

Join NCBI at PAG in San Diego, January 12–16, 2019


Next week, NCBI staff will attend the Plant and Animal Genome (PAG) Conference. We have several activities planned, including 1 booth (#223), 4 workshops, 1 talk and 2 posters.

Read on to learn more about what you can look forward to if you’re attending PAG this year. (Note: The listed times are Pacific time.)

Continue reading

Apply now to join the Seattle Biological Data Science FHackathon February 4-6, 2019


From February 4-6, 2019, the NCBI will help with a data science hackathon at the Fred Hutchinson Cancer Research Center in Seattle. To apply, complete this form (approximately 10 minutes to complete). Initial applications are due Friday, January 11th by 11 pm ET.

The hackathon will focus on genomics as well as general data science. This event is for researchers, including students and postdocs, who have already engaged in the use of large datasets or in the development of pipelines for analyses from high-throughput experiments. Some projects are available to other non-scientific developers, mathematicians, or librarians.

Continue reading