May 20 webinar: Exploring SRA metadata in the cloud with BigQuery

Join us on May 20th to learn how to use Google’s BigQuery to quickly search the data from the Sequence Read Archive (SRA) in the cloud to speed up your bioinformatic research and discovery projects. BigQuery is a tool for exploring cloud-based data tables with SQL-like queries. In this webinar, we’ll introduce you to using BigQuery to mine SRA submitter-supplied metadata and the results of taxonomic analysis for SRA runs. You’ll see real-world case studies that demonstrate how to find key information about SRA runs and identify data sets for your own analysis pipelines.

  • Date and time: Wed, May 20, 2020 12:00 PM – 12:45 PM EDT
  • Register

After registering, you will receive a confirmation email with information about attending the webinar. A few days after the live presentation, you can view the recording on the NCBI YouTube channel. You can learn about future webinars on the Webinars and Courses page.

April 8 Webinar: Accelerate genomics discovery with SRA in the cloud

On Wednesday, April 8, 2019 at 12 PM, NCBI staff will show you how to leverage the cloud to speed up your research and discovery. You’ll be introduced to new and existing tools and data including BigQuery, SRA Toolkit, and more. You’ll hear about real workflows in the cloud featuring an example of the work NCBI was able to accomplish in the cloud using SRA data and a case study from an SRA cloud customer

By the end of this webinar, you will know where to look for new cloud products from NCBI, access help information to get you started, and will see how to run your analyses efficiently in the cloud.

  • Date and time: Wed, Apr 8, 2020 12:00 PM – 12:45 PM EDT
  • Register

After registering, you will receive a confirmation email with information about attending the webinar. A few days after the live presentation, you can view the recording on the NCBI YouTube channel. You can learn about future webinars on the Webinars and Courses page.

Request for proposals: Single Cell in the Cloud codeathon at NYGC in January

The New York Genome Center is hosting an NCBI  Single Cell in the cloud codeathon from January 15-17, 2020. Submissions for project proposals are due December 2nd.

Please submit your proposal and apply here.

What topics are in scope?

This codeathon will focus on single cell data, including RNA, DNA, and chromatin accessibility.  We are particularly interested in proposals for pipelines and analysis of SRA data, data interoperability, and using machine learning techniques in clustering.  We also welcome proposals for tutorial pipelines and educational tools. You will have access to computational resources in the Cloud to turn your idea into a working prototype.   Visit our website for examples of previous codeathon projects.

Continue reading

Virus hunting in the cloud: A hackathon story at ASV 2019

Are you going to ASV 2019?

If you are, join us in a few days for a workshop on the virus hunting hackathon we helped run earlier this year.

Session: Workshop #19: Virus Discovery

Program Number: W-19-8

Time: Sunday, July 21, 7:00 PM CDT

Location: Mayo Auditorium

In this workshop, Dr. Rodney Brister will talk about how 41 scientists from 21 organizations worked to improve the usability of SRA data, identifying datasets that included known viruses and viral signals. Not only is that information now being integrated into a public search interface, but the approach used is also being refined in future hackathons so it can be applied to all SRA datasets.

We hope to see you there!

Bring your own data to the computational virology workshop at LSU New Orleans

Attention all aspirational computational virologists and cloud-curious bioinformaticians! NCBI is hosting a free workshop in New Orleans, Louisiana April 23 and 24.

Choose your own adventure: participants may bring their own data and/or work with public data housed at NCBI.

Day 1 will consist of a short cloud-onboarding session, introduction to Jupyter notebooks, SRA and BLAST intros, and more! On day 2, we’ll roll our sleeves up in a working session around phylogenetic clustering of sequences where we’ll look for unknown viruses.

BYOD (bring your own data) and apply today! Please fill out the application form in its entirety by Tuesday, April 9th.

Attendee insights will be made publicly available on GitHub.

NCBI Computational Virology Workshop at LSU New Orleans!

Introduction to NGS Analysis in a Cloud Environment for Novice Bioinformaticians:

We are pleased to announce a free workshop in New Orleans, Louisiana April 23 and 24.  After a short cloud-onboarding session, early career computational virologists interested in extracting sequences from metagenomic samples will be exposed to new, community-generated tools!

Continue reading

Follow “Pangenomics in the Cloud” hackathon projects on GitHub

NCBI is on the West Coast this week (March 25 – 27) for “Pangenomics in the Cloud,” a three-day hackathon hosted by the University of California, Santa Cruz.

Graphs are the name of the game here! The teams will be building graphs, managing coordinates between samples and defining and identifying and marking haplotypes, and looking at population specific variants.

Please follow along on our GitHub, fork and make pull requests during and after the event, and stay tuned for updates on the findings.

Pangenomics in the Cloud hackathon, March 25-27, 2019

We are pleased to announce the first ever pangenomics, graphs and haplotypes hackathon.

From March 25-27, 2019, the NCBI will help run a bioinformatics hackathon in Santa Cruz, California, hosted by the University of California, Santa Cruz (UCSC).  Potential topics include:

  • Building large scale graphs from pangenomes using several assembly methods
  • Simplification of mapping
  • Resolving haplotypes
  • Identification of population-specific structural variants
  • Defining haplotype-specific expression, visualization, and coordination with the GRC

Continue reading

Announcing the first ever RNA-Seq in the Cloud hackathon!

From March 11-13, 2019, the NCBI will help run a bioinformatics hackathon in the North Carolina Research Triangle hosted by the University of North Carolina, Chapel Hill (UNC).

Potential topics include:

  • technical metadata homogenization
  • a simple interface for using ontologies to make data searches more sensitive and specific
  • automated data analysis and visualization
  • novel isoform identification and comparison

We’re looking for people who have experience in working with subjects like these. If this describes you, please apply!

This event is for researchers, including students and postdocs, who use bioinformatics data or develop pipelines for large scale RNA-Seq analyses from high-throughput experiments. The event is open to anyone selected for the hackathon and willing to travel to UNC. Continue reading