Tag: Basic Local Alignment Search Tool (BLAST)

Web IgBLAST can now determine immunoglobulin isotypes

We have added a new function to IgBLAST on the Web. You can now search immunoglobulin (Ig) nucleotide sequences against the Constant region (C) gene database (Figure 1) to determine the Ig isotypes including subtypes (IgM, IgG, IgA1, etc.). The isotype information is reported in the rearrangement summary table, and the C gene region is displayed in the alignment section. This feature is now available on the IgBLAST web service for human and mouse sequences with possible expansion to other organisms in the future.  The feature is not yet implemented for the standalone IgBLAST package.

Figure 1.  IgBLAST constant region database selection and rearrangement summary table showing the top C gene match, IgHM in this case.  The NCBI C genes database is based on the the current NCBI human Reference Genome annotation.

NCBI events at the Bioinformatics Open Science Conference 2021 (BOSC 2021)

NCBI events at the Bioinformatics Open Science Conference 2021 (BOSC 2021)

Come visit us virtually to learn about new NCBI data access, tools and best practices at the Bioinformatics Open Science Conference  part of the ISMB/ECCB online conference from July 29 – 30, 2021. We will be presenting virtual posters on NCBI resources, offering a Birds of a Feather discussion, and participating in the BOSC  (CoFest) following the conference where you can take part in a hands-on evaluation of ElasticBLAST.

NCBI Posters, July 29, 2021, 11:20 – 12:20 PM EDT

All posters will be presented on Thursday afternoon. You can see complete abstracts on the ISMB/ECCB BOSC schedule.

Nuala O’Leary will talk about NCBI Datasets, a new resource for fast, easy access to NCBI sequence data.  You will learn about the new interface and new tools to access reference genomes, genes, and orthologs using web-based and programmatic tools.

Adelaide Rhodes will present Open access NCBI cloud resources to accelerate scientific insights where you can learn about recent developments in transferring > 20 petabytes of NCBI Sequence Read Archive (SRA) data to the cloud.

Deacon Sweeney will describe the web RAPT service for assembling and annotating bacterial genomes at the click of a button in RAPT, The Read assembly and Annotation Pipeline Tool: building a prokaryotic genome annotation package for users of all backgrounds.

Roberto Vera Alvarez will talk about best practices for using cloud tools for transcriptomics in his poster Transcriptome annotation in the cloud: complexity, best practices, and cost.

Greg Boratyn will discuss improvements to the BLAST-based short read aligner, Magic-Blast, in Recent improvements in Magic-BLAST 1.6.

Visit Christiam Camacho’s poster ElasticBLAST: Using the power of the cloud to speed up science to get an introduction to  ElasticBLAST, a Kubernetes-based approach for high throughput BLAST tasks. Join us following the conference in the CoFest to try out ElasticBLAST yourself and provide input. See the section on the CoFest below and our companion post.

Birds of a Feather, July 29, 2021, 11:20 – 12:20 PM EDT

We will host a Birds of Feather public feedback session on Thursday, where you can provide feedback and participate in discussions on all aspects of NCBI’s new data access options: NCBI Datasets, SRA, BLAST, and the Genome Data Viewer (GDV) — our genome browser for sequence visualization. We welcome your input!  Come and see us!

CollaborationFest (CoFest), July 31 – August 1, 2021

The ElasticBlast team will attend the BOSC CoFest following the conference. Sign up to participate on July 31 and August 1 to get an in-depth orientation and opportunity to test the capabilities of ElasticBlast on the Amazon Web Services (AWS) cloud. You do not have to register for the conference to attend the CoFest. See our post on the CoFest for more information.

 

Try out ElasticBLAST at the BOSC2021 CoFest!

Try out ElasticBLAST at the BOSC2021 CoFest!

Join the BLAST team at the virtual CollaborationFest (July 31 -August 1, 2021) after the BOSC 2021 conference to help test and improve ElasticBLAST, a new cloud-based tool designed to speed up high throughput BLAST searches. We would love to have your help with real world testing of our alpha release of ElasticBLAST with you own workflows and data. You may sign up for the CoFest even if you aren’t registered for BOSC 2021.

Here are suggestions for how you can participate. See the FAQs below for additional information.

  1. Try it out and let us know how well it works. You can be blunt.
  2. Help us improve the documentation.
  3. Write a script to make ElasticBLAST part of your workflow.
  4. Try to process ElasticBLAST results with cloud-native tools. Here is an example.
  5. Bring your own high throughput BLAST search problem to use with ElasticBLAST!  Please discuss it with us first to make sure you don’t blow our budget and get the ElasticBLAST team in trouble!

Continue reading “Try out ElasticBLAST at the BOSC2021 CoFest!”

Introducing the new NCBI Datasets Genomes page

The updated NCBI Datasets Genomes page now has genome data for all domains of life, including bacterial and viral genomes.

The genomes table (Figure 1) now offers filters for:

  • Reference genomes — switch it on to only show reference or representative genomes
  • Annotated — switch it on to only show annotated genomes
  • Assembly level — use the assembly level slider to select higher-quality genomes
  • Year released — use the slider to limit your results to recent genomes

In addition, the new Actions column connects you to NCBI’s Genome Data Viewer, BLAST, and Assembly. The Text filter box lets you search by the name of the assembly, species/infraspecies, or submitter.Figure 1. The new Datasets Genomes page with primate assemblies showing the STATUS switches (reference genomes, annotated); expanded filters section with ASSEMBLY LEVEL and YEAR RELEASED sliding selectors; and the Actions column menu with access to Assembly details, BLAST, the Genome Data Viewer, and Download options. Continue reading “Introducing the new NCBI Datasets Genomes page”

BLAST+ 2.12.0 now available with more efficient multithreaded searches

BLAST+ 2.12.0  programs feature better multithreaded searches and support a different threading model, threading by query, that can be more efficient in some situations.  The new release is also fully compatible the increase in the numeric range for the GI identifier, which will take effect in the nucleotide database later this year.  The list below shows details of the new features and bug fixes.  You can download the new BLAST release from the FTP site.

Continue reading “BLAST+ 2.12.0 now available with more efficient multithreaded searches”

June 2 Webinar: Quickly upload and view your own data in genomic context at NCBI

June 2 Webinar: Quickly upload and view your own data in genomic context at NCBI

Join us on June 2, 2021 at 12PM eastern time to learn how to how to upload and display your own genomic data in the context of annotated genome assemblies. You will use the Genome Data Viewer and the Sequence viewer to visualize your own uploaded data (indexed BAM, VCF, BED, wig, GFF formats), data from public track hubs, and your BLAST and Primer-BLAST results. You will also learn to take advantage of features of the viewers including optimizing display settings, sharing a view with collaborators, exporting images, and downloading genes or other features in the view.

  • Date and time: Wed, June 2, 2021 12:00 PM – 12:45 PM EDT
  • Register

After registering, you will receive a confirmation email with information about attending the webinar. A few days after the live presentation, you can view the recording on the NCBI webinars playlist on the NLM YouTube channel. You can learn about future webinars on the Webinars and Courses page.

Magic-BLAST version 1.6.0 is here!

Magic-BLAST version 1.6.0 is here!

We’ve just released  a new version (1.6.0) of Magic-BLAST, the BLAST-based next-gen alignment tool, with these improvements:

  • Usage reporting — you can help improve Magic-BLAST by sharing limited information about your search. The BLAST User Manual has details on the information collected, how it is used, and how to opt-out.
  • Magic BLAST can access NCBI SRA next-gen reads from the cloud when  you use the -sra or -sra_batch options.  See the Magic-BLAST cookbook for more details.
  • NCBI taxonomy IDs are reported in SAM output if they are present in the target BLAST database.
  • You can get unaligned reads reported separately from the aligned ones by using the -out_unaligned <file name> option.  You can also select the format ( SAM, tabular, or FASTA) with the -unaligned_fmt option. The default format is the same as one for the main report .

The version 1.6.0 executables are available from the NCBI FTP site.  See the release notes , the NCBI GitHub site , and the Magic-BLAST publication for more information.

NCBI on YouTube: Tips for My Bibliography, Genome Data Viewer and more

Missed our latest YouTube videos? Scroll down to see what we’ve been up to.

Add Preprint Citations in My Bibliography

The National Institutes of Health encourages investigators to post preprints to public repositories in order to speed the dissemination and enhance the rigor of their work. This video demonstrates how to add preprint citations to My Bibliography.

Continue reading “NCBI on YouTube: Tips for My Bibliography, Genome Data Viewer and more”

Remembering Mark Boguski

It is with much sadness that we recently learned of the passing of Mark Boguski, MD, PhD, a former Senior Investigator in the Computational Biology Branch at NCBI. Mark worked at the NCBI from 1989-2000 and made a lasting impression on the staff who are still with NCBI and who overlapped with his time here. Many of them have commented on social media about their personal interactions and fond memories of Mark.

Figure 1. Part of an alignment from a translating BLAST (blastx) search of a modified chicken translation factor sequence that Mark provided to Michael Crichton for The Lost World. Mark had edited the sequence by inserting DNA codons that BLAST translates to ‘MARK WAS HERE NIH’ thus leaving his autograph. Continue reading “Remembering Mark Boguski”

April 7 Webinar: Recent and upcoming enhancements to NCBI BLAST and Primer-BLAST services!

April 7 Webinar: Recent and upcoming enhancements to NCBI BLAST and Primer-BLAST services!

Join us on April 7, 2021 at 12PM eastern time to learn about new web BLAST and Primer-BLAST enhancements that improve your BLAST experience. You’ll also see a preview of some planned improvements to the databases that make it easier to find relevant matches.

Recent changes to web BLAST include added data columns on the descriptions table, so you can quickly find and sort your matches. Primer-BLAST now offers direct links from genome assembly pages, so you can easily select the specificity database. Primer-BLAST also now accepts multiple target templates making it easy to design primers that can amplify several similar sequences such as all splice variants of gene or the same target (16S, COI) from different strains or species.

  • Date and time: Wed, April 7, 2021 12:00 PM – 12:45 PM EDT
  • Register

After registering, you will receive a confirmation email with information about attending the webinar. A few days after the live presentation, you can view the recording on the NCBI webinars playlist on the NLM YouTube channel. You can learn about future webinars on the Webinars and Courses page.