Are you looking for SARS-CoV-2 sequence data? Look no further! The NCBI Virus SARS-CoV-2 Data Hub now has an interactive data dashboard (Figure 1) that shows the collection location (country and US state), the date of collection, and the date of public availability for SARS-CoV-2 sequence data. You can view available nucleotide and protein sequences based on criteria you select and send these to a data table. You can further filter by normalized source information including sequence length, protein content, host, anatomical isolation source. The sequence records have links to related SRA records and publications in PubMed when available. You can download the data as FASTA-formatted sequences with customizable titles, accession lists, or as a table including data descriptors. See the help documentation for more details.
The sequences in NCBI Virus were submitted to members of the International Sequence Database Consortium (INSDC) – GenBank, EMBL, and DDBJ. This collaborative effort ensures that data is freely available to the scientific and public health communities where it can be used to understand the biology, evolution, and spread of SARS-CoV-2.
Figure 1. The NCBI Virus SARS-CoV-2 Data Hub Dashboard. You can narrow down sequence data using collection location, collection date, or the public release date. After making your selections, click “View results, Analyze, or Download” near the top of the page to see your dataset in the results table, which shows nucleotide, protein, and RefSeq sequences as well as associated metadata.