Search Results for: datasets

PubMed Central Article Datasets are Now Available on the Cloud

To enhance machine access to biomedical literature and drive impactful analyses and reuse, the National Library of Medicine (NLM) is pleased to announce the availability of the PubMed Central (PMC) Article Datasets on Amazon Web Services (AWS) Registry of Open Data as part of AWS’s Open Data Sponsorship Program (ODP). These datasets collectively span 4 million of PMC’s 7 million … Continue reading PubMed Central Article Datasets are Now Available on the Cloud

Introducing the new NCBI Datasets Genomes page

The updated NCBI Datasets Genomes page now has genome data for all domains of life, including bacterial and viral genomes. The genomes table (Figure 1) now offers filters for: Reference genomes — switch it on to only show reference or representative genomes Annotated — switch it on to only show annotated genomes Assembly level — use the assembly level … Continue reading Introducing the new NCBI Datasets Genomes page

New NCBI Datasets home and documentation pages provide easier access

NCBI Datasets, the new set of services for downloading genome assembly and annotation data (previous Datasets posts), has redesigned and reorganized web pages to make it easier to find and access the services and documentation you need. NCBI Datasets has a fresh new homepage (Figure 1) highlighting the types of data available through our tools. Available … Continue reading New NCBI Datasets home and documentation pages provide easier access

The Datasets command-line tool now provides ortholog data

You can now get gene ortholog data using the NCBI Datasets command-line tool using a gene ID, gene symbol, or RefSeq nucleotide or protein accession. Data are available for vertebrates and insects. The vertebrate orthologs includes a specialized set for fish.  (See our recent post for more information on the orthologs for fish and insects.) You … Continue reading The Datasets command-line tool now provides ortholog data

Programmatic access to Gene data using Datasets command-line and API

In March, we announced NCBI Datasets, a new resource that lets you easily retrieve and download data from across NCBI databases. Did you know you can now fetch NCBI Gene data programmatically using the NCBI Datasets API or command-line tool?  Quickly retrieve both metadata and gene sequence data for multiple Gene records including transcripts and proteins … Continue reading Programmatic access to Gene data using Datasets command-line and API

Easily download large amounts of genomic data with NCBI Datasets

Do you need to download a lot of genomic data? Maybe you need all primate reference genomes or maybe you need just a few really big genomes? Prior to the advent of NCBI Datasets, downloading such a large amount of data could be a frustrating and time consuming experience involving failed downloads and writing custom scripts. NCBI Datasets … Continue reading Easily download large amounts of genomic data with NCBI Datasets

NCBI on YouTube: Customize MSA Viewer, SciENcv, plants and RNA-Seq data, Datasets and PubMed

Missed a few videos on YouTube? Here’s the latest from our channel. Customize the MSA Viewer to Make Your Analysis Easier We’re constantly improving the Multiple Sequence Alignment (MSA) Viewer. This video demonstrates several new and popular features, including the ability to change data columns, hide selected rows, analyze polymorphisms, and more.

Sept 22 Webinar: Using NCBI Datasets command-line tools to access data and metadata for genomes

Sept 22 Webinar: Using NCBI Datasets command-line tools to access data and metadata for genomes

Join us on September 22, 2021 at 12PM eastern time learn to use the datasets command-line tools (datasets and dataformat) to access, filter, download, and format data and metadata for genomes. Through examples from eukaryotes and the SARS-CoV-2 coronavirus, you will see how to use metadata to filter for genome sequences with desired properties such … Continue reading Sept 22 Webinar: Using NCBI Datasets command-line tools to access data and metadata for genomes