The Sequence Read Archive (SRA), NCBI’s largest growing repository of molecular data, archives raw sequencing data and alignment information from high-throughput sequencing platforms, including Roche 454 GS Systems®, Illumina’s Genome Analyzer®, and Complete Genomics® systems.
Researchers commonly use SRA data to make discoveries via comparison of data sets. Data sets can be compared through the SRA web interface, but if you want to integrate these downloads and file conversions into an already existing pipeline, or you simply prefer using a command-line interface, we recommend using the SRA Toolkit.
Run Selector is a tool available through the Sequence Read Archive (SRA) that allows you to fine-tune your web-based search results. There are over two dozen fields that can be used to filter SRA data in Run Selector. For example, if you need to look at data from a particular sequencing platform and genome assembly, you can use these fields as filters.
After running a web-based search for any keyword in the SRA database, users can dump all the results (up to a maximum of 20,000 experiments) into the Run Selector for fine-tuning. In addition, Run Selector shows you how many runs fall into each of the categories even before a filtering category is selected, allowing you to investigate the data further by noting what is contained within the database.
Figure 1. After searching with SRA, click on “Send to” to open the drop-down menu. Then click on the radio button labeled “Run Selector” to send your search results to Run Selector. Note that you can already see how many runs are in each of the categories to the left.