My Bibliography is a component of the My NCBI service and allows authors to create an online collection of their published work. While editing their bibliographies, authors can import citations for their articles directly from PubMed, and the system will automatically check for duplicates and will remove citations imported more than once. However, authors may still end up with duplicates in certain situations, and sometimes it is not obvious how to remove these duplicates. In this post we will describe three situations where duplicates may persist and will discuss ways to remove them.
On a typical day, researchers download about 30 terabytes of data from NCBI in an effort to make discoveries. NCBI began providing online access to data in the early 1990s, starting with the GenBank database of DNA sequences. Over the years we’ve greatly expanded the types and quantity of data available. You can now find on our site descriptions and data from experimental studies such as next-generation sequencing projects, bioactivity assays for small molecules, microarray datasets and genome-wide association studies.
The White House recently recognized these efforts by awarding NCBI Director David J. Lipman with the “Open Science” Champion of Change Award . The scientific community has recognized the benefits of open data. Access to this information serves as a source of both original and supplemental data for exploration and validation [2-4], which improves the power of experimental data  while increasing the speed and decreasing the cost of discovery .
In this post, we summarize three recent cases where researchers used data from an NCBI resource/database to make significant discoveries.