rRNA, snRNA and snoRNA models now predicted by the NCBI Eukaryotic Genome Annotation Pipeline

The NCBI Eukaryotic Genome Annotation Pipeline now includes the prediction of more non-coding RNAs. Starting with software release 8.0, rRNAs, snRNAs and snoRNAs are predicted by searching eukaryotic genomes with HMM models from RFAM. Below is an example of a rRNA cassette predicted in maize Annotation Release 102. These new small RNA types come in addition to the miRNAs and tRNAs that have long been annotated by the pipeline.

rRNA cassette on maize scaffold NW_017972167.1 of assembly B73 RefGen_v4
Fig.1: rRNA cassette on maize scaffold NW_017972167.1 of assembly B73 RefGen_v4. The top track displays the annotated 18S, 5.8S and 28S rRNA subunits in Annotation Release 102. These three genes were missing from the previous annotation, and replaced incorrect non-coding gene predictions (see Annotation Release 101, middle track). The bottom track shows the repeats identified by RepeatMasker. The boundaries of the rRNA repeats match precisely the predicted 18S and 28S rRNA genes.

See what we are annotating now on the Eukaryotic RefSeq Genome Annotation Status page.

One thought on “rRNA, snRNA and snoRNA models now predicted by the NCBI Eukaryotic Genome Annotation Pipeline

Leave a Reply