Genome Annotations

Annotation sources

MAAGE provides two basic annotation sources, RefSeq, and MAAGE. RefSeq sequences from NCBI, have each been annotated by the submitting researcher using methodologies of their choice. MAAGE re-annotates all bacterial genomes using RAST tool kit (RASTtk) and viral genomes with VIGOR4 to provide annotation consistency across a wide variety of genomes. The original RefSeq annotations have been retained and are still available for comparison purposes.

Bacterial genomes that are in more than 500 contigs are not annotated by RAST, nor are plasmid-only genomes. There is approximately a two-month interval between when sequences are submitted to RefSeq and re-annotation/integration with MAAGE, thus there may be a significant difference in the number of genomes at NCBI and at MAAGE.

Annotated Features

Currently, MAAGE supports the following bacterial genomic feature types:

  • -10_signal

  • -35_signal

  • 5’UTR

  • attenuator

  • CDS

  • conflict

  • enhancer

  • exon

  • gene

  • intron

  • LTR

  • mat_peptide

  • misc_binding

  • misc_difference

  • misc_feature

  • misc_recomb

  • misc_RNA

  • misc_signal

  • misc_structure

  • mRNA

  • ncRNA

  • old_sequence

  • prim_transcript

  • primer_bind

  • promoter

  • protein_bind

  • pseudogene

  • pseudogenic_region

  • RBS

  • region

  • rep_origin

  • repeat_region

  • repeat_unit

  • ribosome_entry_site

  • rRNA

  • sig_peptide

  • source

  • stem_loop

  • terminator

  • tmRNA

  • transcript

  • tRNA

  • unsure

  • variation