MicroSatellite DataBase (MSDB) is a collection of simple sequence repeats (SSRs), also known as microsatellites. SSRs are short tandem repeats of 1 to 6 base long motifs present in all genomes, particularly in Eukaryotes. Many studies have pointed to the role of SSRs in gene regulation, and have shown that SSRs can act as transcription factor binding sites, enhancer-blockers, insulators etc., making them an interesting class of DNA elements to study. MSDB aims to be the go-to resource for both accessing as well as visualizing SSR-related information.
Currently, MSDB has information of 6,893 species belonging to various Kingdoms, as outlined below:
Kingdom or Group | Number of species |
Archaea | 514 |
Bacteria | 5732 |
Plants | 74 |
Fungi | 191 |
Protozoa | 72 |
Invertebrates | 112 |
Vertebrates | 198 |
Species name | Common name | Genome Size | Total number of SSRs | Genome covered(%) |
Homo sapiens | Human | 3236.34MB | 4585519 | 2.13% |
Mus musculus | Mouse | 2807.72MB | 5194607 | 3.38% |
Rattus norvegicus | Rat | 2616.42MB | 4232429 | 2.88% |
Gallus gallus | Chicken | 1230.26MB | 1698863 | 2.17% |
Danio rerio | Zebrafish | 1371.1MB | 2937095 | 4.23% |
Drosophila melanogaster | Fruitfly | 143.73MB | 263465 | 2.58% |
Caenorhabditis elegans | Worm | 100.29MB | 100997 | 1.49% |
Saccharomyces cerevisiae | Yeast | 12.16MB | 12073 | 1.38% |