The ACLAME database is dedicated to the collection, analysis and classification of sequenced mobile genetic elements (MGEs, in particular phages and plasmids). Classifications are available at various levels of organization. At the gene/protein level, families group similar sequences that are expected to share the same function. Families of four or more proteins are manually assigned with a functional annotation using the GeneOntology and the locally developed ontology MeGO dedicated to MGEs. At the genome level, evolutionary cohesive modules group sets of protein families shared among MGEs. At the population level, networks display the reticulate evolutionary relationships among MGEs.
ALLGEN server provides various tools for multiple sequence alignments, clustering, and assembly of ESTs. It also includes search tools for transcription factor binding sites (TFBS), repeated patterns, and transposons.
Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) Finder detects this family of direct repeats found in the DNA of many bacteria and archaea.
Gibbs Motif Sampler allows you to identify motifs, conserved regions, in DNA or protein sequences. This tool can be applied for the detection of transcription factor binding sites (TFBS).
Lots of information on fugu including complete draft sequence, annotations, comparative vertebrate genomics, phylogenies, publications, and information about the fugu genome project; has some very useful tools including annotation tools, an Ensembl mirror, and BLAST.
The Multiple alignment Manipulator (MaM) takes a multiple alignment of genomic sequences as input and calculates the locations of exons, common repeat elements and unique regions based on user-selected programs/information. The graphical display also allows users to focus an assessment of sequence variation on the identified regions.
The MEME Suite web server is a unified portal for discovery and analysis of sequence motifs (e.g. DNA binding sites, protein interaction domains). Motifs with/without gaps may be discovered, searched for in DNA and protein databases, compared to other motifs and analyzed for putative function by association with gene ontology terms.
MEME is a tool designed for discovering and searching for DNA motifs such as transcription factor binding sites (TFBS) or protein domains.
Non-B DB is a non-canonical (non-B) structure database database integrating annotations and analysis of non-B DNA-forming sequence motifs. The database provides alternative DNA structure predictions, including Z-DNA motifs, quadruplex-forming motifs, inverted repeats, mirror repeats and direct repeats and their associated subsets of cruciforms, triplex and slipped structures, respectively. The database also contains motifs predicted to form static DNA bends, short tandem repeats and homo(purine•pyrimidine) tracts that have been associated with disease. The database is accessed through query pages that produce results with links to the UCSC browser and a GBrowse-based genomic viewer.
SNPmasker masks SNPs in a given sequence using information from the dbSNP database. This tool can also be used to mask repeat sequences. SNPmasker is primarily designed for masking the sequences before primer and probe design.
VISTA is a comprehensive suite of programs and databases for comparative analysis of genomic sequences. There are two ways of using VISTA - you can submit your own sequences and alignments for analysis (VISTA servers) or examine pre-computed whole-genome alignments of different species.