ARGO is a tool for the detection and visualization of sets of region-specific degenerate oligonucleotide motifs in the regulatory regions of eukaryotic genes.
includes: TRANSFAC - transcription factor database; Patho DB - mutated forms of transcription factors and binding sites that are pathologically relevant; S/MARt DB - scaffold matrix transaction database; TRANSPATH - gene regulatory pathway database.
Server which attempts to identify any motifs related to genes predicted to share regulatory elements. It alters Gibbs sampling through biasing searches towards conserved sequences across multiple species.
ESLpred is a tool for predicting subcellular localization of proteins using support vector machines. The predictions are based on dipeptide and amino acid composition, and physico-chemical properties.
EUGENE'HOM is a gene prediction software for eukaryotic organisms based on comparative analysis. The data is currently tuned for plant sequences of up to 400kb.
Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.
The InParanoid project gathers proteomes of completely sequenced eukaryotic species plus Escherichia coli and calculates pairwise ortholog relationships among them. To facilitate data exchange and comparisons among ortholog databases, we have developed and are making available two XML schemas: SeqXML for the input sequences and OrthoXML for the output ortholog clusters.
JASPAR is a non-redundant, curated collection of transcription factor binding profiles. Each profile is generated from published, experimentally defined eukaryotic transcription factor binding sites.
LOC3D is a database of predicted subcellular localization for eukaryotic proteins of known three-dimensional (3D) structure and includes tools to predict the subcellular localization for submitted protein sequences.
LOCtarget is a tool for predicting, and a database of pre-computed predictions for, sub-cellular localization of eukaryotic and prokaryotic proteins. Several methods are employed to make the predictions, including text analysis of SWISS-PROT keywords, nuclear localization signals, and the use of neural networks.
MADAP is a clustering tool for the interpretation annotation data mapped onto complete or partial genome sequences. Initially developed for determining transcription start sites (TSS) by defining 5\' and 3\'ends of mRNA, MADAP also has utility in clustering other annotation data types (ChIP-chip data, for example).
MetaPath Online gives users access to an algorithm for network expansion. Starting with given metabolites, called seed compounds, the algorithm uses known information about metabolic reactions and pathways to expand the network and define scope. This systematic investigation of structure-function relationships of metabolic networks can offer evolutionary insights.