Server which scans upstream of genes in the same gene expression cluster for regulatory sequence motifs using a Gibbs sampling strategy. The Markov background model is used for non-motif bases, improving specificity of predicted motif locations.
Server which attempts to identify any motifs related to genes predicted to share regulatory elements. It alters Gibbs sampling through biasing searches towards conserved sequences across multiple species.
CORUM is a database that provides a manually curated repository of experimentally characterized protein complexes from mammalian organisms, mainly human (64%), mouse (16%) and rat (12%). The CORUM dataset is built from 3198 different genes, representing approximately 16% of the protein coding genes in humans. Each protein complex is described by a protein complex name, subunit composition, function as well as the literature reference that characterizes the respective protein complex. A 'Phylogenetic Conservation' analysis tool allows one to predict the occurrence of protein complexes in different phylogenetic groups.
The DAnCER (Disease Annotated Chromatin Epigenetic Resource) resource integrates information on genes with chromatin modification (CM) function from five model organisms, including human. Currently integrated are gene functional annotations, Pfam domain architecture, protein interaction networks and associated human diseases. Additional supporting evidence includes orthology relationships across organisms, membership in protein complexes, and information on protein 3D structure. DAnCER allows visual explorations of the integrated data and flexible query capabilities using a variety of data filters. In particular, disease information and functional annotations are mapped onto the protein interaction networks.
Tool which divides both strands of an input DNA sequence into a set of contiguous oligonucleotides. These oligos are designed with complimentary regions so that the complete set can be combined to synthesize the input DNA sequence in vitro (using oligonucleotide synthesis and assembly PCR).
Database of protein orthologs that interact (interologs) and proteins with conserved regulatory relationships across species (regulogs). Contains data for C. elegans, Drosophila, Arabidopsis, and Yeast.
IsoBase is a database identifying functionally related proteins across five major eukaryotic model organisms: yeast, fly, worm, mouse, and human. Clusters were computed using the IsoRankN algorithm, which uses spectral methods, to combine sequence and PPI data to produce clusters of functionally related proteins. IsoBase may be browsed for functionally related proteins across two or more species and may also be queried by accession numbers, species-specific identifiers, gene name or keyword.
MARQ is a web based application that allows users to compare a query set of genes (query signature) against a gene signature database built from GEO datasets for different organisms and platforms.
Server designed to pinpoint protein-DNA interaction sites at the base pair level. Uses ChIP-array data, word enumeration and position-specific weight matrix updating to search for motifs representing these interaction sites.
MitoGenesisDB focuses on the dynamic of mitochondrial protein formation through global mRNA analyses. Three main parameters confer a global view of mitochondrial biogenesis: (i) time-course of mRNA production in highly synchronized yeast cell cultures, (ii) microarray analyses of mRNA localization that define translation sites and (iii) mRNA transcription rate and stability which characterize genes that are more dependent on post-transcriptional regulation processes. MitoGenesisDB integrates and establishes cross-comparisons between these data. Several model organisms can be analyzed via orthologous relationships between interspecies genes.