includes: TRANSFAC - transcription factor database; Patho DB - mutated forms of transcription factors and binding sites that are pathologically relevant; S/MARt DB - scaffold matrix transaction database; TRANSPATH - gene regulatory pathway database.
The Center for Information Biology gene EXpression database (CIBEX) is a public repository for gene expression experimental data. The database system is compliant with the MIAME standard.
The Database of Interacting Proteins (DIP) allows users to search for interacting proteins. Results lists can be searched and/or visualized (statically or dynamically). Users can submit new protein-protein interactions and update database entries.
Summary of gene and genomic information automatically maintained from primary eukaryotic organism databases: gene symbol, full name, chromosome, genetic and molecular maps, gene products, links to extended gene information.
GlyNest and CASPER are two independent services, available through this common interface, for predicting NMR spectra given a glycan chemical structure.
The Gene Expression Omnibus (GEO) database stores over 20,000 microarray- and sequence-based functional genomics studies, and continues to handle the majority of direct high-throughput data submissions from the research community. Multiple mechanisms are provided to help users effectively search, browse, download and visualize the data at the level of individual genes or entire studies.
Protein Blocks Expert (PBE) uses a structural alphabet of short structural motifs to compare protein structures. PBE uses as its structural alphabet a set of protein blocks derived from structurally aligned homologous proteins present in the PALI Database.
The Structural Classification of RNA (SCOR) is a database designed to provide a comprehensive perspective and understanding of RNA motif structure, function, tertiary interactions and their relationships.
Twinscan is a system for predicting gene-structure in eukaryotic genomic sequences. In order to make its predictions, Twinscan combines the information from predicted coding regions and splice sites with conserservation measurements between the target sequence and sequences from a closely related genome. Currently sequences from mammalian genomes, and those of Arabidopsis thaliana, C. elegans, C. briggsae and strains JEC21 and H99 of Cryptococcus neoformans can be processed using Twinscan.