ChemMine Tools is an online service for analyzing and clustering small molecules. A suite of cheminformatics and data mining tools for data visualization, structure comparison, similarity searching, compound clustering and prediction of chemical properties are available.
ConSeq is a tool for predicting functionally and structurally important amino acid residues in protein sequences. The predictions are based on the assumptions that residues of functional importance are often conserved and solvent-accessible, and those of structural importance are often conserved and located in the protein core. A multiple sequence alignment is used to predict the relative solvent accessibility state and the evolutionary rate at each residue.
ConSeq has been integrated into ConSurf (2010).
DBAli is a database that includes a comprehensive all-against-all comparison of protein structures in the PDB database. Tools associated with the database allow users to mine this protein structure space by establishing relationships between protein structures and their functions.
DiANNA (DiAminoacid Neural Network Application) a tool that predicts cysteine states of a protein. It can predict whether a particular cysteine is reduced, is forming a disulfide bond, or is bound to a metallic ligand.
eF-seek predicts protein functional sites by searching for similar electrostatic potential and molecular surface shapes against eF-site, a database of electrostatic surfaces for representative ligand binding sites.
MutDB is a database that associates protein structural information with mutations and polymorphisms in gene sequences. The data is derived from dbSNP and Swiss-Prot, and can be browsed by gene name or searched by keyword or by various identifiers.
Online Guassian Network Model (oGNM) calculates the equilibrium dynamics of any structure submitted in PDB format, using the Gaussian Network Model (GNM), a network model for representing biomolecules.
One-Block CYRCA is a program for identifying blocks (local ungapped profiles of the most conserved regions of protein families and domains) in a multiple sequence alignment based on the LAMA and CYRCA block-to-block alignment methods.