AUGUSTUS is a eukaryotic gene prediction tool. It can integrate evidence, e.g. from RNA-Seq, ESTs, proteomics, but can also predict genes ab initio. The PPX extension to AUGUSTUS can take a protein sequence multiple sequence alignment as input to find new members of the family in a genome. It can be run through a web interface, or downloaded and run locally.
Babelomics is a suite of web tools for the functional annotation and analysis of groups of genes in high throughput experiments. Tools include: FatiGO, FatiGOplus, Fatiscan, Gene Set Enrichment Analysis (GSEA), Marmite, and the Tissues Mining Tool (TMT). Other tools include Biocarta pathways, Transfac and a tool de novo functional annotation of sequences.
The Berkeley Phylogenomics Group provides a series of
web servers for phylogenomic analysis: classification of sequences to pre-computed families and subfamilies using the PhyloFacts Phylogenomic Encyclopedia, FlowerPower clustering of proteins sharing the same domain architecture, MUSCLE multiple sequence alignment, SATCHMO simultaneous alignment and tree construction, and SCI-PHY subfamily identification.
DECA is a species disambiguation system for biological named entities. The system will identify protein entities and disambiguate them according to species, by assigning NCBI Taxonomy organism IDs.
DiANNA (DiAminoacid Neural Network Application) a tool that predicts cysteine states of a protein. It can predict whether a particular cysteine is reduced, is forming a disulfide bond, or is bound to a metallic ligand.
ESLpred is a tool for predicting subcellular localization of proteins using support vector machines. The predictions are based on dipeptide and amino acid composition, and physico-chemical properties.
eTBLAST is a textual similarity search engine. This server can parse and summarize the results of an abstract similarity search to find appropriate journals for publication, authors with expertise in a given field, and documents similar to a submitted query.
FatiGOplus is a tool for the functional profiling of genome-scale experiments oriented to the interpretation of microarray experiments. As part of the Babelomics suite, FatiGOplus finds differential distributions of biological terms (GO, KEGG pathways, Interpro motifs, Transfac motifs, CisRed motifs, etc.) between two groups of genes.
The Gene Expression Data Analyzer (GEDA) is a tool for discovering differential gene expression in a subset of patients. It is tailored to cancer-related microarray studies and offers extensive options for visualization, classification and normalization.