EBI Tools is a project that aims to provide programmatic access to the various databases and retrieval and analysis services that the European Bioinformatics Institute (EBI) provides through Simple Object Access Protocol (SOAP) and other related web service technologies.
Asterias is a set of tools for the analyses of high throughput genomic data that includes applications for microarray data normalization, filtering, detection of differential gene expression, class and survival prediction model building, and analysis of array CGH data. Most applications use parallel computing resulting in significant increases in speed.
CARGO (Cancer And Related Genes Online) is a portal that uses widgets to allow users to mine literature using iHOP, retrieve disease information from OMIM, visualize 3D SNPs, query protein interactions, and view summarized gene annotation information for cancer related genes in human.
The Database for Annotation, Visualization and Integrated Discovery (DAVID) provides a comprehensive set of functional annotation tools for investigators to understand biological meaning behind large list of genes.
FatiGOplus is a tool for the functional profiling of genome-scale experiments oriented to the interpretation of microarray experiments. As part of the Babelomics suite, FatiGOplus finds differential distributions of biological terms (GO, KEGG pathways, Interpro motifs, Transfac motifs, CisRed motifs, etc.) between two groups of genes.
GeneTrail analyzes gene sets for statistically significant accumulations of genes belonging to functional categories. Two common statistical approaches are implemented including: Over-Representation Analysis (ORA) for comparing a reference set of genes to a test set; and, Gene Set Enrichment Analysis (GSEA) for scoring sorted lists of genes.
KAAS (KEGG Automatic Annotation Server) is a tool for ortholog assignment and pathway mapping. KAAS provides functional annotation of genes by BLAST comparisons against the manually curated KEGG GENES database. The result contains KO (KEGG Orthology) assignments and automatically generated KEGG pathways.
MADAP is a clustering tool for the interpretation annotation data mapped onto complete or partial genome sequences. Initially developed for determining transcription start sites (TSS) by defining 5\' and 3\'ends of mRNA, MADAP also has utility in clustering other annotation data types (ChIP-chip data, for example).
The NCBI Influenza Virus Sequence Annotation Tool is a web application for user-provided sequences. It can predict protein sequences encoded by an input flu sequence and produce a feature table that can be used for sequence submission to GenBank.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
g:Profiler is a set of tools for functional annotation of gene lists that includes: g:GOSt, which retrieves the most significant Gene Ontology (GO) terms, KEGG and REACTOME pathways, and TRANSFAC motifs; g:Convert, for conversion between gene or protein names/IDs; g:Orth, for retrieving orthologs; and, g:Sorter, which searches for similar expression profiles. Functional analysis of SNPs and other DNA polymorphisms is also supported, as is enrichment analysis.
BLASTO (BLAST on Orthologous groups) is a modified BLAST tool for searching orthologous group data. It treats each orthologous group as a unit and outputs a ranked list of orthologous groups instead of single sequences.
The Berkeley Phylogenomics Group provides a series of
web servers for phylogenomic analysis: classification of sequences to pre-computed families and subfamilies using the PhyloFacts Phylogenomic Encyclopedia, FlowerPower clustering of proteins sharing the same domain architecture, MUSCLE multiple sequence alignment, SATCHMO simultaneous alignment and tree construction, and SCI-PHY subfamily identification.
MultiPhyl is a high-throughput Maximum Likelihood based phylogeny analysis program that allows researchers to create a virtual phylogenetic supercomputer from a group semi-idle desktop machines.
Phylemon is a server that integrates a suite of tools for multiple sequence alignment, phylogeny, and evolutionary tests from the most popular stand-alone phylogenetic and evolutionary analysis programs. Phylemon 2.0 has an integrated environment to allow for evolutionary analyses, format conversion, file storage and editing capacity, and has saved phylogenetic pipeline function.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
CARGO (Cancer And Related Genes Online) is a portal that uses widgets to allow users to mine literature using iHOP, retrieve disease information from OMIM, visualize 3D SNPs, query protein interactions, and view summarized gene annotation information for cancer related genes in human.
StSNP (Structure SNP) is a server which provides the ability to analyze and compare human nonsynonymous SNPs (nsSNP) in protein structures, protein complexes, protein-protein interfaces and metabolic networks.
WhETS (Wheat Estimated Transcript Server) is a resource that combines Triticeae ESTs/mRNAs with rice genes to find the best estimate of hexaploid wheat transcript sequences for a target gene, supplemented with information on tissue distribution and likely gene structure, to aid in primer design.
g:Profiler is a set of tools for functional annotation of gene lists that includes: g:GOSt, which retrieves the most significant Gene Ontology (GO) terms, KEGG and REACTOME pathways, and TRANSFAC motifs; g:Convert, for conversion between gene or protein names/IDs; g:Orth, for retrieving orthologs; and, g:Sorter, which searches for similar expression profiles. Functional analysis of SNPs and other DNA polymorphisms is also supported, as is enrichment analysis.
EBI Tools is a project that aims to provide programmatic access to the various databases and retrieval and analysis services that the European Bioinformatics Institute (EBI) provides through Simple Object Access Protocol (SOAP) and other related web service technologies.
The NCBI Influenza Virus Sequence Annotation Tool is a web application for user-provided sequences. It can predict protein sequences encoded by an input flu sequence and produce a feature table that can be used for sequence submission to GenBank.
WhETS (Wheat Estimated Transcript Server) is a resource that combines Triticeae ESTs/mRNAs with rice genes to find the best estimate of hexaploid wheat transcript sequences for a target gene, supplemented with information on tissue distribution and likely gene structure, to aid in primer design.
Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) Finder detects this family of direct repeats found in the DNA of many bacteria and archaea.
Gibbs Motif Sampler allows you to identify motifs, conserved regions, in DNA or protein sequences. This tool can be applied for the detection of transcription factor binding sites (TFBS).
MYBS (Mining Yeast Binding Sites) is a tool for mapping Transcription factors (TFs) and their binding sites (TFBSs) that integrates both experimentally verified (ChIP-chip data) and predicted position weight matrixes (PWMs) from several databases.
MelinaII (Motif Elucidator in Nucleotide Sequence Assembly) can run multiple motif prediction tools (Consensus, MEME, Gibbs Sampler, MDScan, Weeder) simultaneously. Graphical results can be used to compare predictions of potential DNA motifs (such as transcription factor binding sites, TFBS) in promoter regions.
OPTIMIZER is an application that optimizes the codon usage of a DNA sequence. This server can be used to predict and optimize the level expression of a gene in heterologous gene expression systems.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
Promoter Analysis Pipeline (PAP) can identify potential transcriptional regulators for sets of co-expressed genes and identify potential regulatory targets of transcription factors.
RF_DYMHC (Random Forest-Based System for Detecting Yeast Meiotic Recombination Hotspots and Coldspots) predicts recombination hot/cold spots in the yeast genome.
SCOPE (Suite for Computational identification Of Promoter Elements) is a set of programs aimed at identifying novel cis-regulatory elements from groups of upstream sequences. SCOPE does not have any nuisance parameters so does not require the user to guess what the results might be. SCOPE is also very robust to noisy data and is still 50% as accurate even in the presence of 4-fold noise. SCOPE can also identify other genes in the genome that might be coregulated with the original gene set.
VISTA is a comprehensive suite of programs and databases for comparative analysis of genomic sequences. There are two ways of using VISTA - you can submit your own sequences and alignments for analysis (VISTA servers) or examine pre-computed whole-genome alignments of different species.
WebMOTIFS is a tool for motif discovery, scoring, analysis, and visualization. It allows you to use different programs (AlignACE, MDscan, MEME, and Weeder) to search for DNA-sequence motifs, and evaluate the results.
ZiFiT (Zinc Finger Targeter) assists in the
design of zinc finger proteins that can bind to specific DNA sequences. ZiFiT has been updated to integrate additional tools and resources including scoring schemes for modular assembly, an interface with the Zinc Finger Database of engineered ZFPs, and direct query of NCBI BLAST servers for identification of potential off-target sites within a host genome.
OligoCalc calculates the physical properties of single- and double-stranded DNA and RNA molecules including melting temperature (multiple methods), concentration, molecular weight, and absorbance at 260 nanometers. OligoCalc can also account for 5’ and 3’ chemical modifications and predict potential hairpin loop formation.
REPK (Restriction Endonuclease Picker) assists in the choice of restriction endonucleases for terminal restriction fragment length polymorphism (T-RFLP) by finding sets of four restriction endonucleases which together uniquely differentiate user-designated sequence groups.
WebTraceMiner is a tool for processing and mining Expressed Sequence Tag (EST) trace files. It can help characterize 3\' and 5\' termini of cDNA inserts by detecting sequence features such as vector fragments, adapter/linker sequences, insert-flanking restriction endonuclease recognition sites, and polyA or polyT tails.
The Bioinformatics Links Directory features curated links to molecular resources, tools and databases. All of the resources are free or available for a nominal fee.
EBI Tools is a project that aims to provide programmatic access to the various databases and retrieval and analysis services that the European Bioinformatics Institute (EBI) provides through Simple Object Access Protocol (SOAP) and other related web service technologies.
The Database for Annotation, Visualization and Integrated Discovery (DAVID) provides a comprehensive set of functional annotation tools for investigators to understand biological meaning behind large list of genes.
DEEP (Differential Expression Effector Prediction) is a tool that can identify effectors of specific expression profiles by combining gene expression data with biological expert knowledge about biomolecular interaction networks (provided by resources like TRANSPATH, for example).
FatiGOplus is a tool for the functional profiling of genome-scale experiments oriented to the interpretation of microarray experiments. As part of the Babelomics suite, FatiGOplus finds differential distributions of biological terms (GO, KEGG pathways, Interpro motifs, Transfac motifs, CisRed motifs, etc.) between two groups of genes.
Gibbs Motif Sampler allows you to identify motifs, conserved regions, in DNA or protein sequences. This tool can be applied for the detection of transcription factor binding sites (TFBS).
MADAP is a clustering tool for the interpretation annotation data mapped onto complete or partial genome sequences. Initially developed for determining transcription start sites (TSS) by defining 5\' and 3\'ends of mRNA, MADAP also has utility in clustering other annotation data types (ChIP-chip data, for example).
MYBS (Mining Yeast Binding Sites) is a tool for mapping Transcription factors (TFs) and their binding sites (TFBSs) that integrates both experimentally verified (ChIP-chip data) and predicted position weight matrixes (PWMs) from several databases.
MelinaII (Motif Elucidator in Nucleotide Sequence Assembly) can run multiple motif prediction tools (Consensus, MEME, Gibbs Sampler, MDScan, Weeder) simultaneously. Graphical results can be used to compare predictions of potential DNA motifs (such as transcription factor binding sites, TFBS) in promoter regions.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
Promoter Analysis Pipeline (PAP) can identify potential transcriptional regulators for sets of co-expressed genes and identify potential regulatory targets of transcription factors.
SCOPE (Suite for Computational identification Of Promoter Elements) is a set of programs aimed at identifying novel cis-regulatory elements from groups of upstream sequences. SCOPE does not have any nuisance parameters so does not require the user to guess what the results might be. SCOPE is also very robust to noisy data and is still 50% as accurate even in the presence of 4-fold noise. SCOPE can also identify other genes in the genome that might be coregulated with the original gene set.
WebMOTIFS is a tool for motif discovery, scoring, analysis, and visualization. It allows you to use different programs (AlignACE, MDscan, MEME, and Weeder) to search for DNA-sequence motifs, and evaluate the results.
ZiFiT (Zinc Finger Targeter) assists in the
design of zinc finger proteins that can bind to specific DNA sequences. ZiFiT has been updated to integrate additional tools and resources including scoring schemes for modular assembly, an interface with the Zinc Finger Database of engineered ZFPs, and direct query of NCBI BLAST servers for identification of potential off-target sites within a host genome.
g:Profiler is a set of tools for functional annotation of gene lists that includes: g:GOSt, which retrieves the most significant Gene Ontology (GO) terms, KEGG and REACTOME pathways, and TRANSFAC motifs; g:Convert, for conversion between gene or protein names/IDs; g:Orth, for retrieving orthologs; and, g:Sorter, which searches for similar expression profiles. Functional analysis of SNPs and other DNA polymorphisms is also supported, as is enrichment analysis.
VisANT is an integrative visual analysis tool for biological networks and pathways that contains modules for querying and integrating KEGG pathways with expression data. VisANT is java-based, and can be run as a java applet, as a java web application, or downloaded and run locally. New functions in VisANT 3.5 allow determination of over-representation and expression enrichment.
VisANT is an integrative visual analysis tool for biological networks and pathways that contains modules for querying and integrating KEGG pathways with expression data. VisANT is java-based, and can be run as a java applet, as a java web application, or downloaded and run locally. New functions in VisANT 3.5 allow determination of over-representation and expression enrichment.
OPTIMIZER is an application that optimizes the codon usage of a DNA sequence. This server can be used to predict and optimize the level expression of a gene in heterologous gene expression systems.
VisANT is an integrative visual analysis tool for biological networks and pathways that contains modules for querying and integrating KEGG pathways with expression data. VisANT is java-based, and can be run as a java applet, as a java web application, or downloaded and run locally. New functions in VisANT 3.5 allow determination of over-representation and expression enrichment.
AStalavista (alternative splicing transcriptional landscape visualization tool) dynamically identifies, extracts, and displays alternative splicing events from whole genome annotations and user provided gene sets.
SplicePort makes splice-site predictions for submitted sequences, and allows browsing of the motifs used in the prediction. Feature selection is optimized for human splice sites.
Asterias is a set of tools for the analyses of high throughput genomic data that includes applications for microarray data normalization, filtering, detection of differential gene expression, class and survival prediction model building, and analysis of array CGH data. Most applications use parallel computing resulting in significant increases in speed.
DEEP (Differential Expression Effector Prediction) is a tool that can identify effectors of specific expression profiles by combining gene expression data with biological expert knowledge about biomolecular interaction networks (provided by resources like TRANSPATH, for example).
FatiGOplus is a tool for the functional profiling of genome-scale experiments oriented to the interpretation of microarray experiments. As part of the Babelomics suite, FatiGOplus finds differential distributions of biological terms (GO, KEGG pathways, Interpro motifs, Transfac motifs, CisRed motifs, etc.) between two groups of genes.
GeneTrail analyzes gene sets for statistically significant accumulations of genes belonging to functional categories. Two common statistical approaches are implemented including: Over-Representation Analysis (ORA) for comparing a reference set of genes to a test set; and, Gene Set Enrichment Analysis (GSEA) for scoring sorted lists of genes.
MAGMA can be used for the analyses of two-channel microarray experiments and computes genes with significant differential expression using R and Bioconductor. Automatically generated R-scripts are available for download.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
Promoter Analysis Pipeline (PAP) can identify potential transcriptional regulators for sets of co-expressed genes and identify potential regulatory targets of transcription factors.
PathExpress is a tool developed to interpret gene expression data obtained from microarray experiments by identifying and visualizing the most relevant metabolic pathways associated with a subset of genes (e.g. differentially expressed genes). Enzyme Neighborhood (a sub-network of linked enzymes with a limited path length) allows identification of the most relevant sub-network affected in a gene expression experiment.
VisANT is an integrative visual analysis tool for biological networks and pathways that contains modules for querying and integrating KEGG pathways with expression data. VisANT is java-based, and can be run as a java applet, as a java web application, or downloaded and run locally. New functions in VisANT 3.5 allow determination of over-representation and expression enrichment.
g:Profiler is a set of tools for functional annotation of gene lists that includes: g:GOSt, which retrieves the most significant Gene Ontology (GO) terms, KEGG and REACTOME pathways, and TRANSFAC motifs; g:Convert, for conversion between gene or protein names/IDs; g:Orth, for retrieving orthologs; and, g:Sorter, which searches for similar expression profiles. Functional analysis of SNPs and other DNA polymorphisms is also supported, as is enrichment analysis.
DEEP (Differential Expression Effector Prediction) is a tool that can identify effectors of specific expression profiles by combining gene expression data with biological expert knowledge about biomolecular interaction networks (provided by resources like TRANSPATH, for example).
GeneHub-GEPIS is a tool for inferring human and mouse gene expression patterns based on normalized EST abundance in various normal and cancerous tissues.
WebTraceMiner is a tool for processing and mining Expressed Sequence Tag (EST) trace files. It can help characterize 3\' and 5\' termini of cDNA inserts by detecting sequence features such as vector fragments, adapter/linker sequences, insert-flanking restriction endonuclease recognition sites, and polyA or polyT tails.
CARGO (Cancer And Related Genes Online) is a portal that uses widgets to allow users to mine literature using iHOP, retrieve disease information from OMIM, visualize 3D SNPs, query protein interactions, and view summarized gene annotation information for cancer related genes in human.
The Database for Annotation, Visualization and Integrated Discovery (DAVID) provides a comprehensive set of functional annotation tools for investigators to understand biological meaning behind large list of genes.
MADAP is a clustering tool for the interpretation annotation data mapped onto complete or partial genome sequences. Initially developed for determining transcription start sites (TSS) by defining 5\' and 3\'ends of mRNA, MADAP also has utility in clustering other annotation data types (ChIP-chip data, for example).
CARGO (Cancer And Related Genes Online) is a portal that uses widgets to allow users to mine literature using iHOP, retrieve disease information from OMIM, visualize 3D SNPs, query protein interactions, and view summarized gene annotation information for cancer related genes in human.
CanPredict uses a combination of computational methods to predict whether specific sequence changes in a protein are likely to be cancer-associated mutations.
G2D (Candidate Genes to Inherited Diseases) scans a human genomic region for genes related to an inherited disease. The G2D server also presents precomputed candidate genes for more than 600 genetically inherited diseases that have been mapped onto chromosomal regions without assignment of a particular gene.
GeneHub-GEPIS is a tool for inferring human and mouse gene expression patterns based on normalized EST abundance in various normal and cancerous tissues.
LIPID Metabolites And Pathways Strategy (LIPID MAPS) is a consortium that presents a collection of tools for performing tasks such as drawing lipid structures and predicting possible structures from mass spectrometry data.
CARGO (Cancer And Related Genes Online) is a portal that uses widgets to allow users to mine literature using iHOP, retrieve disease information from OMIM, visualize 3D SNPs, query protein interactions, and view summarized gene annotation information for cancer related genes in human.
CanPredict uses a combination of computational methods to predict whether specific sequence changes in a protein are likely to be cancer-associated mutations.
StSNP (Structure SNP) is a server which provides the ability to analyze and compare human nonsynonymous SNPs (nsSNP) in protein structures, protein complexes, protein-protein interfaces and metabolic networks.
EBI Tools is a project that aims to provide programmatic access to the various databases and retrieval and analysis services that the European Bioinformatics Institute (EBI) provides through Simple Object Access Protocol (SOAP) and other related web service technologies.
eTBLAST is a textual similarity search engine. This server can parse and summarize the results of an abstract similarity search to find appropriate journals for publication, authors with expertise in a given field, and documents similar to a submitted query.
CARGO (Cancer And Related Genes Online) is a portal that uses widgets to allow users to mine literature using iHOP, retrieve disease information from OMIM, visualize 3D SNPs, query protein interactions, and view summarized gene annotation information for cancer related genes in human.
iHOP (Information Hyperlinked over Proteins) allows researchers to explore a network of gene and protein interactions based on published scientific literature. For each gene search, iHOP reports sentences from abstracts associating it with other genes, links out to full abstracts, and reports experimental evidence for the interactions, if available. You can also select sentences to create and visualize your own gene model.
Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) Finder detects this family of direct repeats found in the DNA of many bacteria and archaea.
REPK (Restriction Endonuclease Picker) assists in the choice of restriction endonucleases for terminal restriction fragment length polymorphism (T-RFLP) by finding sets of four restriction endonucleases which together uniquely differentiate user-designated sequence groups.
GeneHub-GEPIS is a tool for inferring human and mouse gene expression patterns based on normalized EST abundance in various normal and cancerous tissues.
The NCBI Influenza Virus Sequence Annotation Tool is a web application for user-provided sequences. It can predict protein sequences encoded by an input flu sequence and produce a feature table that can be used for sequence submission to GenBank.
WhETS (Wheat Estimated Transcript Server) is a resource that combines Triticeae ESTs/mRNAs with rice genes to find the best estimate of hexaploid wheat transcript sequences for a target gene, supplemented with information on tissue distribution and likely gene structure, to aid in primer design.
MYBS (Mining Yeast Binding Sites) is a tool for mapping Transcription factors (TFs) and their binding sites (TFBSs) that integrates both experimentally verified (ChIP-chip data) and predicted position weight matrixes (PWMs) from several databases.
RF_DYMHC (Random Forest-Based System for Detecting Yeast Meiotic Recombination Hotspots and Coldspots) predicts recombination hot/cold spots in the yeast genome.
LIPID Metabolites And Pathways Strategy (LIPID MAPS) is a consortium that presents a collection of tools for performing tasks such as drawing lipid structures and predicting possible structures from mass spectrometry data.
MetaPath Online gives users access to an algorithm for network expansion. Starting with given metabolites, called seed compounds, the algorithm uses known information about metabolic reactions and pathways to expand the network and define scope. This systematic investigation of structure-function relationships of metabolic networks can offer evolutionary insights.
Frog (Free Online Drug Conformation) is a service aimed at generating 3D conformations for drug-like compounds starting from their 1D or 2D descriptions (smiles or sdf).
Open HUSAR (Heidelberg Unix Sequence Analysis Resources) contains suite of bioinformatics pipelines including tasks oriented to protein annotation by homology (ProtSweep), by domain analysis (DomainSweep), and by secondary structure elements (2Dsweep).
Pcons.net is a protein structure prediction meta server that provides automated tools for
protein structure prediction and analysis using a consensus approach.
Server that allows users to convert PDB files into PQR files by adding missing atoms, optimizing hydrogen bonding and assigning atomic charge and radius parameters. The resulting PQR file can be used for electrostatic calculations which can give insight into the influence of electrostatics on biomolecular structures.
PFP (Patch Finder Plus) is a tool for extracting and displaying positive electrostatic patches on protein surfaces which can be indicative of nucleic acid binding interfaces.
PI2PE (Protein Interface/Interior Prediction Engine) consists of three predictors (WESA, cons-PPISP, and DISPLAR) for predicting solvent accessibility and interface residues from the structure of a protein-binding or DNA-binding protein.
Protein Interactions Calculator (PIC) is a server which recognizes various kinds of interactions; such as disulphide bonds, hydrophobic interactions, hydrogen bonds, aromatic interactions, and ionic interactions within a protein or between proteins in a complex.
ProMateus extends ProMate, a protein binding site predictor, and allows the user to suggest new features that might improve the prediction of the binding site locations.
StSNP (Structure SNP) is a server which provides the ability to analyze and compare human nonsynonymous SNPs (nsSNP) in protein structures, protein complexes, protein-protein interfaces and metabolic networks.
eF-seek predicts protein functional sites by searching for similar electrostatic potential and molecular surface shapes against eF-site, a database of electrostatic surfaces for representative ligand binding sites.
DBAli is a database that includes a comprehensive all-against-all comparison of protein structures in the PDB database. Tools associated with the database allow users to mine this protein structure space by establishing relationships between protein structures and their functions.
EBI Tools is a project that aims to provide programmatic access to the various databases and retrieval and analysis services that the European Bioinformatics Institute (EBI) provides through Simple Object Access Protocol (SOAP) and other related web service technologies.
QSCOP-BLAST is a service that uses BLAST to search the QSCOP (Quantitative SCOP) server, an extension of classic SCOP (Structural Classification of Proteins).
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
eF-seek predicts protein functional sites by searching for similar electrostatic potential and molecular surface shapes against eF-site, a database of electrostatic surfaces for representative ligand binding sites.
fastSCOP identifies structural domains and determines evolutionary superfamilies of a query protein structure using 3D-BLAST to scan SCOP and MAMMOTH to create a structural alignment for refining domain boundaries.
Firestar predicts functionally important residues, such as catalytic and ligand binding sites, using structural templates and alignment reliability. Prediction coverage has increased with the extension of the FireDB and ligands are now classified for biological relevance.
The Berkeley Phylogenomics Group provides a series of
web servers for phylogenomic analysis: classification of sequences to pre-computed families and subfamilies using the PhyloFacts Phylogenomic Encyclopedia, FlowerPower clustering of proteins sharing the same domain architecture, MUSCLE multiple sequence alignment, SATCHMO simultaneous alignment and tree construction, and SCI-PHY subfamily identification.
M4T (Multiple Mapping Method with Multiple Templates) is a comparative protein structure modeling server that uses a combination of multiple templates and iterative optimization of alternative alignments.
Pcons.net is a protein structure prediction meta server that provides automated tools for
protein structure prediction and analysis using a consensus approach.
ProSA-web (Protein Structure Analysis) is an extension of the classic ProSA program used for the refinement and validation of experimental protein structures and in structure prediction and modeling.
MolProbity is a structure validation web service for diagnosing problems in 3D models of proteins, nucleic acids or complexes. It adds and optimizes H atoms (correcting 180° flipped Asn/Gln/His sidechains), and then calculates global and local validation for all-atom contacts (steric clashes, H-bonds & vdW), covalent geometry, and conformation (Ramachandran & rotamers for protein, ribose puckers & suite conformers for RNA). Results are displayed online as 3D graphics and sortable charts.
QSCOP-BLAST is a service that uses BLAST to search the QSCOP (Quantitative SCOP) server, an extension of classic SCOP (Structural Classification of Proteins).
DBAli is a database that includes a comprehensive all-against-all comparison of protein structures in the PDB database. Tools associated with the database allow users to mine this protein structure space by establishing relationships between protein structures and their functions.
GeneTrail analyzes gene sets for statistically significant accumulations of genes belonging to functional categories. Two common statistical approaches are implemented including: Over-Representation Analysis (ORA) for comparing a reference set of genes to a test set; and, Gene Set Enrichment Analysis (GSEA) for scoring sorted lists of genes.
Sequence Harmony detects subfamily specific functional sites from a multiple sequence alignment by scoring compositional differences between given subfamily groupings.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
Server that allows users to convert PDB files into PQR files by adding missing atoms, optimizing hydrogen bonding and assigning atomic charge and radius parameters. The resulting PQR file can be used for electrostatic calculations which can give insight into the influence of electrostatics on biomolecular structures.
PI2PE (Protein Interface/Interior Prediction Engine) consists of three predictors (WESA, cons-PPISP, and DISPLAR) for predicting solvent accessibility and interface residues from the structure of a protein-binding or DNA-binding protein.
ProSight PTM allows identification and characterization of intact proteins and their post- translational modifications (PTMs) using data from the \'Top-Down\' tandem mass spectrometry (MS/MS) approach.
The Berkeley Phylogenomics Group provides a series of
web servers for phylogenomic analysis: classification of sequences to pre-computed families and subfamilies using the PhyloFacts Phylogenomic Encyclopedia, FlowerPower clustering of proteins sharing the same domain architecture, MUSCLE multiple sequence alignment, SATCHMO simultaneous alignment and tree construction, and SCI-PHY subfamily identification.
DBAli is a database that includes a comprehensive all-against-all comparison of protein structures in the PDB database. Tools associated with the database allow users to mine this protein structure space by establishing relationships between protein structures and their functions.
EBI Tools is a project that aims to provide programmatic access to the various databases and retrieval and analysis services that the European Bioinformatics Institute (EBI) provides through Simple Object Access Protocol (SOAP) and other related web service technologies.
GeneTrail analyzes gene sets for statistically significant accumulations of genes belonging to functional categories. Two common statistical approaches are implemented including: Over-Representation Analysis (ORA) for comparing a reference set of genes to a test set; and, Gene Set Enrichment Analysis (GSEA) for scoring sorted lists of genes.
Gibbs Motif Sampler allows you to identify motifs, conserved regions, in DNA or protein sequences. This tool can be applied for the detection of transcription factor binding sites (TFBS).
MelinaII (Motif Elucidator in Nucleotide Sequence Assembly) can run multiple motif prediction tools (Consensus, MEME, Gibbs Sampler, MDScan, Weeder) simultaneously. Graphical results can be used to compare predictions of potential DNA motifs (such as transcription factor binding sites, TFBS) in promoter regions.
The MyHits server integrates several tools with a focus on protein annotation and the analysis of protein domains. Guest users have access to tools such as ClustalW and T-Coffee and databases like Swiss-Prot, Prosite and Interpro. Registration allows users to store their results in private databases, and is free for academic users.
Open HUSAR (Heidelberg Unix Sequence Analysis Resources) contains suite of bioinformatics pipelines including tasks oriented to protein annotation by homology (ProtSweep), by domain analysis (DomainSweep), and by secondary structure elements (2Dsweep).
Pcons.net is a protein structure prediction meta server that provides automated tools for
protein structure prediction and analysis using a consensus approach.
QSCOP-BLAST is a service that uses BLAST to search the QSCOP (Quantitative SCOP) server, an extension of classic SCOP (Structural Classification of Proteins).
SLiMDisc (Short Linear Motif Discovery) is a tool for finding shared motifs in proteins with a common attribute such as sub-cellular location or a common interaction partner.
WebMOTIFS is a tool for motif discovery, scoring, analysis, and visualization. It allows you to use different programs (AlignACE, MDscan, MEME, and Weeder) to search for DNA-sequence motifs, and evaluate the results.
fastSCOP identifies structural domains and determines evolutionary superfamilies of a query protein structure using 3D-BLAST to scan SCOP and MAMMOTH to create a structural alignment for refining domain boundaries.
Firestar predicts functionally important residues, such as catalytic and ligand binding sites, using structural templates and alignment reliability. Prediction coverage has increased with the extension of the FireDB and ligands are now classified for biological relevance.
g:Profiler is a set of tools for functional annotation of gene lists that includes: g:GOSt, which retrieves the most significant Gene Ontology (GO) terms, KEGG and REACTOME pathways, and TRANSFAC motifs; g:Convert, for conversion between gene or protein names/IDs; g:Orth, for retrieving orthologs; and, g:Sorter, which searches for similar expression profiles. Functional analysis of SNPs and other DNA polymorphisms is also supported, as is enrichment analysis.
CARGO (Cancer And Related Genes Online) is a portal that uses widgets to allow users to mine literature using iHOP, retrieve disease information from OMIM, visualize 3D SNPs, query protein interactions, and view summarized gene annotation information for cancer related genes in human.
The Database for Annotation, Visualization and Integrated Discovery (DAVID) provides a comprehensive set of functional annotation tools for investigators to understand biological meaning behind large list of genes.
DEEP (Differential Expression Effector Prediction) is a tool that can identify effectors of specific expression profiles by combining gene expression data with biological expert knowledge about biomolecular interaction networks (provided by resources like TRANSPATH, for example).
FatiGOplus is a tool for the functional profiling of genome-scale experiments oriented to the interpretation of microarray experiments. As part of the Babelomics suite, FatiGOplus finds differential distributions of biological terms (GO, KEGG pathways, Interpro motifs, Transfac motifs, CisRed motifs, etc.) between two groups of genes.
GeneTrail analyzes gene sets for statistically significant accumulations of genes belonging to functional categories. Two common statistical approaches are implemented including: Over-Representation Analysis (ORA) for comparing a reference set of genes to a test set; and, Gene Set Enrichment Analysis (GSEA) for scoring sorted lists of genes.
KAAS (KEGG Automatic Annotation Server) is a tool for ortholog assignment and pathway mapping. KAAS provides functional annotation of genes by BLAST comparisons against the manually curated KEGG GENES database. The result contains KO (KEGG Orthology) assignments and automatically generated KEGG pathways.
MetaPath Online gives users access to an algorithm for network expansion. Starting with given metabolites, called seed compounds, the algorithm uses known information about metabolic reactions and pathways to expand the network and define scope. This systematic investigation of structure-function relationships of metabolic networks can offer evolutionary insights.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
PFP (Patch Finder Plus) is a tool for extracting and displaying positive electrostatic patches on protein surfaces which can be indicative of nucleic acid binding interfaces.
PI2PE (Protein Interface/Interior Prediction Engine) consists of three predictors (WESA, cons-PPISP, and DISPLAR) for predicting solvent accessibility and interface residues from the structure of a protein-binding or DNA-binding protein.
Protein Interactions Calculator (PIC) is a server which recognizes various kinds of interactions; such as disulphide bonds, hydrophobic interactions, hydrogen bonds, aromatic interactions, and ionic interactions within a protein or between proteins in a complex.
PathExpress is a tool developed to interpret gene expression data obtained from microarray experiments by identifying and visualizing the most relevant metabolic pathways associated with a subset of genes (e.g. differentially expressed genes). Enzyme Neighborhood (a sub-network of linked enzymes with a limited path length) allows identification of the most relevant sub-network affected in a gene expression experiment.
ProMateus extends ProMate, a protein binding site predictor, and allows the user to suggest new features that might improve the prediction of the binding site locations.
SLiMDisc (Short Linear Motif Discovery) is a tool for finding shared motifs in proteins with a common attribute such as sub-cellular location or a common interaction partner.
StSNP (Structure SNP) is a server which provides the ability to analyze and compare human nonsynonymous SNPs (nsSNP) in protein structures, protein complexes, protein-protein interfaces and metabolic networks.
VisANT is an integrative visual analysis tool for biological networks and pathways that contains modules for querying and integrating KEGG pathways with expression data. VisANT is java-based, and can be run as a java applet, as a java web application, or downloaded and run locally. New functions in VisANT 3.5 allow determination of over-representation and expression enrichment.
Firestar predicts functionally important residues, such as catalytic and ligand binding sites, using structural templates and alignment reliability. Prediction coverage has increased with the extension of the FireDB and ligands are now classified for biological relevance.
g:Profiler is a set of tools for functional annotation of gene lists that includes: g:GOSt, which retrieves the most significant Gene Ontology (GO) terms, KEGG and REACTOME pathways, and TRANSFAC motifs; g:Convert, for conversion between gene or protein names/IDs; g:Orth, for retrieving orthologs; and, g:Sorter, which searches for similar expression profiles. Functional analysis of SNPs and other DNA polymorphisms is also supported, as is enrichment analysis.
iHOP (Information Hyperlinked over Proteins) allows researchers to explore a network of gene and protein interactions based on published scientific literature. For each gene search, iHOP reports sentences from abstracts associating it with other genes, links out to full abstracts, and reports experimental evidence for the interactions, if available. You can also select sentences to create and visualize your own gene model.
GeneTrail analyzes gene sets for statistically significant accumulations of genes belonging to functional categories. Two common statistical approaches are implemented including: Over-Representation Analysis (ORA) for comparing a reference set of genes to a test set; and, Gene Set Enrichment Analysis (GSEA) for scoring sorted lists of genes.
Open HUSAR (Heidelberg Unix Sequence Analysis Resources) contains suite of bioinformatics pipelines including tasks oriented to protein annotation by homology (ProtSweep), by domain analysis (DomainSweep), and by secondary structure elements (2Dsweep).
SLiMDisc (Short Linear Motif Discovery) is a tool for finding shared motifs in proteins with a common attribute such as sub-cellular location or a common interaction partner.
BLASTO (BLAST on Orthologous groups) is a modified BLAST tool for searching orthologous group data. It treats each orthologous group as a unit and outputs a ranked list of orthologous groups instead of single sequences.
The Berkeley Phylogenomics Group provides a series of
web servers for phylogenomic analysis: classification of sequences to pre-computed families and subfamilies using the PhyloFacts Phylogenomic Encyclopedia, FlowerPower clustering of proteins sharing the same domain architecture, MUSCLE multiple sequence alignment, SATCHMO simultaneous alignment and tree construction, and SCI-PHY subfamily identification.
FGF (Fishing Gene Family) is a resource for finding gene families given a set of protein sequences by investigating phylogenetic trees, duplication fate, and selective pressure (ka/ks calculations).
MultiPhyl is a high-throughput Maximum Likelihood based phylogeny analysis program that allows researchers to create a virtual phylogenetic supercomputer from a group semi-idle desktop machines.
Phylemon is a server that integrates a suite of tools for multiple sequence alignment, phylogeny, and evolutionary tests from the most popular stand-alone phylogenetic and evolutionary analysis programs. Phylemon 2.0 has an integrated environment to allow for evolutionary analyses, format conversion, file storage and editing capacity, and has saved phylogenetic pipeline function.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
ProSight PTM allows identification and characterization of intact proteins and their post- translational modifications (PTMs) using data from the \'Top-Down\' tandem mass spectrometry (MS/MS) approach.
CanPredict uses a combination of computational methods to predict whether specific sequence changes in a protein are likely to be cancer-associated mutations.
WebMOTIFS is a tool for motif discovery, scoring, analysis, and visualization. It allows you to use different programs (AlignACE, MDscan, MEME, and Weeder) to search for DNA-sequence motifs, and evaluate the results.
EBI Tools is a project that aims to provide programmatic access to the various databases and retrieval and analysis services that the European Bioinformatics Institute (EBI) provides through Simple Object Access Protocol (SOAP) and other related web service technologies.
taveRNA hosts three RNA web services: alteRNA, inteRNA and pRuNA. alteRNA is an alternative to many RNA folding methods. inteRNA predicts the joint secondary structure of two RNA sequences. pRuNA is an database pruning method which given a query RNA returns only a few ncRNAs as potential regulators.
RADAR (RNA Data Analysis and Research) provides multiple tools for RNA structure analysis including: pairwise structure alignment, multiple structure alignment, constrained structure alignment, database searching, and prediction of the consensus structure for a set of RNA sequences.
EBI Tools is a project that aims to provide programmatic access to the various databases and retrieval and analysis services that the European Bioinformatics Institute (EBI) provides through Simple Object Access Protocol (SOAP) and other related web service technologies.
MolProbity is a structure validation web service for diagnosing problems in 3D models of proteins, nucleic acids or complexes. It adds and optimizes H atoms (correcting 180° flipped Asn/Gln/His sidechains), and then calculates global and local validation for all-atom contacts (steric clashes, H-bonds & vdW), covalent geometry, and conformation (Ramachandran & rotamers for protein, ribose puckers & suite conformers for RNA). Results are displayed online as 3D graphics and sortable charts.
RADAR (RNA Data Analysis and Research) provides multiple tools for RNA structure analysis including: pairwise structure alignment, multiple structure alignment, constrained structure alignment, database searching, and prediction of the consensus structure for a set of RNA sequences.
RNA Movies is a tool for the visualization of RNA secondary structure spaces. The program creates an interpolated animation of user provided sets of secondary structure data.
pknotsRG is a server for single sequence RNA secondary structure prediction including pseudoknots; source code, binary executable, and web service available.
taveRNA hosts three RNA web services: alteRNA, inteRNA and pRuNA. alteRNA is an alternative to many RNA folding methods. inteRNA predicts the joint secondary structure of two RNA sequences. pRuNA is an database pruning method which given a query RNA returns only a few ncRNAs as potential regulators.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
The Berkeley Phylogenomics Group provides a series of
web servers for phylogenomic analysis: classification of sequences to pre-computed families and subfamilies using the PhyloFacts Phylogenomic Encyclopedia, FlowerPower clustering of proteins sharing the same domain architecture, MUSCLE multiple sequence alignment, SATCHMO simultaneous alignment and tree construction, and SCI-PHY subfamily identification.
COMPASS (Comparison of Multiple Protein Alignments with Assessment of Statistical Significance) is a tool for detecting remote levels of sequence similarity using profile-based comparison of multiple sequence alignments. Statistics and speed have been improved, and visualization tools introduced.
FGF (Fishing Gene Family) is a resource for finding gene families given a set of protein sequences by investigating phylogenetic trees, duplication fate, and selective pressure (ka/ks calculations).
MultiPhyl is a high-throughput Maximum Likelihood based phylogeny analysis program that allows researchers to create a virtual phylogenetic supercomputer from a group semi-idle desktop machines.
Sequence Harmony detects subfamily specific functional sites from a multiple sequence alignment by scoring compositional differences between given subfamily groupings.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
VISTA is a comprehensive suite of programs and databases for comparative analysis of genomic sequences. There are two ways of using VISTA - you can submit your own sequences and alignments for analysis (VISTA servers) or examine pre-computed whole-genome alignments of different species.
eProbalign computes maximal expected accuracy multiple sequence alignments from partition function posterior probabilities. eProbalign also provides a platform
to visualize the alignment, generate images, and
manipulate the output.
The Berkeley Phylogenomics Group provides a series of
web servers for phylogenomic analysis: classification of sequences to pre-computed families and subfamilies using the PhyloFacts Phylogenomic Encyclopedia, FlowerPower clustering of proteins sharing the same domain architecture, MUSCLE multiple sequence alignment, SATCHMO simultaneous alignment and tree construction, and SCI-PHY subfamily identification.
The Database for Annotation, Visualization and Integrated Discovery (DAVID) provides a comprehensive set of functional annotation tools for investigators to understand biological meaning behind large list of genes.
KAAS (KEGG Automatic Annotation Server) is a tool for ortholog assignment and pathway mapping. KAAS provides functional annotation of genes by BLAST comparisons against the manually curated KEGG GENES database. The result contains KO (KEGG Orthology) assignments and automatically generated KEGG pathways.
VISTA is a comprehensive suite of programs and databases for comparative analysis of genomic sequences. There are two ways of using VISTA - you can submit your own sequences and alignments for analysis (VISTA servers) or examine pre-computed whole-genome alignments of different species.
BLASTO (BLAST on Orthologous groups) is a modified BLAST tool for searching orthologous group data. It treats each orthologous group as a unit and outputs a ranked list of orthologous groups instead of single sequences.
The Berkeley Phylogenomics Group provides a series of
web servers for phylogenomic analysis: classification of sequences to pre-computed families and subfamilies using the PhyloFacts Phylogenomic Encyclopedia, FlowerPower clustering of proteins sharing the same domain architecture, MUSCLE multiple sequence alignment, SATCHMO simultaneous alignment and tree construction, and SCI-PHY subfamily identification.
EBI Tools is a project that aims to provide programmatic access to the various databases and retrieval and analysis services that the European Bioinformatics Institute (EBI) provides through Simple Object Access Protocol (SOAP) and other related web service technologies.
FGF (Fishing Gene Family) is a resource for finding gene families given a set of protein sequences by investigating phylogenetic trees, duplication fate, and selective pressure (ka/ks calculations).
The MyHits server integrates several tools with a focus on protein annotation and the analysis of protein domains. Guest users have access to tools such as ClustalW and T-Coffee and databases like Swiss-Prot, Prosite and Interpro. Registration allows users to store their results in private databases, and is free for academic users.
PROMALS (Profile Multiple Alignment with Local Structure) is a progressive mutliple sequence alignment method that improves alignment quality by using additional homologs from PSI-BLAST searches
and secondary structure predictions from PSIPRED.
PROMALS3D constructs alignments for multiple protein sequences and/or structures using information from sequence database searches, secondary structure prediction, available homologs with 3D structures and user-defined constraints.
Phylemon is a server that integrates a suite of tools for multiple sequence alignment, phylogeny, and evolutionary tests from the most popular stand-alone phylogenetic and evolutionary analysis programs. Phylemon 2.0 has an integrated environment to allow for evolutionary analyses, format conversion, file storage and editing capacity, and has saved phylogenetic pipeline function.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
VISTA is a comprehensive suite of programs and databases for comparative analysis of genomic sequences. There are two ways of using VISTA - you can submit your own sequences and alignments for analysis (VISTA servers) or examine pre-computed whole-genome alignments of different species.
eProbalign computes maximal expected accuracy multiple sequence alignments from partition function posterior probabilities. eProbalign also provides a platform
to visualize the alignment, generate images, and
manipulate the output.
Firestar predicts functionally important residues, such as catalytic and ligand binding sites, using structural templates and alignment reliability. Prediction coverage has increased with the extension of the FireDB and ligands are now classified for biological relevance.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
BLASTO (BLAST on Orthologous groups) is a modified BLAST tool for searching orthologous group data. It treats each orthologous group as a unit and outputs a ranked list of orthologous groups instead of single sequences.
COMPASS (Comparison of Multiple Protein Alignments with Assessment of Statistical Significance) is a tool for detecting remote levels of sequence similarity using profile-based comparison of multiple sequence alignments. Statistics and speed have been improved, and visualization tools introduced.
EBI Tools is a project that aims to provide programmatic access to the various databases and retrieval and analysis services that the European Bioinformatics Institute (EBI) provides through Simple Object Access Protocol (SOAP) and other related web service technologies.
FGF (Fishing Gene Family) is a resource for finding gene families given a set of protein sequences by investigating phylogenetic trees, duplication fate, and selective pressure (ka/ks calculations).
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
Open HUSAR (Heidelberg Unix Sequence Analysis Resources) contains suite of bioinformatics pipelines including tasks oriented to protein annotation by homology (ProtSweep), by domain analysis (DomainSweep), and by secondary structure elements (2Dsweep).
Pcons.net is a protein structure prediction meta server that provides automated tools for
protein structure prediction and analysis using a consensus approach.
QSCOP-BLAST is a service that uses BLAST to search the QSCOP (Quantitative SCOP) server, an extension of classic SCOP (Structural Classification of Proteins).
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
WhETS (Wheat Estimated Transcript Server) is a resource that combines Triticeae ESTs/mRNAs with rice genes to find the best estimate of hexaploid wheat transcript sequences for a target gene, supplemented with information on tissue distribution and likely gene structure, to aid in primer design.
fastSCOP identifies structural domains and determines evolutionary superfamilies of a query protein structure using 3D-BLAST to scan SCOP and MAMMOTH to create a structural alignment for refining domain boundaries.
Firestar predicts functionally important residues, such as catalytic and ligand binding sites, using structural templates and alignment reliability. Prediction coverage has increased with the extension of the FireDB and ligands are now classified for biological relevance.