EBI Tools is a project that aims to provide programmatic access to the various databases and retrieval and analysis services that the European Bioinformatics Institute (EBI) provides through Simple Object Access Protocol (SOAP) and other related web service technologies.
BASys (Bacterial Annotation System) is a tool for automated annotation of bacterial genomic (chromosomal and plasmid) sequences including gene/protein names, GO functions, COG functions, possible paralogues and orthologues, molecular weights, isoelectric points, operon structures, subcellular localization, signal peptides, transmembrane regions, secondary structures, 3-D structures, reactions, and pathways.
Babelomics is a suite of web tools for the functional annotation and analysis of groups of genes in high throughput experiments. Tools include: FatiGO, FatiGOplus, Fatiscan, Gene Set Enrichment Analysis (GSEA), Marmite, and the Tissues Mining Tool (TMT). Other tools include Biocarta pathways, Transfac and a tool de novo functional annotation of sequences.
Bioverse is a system that uses computational techniques to facilitate exploring the relationships between molecular, genomic, proteomic, systems and organismal information.
The FeatureExtract server extracts sequence and feature annotations, such as intron/exon structure, from GenBank entries and other GenBank format files.
InterProScan allows you to query using different protein signature recognition methods to look up InterPro annotations for your sequence. These annotations results often include gene ontology terms that you can associate with your sequence.
MutDB is a database that associates protein structural information with mutations and polymorphisms in gene sequences. The data is derived from dbSNP and Swiss-Prot, and can be browsed by gene name or searched by keyword or by various identifiers.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
WebGestalt (WEB-based GEne SeT AnaLysis Toolkit) is a system facilitating the analysis of sets of genes. Gene sets can be compared using set operations (intersection, union, etc.), different annotations can be selected and retrieved for the set, and sets can be visualized and organized by a user-selected method (Gene Ontology, chromosomal distribution, etc.). WebGestalt can also perform a statistical analysis to suggest areas of interest with respect to the set of genes selected.
AUGUSTUS is a eukaryotic gene prediction tool. It can integrate evidence, e.g. from RNA-Seq, ESTs, proteomics, but can also predict genes ab initio. The PPX extension to AUGUSTUS can take a protein sequence multiple sequence alignment as input to find new members of the family in a genome. It can be run through a web interface, or downloaded and run locally.
The GeneMark family of programs employ Markov models and are specifically tuned for gene prediction for sequences from prokaryotes, viral genomes and eukaryotes.
Projector 2 allows users to map completed portions of the genome sequence of an organism onto the finished (or unfinished) genome of a closely-related species or strain. Using the related genome sequence as a template can facilitate sequence assembly and the sequencing of the remaining gaps.
MLST (Multi Locus Sequence Typing) is a nucleotide sequence based approach for the unambiguous characterisation of isolates of bacteria and other organisms using the sequences of internal fragments of seven house-keeping genes.
The Phylogenetic Web Repeater (POWER) allows users to perform phylogenetic analysis using the PHYLIP package. The POWER pipeline can start with processing either multiple sequence alignments (MSA) or can proceed directly with aligned sequences.
ARTS (Advanced Retrieval Tool for SNPs) can be used to retrieve SNP that are polymorphic between several different mouse strains to aid in the design of genome-wide SNP marker panels.
SNPServer combines BLAST, cap3 and a SNP discovery module into a single pipeline for the discovery of SNPs in user submitted files or dynamically created assemblies.
The Gene Expression Pattern Analysis Suite (GEPAS) is a collection of tools for the analysis of microarray data. GEPAS includes tools for data pre-processing, clustering, differential gene expression, predictors, array CGH and functional annotation. A new pipeline module allows for automation of sequential analysis steps.
ISTECH SNPAnalyzer is a tool for the statistical analysis of SNP data that includes Hardy Weinberg equilibrium (HWE), haplotype estimation, linkage disequilibrium (LD), and QTL analyses. Registration required; some browser requirements and set-up necessary.
MutDB is a database that associates protein structural information with mutations and polymorphisms in gene sequences. The data is derived from dbSNP and Swiss-Prot, and can be browsed by gene name or searched by keyword or by various identifiers.
EBI Tools is a project that aims to provide programmatic access to the various databases and retrieval and analysis services that the European Bioinformatics Institute (EBI) provides through Simple Object Access Protocol (SOAP) and other related web service technologies.
The FeatureExtract server extracts sequence and feature annotations, such as intron/exon structure, from GenBank entries and other GenBank format files.
MRS is a biological data retrieval system that can be accessed over the web, or installed and used locally. MRS indexes several flat-file data sets for searching, including EMBL nucleotide, UniProt, PDB and KEGG. Searches can be performed globally, or on one or more flat file fields per data set.
ARGO is a tool for the detection and visualization of sets of region-specific degenerate oligonucleotide motifs in the regulatory regions of eukaryotic genes.
AUGUSTUS is a eukaryotic gene prediction tool. It can integrate evidence, e.g. from RNA-Seq, ESTs, proteomics, but can also predict genes ab initio. The PPX extension to AUGUSTUS can take a protein sequence multiple sequence alignment as input to find new members of the family in a genome. It can be run through a web interface, or downloaded and run locally.
CONREAL (Conserved Regulatory Elements Anchored Alignment) allows identification of transcription factor binding sites (TFBS) that are conserved between two orthologous promoter sequences.
CisMols (Cis-regulatory Modules) is a tool that identifies compositionally predicted cis-clusters that occur in groups of co-regulated genes within each of their ortholog-pair evolutionarily conserved cis-regulatory regions.
Footer is a tool for identifying highly-probable binding sites of known transcription factors using phylogenetic footprinting principles to analyse two homologous DNA sequences.
JCAT (Java Codon Adaptation Tool) is a tool that can adjust the codon usage of an input sequence to the selected organism. Useful for improving the expression of foreign genes in hosts with different codon usage.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
PatMatch is a pattern matching tool that allows you to search for short (<20 residues) nucleotide or peptide sequences and can accomodate ambiguous/degenerate patterns.
PromoterPlot takes the output from a TransFac search as input, and finds similarities between groups of promoters in an attempt to simplify the results of transcription factor searches. FASTA/Affymetrix IDs can also be used as input for a local installation of the tool.
T-Reg Comparator is a tool for the analysis of transcriptional regulation that allows you to compare a set of position weight matrices (PWM) against the T-Reg database (a collection of PWMs built from Transfac and Jaspar).
TOUCAN 2 is a regulatory sequence analysis workbench for Metazoan sequences, directly linked with the Ensembl database and implementing SOAP clients for diverse alignment and motif detection algorithms.
WordSpy allows the user to search for over-represented words in a set of sequences and to search for discriminative words using negative sequence data. WordSpy employs this functionality as a means to search for transcription factor binding motifs.
ASePCR (Alternative Splicing electronic PCR) is a tool for carrying out e-PCR to detect differences in amplicon sizes in transcripts from different tissues and organs.
E-RNAi is a tool for designing and evaluating dsRNA constructs suitable for RNAi experiments in 12 organisms including Drosophila, C. elegans, human and other emerging model organisms. E-RNAi also facilitates the design of secondary RNAi reagents for validation experiments, evaluation of pooled siRNA reagents and batch design.
OligoWiz 2.0 is a client for microarray probe design that allows for the integration of sequence annotations, probe quality parameters, and the placement of multiple probes per transcript.
PriFi is a tool for designing and evaluating primer pairs based on the input of a DNA sequence alignment; useful for the PCR amplification of homologs.
Stitchprofiles.uio.no is a server that performs web-based computations on DNA melting. In addition to creating stitch profile diagrams representing the alternative conformations that partly melted DNA can adopt, the server can also plot the classical melting curves, probability profiles, and temperature profiles.
dsCheck takes a nucleotide sequence as input and estimates off-target effects caused by dsRNA (double-stranded RNA) employed in RNAi studies. dsCheck can be used either to verify previously designed dsRNA sequences, or to design off-target minimized dsRNAs.
The Bioinformatics Links Directory features curated links to molecular resources, tools and databases. All of the resources are free or available for a nominal fee.
EBI Tools is a project that aims to provide programmatic access to the various databases and retrieval and analysis services that the European Bioinformatics Institute (EBI) provides through Simple Object Access Protocol (SOAP) and other related web service technologies.
ARGO is a tool for the detection and visualization of sets of region-specific degenerate oligonucleotide motifs in the regulatory regions of eukaryotic genes.
Babelomics is a suite of web tools for the functional annotation and analysis of groups of genes in high throughput experiments. Tools include: FatiGO, FatiGOplus, Fatiscan, Gene Set Enrichment Analysis (GSEA), Marmite, and the Tissues Mining Tool (TMT). Other tools include Biocarta pathways, Transfac and a tool de novo functional annotation of sequences.
CONREAL (Conserved Regulatory Elements Anchored Alignment) allows identification of transcription factor binding sites (TFBS) that are conserved between two orthologous promoter sequences.
CisMols (Cis-regulatory Modules) is a tool that identifies compositionally predicted cis-clusters that occur in groups of co-regulated genes within each of their ortholog-pair evolutionarily conserved cis-regulatory regions.
E-RNAi is a tool for designing and evaluating dsRNA constructs suitable for RNAi experiments in 12 organisms including Drosophila, C. elegans, human and other emerging model organisms. E-RNAi also facilitates the design of secondary RNAi reagents for validation experiments, evaluation of pooled siRNA reagents and batch design.
Footer is a tool for identifying highly-probable binding sites of known transcription factors using phylogenetic footprinting principles to analyse two homologous DNA sequences.
MicroInspector is a tool that detects miRNA (microRNA) binding sites in your input sequence by searching against databases of known miRNA binding sites.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
PatMatch is a pattern matching tool that allows you to search for short (<20 residues) nucleotide or peptide sequences and can accomodate ambiguous/degenerate patterns.
PromoterPlot takes the output from a TransFac search as input, and finds similarities between groups of promoters in an attempt to simplify the results of transcription factor searches. FASTA/Affymetrix IDs can also be used as input for a local installation of the tool.
T-Reg Comparator is a tool for the analysis of transcriptional regulation that allows you to compare a set of position weight matrices (PWM) against the T-Reg database (a collection of PWMs built from Transfac and Jaspar).
TOUCAN 2 is a regulatory sequence analysis workbench for Metazoan sequences, directly linked with the Ensembl database and implementing SOAP clients for diverse alignment and motif detection algorithms.
WordSpy allows the user to search for over-represented words in a set of sequences and to search for discriminative words using negative sequence data. WordSpy employs this functionality as a means to search for transcription factor binding motifs.
dsCheck takes a nucleotide sequence as input and estimates off-target effects caused by dsRNA (double-stranded RNA) employed in RNAi studies. dsCheck can be used either to verify previously designed dsRNA sequences, or to design off-target minimized dsRNAs.
miRU is a tool that takes as an input a small miRNA sequence and then searches for complementary matches in TIGR plant-specific gene data sets to predict potential target genes.
VisANT is an integrative visual analysis tool for biological networks and pathways that contains modules for querying and integrating KEGG pathways with expression data. VisANT is java-based, and can be run as a java applet, as a java web application, or downloaded and run locally. New functions in VisANT 3.5 allow determination of over-representation and expression enrichment.
Babelomics is a suite of web tools for the functional annotation and analysis of groups of genes in high throughput experiments. Tools include: FatiGO, FatiGOplus, Fatiscan, Gene Set Enrichment Analysis (GSEA), Marmite, and the Tissues Mining Tool (TMT). Other tools include Biocarta pathways, Transfac and a tool de novo functional annotation of sequences.
VisANT is an integrative visual analysis tool for biological networks and pathways that contains modules for querying and integrating KEGG pathways with expression data. VisANT is java-based, and can be run as a java applet, as a java web application, or downloaded and run locally. New functions in VisANT 3.5 allow determination of over-representation and expression enrichment.
Babelomics is a suite of web tools for the functional annotation and analysis of groups of genes in high throughput experiments. Tools include: FatiGO, FatiGOplus, Fatiscan, Gene Set Enrichment Analysis (GSEA), Marmite, and the Tissues Mining Tool (TMT). Other tools include Biocarta pathways, Transfac and a tool de novo functional annotation of sequences.
VisANT is an integrative visual analysis tool for biological networks and pathways that contains modules for querying and integrating KEGG pathways with expression data. VisANT is java-based, and can be run as a java applet, as a java web application, or downloaded and run locally. New functions in VisANT 3.5 allow determination of over-representation and expression enrichment.
ASePCR (Alternative Splicing electronic PCR) is a tool for carrying out e-PCR to detect differences in amplicon sizes in transcripts from different tissues and organs.
AUGUSTUS is a eukaryotic gene prediction tool. It can integrate evidence, e.g. from RNA-Seq, ESTs, proteomics, but can also predict genes ab initio. The PPX extension to AUGUSTUS can take a protein sequence multiple sequence alignment as input to find new members of the family in a genome. It can be run through a web interface, or downloaded and run locally.
ArrayXPath is a web-based service for matching microarray gene-expression profiles with known biological pathways. Input is a clustered gene-expression profile in a tab-delimited text format. Output includes pathway diagrams.
The Bioinformatics Resource for Oral Pathogens (BROP) contains tools for genomics of oral pathogens including Genome Viewer, GOAL (genome wide ORF alignment), an oral pathogen microarray database, an entrez counter, oral pathogen specific BLAST, and a codon usage database.
Babelomics is a suite of web tools for the functional annotation and analysis of groups of genes in high throughput experiments. Tools include: FatiGO, FatiGOplus, Fatiscan, Gene Set Enrichment Analysis (GSEA), Marmite, and the Tissues Mining Tool (TMT). Other tools include Biocarta pathways, Transfac and a tool de novo functional annotation of sequences.
Expression Profiler is a web based platform for microarray data analysis developed at the EBI. This resource is integrated with the ArrayExpress database, a public repository for microarray data.
GEMS (Gene Expression Mining Server) is a tool for biclustering microarray data. It is available as a web tool and as a standalone command-line program.
The Gene Expression Pattern Analysis Suite (GEPAS) is a collection of tools for the analysis of microarray data. GEPAS includes tools for data pre-processing, clustering, differential gene expression, predictors, array CGH and functional annotation. A new pipeline module allows for automation of sequential analysis steps.
Genome Functional INtegrated Discoverer (GFINDer) takes a list of gene/clone IDs with classification information as input, and allows the user to characterize the different gene classes in the list using annotations of various types from several different sources.
OligoWiz 2.0 is a client for microarray probe design that allows for the integration of sequence annotations, probe quality parameters, and the placement of multiple probes per transcript.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
Pathway Explorer is a tool for visualizing high throughput expression data simultaneously with biological pathway data available from KEGG, BioCarta, and GenMAPP.
RACE (Remote Analysis Computation for gene Expression data) is a collection of web tools designed to assist with the analysis of DNA microarray data and results.
T-profiler is a tool for the analysis of gene expression data from yeast that uses the t-test to score changes in the average activity of pre-defined groups of genes.
VisANT is an integrative visual analysis tool for biological networks and pathways that contains modules for querying and integrating KEGG pathways with expression data. VisANT is java-based, and can be run as a java applet, as a java web application, or downloaded and run locally. New functions in VisANT 3.5 allow determination of over-representation and expression enrichment.
WebGestalt (WEB-based GEne SeT AnaLysis Toolkit) is a system facilitating the analysis of sets of genes. Gene sets can be compared using set operations (intersection, union, etc.), different annotations can be selected and retrieved for the set, and sets can be visualized and organized by a user-selected method (Gene Ontology, chromosomal distribution, etc.). WebGestalt can also perform a statistical analysis to suggest areas of interest with respect to the set of genes selected.
Babelomics is a suite of web tools for the functional annotation and analysis of groups of genes in high throughput experiments. Tools include: FatiGO, FatiGOplus, Fatiscan, Gene Set Enrichment Analysis (GSEA), Marmite, and the Tissues Mining Tool (TMT). Other tools include Biocarta pathways, Transfac and a tool de novo functional annotation of sequences.
PatMatch is a pattern matching tool that allows you to search for short (<20 residues) nucleotide or peptide sequences and can accomodate ambiguous/degenerate patterns.
Babelomics is a suite of web tools for the functional annotation and analysis of groups of genes in high throughput experiments. Tools include: FatiGO, FatiGOplus, Fatiscan, Gene Set Enrichment Analysis (GSEA), Marmite, and the Tissues Mining Tool (TMT). Other tools include Biocarta pathways, Transfac and a tool de novo functional annotation of sequences.
Bioverse is a system that uses computational techniques to facilitate exploring the relationships between molecular, genomic, proteomic, systems and organismal information.
The CBS DAS protein viewer uses the distributed annotation system (DAS) to integrate and present annotation data from multiple sources for a protein sequence.
The FeatureExtract server extracts sequence and feature annotations, such as intron/exon structure, from GenBank entries and other GenBank format files.
InterProScan allows you to query using different protein signature recognition methods to look up InterPro annotations for your sequence. These annotations results often include gene ontology terms that you can associate with your sequence.
GeneSeeker allows you to generate a list of candidate genes related to a human genetic disorder by searching against localization and expression databases.
PEPVAC is a tool to assist in the development of multi-epitope vaccines. It is optimized for this task by predicting peptides with the ability to bind to several human leukocyte antigens that have similar peptide binding specificity.
The dcode.org website provides access to tools for comparative genomic analyses developed by the Comparative Genomics Center at the Lawerence Livermore National Laboratory. Tools include: zPicture, Mulan, eShadow, rVista, CREME, and the ECR Browser.
SNPServer combines BLAST, cap3 and a SNP discovery module into a single pipeline for the discovery of SNPs in user submitted files or dynamically created assemblies.
ISTECH SNPAnalyzer is a tool for the statistical analysis of SNP data that includes Hardy Weinberg equilibrium (HWE), haplotype estimation, linkage disequilibrium (LD), and QTL analyses. Registration required; some browser requirements and set-up necessary.
EBI Tools is a project that aims to provide programmatic access to the various databases and retrieval and analysis services that the European Bioinformatics Institute (EBI) provides through Simple Object Access Protocol (SOAP) and other related web service technologies.
LitMiner is a literature data mining tool that is based on the annotation of key terms in article abstracts followed by statistical co-citation analysis of annotated key terms in order to predict relationships between genes, compounds, diseases and phenotypes, and tissues and organs.
E-RNAi is a tool for designing and evaluating dsRNA constructs suitable for RNAi experiments in 12 organisms including Drosophila, C. elegans, human and other emerging model organisms. E-RNAi also facilitates the design of secondary RNAi reagents for validation experiments, evaluation of pooled siRNA reagents and batch design.
InterProScan allows you to query using different protein signature recognition methods to look up InterPro annotations for your sequence. These annotations results often include gene ontology terms that you can associate with your sequence.
JCAT (Java Codon Adaptation Tool) is a tool that can adjust the codon usage of an input sequence to the selected organism. Useful for improving the expression of foreign genes in hosts with different codon usage.
BASys (Bacterial Annotation System) is a tool for automated annotation of bacterial genomic (chromosomal and plasmid) sequences including gene/protein names, GO functions, COG functions, possible paralogues and orthologues, molecular weights, isoelectric points, operon structures, subcellular localization, signal peptides, transmembrane regions, secondary structures, 3-D structures, reactions, and pathways.
The Bioinformatics Resource for Oral Pathogens (BROP) contains tools for genomics of oral pathogens including Genome Viewer, GOAL (genome wide ORF alignment), an oral pathogen microarray database, an entrez counter, oral pathogen specific BLAST, and a codon usage database.
MLST (Multi Locus Sequence Typing) is a nucleotide sequence based approach for the unambiguous characterisation of isolates of bacteria and other organisms using the sequences of internal fragments of seven house-keeping genes.
Projector 2 allows users to map completed portions of the genome sequence of an organism onto the finished (or unfinished) genome of a closely-related species or strain. Using the related genome sequence as a template can facilitate sequence assembly and the sequencing of the remaining gaps.
ARTS (Advanced Retrieval Tool for SNPs) can be used to retrieve SNP that are polymorphic between several different mouse strains to aid in the design of genome-wide SNP marker panels.
E-RNAi is a tool for designing and evaluating dsRNA constructs suitable for RNAi experiments in 12 organisms including Drosophila, C. elegans, human and other emerging model organisms. E-RNAi also facilitates the design of secondary RNAi reagents for validation experiments, evaluation of pooled siRNA reagents and batch design.
E-RNAi is a tool for designing and evaluating dsRNA constructs suitable for RNAi experiments in 12 organisms including Drosophila, C. elegans, human and other emerging model organisms. E-RNAi also facilitates the design of secondary RNAi reagents for validation experiments, evaluation of pooled siRNA reagents and batch design.
The Sacharomyces Cerevisiae Morphological Database (SCMD) is a collection of micrographs of budding yeast mutants; visualization and data mining tools are provided.
T-profiler is a tool for the analysis of gene expression data from yeast that uses the t-test to score changes in the average activity of pre-defined groups of genes.
SEARCHGTr is a tool for the analysis of glycosyltransferases (GTrs) that allows you to compare a query sequence with the sequences of characterized GTrs.
Bioverse is a system that uses computational techniques to facilitate exploring the relationships between molecular, genomic, proteomic, systems and organismal information.
The CBS DAS protein viewer uses the distributed annotation system (DAS) to integrate and present annotation data from multiple sources for a protein sequence.
Protinfo takes a protein sequence as input and returns the atomic coordinates for a prediction of tertiary structure for that protein. Predictions can be made by comparative or de novo modelling.
SCRATCH is a suite of protein structure software and servers for the prediction of secondary structure,
solvent accessibility, disulphide bridges, stability effects of single amino acid mutations, disordered regions, domains, beta-residue and beta-strand pairings, amino acid contact maps, and tertiary structure.
A suite of tools that includes: PSIPRED, a protein structure prediction server; GenTHREADER, for genomic protein fold recognition; MEMSAT2, for transmembrane protein structure prediction; GTD, the genomic threading database; DISOPRED, a dynamic disorder prediction server; DomPred, a domain prediction server; and COPS, for the comparison of protein structure classifications.
UCL web portal has been updated to include a fully automated 3-D modeling pipeline, BioSerf.
CHC_FIND is a tool to analyze conserved hydrophobic contacts (CHC) from multiple structural alignments. In addition to a multiple sequence alignment, a structural alignment which provides the superposition of structurally conserved regions (SCRs) (i.e., results from SCR_FIND) needs to be provided.
The ConSurf server allows one to map levels of amino acid conservation to known protein structures in order to study areas of potential functional importance on the surface of the protein. A PDB file is required as input, and a multiple sequence alignment is optional. If an alignment is not provided, ConSurf will build one by performing a search for homologous sequences and then aligning them.
ConSurf 2010 combines ConSurf and ConSeq for an easier, more intuitive interface.
FF (Fragment Finder) takes a PDB ID, a structure coordinate file or a list of phi and psi angles as input and searches for matches to a specified structural fragment. Users can tailor the search based on sequence similarity and vary the dataset searched against.
Interface-to-Interface (I2I)-SiteEngine compares pairs of interacting protein binding sites by recognizing similarity of physico-chemical properties and shapes in the protein-protein interfaces.
ICGEBnet Protein Tools is a resource for analysis of 3D protein structures including domain predictors, a protein fold similiarity server (PRIDE), and tools for calculating atom protusion (CX) and/or depth (DPX) indexes in 3D models.
ProFunc takes a PDB-format structure as input and predicts the likely function of the protein based on various sequence and structural analysis methods.
SCRATCH is a suite of protein structure software and servers for the prediction of secondary structure,
solvent accessibility, disulphide bridges, stability effects of single amino acid mutations, disordered regions, domains, beta-residue and beta-strand pairings, amino acid contact maps, and tertiary structure.
A suite of tools that includes: PSIPRED, a protein structure prediction server; GenTHREADER, for genomic protein fold recognition; MEMSAT2, for transmembrane protein structure prediction; GTD, the genomic threading database; DISOPRED, a dynamic disorder prediction server; DomPred, a domain prediction server; and COPS, for the comparison of protein structure classifications.
UCL web portal has been updated to include a fully automated 3-D modeling pipeline, BioSerf.
SiteEngine is a server that searches protein structures for regions that can potentially function as binding sites by scanning for regions on the surface of one protein structure that resemble a specific binding site on the other.
pdbFun allows the user to search the PDB as a database of annotated residues. One can limit the search by annotations such as domains, binding sites, active sites, solvent exposure and residue type.
CHC_FIND is a tool to analyze conserved hydrophobic contacts (CHC) from multiple structural alignments. In addition to a multiple sequence alignment, a structural alignment which provides the superposition of structurally conserved regions (SCRs) (i.e., results from SCR_FIND) needs to be provided.
EBI Tools is a project that aims to provide programmatic access to the various databases and retrieval and analysis services that the European Bioinformatics Institute (EBI) provides through Simple Object Access Protocol (SOAP) and other related web service technologies.
Based on the comparison of profile HMMs, HHpred takes a protein sequence or multiple sequence alignment as input and searches for remote homologues in an assortment of databases such as PDB, SMART and Pfam. The user can select either a local or global alignment method, and the search results can be used to generate 3D structural models.
PISCES (Protein Sequence Culling Server) allows the user to weed out sequences from a set in order to obtain a subset of relatively high PDB structure quality and/or mutual sequence identity.
pdbFun allows the user to search the PDB as a database of annotated residues. One can limit the search by annotations such as domains, binding sites, active sites, solvent exposure and residue type.
Bioverse is a system that uses computational techniques to facilitate exploring the relationships between molecular, genomic, proteomic, systems and organismal information.
DIAL (Domain Identification Algorithm) is a web server for the automatic identification of structural domains given the three-dimensional coordinates of a protein.
The Fold & Function Assignment System (FFAS) takes an amino-acid sequence as input and generates a profile which is compared to several sets of sequence profiles including PDB, SCOP, and PFAM. Public metagenomic sequence data has been added to FFAS, so that the profile of a user's protein can be compared to 20 additional profile databases including several complete proteomes, etc.
Based on the comparison of profile HMMs, HHpred takes a protein sequence or multiple sequence alignment as input and searches for remote homologues in an assortment of databases such as PDB, SMART and Pfam. The user can select either a local or global alignment method, and the search results can be used to generate 3D structural models.
ICGEBnet Protein Tools is a resource for analysis of 3D protein structures including domain predictors, a protein fold similiarity server (PRIDE), and tools for calculating atom protusion (CX) and/or depth (DPX) indexes in 3D models.
Protinfo takes a protein sequence as input and returns the atomic coordinates for a prediction of tertiary structure for that protein. Predictions can be made by comparative or de novo modelling.
Ressource Parisienne en Bioinformatique Structurale (RPBS) is a structural bioinformatics resource with several types of specific services including tools for searching sequence (AUTOMAT) and structure (YAKUSA) databases and for homology modelling (WLOOP).
SPACE (Structure Prediction and Analysis based on Complementarity with Environment) is a suite of tools for predicting and analyzing structures of biomolecules and their complexes.
A suite of tools that includes: PSIPRED, a protein structure prediction server; GenTHREADER, for genomic protein fold recognition; MEMSAT2, for transmembrane protein structure prediction; GTD, the genomic threading database; DISOPRED, a dynamic disorder prediction server; DomPred, a domain prediction server; and COPS, for the comparison of protein structure classifications.
UCL web portal has been updated to include a fully automated 3-D modeling pipeline, BioSerf.
The CBS DAS protein viewer uses the distributed annotation system (DAS) to integrate and present annotation data from multiple sources for a protein sequence.
MRS is a biological data retrieval system that can be accessed over the web, or installed and used locally. MRS indexes several flat-file data sets for searching, including EMBL nucleotide, UniProt, PDB and KEGG. Searches can be performed globally, or on one or more flat file fields per data set.
MovieMaker is a web server that accepts PDB files or PDB accession numbers as input and allows short (~10 sec) downloadable movies to be generated of protein motions and interactions.
Ressource Parisienne en Bioinformatique Structurale (RPBS) is a structural bioinformatics resource with several types of specific services including tools for searching sequence (AUTOMAT) and structure (YAKUSA) databases and for homology modelling (WLOOP).
Bioverse is a system that uses computational techniques to facilitate exploring the relationships between molecular, genomic, proteomic, systems and organismal information.
The CBS DAS protein viewer uses the distributed annotation system (DAS) to integrate and present annotation data from multiple sources for a protein sequence.
GRIFFIN (G-protein-Receptor Interacting Feature Finding INstrument) uses a support vector machine and hidden markov model to predict G-protein coupled receptors (GPCRs) and G-protein coupling selectivity.
InterProScan allows you to query using different protein signature recognition methods to look up InterPro annotations for your sequence. These annotations results often include gene ontology terms that you can associate with your sequence.
PEPVAC is a tool to assist in the development of multi-epitope vaccines. It is optimized for this task by predicting peptides with the ability to bind to several human leukocyte antigens that have similar peptide binding specificity.
ProFunc takes a PDB-format structure as input and predicts the likely function of the protein based on various sequence and structural analysis methods.
WebGestalt (WEB-based GEne SeT AnaLysis Toolkit) is a system facilitating the analysis of sets of genes. Gene sets can be compared using set operations (intersection, union, etc.), different annotations can be selected and retrieved for the set, and sets can be visualized and organized by a user-selected method (Gene Ontology, chromosomal distribution, etc.). WebGestalt can also perform a statistical analysis to suggest areas of interest with respect to the set of genes selected.
The CBS DAS protein viewer uses the distributed annotation system (DAS) to integrate and present annotation data from multiple sources for a protein sequence.
DiANNA (DiAminoacid Neural Network Application) a tool that predicts cysteine states of a protein. It can predict whether a particular cysteine is reduced, is forming a disulfide bond, or is bound to a metallic ligand.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
Ressource Parisienne en Bioinformatique Structurale (RPBS) is a structural bioinformatics resource with several types of specific services including tools for searching sequence (AUTOMAT) and structure (YAKUSA) databases and for homology modelling (WLOOP).
WebProAnalyst is a tool for searching for residues whose substitutions are correlated with variations in protein activity. Key physicochemical characteristics of these sites can be calculated.
AUGUSTUS is a eukaryotic gene prediction tool. It can integrate evidence, e.g. from RNA-Seq, ESTs, proteomics, but can also predict genes ab initio. The PPX extension to AUGUSTUS can take a protein sequence multiple sequence alignment as input to find new members of the family in a genome. It can be run through a web interface, or downloaded and run locally.
The CBS DAS protein viewer uses the distributed annotation system (DAS) to integrate and present annotation data from multiple sources for a protein sequence.
DIAL (Domain Identification Algorithm) is a web server for the automatic identification of structural domains given the three-dimensional coordinates of a protein.
EBI Tools is a project that aims to provide programmatic access to the various databases and retrieval and analysis services that the European Bioinformatics Institute (EBI) provides through Simple Object Access Protocol (SOAP) and other related web service technologies.
FF (Fragment Finder) takes a PDB ID, a structure coordinate file or a list of phi and psi angles as input and searches for matches to a specified structural fragment. Users can tailor the search based on sequence similarity and vary the dataset searched against.
The Fold & Function Assignment System (FFAS) takes an amino-acid sequence as input and generates a profile which is compared to several sets of sequence profiles including PDB, SCOP, and PFAM. Public metagenomic sequence data has been added to FFAS, so that the profile of a user's protein can be compared to 20 additional profile databases including several complete proteomes, etc.
Using datasets of known phosphorylation sites, the Group based Phosphorylation Scoring method (GPS) allows the prediction of kinase specific phosphorylation sites from primary protein sequences.
InterProScan allows you to query using different protein signature recognition methods to look up InterPro annotations for your sequence. These annotations results often include gene ontology terms that you can associate with your sequence.
One-Block CYRCA is a program for identifying blocks (local ungapped profiles of the most conserved regions of protein families and domains) in a multiple sequence alignment based on the LAMA and CYRCA block-to-block alignment methods.
PatMatch is a pattern matching tool that allows you to search for short (<20 residues) nucleotide or peptide sequences and can accomodate ambiguous/degenerate patterns.
ProFunc takes a PDB-format structure as input and predicts the likely function of the protein based on various sequence and structural analysis methods.
QuasiMotiFinder uses physicochemical similarity to PROSITE motifs and evolutionary conservation to predict sequence locations with possible function (pseudo-motifs).
SCRATCH is a suite of protein structure software and servers for the prediction of secondary structure,
solvent accessibility, disulphide bridges, stability effects of single amino acid mutations, disordered regions, domains, beta-residue and beta-strand pairings, amino acid contact maps, and tertiary structure.
SEARCHGTr is a tool for the analysis of glycosyltransferases (GTrs) that allows you to compare a query sequence with the sequences of characterized GTrs.
Scooby-domain (sequence hydrophobicity predicts domains) is a method to identify globular domains in protein sequence, based on the observed lengths and hydrophobicities of domains from proteins with known tertiary structure.
A suite of tools that includes: PSIPRED, a protein structure prediction server; GenTHREADER, for genomic protein fold recognition; MEMSAT2, for transmembrane protein structure prediction; GTD, the genomic threading database; DISOPRED, a dynamic disorder prediction server; DomPred, a domain prediction server; and COPS, for the comparison of protein structure classifications.
UCL web portal has been updated to include a fully automated 3-D modeling pipeline, BioSerf.
WebGestalt (WEB-based GEne SeT AnaLysis Toolkit) is a system facilitating the analysis of sets of genes. Gene sets can be compared using set operations (intersection, union, etc.), different annotations can be selected and retrieved for the set, and sets can be visualized and organized by a user-selected method (Gene Ontology, chromosomal distribution, etc.). WebGestalt can also perform a statistical analysis to suggest areas of interest with respect to the set of genes selected.
pdbFun allows the user to search the PDB as a database of annotated residues. One can limit the search by annotations such as domains, binding sites, active sites, solvent exposure and residue type.
ArrayXPath is a web-based service for matching microarray gene-expression profiles with known biological pathways. Input is a clustered gene-expression profile in a tab-delimited text format. Output includes pathway diagrams.
Babelomics is a suite of web tools for the functional annotation and analysis of groups of genes in high throughput experiments. Tools include: FatiGO, FatiGOplus, Fatiscan, Gene Set Enrichment Analysis (GSEA), Marmite, and the Tissues Mining Tool (TMT). Other tools include Biocarta pathways, Transfac and a tool de novo functional annotation of sequences.
Interface-to-Interface (I2I)-SiteEngine compares pairs of interacting protein binding sites by recognizing similarity of physico-chemical properties and shapes in the protein-protein interfaces.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
PEPVAC is a tool to assist in the development of multi-epitope vaccines. It is optimized for this task by predicting peptides with the ability to bind to several human leukocyte antigens that have similar peptide binding specificity.
POPSCOMP is a system for analysing the interaction between components of complexes based on calculations of the accessible surface that is buried when the complex is formed.
Pathway Explorer is a tool for visualizing high throughput expression data simultaneously with biological pathway data available from KEGG, BioCarta, and GenMAPP.
The pathway extraction tool is a part of the network analysis tool suite (NeAT). Given a metabolic network, the tool extracts a subnetwork that connects a list of input compounds, reactions, EC numbers or enzyme-coding genes. Three metabolic networks (KEGG LIGAND, KEGG RPAIR and MetaCyc) are preloaded, but custom networks can be submitted as well.
SiteEngine is a server that searches protein structures for regions that can potentially function as binding sites by scanning for regions on the surface of one protein structure that resemble a specific binding site on the other.
VisANT is an integrative visual analysis tool for biological networks and pathways that contains modules for querying and integrating KEGG pathways with expression data. VisANT is java-based, and can be run as a java applet, as a java web application, or downloaded and run locally. New functions in VisANT 3.5 allow determination of over-representation and expression enrichment.
WebGestalt (WEB-based GEne SeT AnaLysis Toolkit) is a system facilitating the analysis of sets of genes. Gene sets can be compared using set operations (intersection, union, etc.), different annotations can be selected and retrieved for the set, and sets can be visualized and organized by a user-selected method (Gene Ontology, chromosomal distribution, etc.). WebGestalt can also perform a statistical analysis to suggest areas of interest with respect to the set of genes selected.
The CBS DAS protein viewer uses the distributed annotation system (DAS) to integrate and present annotation data from multiple sources for a protein sequence.
A suite of tools that includes: PSIPRED, a protein structure prediction server; GenTHREADER, for genomic protein fold recognition; MEMSAT2, for transmembrane protein structure prediction; GTD, the genomic threading database; DISOPRED, a dynamic disorder prediction server; DomPred, a domain prediction server; and COPS, for the comparison of protein structure classifications.
UCL web portal has been updated to include a fully automated 3-D modeling pipeline, BioSerf.
PatchDock is an algorithm for molecular docking. The input is two molecules of any type: proteins, DNA, peptides, drugs. The output is a list of potential complexes sorted by shape complementarity criteria.
The Phylogenetic Web Repeater (POWER) allows users to perform phylogenetic analysis using the PHYLIP package. The POWER pipeline can start with processing either multiple sequence alignments (MSA) or can proceed directly with aligned sequences.
Bioverse is a system that uses computational techniques to facilitate exploring the relationships between molecular, genomic, proteomic, systems and organismal information.
The CBS DAS protein viewer uses the distributed annotation system (DAS) to integrate and present annotation data from multiple sources for a protein sequence.
DiANNA (DiAminoacid Neural Network Application) a tool that predicts cysteine states of a protein. It can predict whether a particular cysteine is reduced, is forming a disulfide bond, or is bound to a metallic ligand.
REPPER (REPeats and their PERiodicities) is a tool for detecting and analysing regions in protein sequences or sequence alignements that have short gapless repeats.
Ressource Parisienne en Bioinformatique Structurale (RPBS) is a structural bioinformatics resource with several types of specific services including tools for searching sequence (AUTOMAT) and structure (YAKUSA) databases and for homology modelling (WLOOP).
Bioverse is a system that uses computational techniques to facilitate exploring the relationships between molecular, genomic, proteomic, systems and organismal information.
EBI Tools is a project that aims to provide programmatic access to the various databases and retrieval and analysis services that the European Bioinformatics Institute (EBI) provides through Simple Object Access Protocol (SOAP) and other related web service technologies.
The FeatureExtract server extracts sequence and feature annotations, such as intron/exon structure, from GenBank entries and other GenBank format files.
MRS is a biological data retrieval system that can be accessed over the web, or installed and used locally. MRS indexes several flat-file data sets for searching, including EMBL nucleotide, UniProt, PDB and KEGG. Searches can be performed globally, or on one or more flat file fields per data set.
PISCES (Protein Sequence Culling Server) allows the user to weed out sequences from a set in order to obtain a subset of relatively high PDB structure quality and/or mutual sequence identity.
MicroInspector is a tool that detects miRNA (microRNA) binding sites in your input sequence by searching against databases of known miRNA binding sites.
miRU is a tool that takes as an input a small miRNA sequence and then searches for complementary matches in TIGR plant-specific gene data sets to predict potential target genes.
MicroInspector is a tool that detects miRNA (microRNA) binding sites in your input sequence by searching against databases of known miRNA binding sites.
PatMatch is a pattern matching tool that allows you to search for short (<20 residues) nucleotide or peptide sequences and can accomodate ambiguous/degenerate patterns.
EBI Tools is a project that aims to provide programmatic access to the various databases and retrieval and analysis services that the European Bioinformatics Institute (EBI) provides through Simple Object Access Protocol (SOAP) and other related web service technologies.
The FeatureExtract server extracts sequence and feature annotations, such as intron/exon structure, from GenBank entries and other GenBank format files.
E-RNAi is a tool for designing and evaluating dsRNA constructs suitable for RNAi experiments in 12 organisms including Drosophila, C. elegans, human and other emerging model organisms. E-RNAi also facilitates the design of secondary RNAi reagents for validation experiments, evaluation of pooled siRNA reagents and batch design.
The CHAOS/DIALIGN WWW server is a multiple sequence alignment site which passes input sequences through CHAOS to create a list of local similarites. These similarities serve as anchor points, allowing DIALIGN to conduct global alignments faster. ABC can then be used for the interactive visualization of the alignment.
enoLOGOS creates sequence logos based on a variety of input, including sequence alignments, probability and alignment matrices and energy measurements.
CHC_FIND is a tool to analyze conserved hydrophobic contacts (CHC) from multiple structural alignments. In addition to a multiple sequence alignment, a structural alignment which provides the superposition of structurally conserved regions (SCRs) (i.e., results from SCR_FIND) needs to be provided.
The ConSurf server allows one to map levels of amino acid conservation to known protein structures in order to study areas of potential functional importance on the surface of the protein. A PDB file is required as input, and a multiple sequence alignment is optional. If an alignment is not provided, ConSurf will build one by performing a search for homologous sequences and then aligning them.
ConSurf 2010 combines ConSurf and ConSeq for an easier, more intuitive interface.
The Fold & Function Assignment System (FFAS) takes an amino-acid sequence as input and generates a profile which is compared to several sets of sequence profiles including PDB, SCOP, and PFAM. Public metagenomic sequence data has been added to FFAS, so that the profile of a user's protein can be compared to 20 additional profile databases including several complete proteomes, etc.
The Multiple alignment Manipulator (MaM) takes a multiple alignment of genomic sequences as input and calculates the locations of exons, common repeat elements and unique regions based on user-selected programs/information. The graphical display also allows users to focus an assessment of sequence variation on the identified regions.
One-Block CYRCA is a program for identifying blocks (local ungapped profiles of the most conserved regions of protein families and domains) in a multiple sequence alignment based on the LAMA and CYRCA block-to-block alignment methods.
PriFi is a tool for designing and evaluating primer pairs based on the input of a DNA sequence alignment; useful for the PCR amplification of homologs.
WebProAnalyst is a tool for searching for residues whose substitutions are correlated with variations in protein activity. Key physicochemical characteristics of these sites can be calculated.
CisMols (Cis-regulatory Modules) is a tool that identifies compositionally predicted cis-clusters that occur in groups of co-regulated genes within each of their ortholog-pair evolutionarily conserved cis-regulatory regions.
The dcode.org website provides access to tools for comparative genomic analyses developed by the Comparative Genomics Center at the Lawerence Livermore National Laboratory. Tools include: zPicture, Mulan, eShadow, rVista, CREME, and the ECR Browser.
MLST (Multi Locus Sequence Typing) is a nucleotide sequence based approach for the unambiguous characterisation of isolates of bacteria and other organisms using the sequences of internal fragments of seven house-keeping genes.
Projector 2 allows users to map completed portions of the genome sequence of an organism onto the finished (or unfinished) genome of a closely-related species or strain. Using the related genome sequence as a template can facilitate sequence assembly and the sequencing of the remaining gaps.
The CHAOS/DIALIGN WWW server is a multiple sequence alignment site which passes input sequences through CHAOS to create a list of local similarites. These similarities serve as anchor points, allowing DIALIGN to conduct global alignments faster. ABC can then be used for the interactive visualization of the alignment.
EBI Tools is a project that aims to provide programmatic access to the various databases and retrieval and analysis services that the European Bioinformatics Institute (EBI) provides through Simple Object Access Protocol (SOAP) and other related web service technologies.
The Fold & Function Assignment System (FFAS) takes an amino-acid sequence as input and generates a profile which is compared to several sets of sequence profiles including PDB, SCOP, and PFAM. Public metagenomic sequence data has been added to FFAS, so that the profile of a user's protein can be compared to 20 additional profile databases including several complete proteomes, etc.
The Fold & Function Assignment System (FFAS) takes an amino-acid sequence as input and generates a profile which is compared to several sets of sequence profiles including PDB, SCOP, and PFAM. Public metagenomic sequence data has been added to FFAS, so that the profile of a user's protein can be compared to 20 additional profile databases including several complete proteomes, etc.
ParAlign provides a service for sequence similarity searching powered by parallel computing technology. The two comparison algorithms used are Smith-Waterman and ParAlign (a heuristic method for sequence alignment).
EBI Tools is a project that aims to provide programmatic access to the various databases and retrieval and analysis services that the European Bioinformatics Institute (EBI) provides through Simple Object Access Protocol (SOAP) and other related web service technologies.
Based on the comparison of profile HMMs, HHpred takes a protein sequence or multiple sequence alignment as input and searches for remote homologues in an assortment of databases such as PDB, SMART and Pfam. The user can select either a local or global alignment method, and the search results can be used to generate 3D structural models.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
ParAlign provides a service for sequence similarity searching powered by parallel computing technology. The two comparison algorithms used are Smith-Waterman and ParAlign (a heuristic method for sequence alignment).