Babelomics is a suite of web tools for the functional annotation and analysis of groups of genes in high throughput experiments. Tools include: FatiGO, FatiGOplus, Fatiscan, Gene Set Enrichment Analysis (GSEA), Marmite, and the Tissues Mining Tool (TMT). Other tools include Biocarta pathways, Transfac and a tool de novo functional annotation of sequences.
KEGG Orthology-Based Annotation System (KOBAS) annotates sequences with KEGG Orthology terms and identifies the frequently occurring (or significantly enriched) pathways among the queried sequences compared against a background distribution. 5 pathway databases are used (KEGG Pathway, PID, BioCyc, Reactome, Panther) and 5 human databases (OMIM, KEGG Disease, FunDO, GAD, NHGRI GWAS).
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
The PANTHER (Protein ANalysis THrough Evolutionary Relationships) Classification System is a unique resource that classifies genes by their functions, using published scientific experimental evidence and evolutionary relationships to predict function. The PANTHER research tools allow you to score proteins against the PANTHER HMM library, use PANTHER to do gene expression analyses, and download PANTHER tools and data.
Transcriptome Auto-annotation Conducting Tool (TACT) is an automated tool for conducting functional annotation of transcripts that integrates sequence similarity searches and functional motif predictions. TACT was originally developed as an automatic annotation pipeline for the Human Full-length cDNA Annotation Invitational (H-Inv) project.
Web Gene Ontology Annotation Plot (WEGO) helps visualize the annotation of sets of genes. WEGO can compare gene annotation datasets by plotting the distribution of gene ontology (GO) annotation results into a histogram.
The Virtual Ribosome is a DNA translation tool with a built-in ORF finder that allows the use of alternative start codons, the IUPAC degenerate DNA alphabet, and all translation tables defined by the NCBI taxonomy group. The tool can also highlight the intron/exon structure of genes using information found in the feature table of GenBank flatfiles in the final translation results.
AUGUSTUS is a eukaryotic gene prediction tool. It can integrate evidence, e.g. from RNA-Seq, ESTs, proteomics, but can also predict genes ab initio. The PPX extension to AUGUSTUS can take a protein sequence multiple sequence alignment as input to find new members of the family in a genome. It can be run through a web interface, or downloaded and run locally.
BActeriocin GEnome mining tooL (BAGEL) identifies putative bacteriocin ORFs (antimicrobial peptides) based on a database containing information about known bacteriocins and adjacent genes involved in bacteriocin activity.
Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families.
Transcriptome Auto-annotation Conducting Tool (TACT) is an automated tool for conducting functional annotation of transcripts that integrates sequence similarity searches and functional motif predictions. TACT was originally developed as an automatic annotation pipeline for the Human Full-length cDNA Annotation Invitational (H-Inv) project.
TIS Correction (TiCo) is a tool for improving predictions of prokaryotic Translation Initiation Sites (TIS). TiCo can be used to analyze and reannotate predictions obtained by the program GLIMMER.
The Virtual Ribosome is a DNA translation tool with a built-in ORF finder that allows the use of alternative start codons, the IUPAC degenerate DNA alphabet, and all translation tables defined by the NCBI taxonomy group. The tool can also highlight the intron/exon structure of genes using information found in the feature table of GenBank flatfiles in the final translation results.
The Internal Transcribed Spacer 2 Database (ITS2) includes a homology based RNA structure prediction algorithm which allows the detection and secondary structure prediction of ITS2 sequences. This resource also contains more than 25,000 pre-calculated secondary structures for currently known ITS2 sequences that can be searched and browsed via taxonomy.
Max-Planck Integrated Gene Analysis System (MIGenAS) provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction. Users can also configure \"meta\"-tools as a pipeline of individual tools and intermediate filters.
Max-Planck Institute Bioinformatics Toolkit provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction.
Nearest Alignment Space Termination (NAST) is a multiple sequence alignment server for comparative analysis of 16S rRNA gene sequences from bacteria and archaea.
PAL2NAL converts a multiple sequence alignment of proteins and the corresponding DNA (or mRNA) sequences into a codon alignment. Synonymous (Ks) and non-synonymous (Ka) substitution rates can be calculated.
Sorting Permutation by Reversals and Block Interchanges (SPRING) is a tool for the analysis of genome rearrangements. SPRING takes two or more chromosomes as its input and then computes a minimum series of reversals and/or block-interchanges for transforming one chromosome into another. Phylogenetic trees based on the rearrangement analysis are also shown as part of the results.
Sliding Window Analysis of Ka and Ks (SWAKK) is tool for detecting positive selection in proteins using
a sliding window substitution rate analysis. The program can display the results on a 3D protein structure.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
TreeDomViewer is a tool for the visualization of phylogeny and protein domain structure. TreeDomViewer constructs phylogenetic trees and projects the corresponding protein domain information onto the multiple sequence alignment.
Function Analysis and Selection Tool for Single Nucleotide Polymorphisms (FastSNP) allows users to identify and prioritize SNPs that are likely to have functional effects.
The Gene Expression Pattern Analysis Suite (GEPAS) is a collection of tools for the analysis of microarray data. GEPAS includes tools for data pre-processing, clustering, differential gene expression, predictors, array CGH and functional annotation. A new pipeline module allows for automation of sequential analysis steps.
The PANTHER (Protein ANalysis THrough Evolutionary Relationships) Classification System is a unique resource that classifies genes by their functions, using published scientific experimental evidence and evolutionary relationships to predict function. The PANTHER research tools allow you to score proteins against the PANTHER HMM library, use PANTHER to do gene expression analyses, and download PANTHER tools and data.
PDA (Pipeline Diversity Analysis) is a server which searches for polymorphic sequences in large databases, and estimates their genetic diversity. Results contain sequence alignments (generated by ClustalW) and include statistics on polymorphism, synonymous and non-synonymous substitutions, linkage disequilibrium, and codon bias for your alignment. The full results set is also available for download as a MySQL database.
PupaSuite is a SNP analysis tool that allows for the selection of relevant SNPs within a gene based on the characteristics of the SNP. PupaSuite also provides information about LD parameters (based on genotype data from HapMap) and identifies haplotype blocks and tag SNPs. PupaSuite was created to join PupaSNP & PupasView.
SNP@Domain identifies SNPs within human protein domains allowing users to investigate SNPs in the context of two dimensional and three dimensional maps. Links to external databases (Pfam, SCOP, Ensembl, dbSNP, OMIM, SIFT) are also presented.
SNPmasker masks SNPs in a given sequence using information from the dbSNP database. This tool can also be used to mask repeat sequences. SNPmasker is primarily designed for masking the sequences before primer and probe design.
SVMHC is a server for prediction of MHC class I and class II binding peptides. SVMHC can be used to find likely binders in a protein sequence and to investigate the effects of single nucleotide polymorphisms on MHC-peptide binding.
Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families.
Max-Planck Integrated Gene Analysis System (MIGenAS) provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction. Users can also configure \"meta\"-tools as a pipeline of individual tools and intermediate filters.
Eukaryotic Orthology (YOGY) is a resource for retrieving orthologous proteins from nine eukaryotic organisms. Using a gene or protein identifier as a query, this database provides comprehensive, combined information on orthologs in other species using data from five independent resources: KOGs, Inparanoid, Homologene, OrthoMCL, and a table of curated orthologs between budding yeast and fission yeast. Associated Gene Ontology (GO) terms of orthologs can also be
retrieved.
AUGUSTUS is a eukaryotic gene prediction tool. It can integrate evidence, e.g. from RNA-Seq, ESTs, proteomics, but can also predict genes ab initio. The PPX extension to AUGUSTUS can take a protein sequence multiple sequence alignment as input to find new members of the family in a genome. It can be run through a web interface, or downloaded and run locally.
Cis-regulatory Element Annotation System (CEAS) is a resource for ChIP-chip analyses that retrieves repeat-masked genomic sequences, calculates GC content, plots evolutionary conservation, maps nearby genes, and identifies enriched transcription factor binding (TFBS) motifs.
Composite Module Analyst (CMA) defines promoter models based on the composition of transcription factor binding sites (TFBS) in regulatory sequences for sets of coregulated genes.
Composite Regulatory Signature Database (CRSD) is a microarray analysis pipeline aimed at the discovery of motifs involved in gene regulation including microRNA signatures and transcription factor binding sites (TFBS).
CorGenmeasures long-range correlations in DNA sequences and can generate random sequences with the same (or user-specified) correlation and composition parameters.
FeatureScan searches for sequence similarity by comparing physico-chemical properties of DNA such as charge, melting enthalpy, conformational parameters, etc.
GenDecoder is a prediction server for animal mitochondrial genetic codes. It provides information about codon-usage, amino acid composition, GC content and a final genetic code prediction for a mitochondrial genome sequence.
The MEME Suite web server is a unified portal for discovery and analysis of sequence motifs (e.g. DNA binding sites, protein interaction domains). Motifs with/without gaps may be discovered, searched for in DNA and protein databases, compared to other motifs and analyzed for putative function by association with gene ontology terms.
MEME is a tool designed for discovering and searching for DNA motifs such as transcription factor binding sites (TFBS) or protein domains.
Tools for Motif Discovery (MoD) in nucleotide sequences that includes: Weeder, a program for detecting transcription factor binding sites (TFBS) in coregulated genes; WeederH, a tool for detecting TFBS and regulatory regions from homologous genes; and RNA profile, a tool for secondary structure motif discovery in RNA sequences.
Nucleosome eXclusion Sensor (NXSensor) finds potential nucleosome-free regions of DNA sequences. NXSensor can be used to assess the likelihood of nucleosome formation in regions involved in gene regulation and other aspects of chromatin function.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
PlotRep is a tool for the visual analysis of dispersed genomic repeats. The tool is designed to merge fragments belonging to the same copy of a repeat and allows for the inspection of the results using a dot-plot like graphical output.
Quadruplex forming G-Rich Sequences (QGRS) Mapper searches nucleotide sequences for the presence of G-quartet motifs. QGRS Mapper allows you to view the distribution and composition of QGRS in the context of alternatively spliced isoforms of your gene.
Readout is a server for the calculation of direct and indirect readout energy Z-scores which estimate the degree of sequence specificity of the protein-DNA complex. Readout can be useful for checking the quality of protein-DNA interactions in three dimensional (3D) structures.
Regulatory RNA (RegRNA) is a server for identifying regulatory RNA motifs and elements in mRNA sequences including: motifs located in the 5\'-UTR and 3\'-UTR; motifs involved in mRNA splicing and transcriptional regulation; other motifs in mRNA such as riboswitches and splice donor/acceptor sites; RNA structural features; and miRNA target sites.
SNPmasker masks SNPs in a given sequence using information from the dbSNP database. This tool can also be used to mask repeat sequences. SNPmasker is primarily designed for masking the sequences before primer and probe design.
Transcriptome Auto-annotation Conducting Tool (TACT) is an automated tool for conducting functional annotation of transcripts that integrates sequence similarity searches and functional motif predictions. TACT was originally developed as an automatic annotation pipeline for the Human Full-length cDNA Annotation Invitational (H-Inv) project.
TFBScluster is a web tool designed to identify clusters of transcription factor binding sites (TFBSs) conserved in mammalian and mouse genomes. By entering a specified range of TFBS, you can retrieve a list of SWISS-PROT characterized genes to which the clusters are localized.
VOMBAT predicts transcription factor binding sites (TFBS) and other motifs in sequence data based on Variable Order Markov models and variable order BAyesian Trees.
Identify Conserved targets of a Regulon (iCR) predicts regulatory binding sites based on conservation across multiple species. The user provides an ungapped multiple sequence alignment representing the regulatory binding site. iCR reports back with predicted binding sites that are conserved and highlights the downstream co-regulated genes.
jPREdictor predicts cis-regulatory elements using short motifs that are known to bind regulatory proteins. Predictions are made by searching for clusters of these motifs, and weighting these clusters by applying a positive or negative training sets to score the sequence.
DyNAVacS is a tool for designing DNA vaccines that includes steps for chosing a suitable expression vector, ensuring optimal expression by codon optimization, engineering CpG motifs for enhancing immune responses, and providing additional sequence signals for efficient translation. It also allows includes restriction enzyme mapping and design of primers spanning user specified sequences.
GeMprospector is designed to find cross-species genetic marker candidates in legumes and grasses. GeMprospector automates PCR primer design based on multiple sequence alignments of submitted ESTs and their homologues in sequence databases from legumes or grasses.
MutScreener is a human-specific primer design tool that considers gene annotation information to design primers for mutation screening. Users can provide custom gene annotation or use existing public gene annotations.
SNPmasker masks SNPs in a given sequence using information from the dbSNP database. This tool can also be used to mask repeat sequences. SNPmasker is primarily designed for masking the sequences before primer and probe design.
The Bioinformatics Links Directory features curated links to molecular resources, tools and databases. All of the resources are free or available for a nominal fee.
Babelomics is a suite of web tools for the functional annotation and analysis of groups of genes in high throughput experiments. Tools include: FatiGO, FatiGOplus, Fatiscan, Gene Set Enrichment Analysis (GSEA), Marmite, and the Tissues Mining Tool (TMT). Other tools include Biocarta pathways, Transfac and a tool de novo functional annotation of sequences.
Cis-regulatory Element Annotation System (CEAS) is a resource for ChIP-chip analyses that retrieves repeat-masked genomic sequences, calculates GC content, plots evolutionary conservation, maps nearby genes, and identifies enriched transcription factor binding (TFBS) motifs.
Composite Module Analyst (CMA) defines promoter models based on the composition of transcription factor binding sites (TFBS) in regulatory sequences for sets of coregulated genes.
Composite Regulatory Signature Database (CRSD) is a microarray analysis pipeline aimed at the discovery of motifs involved in gene regulation including microRNA signatures and transcription factor binding sites (TFBS).
The MEME Suite web server is a unified portal for discovery and analysis of sequence motifs (e.g. DNA binding sites, protein interaction domains). Motifs with/without gaps may be discovered, searched for in DNA and protein databases, compared to other motifs and analyzed for putative function by association with gene ontology terms.
MEME is a tool designed for discovering and searching for DNA motifs such as transcription factor binding sites (TFBS) or protein domains.
Tools for Motif Discovery (MoD) in nucleotide sequences that includes: Weeder, a program for detecting transcription factor binding sites (TFBS) in coregulated genes; WeederH, a tool for detecting TFBS and regulatory regions from homologous genes; and RNA profile, a tool for secondary structure motif discovery in RNA sequences.
Nucleosome eXclusion Sensor (NXSensor) finds potential nucleosome-free regions of DNA sequences. NXSensor can be used to assess the likelihood of nucleosome formation in regions involved in gene regulation and other aspects of chromatin function.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
RNAhybrid is a tool for predicting miRNA (microRNA) targets by calculating the minimum free energy of hybridization between target RNA and miRNA sequences.
Readout is a server for the calculation of direct and indirect readout energy Z-scores which estimate the degree of sequence specificity of the protein-DNA complex. Readout can be useful for checking the quality of protein-DNA interactions in three dimensional (3D) structures.
Regulatory RNA (RegRNA) is a server for identifying regulatory RNA motifs and elements in mRNA sequences including: motifs located in the 5\'-UTR and 3\'-UTR; motifs involved in mRNA splicing and transcriptional regulation; other motifs in mRNA such as riboswitches and splice donor/acceptor sites; RNA structural features; and miRNA target sites.
TFBScluster is a web tool designed to identify clusters of transcription factor binding sites (TFBSs) conserved in mammalian and mouse genomes. By entering a specified range of TFBS, you can retrieve a list of SWISS-PROT characterized genes to which the clusters are localized.
VOMBAT predicts transcription factor binding sites (TFBS) and other motifs in sequence data based on Variable Order Markov models and variable order BAyesian Trees.
Identify Conserved targets of a Regulon (iCR) predicts regulatory binding sites based on conservation across multiple species. The user provides an ungapped multiple sequence alignment representing the regulatory binding site. iCR reports back with predicted binding sites that are conserved and highlights the downstream co-regulated genes.
jPREdictor predicts cis-regulatory elements using short motifs that are known to bind regulatory proteins. Predictions are made by searching for clusters of these motifs, and weighting these clusters by applying a positive or negative training sets to score the sequence.
Babelomics is a suite of web tools for the functional annotation and analysis of groups of genes in high throughput experiments. Tools include: FatiGO, FatiGOplus, Fatiscan, Gene Set Enrichment Analysis (GSEA), Marmite, and the Tissues Mining Tool (TMT). Other tools include Biocarta pathways, Transfac and a tool de novo functional annotation of sequences.
Babelomics is a suite of web tools for the functional annotation and analysis of groups of genes in high throughput experiments. Tools include: FatiGO, FatiGOplus, Fatiscan, Gene Set Enrichment Analysis (GSEA), Marmite, and the Tissues Mining Tool (TMT). Other tools include Biocarta pathways, Transfac and a tool de novo functional annotation of sequences.
Alternative Splicing Prediction (ASPic) can predict alternative splicing of user submitted genes based on comparative analysis of available transcript and genome data from a variety of species. Results include graphical and tabular views of the splicing patterns of full-length mRNA isoforms compatible with the detected splice sites of the genes as well as structural and functional annotations.
AUGUSTUS is a eukaryotic gene prediction tool. It can integrate evidence, e.g. from RNA-Seq, ESTs, proteomics, but can also predict genes ab initio. The PPX extension to AUGUSTUS can take a protein sequence multiple sequence alignment as input to find new members of the family in a genome. It can be run through a web interface, or downloaded and run locally.
Quadruplex forming G-Rich Sequences (QGRS) Mapper searches nucleotide sequences for the presence of G-quartet motifs. QGRS Mapper allows you to view the distribution and composition of QGRS in the context of alternatively spliced isoforms of your gene.
Regulatory RNA (RegRNA) is a server for identifying regulatory RNA motifs and elements in mRNA sequences including: motifs located in the 5\'-UTR and 3\'-UTR; motifs involved in mRNA splicing and transcriptional regulation; other motifs in mRNA such as riboswitches and splice donor/acceptor sites; RNA structural features; and miRNA target sites.
Arabidopsis Co-expression Tool (ACT) is a resource for investigating the co-expression of genes in the NASC/GARNet microarray-based gene expression dataset from Arabidopsis.
Boltzmann Time Warping (BTW) computes time warping distances and Boltzmann\' pair probabilities for a given input gene expression time series. This tool helps to identify genes that may share similar expression patterns over time.
Babelomics is a suite of web tools for the functional annotation and analysis of groups of genes in high throughput experiments. Tools include: FatiGO, FatiGOplus, Fatiscan, Gene Set Enrichment Analysis (GSEA), Marmite, and the Tissues Mining Tool (TMT). Other tools include Biocarta pathways, Transfac and a tool de novo functional annotation of sequences.
BiologicalNetworks allows retrieval, construction, and visualization of biological networks and allows for the analysis of high-throughput expression data by mapping onto network graphs. BiologicalNetworks queries the PathSys system, a combined database of biological pathways, gene regulatory networks, and protein interaction maps.
Comprehensive R based Microarray Analysis web service (CARMAweb) is a resource for the analyses of microarray data including data preprocessing, detection for differentially expressed genes, cluster analysis, and GO analysis. Calculations are performed in R using functions provided by BioConductor.
Cis-regulatory Element Annotation System (CEAS) is a resource for ChIP-chip analyses that retrieves repeat-masked genomic sequences, calculates GC content, plots evolutionary conservation, maps nearby genes, and identifies enriched transcription factor binding (TFBS) motifs.
Composite Module Analyst (CMA) defines promoter models based on the composition of transcription factor binding sites (TFBS) in regulatory sequences for sets of coregulated genes.
Composite Regulatory Signature Database (CRSD) is a microarray analysis pipeline aimed at the discovery of motifs involved in gene regulation including microRNA signatures and transcription factor binding sites (TFBS).
The Gene Expression Pattern Analysis Suite (GEPAS) is a collection of tools for the analysis of microarray data. GEPAS includes tools for data pre-processing, clustering, differential gene expression, predictors, array CGH and functional annotation. A new pipeline module allows for automation of sequential analysis steps.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
The PANTHER (Protein ANalysis THrough Evolutionary Relationships) Classification System is a unique resource that classifies genes by their functions, using published scientific experimental evidence and evolutionary relationships to predict function. The PANTHER research tools allow you to score proteins against the PANTHER HMM library, use PANTHER to do gene expression analyses, and download PANTHER tools and data.
SNPmasker masks SNPs in a given sequence using information from the dbSNP database. This tool can also be used to mask repeat sequences. SNPmasker is primarily designed for masking the sequences before primer and probe design.
Alternative Splicing Prediction (ASPic) can predict alternative splicing of user submitted genes based on comparative analysis of available transcript and genome data from a variety of species. Results include graphical and tabular views of the splicing patterns of full-length mRNA isoforms compatible with the detected splice sites of the genes as well as structural and functional annotations.
Babelomics is a suite of web tools for the functional annotation and analysis of groups of genes in high throughput experiments. Tools include: FatiGO, FatiGOplus, Fatiscan, Gene Set Enrichment Analysis (GSEA), Marmite, and the Tissues Mining Tool (TMT). Other tools include Biocarta pathways, Transfac and a tool de novo functional annotation of sequences.
GeMprospector is designed to find cross-species genetic marker candidates in legumes and grasses. GeMprospector automates PCR primer design based on multiple sequence alignments of submitted ESTs and their homologues in sequence databases from legumes or grasses.
Babelomics is a suite of web tools for the functional annotation and analysis of groups of genes in high throughput experiments. Tools include: FatiGO, FatiGOplus, Fatiscan, Gene Set Enrichment Analysis (GSEA), Marmite, and the Tissues Mining Tool (TMT). Other tools include Biocarta pathways, Transfac and a tool de novo functional annotation of sequences.
The PANTHER (Protein ANalysis THrough Evolutionary Relationships) Classification System is a unique resource that classifies genes by their functions, using published scientific experimental evidence and evolutionary relationships to predict function. The PANTHER research tools allow you to score proteins against the PANTHER HMM library, use PANTHER to do gene expression analyses, and download PANTHER tools and data.
Transcriptome Auto-annotation Conducting Tool (TACT) is an automated tool for conducting functional annotation of transcripts that integrates sequence similarity searches and functional motif predictions. TACT was originally developed as an automatic annotation pipeline for the Human Full-length cDNA Annotation Invitational (H-Inv) project.
Web Gene Ontology Annotation Plot (WEGO) helps visualize the annotation of sets of genes. WEGO can compare gene annotation datasets by plotting the distribution of gene ontology (GO) annotation results into a histogram.
KEGG Orthology-Based Annotation System (KOBAS) annotates sequences with KEGG Orthology terms and identifies the frequently occurring (or significantly enriched) pathways among the queried sequences compared against a background distribution. 5 pathway databases are used (KEGG Pathway, PID, BioCyc, Reactome, Panther) and 5 human databases (OMIM, KEGG Disease, FunDO, GAD, NHGRI GWAS).
DyNAVacS is a tool for designing DNA vaccines that includes steps for chosing a suitable expression vector, ensuring optimal expression by codon optimization, engineering CpG motifs for enhancing immune responses, and providing additional sequence signals for efficient translation. It also allows includes restriction enzyme mapping and design of primers spanning user specified sequences.
SNP@Domain identifies SNPs within human protein domains allowing users to investigate SNPs in the context of two dimensional and three dimensional maps. Links to external databases (Pfam, SCOP, Ensembl, dbSNP, OMIM, SIFT) are also presented.
SVMHC is a server for prediction of MHC class I and class II binding peptides. SVMHC can be used to find likely binders in a protein sequence and to investigate the effects of single nucleotide polymorphisms on MHC-peptide binding.
TFBScluster is a web tool designed to identify clusters of transcription factor binding sites (TFBSs) conserved in mammalian and mouse genomes. By entering a specified range of TFBS, you can retrieve a list of SWISS-PROT characterized genes to which the clusters are localized.
Function Analysis and Selection Tool for Single Nucleotide Polymorphisms (FastSNP) allows users to identify and prioritize SNPs that are likely to have functional effects.
The PANTHER (Protein ANalysis THrough Evolutionary Relationships) Classification System is a unique resource that classifies genes by their functions, using published scientific experimental evidence and evolutionary relationships to predict function. The PANTHER research tools allow you to score proteins against the PANTHER HMM library, use PANTHER to do gene expression analyses, and download PANTHER tools and data.
PDA (Pipeline Diversity Analysis) is a server which searches for polymorphic sequences in large databases, and estimates their genetic diversity. Results contain sequence alignments (generated by ClustalW) and include statistics on polymorphism, synonymous and non-synonymous substitutions, linkage disequilibrium, and codon bias for your alignment. The full results set is also available for download as a MySQL database.
PupaSuite is a SNP analysis tool that allows for the selection of relevant SNPs within a gene based on the characteristics of the SNP. PupaSuite also provides information about LD parameters (based on genotype data from HapMap) and identifies haplotype blocks and tag SNPs. PupaSuite was created to join PupaSNP & PupasView.
SNP@Domain identifies SNPs within human protein domains allowing users to investigate SNPs in the context of two dimensional and three dimensional maps. Links to external databases (Pfam, SCOP, Ensembl, dbSNP, OMIM, SIFT) are also presented.
SNPmasker masks SNPs in a given sequence using information from the dbSNP database. This tool can also be used to mask repeat sequences. SNPmasker is primarily designed for masking the sequences before primer and probe design.
SVMHC is a server for prediction of MHC class I and class II binding peptides. SVMHC can be used to find likely binders in a protein sequence and to investigate the effects of single nucleotide polymorphisms on MHC-peptide binding.
HubMed uses information from PubMed's database, provided by the NCBI through the EUtils web service, to produce a search interface focused on browsing, organising and gathering information from the biomedical literature.
botXminer is a literature searching tool that allows you to search BotDB. BotDB citations contain only those articles with either \'botulinum\' or \'tetanus\' anywhere in their text. botXminer is an interface to this subset of complete medline xml files loaded into an Oracle XMLDB.
botXminer is a literature searching tool that allows you to search BotDB. BotDB citations contain only those articles with either \'botulinum\' or \'tetanus\' anywhere in their text. botXminer is an interface to this subset of complete medline xml files loaded into an Oracle XMLDB.
The PANTHER (Protein ANalysis THrough Evolutionary Relationships) Classification System is a unique resource that classifies genes by their functions, using published scientific experimental evidence and evolutionary relationships to predict function. The PANTHER research tools allow you to score proteins against the PANTHER HMM library, use PANTHER to do gene expression analyses, and download PANTHER tools and data.
Eukaryotic Orthology (YOGY) is a resource for retrieving orthologous proteins from nine eukaryotic organisms. Using a gene or protein identifier as a query, this database provides comprehensive, combined information on orthologs in other species using data from five independent resources: KOGs, Inparanoid, Homologene, OrthoMCL, and a table of curated orthologs between budding yeast and fission yeast. Associated Gene Ontology (GO) terms of orthologs can also be
retrieved.
Web Gene Ontology Annotation Plot (WEGO) helps visualize the annotation of sets of genes. WEGO can compare gene annotation datasets by plotting the distribution of gene ontology (GO) annotation results into a histogram.
BActeriocin GEnome mining tooL (BAGEL) identifies putative bacteriocin ORFs (antimicrobial peptides) based on a database containing information about known bacteriocins and adjacent genes involved in bacteriocin activity.
Nearest Alignment Space Termination (NAST) is a multiple sequence alignment server for comparative analysis of 16S rRNA gene sequences from bacteria and archaea.
PROFtmb predicts transmembrane beta-barrel (TMB) proteins in Gram-negative bacteria. In addition to running your own predictions, you can also download predictions for all proteins in 78 Gram-Negative bacterial genomes.
TIS Correction (TiCo) is a tool for improving predictions of prokaryotic Translation Initiation Sites (TIS). TiCo can be used to analyze and reannotate predictions obtained by the program GLIMMER.
Eukaryotic Orthology (YOGY) is a resource for retrieving orthologous proteins from nine eukaryotic organisms. Using a gene or protein identifier as a query, this database provides comprehensive, combined information on orthologs in other species using data from five independent resources: KOGs, Inparanoid, Homologene, OrthoMCL, and a table of curated orthologs between budding yeast and fission yeast. Associated Gene Ontology (GO) terms of orthologs can also be
retrieved.
Jumping Profile Hidden Markov Model (jpHMM) takes a HIV-1 genome sequence and uses a pre-calculated multiple alignment of the major HIV-1 subtypes to predict the phylogenetic breakpoints and HIV subtype of the submitted sequence.
sgTarget is a structural genomics resource that helps to select and prioritize good targets for structure determination from a list of sequences. Target selection is based on multiple factors including homology searches and a range of physiochemical properties.
The PANTHER (Protein ANalysis THrough Evolutionary Relationships) Classification System is a unique resource that classifies genes by their functions, using published scientific experimental evidence and evolutionary relationships to predict function. The PANTHER research tools allow you to score proteins against the PANTHER HMM library, use PANTHER to do gene expression analyses, and download PANTHER tools and data.
TFBScluster is a web tool designed to identify clusters of transcription factor binding sites (TFBSs) conserved in mammalian and mouse genomes. By entering a specified range of TFBS, you can retrieve a list of SWISS-PROT characterized genes to which the clusters are localized.
Eukaryotic Orthology (YOGY) is a resource for retrieving orthologous proteins from nine eukaryotic organisms. Using a gene or protein identifier as a query, this database provides comprehensive, combined information on orthologs in other species using data from five independent resources: KOGs, Inparanoid, Homologene, OrthoMCL, and a table of curated orthologs between budding yeast and fission yeast. Associated Gene Ontology (GO) terms of orthologs can also be
retrieved.
Jumping Profile Hidden Markov Model (jpHMM) takes a HIV-1 genome sequence and uses a pre-calculated multiple alignment of the major HIV-1 subtypes to predict the phylogenetic breakpoints and HIV subtype of the submitted sequence.
Arabidopsis Co-expression Tool (ACT) is a resource for investigating the co-expression of genes in the NASC/GARNet microarray-based gene expression dataset from Arabidopsis.
GeMprospector is designed to find cross-species genetic marker candidates in legumes and grasses. GeMprospector automates PCR primer design based on multiple sequence alignments of submitted ESTs and their homologues in sequence databases from legumes or grasses.
Eukaryotic Orthology (YOGY) is a resource for retrieving orthologous proteins from nine eukaryotic organisms. Using a gene or protein identifier as a query, this database provides comprehensive, combined information on orthologs in other species using data from five independent resources: KOGs, Inparanoid, Homologene, OrthoMCL, and a table of curated orthologs between budding yeast and fission yeast. Associated Gene Ontology (GO) terms of orthologs can also be
retrieved.
Eukaryotic Orthology (YOGY) is a resource for retrieving orthologous proteins from nine eukaryotic organisms. Using a gene or protein identifier as a query, this database provides comprehensive, combined information on orthologs in other species using data from five independent resources: KOGs, Inparanoid, Homologene, OrthoMCL, and a table of curated orthologs between budding yeast and fission yeast. Associated Gene Ontology (GO) terms of orthologs can also be
retrieved.
GlyNest and CASPER are two independent services, available through this common interface, for predicting NMR spectra given a glycan chemical structure.
Free ADME/tox Filtering (FAF-Drugs) allows users to process their own compound collections via simple absorption, distribution, metabolism, excretion and toxicity (ADME/tox) filtering rules to aid in the drug discovery process.
Target Fishing Dock (TarFisDock) is a web server that docks small molecules with protein structures in the Potential Drug Target Database (PDTD) in an effort to discover new drug targets.
HHrep is a tool for the de novo identification
of repeats in protein sequences based on the pairwise comparison of profile hidden Markov models (HMMs).
Max-Planck Integrated Gene Analysis System (MIGenAS) provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction. Users can also configure \"meta\"-tools as a pipeline of individual tools and intermediate filters.
Max-Planck Institute Bioinformatics Toolkit provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction.
PONGO allows users to compare topology predictions of membrane proteins made by four different predictors (MEMSAT, TMHMM, ENSEMBLE, and PRODIV). It also displays a signal peptide prediction determined with SPEP.
Protein Features (PROFEAT) is a tool for computing commonly-used structural and physicochemical features of proteins and peptides from amino acid sequence. New functions include computation of protein-protein and protein-small molecule interactions, topological descriptors for peptide sequences and small molecule structures. New feature groups of descriptors have also been added.
PROFtmb predicts transmembrane beta-barrel (TMB) proteins in Gram-negative bacteria. In addition to running your own predictions, you can also download predictions for all proteins in 78 Gram-Negative bacterial genomes.
transFold predicts the super-secondary structure of transmembrane beta-barrel (TMB) proteins including side-chain orientation and topology of transmembrane segments.
ArchPRED predicts the structure of loop regions in protein structures based on a fragment-search based method. Given a query loop of unknown structure, ArchPRED identifies which loops of know structures are likely to share conformational similarity with the query loop.
Computed Atlas of Surface Topography of proteins (CASTp) locates and measures concave surface regions on 3D protein structures. This tool can be used to study surface features, binding sites, and functional regions of proteins.
KinDOCK is a tool for the analysis of ATP-binding sites of protein kinases based on a structural library of protein kinase-ligand complexes extracted from the Protein Data Bank (PDB).
Polypeptide Angles Suffix Tree (PAST) provides methods for searching three-dimensional protein structures and
identifying frequent common substructures (structural motifs) using an approach independent of the sequence of amino acids.
PDB_Hydro provides tools for mutating and solvating structures of biological macromolecules like proteins or DNA. It identifies strongly and poorly solvated exposed surfaces of structures in an effort to help identify potential interaction zones.
pH-dependent Protein Electrostatics Server (PHEPS) performs global and local pH-dependent electrostatic analysis of protein structures. The user can supply a PDB ID or upload a coordinate file.
PREDITOR predicts torsion angles in proteins from NMR chemical shift data and sequence homology. PREDITOR combines sequence alignment methods with advanced chemical shift analysis techniques to generate its torsion angle predictions.
Protein Features (PROFEAT) is a tool for computing commonly-used structural and physicochemical features of proteins and peptides from amino acid sequence. New functions include computation of protein-protein and protein-small molecule interactions, topological descriptors for peptide sequences and small molecule structures. New feature groups of descriptors have also been added.
Prediction of Biological Interfaces (PreBI) is a server for predicting protein-protein interfaces in protein structures that maximizes both the degree of complementarity and the interface area.
ProKware is a tool for investigating protein structural properties such as domains, functional sites, and post-translational modifications. To use some features a stand-alone piece of software is necessary; currently available for windows only.
Protein Structure Annotation Tool (ProSAT2) is a tool for the visualization of sequence based, residue-specific annotations mapped onto three-dimensional (3D) protein structures
The Protein motif (Protemot) server predicts protein binding sites based on structural templates automatically extracted from crystal structures of protein-ligand complexes in the PDB.
Readout is a server for the calculation of direct and indirect readout energy Z-scores which estimate the degree of sequence specificity of the protein-DNA complex. Readout can be useful for checking the quality of protein-DNA interactions in three dimensional (3D) structures.
Sliding Window Analysis of Ka and Ks (SWAKK) is tool for detecting positive selection in proteins using
a sliding window substitution rate analysis. The program can display the results on a 3D protein structure.
3-Dimensional Structural Superposition (3DSS) is a tool for superposing two or more protein structures that uses RASMOL for visualization; some browser configuration is necessary.
Optimal, Permuted, and Other Alternative
Alignments of Protein Structures (OPAAS) is a tool for investigating structural neighbors that can calculate a structural alignment for two protein structures.
Polypeptide Angles Suffix Tree (PAST) provides methods for searching three-dimensional protein structures and
identifying frequent common substructures (structural motifs) using an approach independent of the sequence of amino acids.
Protein Blocks Expert (PBE) uses a structural alphabet of short structural motifs to compare protein structures. PBE uses as its structural alphabet a set of protein blocks derived from structurally aligned homologous proteins present in the PALI Database.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
(PS)2 Protein Structure Prediction Server performs automated homology modeling by combining PSI-BLAST, IMPALA, and T-Coffee for template selection and target-template alignment. The final three-dimensional (3D) structure is built using RAMP or MODELLER.
ArchPRED predicts the structure of loop regions in protein structures based on a fragment-search based method. Given a query loop of unknown structure, ArchPRED identifies which loops of know structures are likely to share conformational similarity with the query loop.
Cologne University Protein Stability Analysis Tool (CUPSAT) is a tool to analyse and predict protein stability changes upon point mutations (single amino acid mutations) for known protein structures.
Max-Planck Integrated Gene Analysis System (MIGenAS) provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction. Users can also configure \"meta\"-tools as a pipeline of individual tools and intermediate filters.
Max-Planck Institute Bioinformatics Toolkit provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction.
PREDITOR predicts torsion angles in proteins from NMR chemical shift data and sequence homology. PREDITOR combines sequence alignment methods with advanced chemical shift analysis techniques to generate its torsion angle predictions.
RosettaDesign identifies low energy sequences for specified protein three dimensional (3D) structures and can been used to predict stable side chain conformations.
Structural Alphabet based protein Backbone Builder from Alpha Carbon trace (SABBAC) is a tool for the reconstruction of complete protein structures from their alpha-carbon descriptions.
sgTarget is a structural genomics resource that helps to select and prioritize good targets for structure determination from a list of sequences. Target selection is based on multiple factors including homology searches and a range of physiochemical properties.
3-Dimensional Structural Superposition (3DSS) is a tool for superposing two or more protein structures that uses RASMOL for visualization; some browser configuration is necessary.
Computed Atlas of Surface Topography of proteins (CASTp) locates and measures concave surface regions on 3D protein structures. This tool can be used to study surface features, binding sites, and functional regions of proteins.
Family Identification with Structure Anchored HMMs (FISH) is a server for the identification of remote sequence homologues on the basis of protein domains.
ProKware is a tool for investigating protein structural properties such as domains, functional sites, and post-translational modifications. To use some features a stand-alone piece of software is necessary; currently available for windows only.
Protein Structure Annotation Tool (ProSAT2) is a tool for the visualization of sequence based, residue-specific annotations mapped onto three-dimensional (3D) protein structures
REinforced Merging techniques for Unique peptide
Segments (ReMus) is designed for identification of the locations and compositions of unique peptide segments from a set of protein family sequences.
SNP@Domain identifies SNPs within human protein domains allowing users to investigate SNPs in the context of two dimensional and three dimensional maps. Links to external databases (Pfam, SCOP, Ensembl, dbSNP, OMIM, SIFT) are also presented.
The PANTHER (Protein ANalysis THrough Evolutionary Relationships) Classification System is a unique resource that classifies genes by their functions, using published scientific experimental evidence and evolutionary relationships to predict function. The PANTHER research tools allow you to score proteins against the PANTHER HMM library, use PANTHER to do gene expression analyses, and download PANTHER tools and data.
SNP@Domain identifies SNPs within human protein domains allowing users to investigate SNPs in the context of two dimensional and three dimensional maps. Links to external databases (Pfam, SCOP, Ensembl, dbSNP, OMIM, SIFT) are also presented.
Web Gene Ontology Annotation Plot (WEGO) helps visualize the annotation of sets of genes. WEGO can compare gene annotation datasets by plotting the distribution of gene ontology (GO) annotation results into a histogram.
Cologne University Protein Stability Analysis Tool (CUPSAT) is a tool to analyse and predict protein stability changes upon point mutations (single amino acid mutations) for known protein structures.
DiANNA (DiAminoacid Neural Network Application) a tool that predicts cysteine states of a protein. It can predict whether a particular cysteine is reduced, is forming a disulfide bond, or is bound to a metallic ligand.
Max-Planck Institute Bioinformatics Toolkit provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
PDB_Hydro provides tools for mutating and solvating structures of biological macromolecules like proteins or DNA. It identifies strongly and poorly solvated exposed surfaces of structures in an effort to help identify potential interaction zones.
pH-dependent Protein Electrostatics Server (PHEPS) performs global and local pH-dependent electrostatic analysis of protein structures. The user can supply a PDB ID or upload a coordinate file.
Protein Features (PROFEAT) is a tool for computing commonly-used structural and physicochemical features of proteins and peptides from amino acid sequence. New functions include computation of protein-protein and protein-small molecule interactions, topological descriptors for peptide sequences and small molecule structures. New feature groups of descriptors have also been added.
RosettaDesign identifies low energy sequences for specified protein three dimensional (3D) structures and can been used to predict stable side chain conformations.
Max-Planck Institute Bioinformatics Toolkit provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction.
AUGUSTUS is a eukaryotic gene prediction tool. It can integrate evidence, e.g. from RNA-Seq, ESTs, proteomics, but can also predict genes ab initio. The PPX extension to AUGUSTUS can take a protein sequence multiple sequence alignment as input to find new members of the family in a genome. It can be run through a web interface, or downloaded and run locally.
Cascade PSI-BLAST detects distant protein similarities using a cascade search protocol where PSI-BLAST searches are carried out on each hit, until no new hits are found in the selected database (SwissProt, SCOP, or Pfam).
Family Identification with Structure Anchored HMMs (FISH) is a server for the identification of remote sequence homologues on the basis of protein domains.
MAGIIC-PRO is a tool for detecting patterns in protein sequences. MAGIIC-PRO takes a protein sequence as input and helps users prepare training data for the pattern mining experiments by using Swiss-Prot annotations or by PSI-BLAST. Multiple resources for analysis of the detected patterns are also presented.
The MEME Suite web server is a unified portal for discovery and analysis of sequence motifs (e.g. DNA binding sites, protein interaction domains). Motifs with/without gaps may be discovered, searched for in DNA and protein databases, compared to other motifs and analyzed for putative function by association with gene ontology terms.
MEME is a tool designed for discovering and searching for DNA motifs such as transcription factor binding sites (TFBS) or protein domains.
Max-Planck Institute Bioinformatics Toolkit provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction.
The PANTHER (Protein ANalysis THrough Evolutionary Relationships) Classification System is a unique resource that classifies genes by their functions, using published scientific experimental evidence and evolutionary relationships to predict function. The PANTHER research tools allow you to score proteins against the PANTHER HMM library, use PANTHER to do gene expression analyses, and download PANTHER tools and data.
ProKware is a tool for investigating protein structural properties such as domains, functional sites, and post-translational modifications. To use some features a stand-alone piece of software is necessary; currently available for windows only.
SNP@Domain identifies SNPs within human protein domains allowing users to investigate SNPs in the context of two dimensional and three dimensional maps. Links to external databases (Pfam, SCOP, Ensembl, dbSNP, OMIM, SIFT) are also presented.
SUMOylation Sites Prediction (SUMOsp) predicts sites for sumoylation, a reversible post-translational modification of proteins by the small ubiquitin-related modifiers (SUMO).
Transcriptome Auto-annotation Conducting Tool (TACT) is an automated tool for conducting functional annotation of transcripts that integrates sequence similarity searches and functional motif predictions. TACT was originally developed as an automatic annotation pipeline for the Human Full-length cDNA Annotation Invitational (H-Inv) project.
TreeDomViewer is a tool for the visualization of phylogeny and protein domain structure. TreeDomViewer constructs phylogenetic trees and projects the corresponding protein domain information onto the multiple sequence alignment.
Agile Protein Interaction DataAnalyzer (APID) allows you to query protein-protein interactions using a common platform facilitating comparison across different datasets. Currently supported interaction databases are BIND, HPRD, DIP, IntAct, and MINT.
Babelomics is a suite of web tools for the functional annotation and analysis of groups of genes in high throughput experiments. Tools include: FatiGO, FatiGOplus, Fatiscan, Gene Set Enrichment Analysis (GSEA), Marmite, and the Tissues Mining Tool (TMT). Other tools include Biocarta pathways, Transfac and a tool de novo functional annotation of sequences.
BiologicalNetworks allows retrieval, construction, and visualization of biological networks and allows for the analysis of high-throughput expression data by mapping onto network graphs. BiologicalNetworks queries the PathSys system, a combined database of biological pathways, gene regulatory networks, and protein interaction maps.
KEGG Orthology-Based Annotation System (KOBAS) annotates sequences with KEGG Orthology terms and identifies the frequently occurring (or significantly enriched) pathways among the queried sequences compared against a background distribution. 5 pathway databases are used (KEGG Pathway, PID, BioCyc, Reactome, Panther) and 5 human databases (OMIM, KEGG Disease, FunDO, GAD, NHGRI GWAS).
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
The PANTHER (Protein ANalysis THrough Evolutionary Relationships) Classification System is a unique resource that classifies genes by their functions, using published scientific experimental evidence and evolutionary relationships to predict function. The PANTHER research tools allow you to score proteins against the PANTHER HMM library, use PANTHER to do gene expression analyses, and download PANTHER tools and data.
SVMHC is a server for prediction of MHC class I and class II binding peptides. SVMHC can be used to find likely binders in a protein sequence and to investigate the effects of single nucleotide polymorphisms on MHC-peptide binding.
Identify Conserved targets of a Regulon (iCR) predicts regulatory binding sites based on conservation across multiple species. The user provides an ungapped multiple sequence alignment representing the regulatory binding site. iCR reports back with predicted binding sites that are conserved and highlights the downstream co-regulated genes.
Metabolic Search And Reconstruction Kit (metaSHARK) provides an interactive visualisation platform for the
KEGG metabolic network in the form of the SHARKview applet.
Max-Planck Integrated Gene Analysis System (MIGenAS) provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction. Users can also configure \"meta\"-tools as a pipeline of individual tools and intermediate filters.
PONGO allows users to compare topology predictions of membrane proteins made by four different predictors (MEMSAT, TMHMM, ENSEMBLE, and PRODIV). It also displays a signal peptide prediction determined with SPEP.
PROFtmb predicts transmembrane beta-barrel (TMB) proteins in Gram-negative bacteria. In addition to running your own predictions, you can also download predictions for all proteins in 78 Gram-Negative bacterial genomes.
pTARGET predicts the subcellular localization of eukaryotic proteins based on the occurrence patterns of location-specific protein functional domains and the amino acid compositional differences in proteins from nine distinct subcellular locations.
transFold predicts the super-secondary structure of transmembrane beta-barrel (TMB) proteins including side-chain orientation and topology of transmembrane segments.
KinDOCK is a tool for the analysis of ATP-binding sites of protein kinases based on a structural library of protein kinase-ligand complexes extracted from the Protein Data Bank (PDB).
Normal Mode Analysis, Deformation, and Refinement (NOMAD-Ref) provides tools for calculating functionally relevant movements in biological macromolecules. This resource provides tools for investigating different conformations of large protein structures, for modeling receptor/ligand docking complexes, and for calculating the associated molecular motions.
Target Fishing Dock (TarFisDock) is a web server that docks small molecules with protein structures in the Potential Drug Target Database (PDTD) in an effort to discover new drug targets.
Online Guassian Network Model (oGNM) calculates the equilibrium dynamics of any structure submitted in PDB format, using the Gaussian Network Model (GNM), a network model for representing biomolecules.
Max-Planck Integrated Gene Analysis System (MIGenAS) provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction. Users can also configure \"meta\"-tools as a pipeline of individual tools and intermediate filters.
Max-Planck Institute Bioinformatics Toolkit provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction.
PAL2NAL converts a multiple sequence alignment of proteins and the corresponding DNA (or mRNA) sequences into a codon alignment. Synonymous (Ks) and non-synonymous (Ka) substitution rates can be calculated.
Sliding Window Analysis of Ka and Ks (SWAKK) is tool for detecting positive selection in proteins using
a sliding window substitution rate analysis. The program can display the results on a 3D protein structure.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
TreeDomViewer is a tool for the visualization of phylogeny and protein domain structure. TreeDomViewer constructs phylogenetic trees and projects the corresponding protein domain information onto the multiple sequence alignment.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
NeuroPred is a tool designed to predict cleavage sites at basic amino acid locations in neuropeptide precursor sequences. Neuropred also computes the mass of the predicted peptides with or without selected post-translational modifications.
sgTarget is a structural genomics resource that helps to select and prioritize good targets for structure determination from a list of sequences. Target selection is based on multiple factors including homology searches and a range of physiochemical properties.
DiANNA (DiAminoacid Neural Network Application) a tool that predicts cysteine states of a protein. It can predict whether a particular cysteine is reduced, is forming a disulfide bond, or is bound to a metallic ligand.
HHrep is a tool for the de novo identification
of repeats in protein sequences based on the pairwise comparison of profile hidden Markov models (HMMs).
MAGIIC-PRO is a tool for detecting patterns in protein sequences. MAGIIC-PRO takes a protein sequence as input and helps users prepare training data for the pattern mining experiments by using Swiss-Prot annotations or by PSI-BLAST. Multiple resources for analysis of the detected patterns are also presented.
The MEME Suite web server is a unified portal for discovery and analysis of sequence motifs (e.g. DNA binding sites, protein interaction domains). Motifs with/without gaps may be discovered, searched for in DNA and protein databases, compared to other motifs and analyzed for putative function by association with gene ontology terms.
MEME is a tool designed for discovering and searching for DNA motifs such as transcription factor binding sites (TFBS) or protein domains.
Methylation Modification Prediction Server (MeMo) is a server that predicts arginine and lysine sites that undergo methylation using a support vector machine (SVM).
NeuroPred is a tool designed to predict cleavage sites at basic amino acid locations in neuropeptide precursor sequences. Neuropred also computes the mass of the predicted peptides with or without selected post-translational modifications.
Protein Features (PROFEAT) is a tool for computing commonly-used structural and physicochemical features of proteins and peptides from amino acid sequence. New functions include computation of protein-protein and protein-small molecule interactions, topological descriptors for peptide sequences and small molecule structures. New feature groups of descriptors have also been added.
REinforced Merging techniques for Unique peptide
Segments (ReMus) is designed for identification of the locations and compositions of unique peptide segments from a set of protein family sequences.
Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families.
Cascade PSI-BLAST detects distant protein similarities using a cascade search protocol where PSI-BLAST searches are carried out on each hit, until no new hits are found in the selected database (SwissProt, SCOP, or Pfam).
Max-Planck Integrated Gene Analysis System (MIGenAS) provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction. Users can also configure \"meta\"-tools as a pipeline of individual tools and intermediate filters.
Eukaryotic Orthology (YOGY) is a resource for retrieving orthologous proteins from nine eukaryotic organisms. Using a gene or protein identifier as a query, this database provides comprehensive, combined information on orthologs in other species using data from five independent resources: KOGs, Inparanoid, Homologene, OrthoMCL, and a table of curated orthologs between budding yeast and fission yeast. Associated Gene Ontology (GO) terms of orthologs can also be
retrieved.
Composite Regulatory Signature Database (CRSD) is a microarray analysis pipeline aimed at the discovery of motifs involved in gene regulation including microRNA signatures and transcription factor binding sites (TFBS).
Crosslink is a tool for visualizing pairwise sequence similiarity relationships determined using BLAST, Vmatch, and RNAhybrid. The nucleotide sequences and their relationships are displayed as nodes and links of a network, respectively. CrossLink can be useful for investigating potential interactions between microRNAs and their targets.
RNAhybrid is a tool for predicting miRNA (microRNA) targets by calculating the minimum free energy of hybridization between target RNA and miRNA sequences.
Regulatory RNA (RegRNA) is a server for identifying regulatory RNA motifs and elements in mRNA sequences including: motifs located in the 5\'-UTR and 3\'-UTR; motifs involved in mRNA splicing and transcriptional regulation; other motifs in mRNA such as riboswitches and splice donor/acceptor sites; RNA structural features; and miRNA target sites.
Nearest Alignment Space Termination (NAST) is a multiple sequence alignment server for comparative analysis of 16S rRNA gene sequences from bacteria and archaea.
The Internal Transcribed Spacer 2 Database (ITS2) includes a homology based RNA structure prediction algorithm which allows the detection and secondary structure prediction of ITS2 sequences. This resource also contains more than 25,000 pre-calculated secondary structures for currently known ITS2 sequences that can be searched and browsed via taxonomy.
Tools for Motif Discovery (MoD) in nucleotide sequences that includes: Weeder, a program for detecting transcription factor binding sites (TFBS) in coregulated genes; WeederH, a tool for detecting TFBS and regulatory regions from homologous genes; and RNA profile, a tool for secondary structure motif discovery in RNA sequences.
Quadruplex forming G-Rich Sequences (QGRS) Mapper searches nucleotide sequences for the presence of G-quartet motifs. QGRS Mapper allows you to view the distribution and composition of QGRS in the context of alternatively spliced isoforms of your gene.
Regulatory RNA (RegRNA) is a server for identifying regulatory RNA motifs and elements in mRNA sequences including: motifs located in the 5\'-UTR and 3\'-UTR; motifs involved in mRNA splicing and transcriptional regulation; other motifs in mRNA such as riboswitches and splice donor/acceptor sites; RNA structural features; and miRNA target sites.
Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families.
Alignment of RNA Tertiary Structures (ARTS) is a method for aligning two nucleic acid structures (RNAs or DNAs) and detecting a-priori unknown common substructures.
CorreLogo generates a 3D sequence logo for RNA or DNA alignments. The first two dimensions of the sequence logo display information about the information content and residue composition of individual columns of the alignment. The 3D sequence logo consists of a square matrix that shows columns with high mutual information, a measure of how much the residues in two alignment columns are correlated.
The Internal Transcribed Spacer 2 Database (ITS2) includes a homology based RNA structure prediction algorithm which allows the detection and secondary structure prediction of ITS2 sequences. This resource also contains more than 25,000 pre-calculated secondary structures for currently known ITS2 sequences that can be searched and browsed via taxonomy.
Quadruplex forming G-Rich Sequences (QGRS) Mapper searches nucleotide sequences for the presence of G-quartet motifs. QGRS Mapper allows you to view the distribution and composition of QGRS in the context of alternatively spliced isoforms of your gene.
Kalign is a fast and accurate multiple sequence alignment algorithm designed to align large numbers of protein sequences. Kalignvu is an xml based alignment viewer that allows users to resize alignments and choose different colour schemes. Mumsa is a tool for automatic assessment of alignment quality.
Max-Planck Integrated Gene Analysis System (MIGenAS) provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction. Users can also configure \"meta\"-tools as a pipeline of individual tools and intermediate filters.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
CorreLogo generates a 3D sequence logo for RNA or DNA alignments. The first two dimensions of the sequence logo display information about the information content and residue composition of individual columns of the alignment. The 3D sequence logo consists of a square matrix that shows columns with high mutual information, a measure of how much the residues in two alignment columns are correlated.
Kalign is a fast and accurate multiple sequence alignment algorithm designed to align large numbers of protein sequences. Kalignvu is an xml based alignment viewer that allows users to resize alignments and choose different colour schemes. Mumsa is a tool for automatic assessment of alignment quality.
Max-Planck Integrated Gene Analysis System (MIGenAS) provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction. Users can also configure \"meta\"-tools as a pipeline of individual tools and intermediate filters.
PAL2NAL converts a multiple sequence alignment of proteins and the corresponding DNA (or mRNA) sequences into a codon alignment. Synonymous (Ks) and non-synonymous (Ka) substitution rates can be calculated.
PDA (Pipeline Diversity Analysis) is a server which searches for polymorphic sequences in large databases, and estimates their genetic diversity. Results contain sequence alignments (generated by ClustalW) and include statistics on polymorphism, synonymous and non-synonymous substitutions, linkage disequilibrium, and codon bias for your alignment. The full results set is also available for download as a MySQL database.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
The MEME Suite web server is a unified portal for discovery and analysis of sequence motifs (e.g. DNA binding sites, protein interaction domains). Motifs with/without gaps may be discovered, searched for in DNA and protein databases, compared to other motifs and analyzed for putative function by association with gene ontology terms.
MEME is a tool designed for discovering and searching for DNA motifs such as transcription factor binding sites (TFBS) or protein domains.
Sorting Permutation by Reversals and Block Interchanges (SPRING) is a tool for the analysis of genome rearrangements. SPRING takes two or more chromosomes as its input and then computes a minimum series of reversals and/or block-interchanges for transforming one chromosome into another. Phylogenetic trees based on the rearrangement analysis are also shown as part of the results.
Eukaryotic Orthology (YOGY) is a resource for retrieving orthologous proteins from nine eukaryotic organisms. Using a gene or protein identifier as a query, this database provides comprehensive, combined information on orthologs in other species using data from five independent resources: KOGs, Inparanoid, Homologene, OrthoMCL, and a table of curated orthologs between budding yeast and fission yeast. Associated Gene Ontology (GO) terms of orthologs can also be
retrieved.
Family Identification with Structure Anchored HMMs (FISH) is a server for the identification of remote sequence homologues on the basis of protein domains.
GenDecoder is a prediction server for animal mitochondrial genetic codes. It provides information about codon-usage, amino acid composition, GC content and a final genetic code prediction for a mitochondrial genome sequence.
HHsenser is a tool for sensitive iterative sequence searching based on HMM-HMM comparison. Starting from a single sequence or an alignment, HHSenser is able to build alignments with as many near or remote homologs as possible allowing users to explore protein superfamilies.
Kalign is a fast and accurate multiple sequence alignment algorithm designed to align large numbers of protein sequences. Kalignvu is an xml based alignment viewer that allows users to resize alignments and choose different colour schemes. Mumsa is a tool for automatic assessment of alignment quality.
MAGIIC-PRO is a tool for detecting patterns in protein sequences. MAGIIC-PRO takes a protein sequence as input and helps users prepare training data for the pattern mining experiments by using Swiss-Prot annotations or by PSI-BLAST. Multiple resources for analysis of the detected patterns are also presented.
Max-Planck Integrated Gene Analysis System (MIGenAS) provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction. Users can also configure \"meta\"-tools as a pipeline of individual tools and intermediate filters.
Max-Planck Institute Bioinformatics Toolkit provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction.
Nearest Alignment Space Termination (NAST) is a multiple sequence alignment server for comparative analysis of 16S rRNA gene sequences from bacteria and archaea.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
TreeDomViewer is a tool for the visualization of phylogeny and protein domain structure. TreeDomViewer constructs phylogenetic trees and projects the corresponding protein domain information onto the multiple sequence alignment.
Alternative Splicing Prediction (ASPic) can predict alternative splicing of user submitted genes based on comparative analysis of available transcript and genome data from a variety of species. Results include graphical and tabular views of the splicing patterns of full-length mRNA isoforms compatible with the detected splice sites of the genes as well as structural and functional annotations.
HHrep is a tool for the de novo identification
of repeats in protein sequences based on the pairwise comparison of profile hidden Markov models (HMMs).
PAL2NAL converts a multiple sequence alignment of proteins and the corresponding DNA (or mRNA) sequences into a codon alignment. Synonymous (Ks) and non-synonymous (Ka) substitution rates can be calculated.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
Crosslink is a tool for visualizing pairwise sequence similiarity relationships determined using BLAST, Vmatch, and RNAhybrid. The nucleotide sequences and their relationships are displayed as nodes and links of a network, respectively. CrossLink can be useful for investigating potential interactions between microRNAs and their targets.
Family Identification with Structure Anchored HMMs (FISH) is a server for the identification of remote sequence homologues on the basis of protein domains.
Max-Planck Integrated Gene Analysis System (MIGenAS) provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction. Users can also configure \"meta\"-tools as a pipeline of individual tools and intermediate filters.
Max-Planck Institute Bioinformatics Toolkit provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction.
PlotRep is a tool for the visual analysis of dispersed genomic repeats. The tool is designed to merge fragments belonging to the same copy of a repeat and allows for the inspection of the results using a dot-plot like graphical output.
Sliding Window Analysis of Ka and Ks (SWAKK) is tool for detecting positive selection in proteins using
a sliding window substitution rate analysis. The program can display the results on a 3D protein structure.
Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families.
Cascade PSI-BLAST detects distant protein similarities using a cascade search protocol where PSI-BLAST searches are carried out on each hit, until no new hits are found in the selected database (SwissProt, SCOP, or Pfam).
Crosslink is a tool for visualizing pairwise sequence similiarity relationships determined using BLAST, Vmatch, and RNAhybrid. The nucleotide sequences and their relationships are displayed as nodes and links of a network, respectively. CrossLink can be useful for investigating potential interactions between microRNAs and their targets.
Family Identification with Structure Anchored HMMs (FISH) is a server for the identification of remote sequence homologues on the basis of protein domains.
FeatureScan searches for sequence similarity by comparing physico-chemical properties of DNA such as charge, melting enthalpy, conformational parameters, etc.
HHrep is a tool for the de novo identification
of repeats in protein sequences based on the pairwise comparison of profile hidden Markov models (HMMs).
HHsenser is a tool for sensitive iterative sequence searching based on HMM-HMM comparison. Starting from a single sequence or an alignment, HHSenser is able to build alignments with as many near or remote homologs as possible allowing users to explore protein superfamilies.
Max-Planck Integrated Gene Analysis System (MIGenAS) provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction. Users can also configure \"meta\"-tools as a pipeline of individual tools and intermediate filters.
Max-Planck Institute Bioinformatics Toolkit provides access to many different bioinformatics software tools and databases for sequence similarity searching, multiple sequence alignments, phylogenetic analysis, and protein structure prediction.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
PlotRep is a tool for the visual analysis of dispersed genomic repeats. The tool is designed to merge fragments belonging to the same copy of a repeat and allows for the inspection of the results using a dot-plot like graphical output.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
Transcriptome Auto-annotation Conducting Tool (TACT) is an automated tool for conducting functional annotation of transcripts that integrates sequence similarity searches and functional motif predictions. TACT was originally developed as an automatic annotation pipeline for the Human Full-length cDNA Annotation Invitational (H-Inv) project.