Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
Web server that automatically generates and annotates circular plasmid maps. The tool has: a built in set of features that can be displayed (ie. RE sites, tags, ORFs, etc.); allows users to define custom features to display; contains a library of commonly used plasmids; and, generates nice looking images in a variety of output formats.
The Web Alignment Visualization Server (WAViS) provides various web tools to enhance the presentation of amino acid or nucleotide multiple sequence alignments.
The European Bioinformatics Institute (EMBL-EBI) provides access to a comprehensive, integrated set of biological data. Resources include information on genes, genomes, proteins, expression, literature, and molecular interactions as well as many other useful resources.
AGenDA is a web tool that compares the genomic sequences from evolutionarily related organisms in order to make gene predictions. It takes pairs of genomic sequences as input, aligns the sequences, and makes predictions based on splice signals, start and stop codons, and areas of conserved sequence.
AUGUSTUS is a eukaryotic gene prediction tool. It can integrate evidence, e.g. from RNA-Seq, ESTs, proteomics, but can also predict genes ab initio. The PPX extension to AUGUSTUS can take a protein sequence multiple sequence alignment as input to find new members of the family in a genome. It can be run through a web interface, or downloaded and run locally.
Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families.
Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.
Server which aims to extract proximal promoter sequences from mammalian genomes. Does so by mapping mRNA and EST sequences and tracking overlapping alignments to find the transcription start site.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
The Gene Expression Pattern Analysis Suite (GEPAS) is a collection of tools for the analysis of microarray data. GEPAS includes tools for data pre-processing, clustering, differential gene expression, predictors, array CGH and functional annotation. A new pipeline module allows for automation of sequential analysis steps.
PDA (Pipeline Diversity Analysis) is a server which searches for polymorphic sequences in large databases, and estimates their genetic diversity. Results contain sequence alignments (generated by ClustalW) and include statistics on polymorphism, synonymous and non-synonymous substitutions, linkage disequilibrium, and codon bias for your alignment. The full results set is also available for download as a MySQL database.
PupaSuite is a SNP analysis tool that allows for the selection of relevant SNPs within a gene based on the characteristics of the SNP. PupaSuite also provides information about LD parameters (based on genotype data from HapMap) and identifies haplotype blocks and tag SNPs. PupaSuite was created to join PupaSNP & PupasView.
The SBT (Sequencing-based typing) interface is a tool accompanying the NCBI's Major Histocompatibility Complex database (dbMHC). It identifies the allelic composition of sequence-based typing (SBT) results of cDNA or genomic sequences. Samples are compared with databases containing all known alleles for several human leukocyte antigen (HLA) and killer cell immunoglobulin-like receptors (KIR) loci. An interactive sequence viewer allows inspection of the alignments.
Tool for large-scale standardized primer design with specific modules for SNP and exon PCR amplification. Results are presented as HTML with genomic position, primer sequence, and PCR conditions. Precomputed SNPbox results for all Ensembl exons are also available on the server.
Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families.
It goes to the library. You go to the pub; receive email alerts for current contents of PubMed and GenBank; e.g. use accession number of htg record as query to receive sequence updates (as the version number changes).
The European Bioinformatics Institute (EMBL-EBI) provides access to a comprehensive, integrated set of biological data. Resources include information on genes, genomes, proteins, expression, literature, and molecular interactions as well as many other useful resources.
AUGUSTUS is a eukaryotic gene prediction tool. It can integrate evidence, e.g. from RNA-Seq, ESTs, proteomics, but can also predict genes ab initio. The PPX extension to AUGUSTUS can take a protein sequence multiple sequence alignment as input to find new members of the family in a genome. It can be run through a web interface, or downloaded and run locally.
Batch Extraction and Analysis of cis-Regulatory Regions (BEARR) takes a list of gene identifiers (such as RefSeq and Unigene IDs), consensus patterns, and (optionally) a position weight matrix as input and returns a list of matches for the patterns in both the sense and anti-sense strands of the relevant genomic sequence. The user specifies what distance up and downstream from both the transcription start site and the 3\' terminus to look for the patterns.
Server which scans upstream of genes in the same gene expression cluster for regulatory sequence motifs using a Gibbs sampling strategy. The Markov background model is used for non-motif bases, improving specificity of predicted motif locations.
CREME (Cis-Regulatory Module Explorer for the human genome) is a tool for identifying and visualizing cis-regulatory modules for a given set of genes that are potentially co-expressed or co-regulated. It takes as input a list of accession numbers, and reports back common modules, grouping genes from the list by which modules are found in their promoter regions.
CSTminer searches for Conserved Sequence Tags (CST) in pairwise comparisons of genomic sequences. Output is a graphical map showing the location of CSTs overlayed with known Ensembl exons and ESTs. A scoring system reflects coding or non-coding potential of these conserved blocks.
Server which attempts to identify any motifs related to genes predicted to share regulatory elements. It alters Gibbs sampling through biasing searches towards conserved sequences across multiple species.
Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.
LowComplexity is a tool that searches for low complexity regions of DNA or protein sequences. Using LowComplexity you can search long sequences (chromosomes, genomes) or a set of aligned sequences. This resource also contains links to other algorithms for evaluating the complexity of sequences.
Server designed to pinpoint protein-DNA interaction sites at the base pair level. Uses ChIP-array data, word enumeration and position-specific weight matrix updating to search for motifs representing these interaction sites.
MoViES (Molecular Vibrations Evaluation Server) allows the computation of the vibrational and thermal fluctuations dynamics for the chemical bonds of biomolecules. The server takes as input a structure file in PDB format and returns results by email. Analysis of vibrational motions and thermal fluctuational dynamics can give insights into the functional properties of both proteins and nucleic acids.
MotifViz is a tool for detecting overrepresented transcription factor binding motifs. Four motif discovery programs are accessible from the MotifViz web interface: Clover, Rover, Motifish and Possum.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
POBO is a tool to summarize, verify and screen predetermined cis-element motifs from a set of sequences. POBO reports the results in as understandable format as possible for biologists.
PredictRegulon is a tool that, for a given regulatory protein and prokaryotic genome, predicts binding sites and operons. The user selects a prokaryote genome and provides an ungapped multiple sequence alignment representing the binding site. PredictRegulon reports back with predicted binding sites and downstream co-regulated genes.
Server which aims to extract proximal promoter sequences from mammalian genomes. Does so by mapping mRNA and EST sequences and tracking overlapping alignments to find the transcription start site.
Tool for sequence feature visualization and comparison; integrates with Internet Explorer; accepts GenBank Flat Files, GenBank HTML, FASTA files; plugins exist for visualization of output from RepeatMasker, PsiPred, and Cister.
VISTA is a comprehensive suite of programs and databases for comparative analysis of genomic sequences. There are two ways of using VISTA - you can submit your own sequences and alignments for analysis (VISTA servers) or examine pre-computed whole-genome alignments of different species.
Server which identifies conserved motifs within sets of related regulatory DNA sequences, which are likely to be transcription factor binding sites. The interface will automatically conduct several runs using different parameters and output a summary.
Tool which aids in the design and quality control of small interfering RNAs (siRNAs) for RNA interference (RNAi) and gene silencing. It evaluates the inhibitory potency of potential siRNA sequences as well as identifying gene regions that have a high silencing capacity.
Tool which divides both strands of an input DNA sequence into a set of contiguous oligonucleotides. These oligos are designed with complimentary regions so that the complete set can be combined to synthesize the input DNA sequence in vitro (using oligonucleotide synthesis and assembly PCR).
Web server that automatically generates and annotates circular plasmid maps. The tool has: a built in set of features that can be displayed (ie. RE sites, tags, ORFs, etc.); allows users to define custom features to display; contains a library of commonly used plasmids; and, generates nice looking images in a variety of output formats.
Using current releases of genomic sequence data, ProbeLynx allows users to assess the specificity of probe sequences used for microarray experiments. The user provides probe sequences in FASTA or tab-delimited format, and ProbeLynx reports specificity information for each probe and functional annotations from Ensembl or TIGR for each probe target hit.
Tool for large-scale standardized primer design with specific modules for SNP and exon PCR amplification. Results are presented as HTML with genomic position, primer sequence, and PCR conditions. Precomputed SNPbox results for all Ensembl exons are also available on the server.
Server with three tools for the rational design of small interfering RNAs (Sirna), antisense oligonucleotides (Soligo), and trans-cleaving ribozymes (Sribo). A fourth tool, Srna, returns output including general folding features.
T7 RNAi Oligo Designer (TROD) aids in the design of DNA oligonucleotides for short interfering RNA (siRNA) synthesis with T7 RNA polymerase. It takes an input of a cDNA sequence and outputs a list of DNA oligos for ordering.
Server for computing small interfering RNA (siRNA) sequences which are best suited for mammalian RNA interference (RNAi). The site accepts a sequence as input and returns a list of siRNA candidates.
The European Bioinformatics Institute (EMBL-EBI) provides access to a comprehensive, integrated set of biological data. Resources include information on genes, genomes, proteins, expression, literature, and molecular interactions as well as many other useful resources.
The Bioinformatics Links Directory features curated links to molecular resources, tools and databases. All of the resources are free or available for a nominal fee.
The European Bioinformatics Institute (EMBL-EBI) provides access to a comprehensive, integrated set of biological data. Resources include information on genes, genomes, proteins, expression, literature, and molecular interactions as well as many other useful resources.
The European Bioinformatics Institute (EMBL-EBI) provides access to a comprehensive, integrated set of biological data. Resources include information on genes, genomes, proteins, expression, literature, and molecular interactions as well as many other useful resources.
Batch Extraction and Analysis of cis-Regulatory Regions (BEARR) takes a list of gene identifiers (such as RefSeq and Unigene IDs), consensus patterns, and (optionally) a position weight matrix as input and returns a list of matches for the patterns in both the sense and anti-sense strands of the relevant genomic sequence. The user specifies what distance up and downstream from both the transcription start site and the 3\' terminus to look for the patterns.
Server which scans upstream of genes in the same gene expression cluster for regulatory sequence motifs using a Gibbs sampling strategy. The Markov background model is used for non-motif bases, improving specificity of predicted motif locations.
Server which analyzes microarray and promoter sequence data associated with a response to a specific stimulus. After analysis a potential transcriptional regulatory network is created. CARRIE also determines which transcription factors were likely involved in regulation and which genes they regulated.
The Conserved Transcription Factor Binding Site Finder (CONFAC) takes a list of human gene names and identifiers as input, and compares them with their mouse orthologues to identify conserved transcription factor binding sites. Further information from the user allows CONFAC to identify binding sites that are enriched in the promoter regions of gene clusters from microarray analyses when compared to control gene sets.
CREME (Cis-Regulatory Module Explorer for the human genome) is a tool for identifying and visualizing cis-regulatory modules for a given set of genes that are potentially co-expressed or co-regulated. It takes as input a list of accession numbers, and reports back common modules, grouping genes from the list by which modules are found in their promoter regions.
CSTminer searches for Conserved Sequence Tags (CST) in pairwise comparisons of genomic sequences. Output is a graphical map showing the location of CSTs overlayed with known Ensembl exons and ESTs. A scoring system reflects coding or non-coding potential of these conserved blocks.
Server which attempts to identify any motifs related to genes predicted to share regulatory elements. It alters Gibbs sampling through biasing searches towards conserved sequences across multiple species.
Tool which aids in the design and quality control of small interfering RNAs (siRNAs) for RNA interference (RNAi) and gene silencing. It evaluates the inhibitory potency of potential siRNA sequences as well as identifying gene regions that have a high silencing capacity.
Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.
Server designed to pinpoint protein-DNA interaction sites at the base pair level. Uses ChIP-array data, word enumeration and position-specific weight matrix updating to search for motifs representing these interaction sites.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
POBO is a tool to summarize, verify and screen predetermined cis-element motifs from a set of sequences. POBO reports the results in as understandable format as possible for biologists.
PredictRegulon is a tool that, for a given regulatory protein and prokaryotic genome, predicts binding sites and operons. The user selects a prokaryote genome and provides an ungapped multiple sequence alignment representing the binding site. PredictRegulon reports back with predicted binding sites and downstream co-regulated genes.
RNA motif search program that identifies RNA motifs called riboswitches which are metabolic binding domains in mRNA that regulate gene expression. The program was originally designed around a set of riboswitches found in Bacillus subtilis.
Server with three tools for the rational design of small interfering RNAs (Sirna), antisense oligonucleotides (Soligo), and trans-cleaving ribozymes (Sribo). A fourth tool, Srna, returns output including general folding features.
T7 RNAi Oligo Designer (TROD) aids in the design of DNA oligonucleotides for short interfering RNA (siRNA) synthesis with T7 RNA polymerase. It takes an input of a cDNA sequence and outputs a list of DNA oligos for ordering.
Server which identifies conserved motifs within sets of related regulatory DNA sequences, which are likely to be transcription factor binding sites. The interface will automatically conduct several runs using different parameters and output a summary.
Server for computing small interfering RNA (siRNA) sequences which are best suited for mammalian RNA interference (RNAi). The site accepts a sequence as input and returns a list of siRNA candidates.
ProMoST (Protein Modification Screening Tool) is a program to calculate accurate MW and pI values from proteins considering the effects of post-translational modifications. Results are displayed as calculated values of pI and MW for each protein and are also plotted on two-dimensional (2D) gel images.
Gene modeling server which focuses on the modeling of alternative splicing. It is based on the alignment of mRNA, EST and protein sequences and combines genome-based clustering and transcript assembly. Supports human, mouse and rat genomes.
AUGUSTUS is a eukaryotic gene prediction tool. It can integrate evidence, e.g. from RNA-Seq, ESTs, proteomics, but can also predict genes ab initio. The PPX extension to AUGUSTUS can take a protein sequence multiple sequence alignment as input to find new members of the family in a genome. It can be run through a web interface, or downloaded and run locally.
ArrayPipe allows users to customize a processing pipeline for the analysis of microarray data. Includes methods for quality assessment of slides, data visualization, normalization, and detection of differentially expressed genes. Output consists of reports formatted as standard web pages and tab-delimited lists of calculated values.
ArrayXPath is a web-based service for matching microarray gene-expression profiles with known biological pathways. Input is a clustered gene-expression profile in a tab-delimited text format. Output includes pathway diagrams.
Batch Extraction and Analysis of cis-Regulatory Regions (BEARR) takes a list of gene identifiers (such as RefSeq and Unigene IDs), consensus patterns, and (optionally) a position weight matrix as input and returns a list of matches for the patterns in both the sense and anti-sense strands of the relevant genomic sequence. The user specifies what distance up and downstream from both the transcription start site and the 3\' terminus to look for the patterns.
Server which scans upstream of genes in the same gene expression cluster for regulatory sequence motifs using a Gibbs sampling strategy. The Markov background model is used for non-motif bases, improving specificity of predicted motif locations.
Server which analyzes microarray and promoter sequence data associated with a response to a specific stimulus. After analysis a potential transcriptional regulatory network is created. CARRIE also determines which transcription factors were likely involved in regulation and which genes they regulated.
The Conserved Transcription Factor Binding Site Finder (CONFAC) takes a list of human gene names and identifiers as input, and compares them with their mouse orthologues to identify conserved transcription factor binding sites. Further information from the user allows CONFAC to identify binding sites that are enriched in the promoter regions of gene clusters from microarray analyses when compared to control gene sets.
Server which attempts to identify any motifs related to genes predicted to share regulatory elements. It alters Gibbs sampling through biasing searches towards conserved sequences across multiple species.
Expression Profiler is a web based platform for microarray data analysis developed at the EBI. This resource is integrated with the ArrayExpress database, a public repository for microarray data.
The Gene Expression Pattern Analysis Suite (GEPAS) is a collection of tools for the analysis of microarray data. GEPAS includes tools for data pre-processing, clustering, differential gene expression, predictors, array CGH and functional annotation. A new pipeline module allows for automation of sequential analysis steps.
Genome Functional INtegrated Discoverer (GFINDer) takes a list of gene/clone IDs with classification information as input, and allows the user to characterize the different gene classes in the list using annotations of various types from several different sources.
KARMA (Keck Array Manager and Annotator) allows you compare and annotate your own microarrays against other available arrays. Comparison of arrays can be achieved within the same species as well as across species (array comparison is based on UniGene Cluster ID and/or Homologene data). Annotation data includes LocusLink, SwissProt, and Gene Ontology.
Server designed to pinpoint protein-DNA interaction sites at the base pair level. Uses ChIP-array data, word enumeration and position-specific weight matrix updating to search for motifs representing these interaction sites.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
POBO is a tool to summarize, verify and screen predetermined cis-element motifs from a set of sequences. POBO reports the results in as understandable format as possible for biologists.
Using current releases of genomic sequence data, ProbeLynx allows users to assess the specificity of probe sequences used for microarray experiments. The user provides probe sequences in FASTA or tab-delimited format, and ProbeLynx reports specificity information for each probe and functional annotations from Ensembl or TIGR for each probe target hit.
The European Bioinformatics Institute (EMBL-EBI) provides access to a comprehensive, integrated set of biological data. Resources include information on genes, genomes, proteins, expression, literature, and molecular interactions as well as many other useful resources.
E2G is a tool that maps a large set of EST and cDNA sequences to a user-supplied genomic sequence. The use of pre-computed indexed data structures increases the efficiency of the sequence comparison process, allowing a large amount of data to be mapped within a reasonable timeframe.
Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.
The European Bioinformatics Institute (EMBL-EBI) provides access to a comprehensive, integrated set of biological data. Resources include information on genes, genomes, proteins, expression, literature, and molecular interactions as well as many other useful resources.
Batch Extraction and Analysis of cis-Regulatory Regions (BEARR) takes a list of gene identifiers (such as RefSeq and Unigene IDs), consensus patterns, and (optionally) a position weight matrix as input and returns a list of matches for the patterns in both the sense and anti-sense strands of the relevant genomic sequence. The user specifies what distance up and downstream from both the transcription start site and the 3\' terminus to look for the patterns.
CREME (Cis-Regulatory Module Explorer for the human genome) is a tool for identifying and visualizing cis-regulatory modules for a given set of genes that are potentially co-expressed or co-regulated. It takes as input a list of accession numbers, and reports back common modules, grouping genes from the list by which modules are found in their promoter regions.
The SBT (Sequencing-based typing) interface is a tool accompanying the NCBI's Major Histocompatibility Complex database (dbMHC). It identifies the allelic composition of sequence-based typing (SBT) results of cDNA or genomic sequences. Samples are compared with databases containing all known alleles for several human leukocyte antigen (HLA) and killer cell immunoglobulin-like receptors (KIR) loci. An interactive sequence viewer allows inspection of the alignments.
PDA (Pipeline Diversity Analysis) is a server which searches for polymorphic sequences in large databases, and estimates their genetic diversity. Results contain sequence alignments (generated by ClustalW) and include statistics on polymorphism, synonymous and non-synonymous substitutions, linkage disequilibrium, and codon bias for your alignment. The full results set is also available for download as a MySQL database.
PupaSuite is a SNP analysis tool that allows for the selection of relevant SNPs within a gene based on the characteristics of the SNP. PupaSuite also provides information about LD parameters (based on genotype data from HapMap) and identifies haplotype blocks and tag SNPs. PupaSuite was created to join PupaSNP & PupasView.
The SBT (Sequencing-based typing) interface is a tool accompanying the NCBI's Major Histocompatibility Complex database (dbMHC). It identifies the allelic composition of sequence-based typing (SBT) results of cDNA or genomic sequences. Samples are compared with databases containing all known alleles for several human leukocyte antigen (HLA) and killer cell immunoglobulin-like receptors (KIR) loci. An interactive sequence viewer allows inspection of the alignments.
Tool for large-scale standardized primer design with specific modules for SNP and exon PCR amplification. Results are presented as HTML with genomic position, primer sequence, and PCR conditions. Precomputed SNPbox results for all Ensembl exons are also available on the server.
The European Bioinformatics Institute (EMBL-EBI) provides access to a comprehensive, integrated set of biological data. Resources include information on genes, genomes, proteins, expression, literature, and molecular interactions as well as many other useful resources.
It goes to the library. You go to the pub; receive email alerts for current contents of PubMed and GenBank; e.g. use accession number of htg record as query to receive sequence updates (as the version number changes).
NLProt is a tool for finding protein names in natural language text. This data-mining method is a useful approach for extracting protein UniprotIDs from research articles for the construction of custom datasets and/or databases.
The European Bioinformatics Institute (EMBL-EBI) provides access to a comprehensive, integrated set of biological data. Resources include information on genes, genomes, proteins, expression, literature, and molecular interactions as well as many other useful resources.
The ECR (Evolutionary Conserved Regions) browser is a web-based tool for visualizing and navigating through whole genome alignments of several vertebrate species. Users can also submit sequences for alignment with one of the genomes represented.
The ECR (Evolutionary Conserved Regions) browser is a web-based tool for visualizing and navigating through whole genome alignments of several vertebrate species. Users can also submit sequences for alignment with one of the genomes represented.
Composition Vector Tree (CVTree) infers phylogenetic relationships between microbial organisms by comparing their proteomes using a composition vector approach.
Phydbac2 (Phylogenomic display of bacterial genes) is a tool to visualize and explore the phylogenomic profiles of bacterial protein sequences. It also allows the user to view sequence similarity across different organisms, access other genes with similar conservation profiles, and view genes that are found nearby a selected gene in multiple genomes.
Gene modeling server which focuses on the modeling of alternative splicing. It is based on the alignment of mRNA, EST and protein sequences and combines genome-based clustering and transcript assembly. Supports human, mouse and rat genomes.
Batch Extraction and Analysis of cis-Regulatory Regions (BEARR) takes a list of gene identifiers (such as RefSeq and Unigene IDs), consensus patterns, and (optionally) a position weight matrix as input and returns a list of matches for the patterns in both the sense and anti-sense strands of the relevant genomic sequence. The user specifies what distance up and downstream from both the transcription start site and the 3\' terminus to look for the patterns.
The ECR (Evolutionary Conserved Regions) browser is a web-based tool for visualizing and navigating through whole genome alignments of several vertebrate species. Users can also submit sequences for alignment with one of the genomes represented.
The Genotyping tool at the NCBI identifies the genotype (or subtype) of viral sequences by using a sliding window approach to BLAST analysis against reference sequences for different viral subtypes. Results are shown as a graphical output plotting the top-scoring genotype. An alignment tool is also available.
The ECR (Evolutionary Conserved Regions) browser is a web-based tool for visualizing and navigating through whole genome alignments of several vertebrate species. Users can also submit sequences for alignment with one of the genomes represented.
The European Bioinformatics Institute (EMBL-EBI) provides access to a comprehensive, integrated set of biological data. Resources include information on genes, genomes, proteins, expression, literature, and molecular interactions as well as many other useful resources.
The beta-barrel Outer Membrane protein Predictor (BOMP) takes one or more fasta-formatted polypeptide sequences from Gram-negative bacteria as input and predicts whether or not they are beta-barrel integral outer membrane proteins.
A server that supports several different algorithms for the analysis of Circular Dichroism (CD) spectra for the prediction of protein secondary structure. Results also contain a graphical comparison of calculated versus experimental results.
Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.
PRED-TMBB is a tool that takes a Gram-negative bacteria protein sequence as input and predicts the transmembrane strands and the probability of it being an outer membrane beta-barrel protein. The user has a choice of three different decoding methods.
An automated Protein Structure Prediction Pipeline (PSPP) based on multiple structure prediction tools. A key component of the pipeline is the fold recognition program, PROSPECT. Server supports genome scale analyses.
PredictProtein is a protein sequence analysis and structure prediction tool. Users provide a protein sequence, and PredictProtein reports similar sequences, PROSITE sequence motifs, and various types of structure prediction information. You can also use META PredictProtein to submit a sequence to one web form and query several other servers at once, retrieving the results by e-mail.
BSDD (Biomolecules Segment Display Device) searches for and displays user defined sequence motifs in known protein structures. This web based tool incorporates the graphics package of RASMOL for visualization.
BioInfo3D is a collection of tools for the structural analysis of proteins, including tools for structural alignments and prediction of protein interactions.
COLORADO-3D allows you to color your protein structures to indicate the presence of potential errors in protein structure (detected by ANOLEA, PROSAII, PROVE or VERIFY3D), buried residues, and sequence conservation. The server returns a PDB-formatted file which can be displayed in RASMOL.
FoldMiner aligns a user-supplied or identified query structure to a database of single domain targets to discover structural neighbours and characteristic motifs. Query structures can also be aligned to one or more user-specified structures using the LOCK 2 algorithm.
Server that allows users to convert PDB files into PQR files by adding missing atoms, optimizing hydrogen bonding and assigning atomic charge and radius parameters. The resulting PQR file can be used for electrostatic calculations which can give insight into the influence of electrostatics on biomolecular structures.
SCit is a set of tools facilitating the analysis and editing of protein side chain conformations. Using a PDB file as input, the tools allow the user to perform such tasks as listing and/or modifying the values of the dihedral angles, listing structurally compatible rotamers for each residue and substituting side chains resulting in a new structure.
BioInfo3D is a collection of tools for the structural analysis of proteins, including tools for structural alignments and prediction of protein interactions.
A multiple protein structure alignment server which creates an all-to-all pairwise alignment using a combinatorial extension program and then using Monte Carlo optimization methods conducts an iterative global optimization. Results are formatted using JOY.
FATCAT provides the means to compare two PDB-format protein structures, or to search for structures similar to a given PDB structure. The user can supply a PDB ID or upload a structure file. The FATCAT web server employs the Flexible structure AlignmenT by Chaining AFPs (Aligned Fragment Pairs) with Twists (FATCAT) algorithm.
FoldMiner aligns a user-supplied or identified query structure to a database of single domain targets to discover structural neighbours and characteristic motifs. Query structures can also be aligned to one or more user-specified structures using the LOCK 2 algorithm.
ProteinDBS takes a PDB ID or structure as input, and searches for similar protein tertiary structures using computer visualization techniques. The superposition of structures and the aligned component of the sequence can then be viewed over the web.
ProteinDBS v2.0 applies advanced feature extraction algorithms and scalable indexing techniques to achieve a fast computation time.
SA-Search is a tool which first converts a PDB structure file into a one-dimensional representation using a structural alphabet, and then searches for similarities using standard methods for sequence alignment.
SuperPose is a tool for superpositions of protein structures. SuperPose generates sequence alignments, structure alignments, PDB coordinates and RMSD statistics, difference distance plots, and images of the superimposed molecules.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
COLORADO-3D allows you to color your protein structures to indicate the presence of potential errors in protein structure (detected by ANOLEA, PROSAII, PROVE or VERIFY3D), buried residues, and sequence conservation. The server returns a PDB-formatted file which can be displayed in RASMOL.
CaspR is a web tool for building structural models for protein sequences using molecular replacement and homology modelling. The software implements an automated approach that uses T-COFFEE to produce alignments, MODELLER to produce homology models, and finally uses AMoRe and CNS to produce the structural model.
An automated Protein Structure Prediction Pipeline (PSPP) based on multiple structure prediction tools. A key component of the pipeline is the fold recognition program, PROSPECT. Server supports genome scale analyses.
PepBuild is a tool that facilitates the construction, from known sequence and secondary/tertiary structures, of capped or uncapped proteins. The output generated is in PDB format.
The Robetta server provides protein structure prediction tools and interface alanine scanning. The structure prediction is accomplished by either comparative modelling or the de novo Rosetta fragment insertion method. Interface alanine scanning is employed to predict how protein-protein interactions could be affected by mutations.
BSDD (Biomolecules Segment Display Device) searches for and displays user defined sequence motifs in known protein structures. This web based tool incorporates the graphics package of RASMOL for visualization.
CD-Search allows the user to search the Conserved Domain Database (CDD) for conserved structural and functional domains in a sequence of interest; links to 3-D structures where applicable.
COLORADO-3D allows you to color your protein structures to indicate the presence of potential errors in protein structure (detected by ANOLEA, PROSAII, PROVE or VERIFY3D), buried residues, and sequence conservation. The server returns a PDB-formatted file which can be displayed in RASMOL.
ElNemo (The Elastic Network Model) is a tool for predicting the possible movements (ie. conformational changes and other structural changes) of macromolecules. This tool allows users to compute, visualize, and analyse low-frequency normal modes of a protein.
MONSTER takes a PDB structure file as input, and predicts non-bonding interactions that have the potential to provide stability to the molecule. The output can be viewed or downloaded as XML or text, or can be viewed as a 3D structure or a 2D diagram. There is also a good tutorial available at the site.
MolProbity is a structure validation web service for diagnosing problems in 3D models of proteins, nucleic acids or complexes. It adds and optimizes H atoms (correcting 180° flipped Asn/Gln/His sidechains), and then calculates global and local validation for all-atom contacts (steric clashes, H-bonds & vdW), covalent geometry, and conformation (Ramachandran & rotamers for protein, ribose puckers & suite conformers for RNA). Results are displayed online as 3D graphics and sortable charts.
ProteinDBS takes a PDB ID or structure as input, and searches for similar protein tertiary structures using computer visualization techniques. The superposition of structures and the aligned component of the sequence can then be viewed over the web.
ProteinDBS v2.0 applies advanced feature extraction algorithms and scalable indexing techniques to achieve a fast computation time.
SuperPose is a tool for superpositions of protein structures. SuperPose generates sequence alignments, structure alignments, PDB coordinates and RMSD statistics, difference distance plots, and images of the superimposed molecules.
The European Bioinformatics Institute (EMBL-EBI) provides access to a comprehensive, integrated set of biological data. Resources include information on genes, genomes, proteins, expression, literature, and molecular interactions as well as many other useful resources.
GOblet allows the user to BLAST one or more protein or nucleotide sequences against databases that have sequences mapped to Gene Ontology (GO) terms. Results can be viewed for individual sequences, or as survey statistics for the group if more than one sequence is submitted.
Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.
LOCtarget is a tool for predicting, and a database of pre-computed predictions for, sub-cellular localization of eukaryotic and prokaryotic proteins. Several methods are employed to make the predictions, including text analysis of SWISS-PROT keywords, nuclear localization signals, and the use of neural networks.
MITOPRED uses Pfam domains, pI values and amino acid composition to predict nuclear-encoded mitochondrial proteins. Predictions have been precomputed for a number of proteomes, as well as for all Eukaryotic sequences in Swiss-Prot and TrEMBL. Users may directly enter or upload a file with a list of protein sequences or Swiss-Prot/TrEMBL accession numbers.
MoViES (Molecular Vibrations Evaluation Server) allows the computation of the vibrational and thermal fluctuations dynamics for the chemical bonds of biomolecules. The server takes as input a structure file in PDB format and returns results by email. Analysis of vibrational motions and thermal fluctuational dynamics can give insights into the functional properties of both proteins and nucleic acids.
PRED-GPCR is a tool which queries user-supplied sequences against a database of HMMs corresponding to G-protein coupled receptor (GPCR) families in order to determine which GPCR family the query sequence most resembles.
Phydbac2 (Phylogenomic display of bacterial genes) is a tool to visualize and explore the phylogenomic profiles of bacterial protein sequences. It also allows the user to view sequence similarity across different organisms, access other genes with similar conservation profiles, and view genes that are found nearby a selected gene in multiple genomes.
PrediSi (Prediction of Signal Peptides) takes one or more amino acid sequences as input and predicts the likelihood that they are signal peptides as well as their cleavage positions. It can be used to analyse whole proteome datasets.
Proteome Analyst is a high-throughput tool for predicting properties for each protein in a proteome. The user provides a proteome in fasta format, and the system employs Psi-blast, Psipred and Modeller to predict protein function and subcellular localization. Proteome Analyst uses machine-learned classifiers to predict things such as GO molecular function. User-supplied training data can also be used to create custom classifiers.
SDPpred is a tool for predicting which residues of a protein determine functional differences relative to its homologues. It takes as input an multiple sequence alignment of a protein family divided into groups based on the perceived functional differences.
The European Bioinformatics Institute (EMBL-EBI) provides access to a comprehensive, integrated set of biological data. Resources include information on genes, genomes, proteins, expression, literature, and molecular interactions as well as many other useful resources.
The Genomic Disulfide Analysis Program (GDAP) predicts disulfide bonds for a user-supplied protein sequence. GDAP also provides access to pre-computed predictions of disulfide bonds for over 100 microbial genomes.
Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.
MONSTER takes a PDB structure file as input, and predicts non-bonding interactions that have the potential to provide stability to the molecule. The output can be viewed or downloaded as XML or text, or can be viewed as a 3D structure or a 2D diagram. There is also a good tutorial available at the site.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
Server that allows users to convert PDB files into PQR files by adding missing atoms, optimizing hydrogen bonding and assigning atomic charge and radius parameters. The resulting PQR file can be used for electrostatic calculations which can give insight into the influence of electrostatics on biomolecular structures.
ProMoST (Protein Modification Screening Tool) is a program to calculate accurate MW and pI values from proteins considering the effects of post-translational modifications. Results are displayed as calculated values of pI and MW for each protein and are also plotted on two-dimensional (2D) gel images.
ProSight PTM allows identification and characterization of intact proteins and their post- translational modifications (PTMs) using data from the \'Top-Down\' tandem mass spectrometry (MS/MS) approach.
Server which provides cluster tree diagrams of a protein based on the charged atoms or hydrophobicity of each of its residues. The diagram allows for visual inspection of the distribution of hydrophobic and charged regions in proteins.
The European Bioinformatics Institute (EMBL-EBI) provides access to a comprehensive, integrated set of biological data. Resources include information on genes, genomes, proteins, expression, literature, and molecular interactions as well as many other useful resources.
AUGUSTUS is a eukaryotic gene prediction tool. It can integrate evidence, e.g. from RNA-Seq, ESTs, proteomics, but can also predict genes ab initio. The PPX extension to AUGUSTUS can take a protein sequence multiple sequence alignment as input to find new members of the family in a genome. It can be run through a web interface, or downloaded and run locally.
CD-Search allows the user to search the Conserved Domain Database (CDD) for conserved structural and functional domains in a sequence of interest; links to 3-D structures where applicable.
CHOP takes a protein sequence as input, and returns a list of protein sequence fragments with homology to PDB and Pfam domains and to proteins from the SWISS-PROT database.
FoldMiner aligns a user-supplied or identified query structure to a database of single domain targets to discover structural neighbours and characteristic motifs. Query structures can also be aligned to one or more user-specified structures using the LOCK 2 algorithm.
Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.
The MyHits server integrates several tools with a focus on protein annotation and the analysis of protein domains. Guest users have access to tools such as ClustalW and T-Coffee and databases like Swiss-Prot, Prosite and Interpro. Registration allows users to store their results in private databases, and is free for academic users.
NRPS-PKS is an tool comprising four integrated databases for the analysis of large multi-enzymatic multi-domain megasynthases. The user can submit a query sequence to search for domains or view the properties of the products.
PRED-GPCR is a tool which queries user-supplied sequences against a database of HMMs corresponding to G-protein coupled receptor (GPCR) families in order to determine which GPCR family the query sequence most resembles.
PredictProtein is a protein sequence analysis and structure prediction tool. Users provide a protein sequence, and PredictProtein reports similar sequences, PROSITE sequence motifs, and various types of structure prediction information. You can also use META PredictProtein to submit a sequence to one web form and query several other servers at once, retrieving the results by e-mail.
SledgeHMMER is a tool for searching the Pfam database using a parallelized version of the program hmmpfam. The user can perform queries with one or more sequences at a time and then receive the results by e-mail.
The European Bioinformatics Institute (EMBL-EBI) provides access to a comprehensive, integrated set of biological data. Resources include information on genes, genomes, proteins, expression, literature, and molecular interactions as well as many other useful resources.
ArrayXPath is a web-based service for matching microarray gene-expression profiles with known biological pathways. Input is a clustered gene-expression profile in a tab-delimited text format. Output includes pathway diagrams.
BioInfo3D is a collection of tools for the structural analysis of proteins, including tools for structural alignments and prediction of protein interactions.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
PathBLAST is a tool for cross-species comparison of protein interaction networks. PathBLAST takes a short protein interaction path as input and searches against an available protein-protein interation network specified by the user.
The Robetta server provides protein structure prediction tools and interface alanine scanning. The structure prediction is accomplished by either comparative modelling or the de novo Rosetta fragment insertion method. Interface alanine scanning is employed to predict how protein-protein interactions could be affected by mutations.
The SBT (Sequencing-based typing) interface is a tool accompanying the NCBI's Major Histocompatibility Complex database (dbMHC). It identifies the allelic composition of sequence-based typing (SBT) results of cDNA or genomic sequences. Samples are compared with databases containing all known alleles for several human leukocyte antigen (HLA) and killer cell immunoglobulin-like receptors (KIR) loci. An interactive sequence viewer allows inspection of the alignments.
WebInterViewer is a tool for visualizing molecular interaction networks. It employs a fast-layout algorithm, and can visualize data from interaction databases directly.
The beta-barrel Outer Membrane protein Predictor (BOMP) takes one or more fasta-formatted polypeptide sequences from Gram-negative bacteria as input and predicts whether or not they are beta-barrel integral outer membrane proteins.
ESLpred is a tool for predicting subcellular localization of proteins using support vector machines. The predictions are based on dipeptide and amino acid composition, and physico-chemical properties.
PRED-TMBB is a tool that takes a Gram-negative bacteria protein sequence as input and predicts the transmembrane strands and the probability of it being an outer membrane beta-barrel protein. The user has a choice of three different decoding methods.
PrediSi (Prediction of Signal Peptides) takes one or more amino acid sequences as input and predicts the likelihood that they are signal peptides as well as their cleavage positions. It can be used to analyse whole proteome datasets.
BioInfo3D is a collection of tools for the structural analysis of proteins, including tools for structural alignments and prediction of protein interactions.
ClusPro is a tool for automatically computing the docking of two protein structures supplied by the user (or as PDB IDs). The result set is a ranked list of putative complexes, ordered by clustering properties.
ElNemo (The Elastic Network Model) is a tool for predicting the possible movements (ie. conformational changes and other structural changes) of macromolecules. This tool allows users to compute, visualize, and analyse low-frequency normal modes of a protein.
MoViES (Molecular Vibrations Evaluation Server) allows the computation of the vibrational and thermal fluctuations dynamics for the chemical bonds of biomolecules. The server takes as input a structure file in PDB format and returns results by email. Analysis of vibrational motions and thermal fluctuational dynamics can give insights into the functional properties of both proteins and nucleic acids.
Composition Vector Tree (CVTree) infers phylogenetic relationships between microbial organisms by comparing their proteomes using a composition vector approach.
Phydbac2 (Phylogenomic display of bacterial genes) is a tool to visualize and explore the phylogenomic profiles of bacterial protein sequences. It also allows the user to view sequence similarity across different organisms, access other genes with similar conservation profiles, and view genes that are found nearby a selected gene in multiple genomes.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
The Web Alignment Visualization Server (WAViS) provides various web tools to enhance the presentation of amino acid or nucleotide multiple sequence alignments.
ProMoST (Protein Modification Screening Tool) is a program to calculate accurate MW and pI values from proteins considering the effects of post-translational modifications. Results are displayed as calculated values of pI and MW for each protein and are also plotted on two-dimensional (2D) gel images.
Tools developed to aid in the identification of mass spectrum that allow the calculation of mass values with isotopic distributions based on molecular formulas, peptides/proteins, DNA/RNA, carbohydrate sequences or combinations thereof. A viewer for visualizing results is also available.
ProMoST (Protein Modification Screening Tool) is a program to calculate accurate MW and pI values from proteins considering the effects of post-translational modifications. Results are displayed as calculated values of pI and MW for each protein and are also plotted on two-dimensional (2D) gel images.
ProSight PTM allows identification and characterization of intact proteins and their post- translational modifications (PTMs) using data from the \'Top-Down\' tandem mass spectrometry (MS/MS) approach.
Proteome Analyst is a high-throughput tool for predicting properties for each protein in a proteome. The user provides a proteome in fasta format, and the system employs Psi-blast, Psipred and Modeller to predict protein function and subcellular localization. Proteome Analyst uses machine-learned classifiers to predict things such as GO molecular function. User-supplied training data can also be used to create custom classifiers.
The Genomic Disulfide Analysis Program (GDAP) predicts disulfide bonds for a user-supplied protein sequence. GDAP also provides access to pre-computed predictions of disulfide bonds for over 100 microbial genomes.
LowComplexity is a tool that searches for low complexity regions of DNA or protein sequences. Using LowComplexity you can search long sequences (chromosomes, genomes) or a set of aligned sequences. This resource also contains links to other algorithms for evaluating the complexity of sequences.
Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families.
NLProt is a tool for finding protein names in natural language text. This data-mining method is a useful approach for extracting protein UniprotIDs from research articles for the construction of custom datasets and/or databases.
It goes to the library. You go to the pub; receive email alerts for current contents of PubMed and GenBank; e.g. use accession number of htg record as query to receive sequence updates (as the version number changes).
The European Bioinformatics Institute (EMBL-EBI) provides access to a comprehensive, integrated set of biological data. Resources include information on genes, genomes, proteins, expression, literature, and molecular interactions as well as many other useful resources.
The European Bioinformatics Institute (EMBL-EBI) provides access to a comprehensive, integrated set of biological data. Resources include information on genes, genomes, proteins, expression, literature, and molecular interactions as well as many other useful resources.
ERPIN (Easy RNA Profile IdentificatioN) takes as input an RNA sequence alignment and secondary structure annotation and will identify a wide variety of known RNA motifs (such as tRNAs, 5S rRNAs, SRP RNA, C/D box snoRNAs, hammerhead motifs, miRNAs and others) in your sequence(s) of interest. Also contains tool for drawing secondary structure of motifs.
Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.
RNA motif search program that identifies RNA motifs called riboswitches which are metabolic binding domains in mRNA that regulate gene expression. The program was originally designed around a set of riboswitches found in Bacillus subtilis.
Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families.
It goes to the library. You go to the pub; receive email alerts for current contents of PubMed and GenBank; e.g. use accession number of htg record as query to receive sequence updates (as the version number changes).
Server which predicts conserved secondary structure elements of homologous RNAs. The input of a set of RNA sequences are not required to be previously aligned.
Tool which aids in the design and quality control of small interfering RNAs (siRNAs) for RNA interference (RNAi) and gene silencing. It evaluates the inhibitory potency of potential siRNA sequences as well as identifying gene regions that have a high silencing capacity.
ERPIN (Easy RNA Profile IdentificatioN) takes as input an RNA sequence alignment and secondary structure annotation and will identify a wide variety of known RNA motifs (such as tRNAs, 5S rRNAs, SRP RNA, C/D box snoRNAs, hammerhead motifs, miRNAs and others) in your sequence(s) of interest. Also contains tool for drawing secondary structure of motifs.
Server which provides iterated loop matching and maximum weighted matching algorithms for pseudoknot containing RNA secondary structure prediction. Algorithms can apply thermodynamic and comparative information, and thus can be used for either aligned or individual sequences.
MoViES (Molecular Vibrations Evaluation Server) allows the computation of the vibrational and thermal fluctuations dynamics for the chemical bonds of biomolecules. The server takes as input a structure file in PDB format and returns results by email. Analysis of vibrational motions and thermal fluctuational dynamics can give insights into the functional properties of both proteins and nucleic acids.
MolProbity is a structure validation web service for diagnosing problems in 3D models of proteins, nucleic acids or complexes. It adds and optimizes H atoms (correcting 180° flipped Asn/Gln/His sidechains), and then calculates global and local validation for all-atom contacts (steric clashes, H-bonds & vdW), covalent geometry, and conformation (Ramachandran & rotamers for protein, ribose puckers & suite conformers for RNA). Results are displayed online as 3D graphics and sortable charts.
Server with three tools for the rational design of small interfering RNAs (Sirna), antisense oligonucleotides (Soligo), and trans-cleaving ribozymes (Sribo). A fourth tool, Srna, returns output including general folding features.
T7 RNAi Oligo Designer (TROD) aids in the design of DNA oligonucleotides for short interfering RNA (siRNA) synthesis with T7 RNA polymerase. It takes an input of a cDNA sequence and outputs a list of DNA oligos for ordering.
Server for computing small interfering RNA (siRNA) sequences which are best suited for mammalian RNA interference (RNAi). The site accepts a sequence as input and returns a list of siRNA candidates.
The CHAOS/DIALIGN WWW server is a multiple sequence alignment site which passes input sequences through CHAOS to create a list of local similarites. These similarities serve as anchor points, allowing DIALIGN to conduct global alignments faster. ABC can then be used for the interactive visualization of the alignment.
Tool for sequence feature visualization and comparison; integrates with Internet Explorer; accepts GenBank Flat Files, GenBank HTML, FASTA files; plugins exist for visualization of output from RepeatMasker, PsiPred, and Cister.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
The Web Alignment Visualization Server (WAViS) provides various web tools to enhance the presentation of amino acid or nucleotide multiple sequence alignments.
AGenDA is a web tool that compares the genomic sequences from evolutionarily related organisms in order to make gene predictions. It takes pairs of genomic sequences as input, aligns the sequences, and makes predictions based on splice signals, start and stop codons, and areas of conserved sequence.
Gene modeling server which focuses on the modeling of alternative splicing. It is based on the alignment of mRNA, EST and protein sequences and combines genome-based clustering and transcript assembly. Supports human, mouse and rat genomes.
Correlation analysis of the amino acid substitutions in protein sequences (CRASP) takes multiple alignments of amino acid sequences as input, and detects coordinated residue substitutions. These substitutions may suggest dependent evolution of functionally related pairs of amino acids.
CSTminer searches for Conserved Sequence Tags (CST) in pairwise comparisons of genomic sequences. Output is a graphical map showing the location of CSTs overlayed with known Ensembl exons and ESTs. A scoring system reflects coding or non-coding potential of these conserved blocks.
The ECR (Evolutionary Conserved Regions) browser is a web-based tool for visualizing and navigating through whole genome alignments of several vertebrate species. Users can also submit sequences for alignment with one of the genomes represented.
GOblet allows the user to BLAST one or more protein or nucleotide sequences against databases that have sequences mapped to Gene Ontology (GO) terms. Results can be viewed for individual sequences, or as survey statistics for the group if more than one sequence is submitted.
LowComplexity is a tool that searches for low complexity regions of DNA or protein sequences. Using LowComplexity you can search long sequences (chromosomes, genomes) or a set of aligned sequences. This resource also contains links to other algorithms for evaluating the complexity of sequences.
PDA (Pipeline Diversity Analysis) is a server which searches for polymorphic sequences in large databases, and estimates their genetic diversity. Results contain sequence alignments (generated by ClustalW) and include statistics on polymorphism, synonymous and non-synonymous substitutions, linkage disequilibrium, and codon bias for your alignment. The full results set is also available for download as a MySQL database.
Phydbac2 (Phylogenomic display of bacterial genes) is a tool to visualize and explore the phylogenomic profiles of bacterial protein sequences. It also allows the user to view sequence similarity across different organisms, access other genes with similar conservation profiles, and view genes that are found nearby a selected gene in multiple genomes.
SDPpred is a tool for predicting which residues of a protein determine functional differences relative to its homologues. It takes as input an multiple sequence alignment of a protein family divided into groups based on the perceived functional differences.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
VISTA is a comprehensive suite of programs and databases for comparative analysis of genomic sequences. There are two ways of using VISTA - you can submit your own sequences and alignments for analysis (VISTA servers) or examine pre-computed whole-genome alignments of different species.
Phydbac2 (Phylogenomic display of bacterial genes) is a tool to visualize and explore the phylogenomic profiles of bacterial protein sequences. It also allows the user to view sequence similarity across different organisms, access other genes with similar conservation profiles, and view genes that are found nearby a selected gene in multiple genomes.
VISTA is a comprehensive suite of programs and databases for comparative analysis of genomic sequences. There are two ways of using VISTA - you can submit your own sequences and alignments for analysis (VISTA servers) or examine pre-computed whole-genome alignments of different species.
A multiple protein structure alignment server which creates an all-to-all pairwise alignment using a combinatorial extension program and then using Monte Carlo optimization methods conducts an iterative global optimization. Results are formatted using JOY.
The CHAOS/DIALIGN WWW server is a multiple sequence alignment site which passes input sequences through CHAOS to create a list of local similarites. These similarities serve as anchor points, allowing DIALIGN to conduct global alignments faster. ABC can then be used for the interactive visualization of the alignment.
Multiple alignment program which assembles a global sequence alignment from gap-free local pairwise alignments. This method could be especially useful when comparing large sequences that have only local similarities.
Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.
The MyHits server integrates several tools with a focus on protein annotation and the analysis of protein domains. Guest users have access to tools such as ClustalW and T-Coffee and databases like Swiss-Prot, Prosite and Interpro. Registration allows users to store their results in private databases, and is free for academic users.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
The European Bioinformatics Institute (EMBL-EBI) provides access to a comprehensive, integrated set of biological data. Resources include information on genes, genomes, proteins, expression, literature, and molecular interactions as well as many other useful resources.
VISTA is a comprehensive suite of programs and databases for comparative analysis of genomic sequences. There are two ways of using VISTA - you can submit your own sequences and alignments for analysis (VISTA servers) or examine pre-computed whole-genome alignments of different species.
E2G is a tool that maps a large set of EST and cDNA sequences to a user-supplied genomic sequence. The use of pre-computed indexed data structures increases the efficiency of the sequence comparison process, allowing a large amount of data to be mapped within a reasonable timeframe.
The Genotyping tool at the NCBI identifies the genotype (or subtype) of viral sequences by using a sliding window approach to BLAST analysis against reference sequences for different viral subtypes. Results are shown as a graphical output plotting the top-scoring genotype. An alignment tool is also available.
The international ImMunoGeneTics information system (IMGT) V-QUEry and STandardization (V-QUEST) tool compares a user-provided fasta-formatted germline or rearranged T cell receptor or immunoglobulin variable sequence with a reference set of sequences.
The Iccare (Interspecific Comparative Clustering and Annotation foR Est) web server compares all available EST and mRNA sequences for a query organism against the set of transcripts for a reference organism. The results are presented graphically and relative to the location of genes on the chromosomes of the reference organism.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families.
CHOP takes a protein sequence as input, and returns a list of protein sequence fragments with homology to PDB and Pfam domains and to proteins from the SWISS-PROT database.
Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.
The European Bioinformatics Institute (EMBL-EBI) provides access to a comprehensive, integrated set of biological data. Resources include information on genes, genomes, proteins, expression, literature, and molecular interactions as well as many other useful resources.