NAR Web Server Issue 2004

DNA

Education

Expression

Human Genome

Literature

Model Organisms

Other Molecules

Protein

RNA

Sequence Comparison

DNA > Annotations

IBM Bioinformatics and Pattern Discovery GroupTool Content

http://cbcsrv.watson.ibm.com/Tspd.html

Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.

Onto-ToolsTool Content

http://vortex.cs.wayne.edu/Projects.html

Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.

DNA > DNA and Genomic Analysis

PlasMapperTool Content

http://wishart.biology.ualberta.ca/PlasMapper/index.html

Web server that automatically generates and annotates circular plasmid maps. The tool has: a built in set of features that can be displayed (ie. RE sites, tags, ORFs, etc.); allows users to define custom features to display; contains a library of commonly used plasmids; and, generates nice looking images in a variety of output formats.

WAViSTool Content

http://wavis.img.cas.cz/

The Web Alignment Visualization Server (WAViS) provides various web tools to enhance the presentation of amino acid or nucleotide multiple sequence alignments.

DNA > Databases

DNA > Gene Prediction

AGenDATool Content

http://bibiserv.techfak.uni-bielefeld.de/agenda/

AGenDA is a web tool that compares the genomic sequences from evolutionarily related organisms in order to make gene predictions. It takes pairs of genomic sequences as input, aligns the sequences, and makes predictions based on splice signals, start and stop codons, and areas of conserved sequence.

BLASTTool Content

http://www.ncbi.nlm.nih.gov/BLAST/

Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families.

IBM Bioinformatics and Pattern Discovery GroupTool Content

http://cbcsrv.watson.ibm.com/Tspd.html

Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.

DNA > Mapping and Assembly

DNA > Phylogeny Reconstruction

T-COFFEETool Content

http://www.tcoffee.org/

The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.

DNA > Sequence Polymorphisms

GEPASTool Content

http://www.gepas.org

The Gene Expression Pattern Analysis Suite (GEPAS) is a collection of tools for the analysis of microarray data. GEPAS includes tools for data pre-processing, clustering, differential gene expression, predictors, array CGH and functional annotation. A new pipeline module allows for automation of sequential analysis steps.

PDATool Content

http://pda.uab.es/pda2/

PDA (Pipeline Diversity Analysis) is a server which searches for polymorphic sequences in large databases, and estimates their genetic diversity. Results contain sequence alignments (generated by ClustalW) and include statistics on polymorphism, synonymous and non-synonymous substitutions, linkage disequilibrium, and codon bias for your alignment. The full results set is also available for download as a MySQL database.

This content is being maintained by soniacas.

PupaSuiteTool Content

http://pupasuite.bioinfo.cipf.es

PupaSuite is a SNP analysis tool that allows for the selection of relevant SNPs within a gene based on the characteristics of the SNP. PupaSuite also provides information about LD parameters (based on genotype data from HapMap) and identifies haplotype blocks and tag SNPs. PupaSuite was created to join PupaSNP & PupasView.

SBT Interface - dbMHCTool Content

http://www.ncbi.nlm.nih.gov/gv/mhc/sbt.cgi?cmd=main

The SBT (Sequencing-based typing) interface is a tool accompanying the NCBI's Major Histocompatibility Complex database (dbMHC). It identifies the allelic composition of sequence-based typing (SBT) results of cDNA or genomic sequences. Samples are compared with databases containing all known alleles for several human leukocyte antigen (HLA) and killer cell immunoglobulin-like receptors (KIR) loci. An interactive sequence viewer allows inspection of the alignments.

SNPboxTool Content

http://www.SNPbox.org

Tool for large-scale standardized primer design with specific modules for SNP and exon PCR amplification. Results are presented as HTML with genomic position, primer sequence, and PCR conditions. Precomputed SNPbox results for all Ensembl exons are also available on the server.

DNA > Sequence Retrieval and Submission

BLASTTool Content

http://www.ncbi.nlm.nih.gov/BLAST/

Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families.

PubCrawlerTool Content

http://pubcrawler.gen.tcd.ie/

It goes to the library. You go to the pub; receive email alerts for current contents of PubMed and GenBank; e.g. use accession number of htg record as query to receive sequence updates (as the version number changes).

DNA > Structure and Sequence Feature Detection

BEARRTool Content

http://giscompute.gis.a-star.edu.sg/~vega/BEARR1.0/

Batch Extraction and Analysis of cis-Regulatory Regions (BEARR) takes a list of gene identifiers (such as RefSeq and Unigene IDs), consensus patterns, and (optionally) a position weight matrix as input and returns a list of matches for the patterns in both the sense and anti-sense strands of the relevant genomic sequence. The user specifies what distance up and downstream from both the transcription start site and the 3\' terminus to look for the patterns.

CREMETool Content

http://creme.dcode.org/

CREME (Cis-Regulatory Module Explorer for the human genome) is a tool for identifying and visualizing cis-regulatory modules for a given set of genes that are potentially co-expressed or co-regulated. It takes as input a list of accession numbers, and reports back common modules, grouping genes from the list by which modules are found in their promoter regions.

CSTminerTool Content

http://www.caspur.it/CSTminer/

CSTminer searches for Conserved Sequence Tags (CST) in pairwise comparisons of genomic sequences. Output is a graphical map showing the location of CSTs overlayed with known Ensembl exons and ESTs. A scoring system reflects coding or non-coding potential of these conserved blocks.

IBM Bioinformatics and Pattern Discovery GroupTool Content

http://cbcsrv.watson.ibm.com/Tspd.html

Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.

IsoFinderTool Content

http://isofinder.sourceforge.net/

IsoFinder is a tool for the prediction of isochores for a user-supplied sequence.

This content is being maintained by oliver.

LowComplexityTool Content

http://wwwmgs.bionet.nsc.ru/mgs/programs/low_complexity/

LowComplexity is a tool that searches for low complexity regions of DNA or protein sequences. Using LowComplexity you can search long sequences (chromosomes, genomes) or a set of aligned sequences. This resource also contains links to other algorithms for evaluating the complexity of sequences.

MoViESTool Content

http://ang.cz3.nus.edu.sg/cgi-bin/prog/norm.pl

MoViES (Molecular Vibrations Evaluation Server) allows the computation of the vibrational and thermal fluctuations dynamics for the chemical bonds of biomolecules. The server takes as input a structure file in PDB format and returns results by email. Analysis of vibrational motions and thermal fluctuational dynamics can give insights into the functional properties of both proteins and nucleic acids.

MotifVizTool Content

http://biowulf.bu.edu/MotifViz/

MotifViz is a tool for detecting overrepresented transcription factor binding motifs. Four motif discovery programs are accessible from the MotifViz web interface: Clover, Rover, Motifish and Possum.

Onto-ToolsTool Content

http://vortex.cs.wayne.edu/Projects.html

Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.

POBOTool Content

http://ekhidna.biocenter.helsinki.fi/poxo/pobo/

POBO is a tool to summarize, verify and screen predetermined cis-element motifs from a set of sequences. POBO reports the results in as understandable format as possible for biologists.

This content is being maintained by mkankain.

PredictRegulonTool Content

http://www.cdfd.org.in/predictregulon/

PredictRegulon is a tool that, for a given regulatory protein and prokaryotic genome, predicts binding sites and operons. The user selects a prokaryote genome and provides an ungapped multiple sequence alignment representing the binding site. PredictRegulon reports back with predicted binding sites and downstream co-regulated genes.

Web WeederTool Content

http://159.149.109.9/modtools/

Server which identifies conserved motifs within sets of related regulatory DNA sequences, which are likely to be transcription factor binding sites. The interface will automatically conduct several runs using different parameters and output a summary.

rVISTATool Content

http://rvista.dcode.org/

Server which detects transcription factor binding sites(TFBS) through combining TFBS prediction, sequence comparison and cluster analysis.

DNA > Tools For the Bench

DEQORDatabase Content

http://deqor.mpi-cbg.de/deqor_new/input.html

Tool which aids in the design and quality control of small interfering RNAs (siRNAs) for RNA interference (RNAi) and gene silencing. It evaluates the inhibitory potency of potential siRNA sequences as well as identifying gene regions that have a high silencing capacity.

Gene2OligoTool Content

http://berry.engin.umich.edu/gene2oligo/

Tool which divides both strands of an input DNA sequence into a set of contiguous oligonucleotides. These oligos are designed with complimentary regions so that the complete set can be combined to synthesize the input DNA sequence in vitro (using oligonucleotide synthesis and assembly PCR).

PlasMapperTool Content

http://wishart.biology.ualberta.ca/PlasMapper/index.html

Web server that automatically generates and annotates circular plasmid maps. The tool has: a built in set of features that can be displayed (ie. RE sites, tags, ORFs, etc.); allows users to define custom features to display; contains a library of commonly used plasmids; and, generates nice looking images in a variety of output formats.

ProbeLynxTool Content

http://www.pathogenomics.ca/probelynx

Using current releases of genomic sequence data, ProbeLynx allows users to assess the specificity of probe sequences used for microarray experiments. The user provides probe sequences in FASTA or tab-delimited format, and ProbeLynx reports specificity information for each probe and functional annotations from Ensembl or TIGR for each probe target hit.

SNPboxTool Content

http://www.SNPbox.org

Tool for large-scale standardized primer design with specific modules for SNP and exon PCR amplification. Results are presented as HTML with genomic position, primer sequence, and PCR conditions. Precomputed SNPbox results for all Ensembl exons are also available on the server.

siDirectTool Content

http://design.RNAi.jp/

Server for computing small interfering RNA (siRNA) sequences which are best suited for mammalian RNA interference (RNAi). The site accepts a sequence as input and returns a list of siRNA candidates.

siRNA Selection ServerTool Content

http://jura.wi.mit.edu/bioc/siRNA

Server aiding the design of short interfering RNAs (siRNAs) by providing information on stability, SNPs and specificity of a potential siRNA.

This content is being maintained by lewitter.

Education > Community

Education > Directories and Portals

Expression > Databases

Expression > Gene Regulation

BEARRTool Content

http://giscompute.gis.a-star.edu.sg/~vega/BEARR1.0/

Batch Extraction and Analysis of cis-Regulatory Regions (BEARR) takes a list of gene identifiers (such as RefSeq and Unigene IDs), consensus patterns, and (optionally) a position weight matrix as input and returns a list of matches for the patterns in both the sense and anti-sense strands of the relevant genomic sequence. The user specifies what distance up and downstream from both the transcription start site and the 3\' terminus to look for the patterns.

CARRIETool Content

http://zlab.bu.edu/CARRIE-web

Server which analyzes microarray and promoter sequence data associated with a response to a specific stimulus. After analysis a potential transcriptional regulatory network is created. CARRIE also determines which transcription factors were likely involved in regulation and which genes they regulated.

CONFACTool Content

http://morenolab.whitehead.emory.edu/cgi-bin/confac/login.pl

The Conserved Transcription Factor Binding Site Finder (CONFAC) takes a list of human gene names and identifiers as input, and compares them with their mouse orthologues to identify conserved transcription factor binding sites. Further information from the user allows CONFAC to identify binding sites that are enriched in the promoter regions of gene clusters from microarray analyses when compared to control gene sets.

CREMETool Content

http://creme.dcode.org/

CREME (Cis-Regulatory Module Explorer for the human genome) is a tool for identifying and visualizing cis-regulatory modules for a given set of genes that are potentially co-expressed or co-regulated. It takes as input a list of accession numbers, and reports back common modules, grouping genes from the list by which modules are found in their promoter regions.

CSTminerTool Content

http://www.caspur.it/CSTminer/

CSTminer searches for Conserved Sequence Tags (CST) in pairwise comparisons of genomic sequences. Output is a graphical map showing the location of CSTs overlayed with known Ensembl exons and ESTs. A scoring system reflects coding or non-coding potential of these conserved blocks.

DEQORDatabase Content

http://deqor.mpi-cbg.de/deqor_new/input.html

Tool which aids in the design and quality control of small interfering RNAs (siRNAs) for RNA interference (RNAi) and gene silencing. It evaluates the inhibitory potency of potential siRNA sequences as well as identifying gene regions that have a high silencing capacity.

IBM Bioinformatics and Pattern Discovery GroupTool Content

http://cbcsrv.watson.ibm.com/Tspd.html

Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.

Onto-ToolsTool Content

http://vortex.cs.wayne.edu/Projects.html

Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.

POBOTool Content

http://ekhidna.biocenter.helsinki.fi/poxo/pobo/

POBO is a tool to summarize, verify and screen predetermined cis-element motifs from a set of sequences. POBO reports the results in as understandable format as possible for biologists.

This content is being maintained by mkankain.

PredictRegulonTool Content

http://www.cdfd.org.in/predictregulon/

PredictRegulon is a tool that, for a given regulatory protein and prokaryotic genome, predicts binding sites and operons. The user selects a prokaryote genome and provides an ungapped multiple sequence alignment representing the binding site. PredictRegulon reports back with predicted binding sites and downstream co-regulated genes.

Riboswitch finderTool Content

http://riboswitch.bioapps.biozentrum.uni-wuerzburg.de/server.html

RNA motif search program that identifies RNA motifs called riboswitches which are metabolic binding domains in mRNA that regulate gene expression. The program was originally designed around a set of riboswitches found in Bacillus subtilis.

Web WeederTool Content

http://159.149.109.9/modtools/

Server which identifies conserved motifs within sets of related regulatory DNA sequences, which are likely to be transcription factor binding sites. The interface will automatically conduct several runs using different parameters and output a summary.

rVISTATool Content

http://rvista.dcode.org/

Server which detects transcription factor binding sites(TFBS) through combining TFBS prediction, sequence comparison and cluster analysis.

siDirectTool Content

http://design.RNAi.jp/

Server for computing small interfering RNA (siRNA) sequences which are best suited for mammalian RNA interference (RNAi). The site accepts a sequence as input and returns a list of siRNA candidates.

siRNA Selection ServerTool Content

http://jura.wi.mit.edu/bioc/siRNA

Server aiding the design of short interfering RNAs (siRNAs) by providing information on stability, SNPs and specificity of a potential siRNA.

This content is being maintained by lewitter.

Expression > Protein Expression

ProMoSTTool Content

http://proteomics.mcw.edu/promost

ProMoST (Protein Modification Screening Tool) is a program to calculate accurate MW and pI values from proteins considering the effects of post-translational modifications. Results are displayed as calculated values of pI and MW for each protein and are also plotted on two-dimensional (2D) gel images.

This content is being maintained by halligan.

Expression > Splicing

ASmodelerTool Content

http://genome.ewha.ac.kr/ECgene/ASmodeler/

Gene modeling server which focuses on the modeling of alternative splicing. It is based on the alignment of mRNA, EST and protein sequences and combines genome-based clustering and transcript assembly. Supports human, mouse and rat genomes.

Expression > Transcript Expression Analysis

ArrayPipeTool Content

http://koch.pathogenomics.ca/cgi-bin/pub/arraypipe.pl

ArrayPipe allows users to customize a processing pipeline for the analysis of microarray data. Includes methods for quality assessment of slides, data visualization, normalization, and detection of differentially expressed genes. Output consists of reports formatted as standard web pages and tab-delimited lists of calculated values.

BEARRTool Content

http://giscompute.gis.a-star.edu.sg/~vega/BEARR1.0/

Batch Extraction and Analysis of cis-Regulatory Regions (BEARR) takes a list of gene identifiers (such as RefSeq and Unigene IDs), consensus patterns, and (optionally) a position weight matrix as input and returns a list of matches for the patterns in both the sense and anti-sense strands of the relevant genomic sequence. The user specifies what distance up and downstream from both the transcription start site and the 3\' terminus to look for the patterns.

CARRIETool Content

http://zlab.bu.edu/CARRIE-web

Server which analyzes microarray and promoter sequence data associated with a response to a specific stimulus. After analysis a potential transcriptional regulatory network is created. CARRIE also determines which transcription factors were likely involved in regulation and which genes they regulated.

CONFACTool Content

http://morenolab.whitehead.emory.edu/cgi-bin/confac/login.pl

The Conserved Transcription Factor Binding Site Finder (CONFAC) takes a list of human gene names and identifiers as input, and compares them with their mouse orthologues to identify conserved transcription factor binding sites. Further information from the user allows CONFAC to identify binding sites that are enriched in the promoter regions of gene clusters from microarray analyses when compared to control gene sets.

Expression ProfilerTool Content

http://www.ebi.ac.uk/expressionprofiler/

Expression Profiler is a web based platform for microarray data analysis developed at the EBI. This resource is integrated with the ArrayExpress database, a public repository for microarray data.

GEPASTool Content

http://www.gepas.org

The Gene Expression Pattern Analysis Suite (GEPAS) is a collection of tools for the analysis of microarray data. GEPAS includes tools for data pre-processing, clustering, differential gene expression, predictors, array CGH and functional annotation. A new pipeline module allows for automation of sequential analysis steps.

GFINDerTool Content

http://www.medinfopoli.polimi.it/GFINDer/

Genome Functional INtegrated Discoverer (GFINDer) takes a list of gene/clone IDs with classification information as input, and allows the user to characterize the different gene classes in the list using annotations of various types from several different sources.

KARMATool Content

http://biryani.med.yale.edu/karma/cgi-bin/mysql/karma.pl

KARMA (Keck Array Manager and Annotator) allows you compare and annotate your own microarrays against other available arrays. Comparison of arrays can be achieved within the same species as well as across species (array comparison is based on UniGene Cluster ID and/or Homologene data). Annotation data includes LocusLink, SwissProt, and Gene Ontology.

Onto-ToolsTool Content

http://vortex.cs.wayne.edu/Projects.html

Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.

POBOTool Content

http://ekhidna.biocenter.helsinki.fi/poxo/pobo/

POBO is a tool to summarize, verify and screen predetermined cis-element motifs from a set of sequences. POBO reports the results in as understandable format as possible for biologists.

This content is being maintained by mkankain.

ProbeLynxTool Content

http://www.pathogenomics.ca/probelynx

Using current releases of genomic sequence data, ProbeLynx allows users to assess the specificity of probe sequences used for microarray experiments. The user provides probe sequences in FASTA or tab-delimited format, and ProbeLynx reports specificity information for each probe and functional annotations from Ensembl or TIGR for each probe target hit.

Expression > cDNA, EST, SAGE

E2GTool Content

http://bibiserv.techfak.uni-bielefeld.de/e2g/

E2G is a tool that maps a large set of EST and cDNA sequences to a user-supplied genomic sequence. The use of pre-computed indexed data structures increases the efficiency of the sequence comparison process, allowing a large amount of data to be mapped within a reasonable timeframe.

Human Genome > Annotations

IBM Bioinformatics and Pattern Discovery GroupTool Content

http://cbcsrv.watson.ibm.com/Tspd.html

Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.

Human Genome > Databases

Human Genome > Other Resources

BEARRTool Content

http://giscompute.gis.a-star.edu.sg/~vega/BEARR1.0/

Batch Extraction and Analysis of cis-Regulatory Regions (BEARR) takes a list of gene identifiers (such as RefSeq and Unigene IDs), consensus patterns, and (optionally) a position weight matrix as input and returns a list of matches for the patterns in both the sense and anti-sense strands of the relevant genomic sequence. The user specifies what distance up and downstream from both the transcription start site and the 3\' terminus to look for the patterns.

CREMETool Content

http://creme.dcode.org/

CREME (Cis-Regulatory Module Explorer for the human genome) is a tool for identifying and visualizing cis-regulatory modules for a given set of genes that are potentially co-expressed or co-regulated. It takes as input a list of accession numbers, and reports back common modules, grouping genes from the list by which modules are found in their promoter regions.

SBT Interface - dbMHCTool Content

http://www.ncbi.nlm.nih.gov/gv/mhc/sbt.cgi?cmd=main

The SBT (Sequencing-based typing) interface is a tool accompanying the NCBI's Major Histocompatibility Complex database (dbMHC). It identifies the allelic composition of sequence-based typing (SBT) results of cDNA or genomic sequences. Samples are compared with databases containing all known alleles for several human leukocyte antigen (HLA) and killer cell immunoglobulin-like receptors (KIR) loci. An interactive sequence viewer allows inspection of the alignments.

Human Genome > Sequence Polymorphisms

PDATool Content

http://pda.uab.es/pda2/

PDA (Pipeline Diversity Analysis) is a server which searches for polymorphic sequences in large databases, and estimates their genetic diversity. Results contain sequence alignments (generated by ClustalW) and include statistics on polymorphism, synonymous and non-synonymous substitutions, linkage disequilibrium, and codon bias for your alignment. The full results set is also available for download as a MySQL database.

This content is being maintained by soniacas.

PupaSuiteTool Content

http://pupasuite.bioinfo.cipf.es

PupaSuite is a SNP analysis tool that allows for the selection of relevant SNPs within a gene based on the characteristics of the SNP. PupaSuite also provides information about LD parameters (based on genotype data from HapMap) and identifies haplotype blocks and tag SNPs. PupaSuite was created to join PupaSNP & PupasView.

SBT Interface - dbMHCTool Content

http://www.ncbi.nlm.nih.gov/gv/mhc/sbt.cgi?cmd=main

The SBT (Sequencing-based typing) interface is a tool accompanying the NCBI's Major Histocompatibility Complex database (dbMHC). It identifies the allelic composition of sequence-based typing (SBT) results of cDNA or genomic sequences. Samples are compared with databases containing all known alleles for several human leukocyte antigen (HLA) and killer cell immunoglobulin-like receptors (KIR) loci. An interactive sequence viewer allows inspection of the alignments.

SNPboxTool Content

http://www.SNPbox.org

Tool for large-scale standardized primer design with specific modules for SNP and exon PCR amplification. Results are presented as HTML with genomic position, primer sequence, and PCR conditions. Precomputed SNPbox results for all Ensembl exons are also available on the server.

Literature > Databases

Literature > Search Tools

PubCrawlerTool Content

http://pubcrawler.gen.tcd.ie/

It goes to the library. You go to the pub; receive email alerts for current contents of PubMed and GenBank; e.g. use accession number of htg record as query to receive sequence updates (as the version number changes).

Literature > Text Mining and Semantics

NLProtTool Content

http://cubic.bioc.columbia.edu/services/nlprot/index.html

NLProt is a tool for finding protein names in natural language text. This data-mining method is a useful approach for extracting protein UniprotIDs from research articles for the construction of custom datasets and/or databases.

Model Organisms > Databases

Model Organisms > Fish

ECR BrowserTool Content

http://ecrbrowser.dcode.org/

The ECR (Evolutionary Conserved Regions) browser is a web-based tool for visualizing and navigating through whole genome alignments of several vertebrate species. Users can also submit sequences for alignment with one of the genomes represented.

Model Organisms > Fly

ECR BrowserTool Content

http://ecrbrowser.dcode.org/

The ECR (Evolutionary Conserved Regions) browser is a web-based tool for visualizing and navigating through whole genome alignments of several vertebrate species. Users can also submit sequences for alignment with one of the genomes represented.

Model Organisms > Microbes

CVTreeTool Content

http://tlife.fudan.edu.cn/cvtree

Composition Vector Tree (CVTree) infers phylogenetic relationships between microbial organisms by comparing their proteomes using a composition vector approach.

Phydbac2Tool Content

http://igs-server.cnrs-mrs.fr/phydbac/

Phydbac2 (Phylogenomic display of bacterial genes) is a tool to visualize and explore the phylogenomic profiles of bacterial protein sequences. It also allows the user to view sequence similarity across different organisms, access other genes with similar conservation profiles, and view genes that are found nearby a selected gene in multiple genomes.

Model Organisms > Mouse and Rat

ASmodelerTool Content

http://genome.ewha.ac.kr/ECgene/ASmodeler/

Gene modeling server which focuses on the modeling of alternative splicing. It is based on the alignment of mRNA, EST and protein sequences and combines genome-based clustering and transcript assembly. Supports human, mouse and rat genomes.

BEARRTool Content

http://giscompute.gis.a-star.edu.sg/~vega/BEARR1.0/

Batch Extraction and Analysis of cis-Regulatory Regions (BEARR) takes a list of gene identifiers (such as RefSeq and Unigene IDs), consensus patterns, and (optionally) a position weight matrix as input and returns a list of matches for the patterns in both the sense and anti-sense strands of the relevant genomic sequence. The user specifies what distance up and downstream from both the transcription start site and the 3\' terminus to look for the patterns.

ECR BrowserTool Content

http://ecrbrowser.dcode.org/

The ECR (Evolutionary Conserved Regions) browser is a web-based tool for visualizing and navigating through whole genome alignments of several vertebrate species. Users can also submit sequences for alignment with one of the genomes represented.

Model Organisms > Other Organisms

Genotyping - NCBITool Content

http://www.ncbi.nih.gov/projects/genotyping/formpage.cgi

The Genotyping tool at the NCBI identifies the genotype (or subtype) of viral sequences by using a sliding window approach to BLAST analysis against reference sequences for different viral subtypes. Results are shown as a graphical output plotting the top-scoring genotype. An alignment tool is also available.

Model Organisms > Other Vertebrates

ECR BrowserTool Content

http://ecrbrowser.dcode.org/

The ECR (Evolutionary Conserved Regions) browser is a web-based tool for visualizing and navigating through whole genome alignments of several vertebrate species. Users can also submit sequences for alignment with one of the genomes represented.

Other Molecules > Databases

Protein > 2-D Structure Prediction

BOMPTool Content

http://www.bioinfo.no/tools/bomp

The beta-barrel Outer Membrane protein Predictor (BOMP) takes one or more fasta-formatted polypeptide sequences from Gram-negative bacteria as input and predicts whether or not they are beta-barrel integral outer membrane proteins.

DICHROWEBTool Content

http://dichroweb.cryst.bbk.ac.uk/html/home.shtml

A server that supports several different algorithms for the analysis of Circular Dichroism (CD) spectra for the prediction of protein secondary structure. Results also contain a graphical comparison of calculated versus experimental results.

IBM Bioinformatics and Pattern Discovery GroupTool Content

http://cbcsrv.watson.ibm.com/Tspd.html

Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.

PRED-TMBBTool Content

http://bioinformatics.biol.uoa.gr/PRED-TMBB/

PRED-TMBB is a tool that takes a Gram-negative bacteria protein sequence as input and predicts the transmembrane strands and the probability of it being an outer membrane beta-barrel protein. The user has a choice of three different decoding methods.

This content is being maintained by pbagos.

PROSPECT-PSPPTool Content

http://csbl.bmb.uga.edu/protein_pipeline

An automated Protein Structure Prediction Pipeline (PSPP) based on multiple structure prediction tools. A key component of the pipeline is the fold recognition program, PROSPECT. Server supports genome scale analyses.

PredictProteinTool Content

http://www.predictprotein.org

PredictProtein is a protein sequence analysis and structure prediction tool. Users provide a protein sequence, and PredictProtein reports similar sequences, PROSITE sequence motifs, and various types of structure prediction information. You can also use META PredictProtein to submit a sequence to one web form and query several other servers at once, retrieving the results by e-mail.

STRIDETool Content

http://webclu.bio.wzw.tum.de/stride/

STRIDE takes a PDB structure as input and reports back either secondary structure assignments, a Ramachandran plot or a contact map.

Protein > 3-D Structural Features

BSDDDatabase Content

http://iris.physics.iisc.ernet.in/bsdd/

BSDD (Biomolecules Segment Display Device) searches for and displays user defined sequence motifs in known protein structures. This web based tool incorporates the graphics package of RASMOL for visualization.

This content is being maintained by kitty.pillai.

BioInfo3DTool Content

http://bioinfo3d.cs.tau.ac.il/

BioInfo3D is a collection of tools for the structural analysis of proteins, including tools for structural alignments and prediction of protein interactions.

This content is being maintained by BioInfo3D.

COLORADO-3DTool Content

http://asia.genesilico.pl/colorado3d/

COLORADO-3D allows you to color your protein structures to indicate the presence of potential errors in protein structure (detected by ANOLEA, PROSAII, PROVE or VERIFY3D), buried residues, and sequence conservation. The server returns a PDB-formatted file which can be displayed in RASMOL.

FoldMiner and LOCK 2Tool Content

http://foldminer.stanford.edu/

FoldMiner aligns a user-supplied or identified query structure to a database of single domain targets to discover structural neighbours and characteristic motifs. Query structures can also be aligned to one or more user-specified structures using the LOCK 2 algorithm.

PDB2PQR ServerTool Content

http://pdb2pqr.sourceforge.net/

Server that allows users to convert PDB files into PQR files by adding missing atoms, optimizing hydrogen bonding and assigning atomic charge and radius parameters. The resulting PQR file can be used for electrostatic calculations which can give insight into the influence of electrostatics on biomolecular structures.

SCitTool Content

http://bioserv.rpbs.jussieu.fr/cgi-bin/SCit

SCit is a set of tools facilitating the analysis and editing of protein side chain conformations. Using a PDB file as input, the tools allow the user to perform such tasks as listing and/or modifying the values of the dihedral angles, listing structurally compatible rotamers for each residue and substituting side chains resulting in a new structure.

Protein > 3-D Structure Comparison

BioInfo3DTool Content

http://bioinfo3d.cs.tau.ac.il/

BioInfo3D is a collection of tools for the structural analysis of proteins, including tools for structural alignments and prediction of protein interactions.

This content is being maintained by BioInfo3D.

CE-MCTool Content

http://bioinformatics.albany.edu/~cemc/

A multiple protein structure alignment server which creates an all-to-all pairwise alignment using a combinatorial extension program and then using Monte Carlo optimization methods conducts an iterative global optimization. Results are formatted using JOY.

FATCATTool Content

http://fatcat.burnham.org/

FATCAT provides the means to compare two PDB-format protein structures, or to search for structures similar to a given PDB structure. The user can supply a PDB ID or upload a structure file. The FATCAT web server employs the Flexible structure AlignmenT by Chaining AFPs (Aligned Fragment Pairs) with Twists (FATCAT) algorithm.

FoldMiner and LOCK 2Tool Content

http://foldminer.stanford.edu/

FoldMiner aligns a user-supplied or identified query structure to a database of single domain targets to discover structural neighbours and characteristic motifs. Query structures can also be aligned to one or more user-specified structures using the LOCK 2 algorithm.

ProteinDBSTool Content

http://proteindbs.rnet.missouri.edu/

ProteinDBS takes a PDB ID or structure as input, and searches for similar protein tertiary structures using computer visualization techniques. The superposition of structures and the aligned component of the sequence can then be viewed over the web. ProteinDBS v2.0 applies advanced feature extraction algorithms and scalable indexing techniques to achieve a fast computation time.

SA-SearchTool Content

http://bioserv.rpbs.jussieu.fr/cgi-bin/SA-Search

SA-Search is a tool which first converts a PDB structure file into a one-dimensional representation using a structural alphabet, and then searches for similarities using standard methods for sequence alignment.

SuperPoseTool Content

http://wishart.biology.ualberta.ca/SuperPose/

SuperPose is a tool for superpositions of protein structures. SuperPose generates sequence alignments, structure alignments, PDB coordinates and RMSD statistics, difference distance plots, and images of the superimposed molecules.

T-COFFEETool Content

http://www.tcoffee.org/

The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.

Protein > 3-D Structure Prediction

COLORADO-3DTool Content

http://asia.genesilico.pl/colorado3d/

COLORADO-3D allows you to color your protein structures to indicate the presence of potential errors in protein structure (detected by ANOLEA, PROSAII, PROVE or VERIFY3D), buried residues, and sequence conservation. The server returns a PDB-formatted file which can be displayed in RASMOL.

CaspRTool Content

http://igs-server.cnrs-mrs.fr/Caspr/index.cgi

CaspR is a web tool for building structural models for protein sequences using molecular replacement and homology modelling. The software implements an automated approach that uses T-COFFEE to produce alignments, MODELLER to produce homology models, and finally uses AMoRe and CNS to produce the structural model.

PROSPECT-PSPPTool Content

http://csbl.bmb.uga.edu/protein_pipeline

An automated Protein Structure Prediction Pipeline (PSPP) based on multiple structure prediction tools. A key component of the pipeline is the fold recognition program, PROSPECT. Server supports genome scale analyses.

PepBuildTool Content

http://www.imtech.res.in/bvs/pepbuild/

PepBuild is a tool that facilitates the construction, from known sequence and secondary/tertiary structures, of capped or uncapped proteins. The output generated is in PDB format.

RobettaTool Content

http://robetta.bakerlab.org/

The Robetta server provides protein structure prediction tools and interface alanine scanning. The structure prediction is accomplished by either comparative modelling or the de novo Rosetta fragment insertion method. Interface alanine scanning is employed to predict how protein-protein interactions could be affected by mutations.

WurstTool Content

http://www.zbh.uni-hamburg.de/wurst/

Wurst is a protein structure prediction tool that uses threading, aligning submitted sequences to thousands of PDB template structures.

Protein > 3-D Structure Retrieval/Viewing

BSDDDatabase Content

http://iris.physics.iisc.ernet.in/bsdd/

BSDD (Biomolecules Segment Display Device) searches for and displays user defined sequence motifs in known protein structures. This web based tool incorporates the graphics package of RASMOL for visualization.

This content is being maintained by kitty.pillai.

CD-SearchDatabase Content

http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi

CD-Search allows the user to search the Conserved Domain Database (CDD) for conserved structural and functional domains in a sequence of interest; links to 3-D structures where applicable.

COLORADO-3DTool Content

http://asia.genesilico.pl/colorado3d/

COLORADO-3D allows you to color your protein structures to indicate the presence of potential errors in protein structure (detected by ANOLEA, PROSAII, PROVE or VERIFY3D), buried residues, and sequence conservation. The server returns a PDB-formatted file which can be displayed in RASMOL.

ElNemoTool Content

http://igs-server.cnrs-mrs.fr/elnemo/start.html

ElNemo (The Elastic Network Model) is a tool for predicting the possible movements (ie. conformational changes and other structural changes) of macromolecules. This tool allows users to compute, visualize, and analyse low-frequency normal modes of a protein.

MONSTERTool Content

http://monster.northwestern.edu/

MONSTER takes a PDB structure file as input, and predicts non-bonding interactions that have the potential to provide stability to the molecule. The output can be viewed or downloaded as XML or text, or can be viewed as a 3D structure or a 2D diagram. There is also a good tutorial available at the site.

This content is being maintained by samseaver.

MolProbityTool Content

http://molprobity.biochem.duke.edu/

MolProbity is a structure validation web service for diagnosing problems in 3D models of proteins, nucleic acids or complexes. It adds and optimizes H atoms (correcting 180° flipped Asn/Gln/His sidechains), and then calculates global and local validation for all-atom contacts (steric clashes, H-bonds & vdW), covalent geometry, and conformation (Ramachandran & rotamers for protein, ribose puckers & suite conformers for RNA). Results are displayed online as 3D graphics and sortable charts.

This content is being maintained by dcrjsr.

ProteinDBSTool Content

http://proteindbs.rnet.missouri.edu/

ProteinDBS takes a PDB ID or structure as input, and searches for similar protein tertiary structures using computer visualization techniques. The superposition of structures and the aligned component of the sequence can then be viewed over the web. ProteinDBS v2.0 applies advanced feature extraction algorithms and scalable indexing techniques to achieve a fast computation time.

SuperPoseTool Content

http://wishart.biology.ualberta.ca/SuperPose/

SuperPose is a tool for superpositions of protein structures. SuperPose generates sequence alignments, structure alignments, PDB coordinates and RMSD statistics, difference distance plots, and images of the superimposed molecules.

Protein > Annotation and Function

GObletTool Content

http://goblet.molgen.mpg.de/

GOblet allows the user to BLAST one or more protein or nucleotide sequences against databases that have sequences mapped to Gene Ontology (GO) terms. Results can be viewed for individual sequences, or as survey statistics for the group if more than one sequence is submitted.

GPCRpredTool Content

http://www.imtech.res.in/raghava/gpcrpred/

GPCRpred is a tool that uses a support vector machine based method to make GPCR family and subfamily predictions for a user-supplied query sequence.

This content is being maintained by raghavagps.

IBM Bioinformatics and Pattern Discovery GroupTool Content

http://cbcsrv.watson.ibm.com/Tspd.html

Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.

LOCtargetTool Content

http://www.rostlab.org/services/LOCtarget/

LOCtarget is a tool for predicting, and a database of pre-computed predictions for, sub-cellular localization of eukaryotic and prokaryotic proteins. Several methods are employed to make the predictions, including text analysis of SWISS-PROT keywords, nuclear localization signals, and the use of neural networks.

MITOPREDTool Content

http://bioinformatics.albany.edu/~mitopred/

MITOPRED uses Pfam domains, pI values and amino acid composition to predict nuclear-encoded mitochondrial proteins. Predictions have been precomputed for a number of proteomes, as well as for all Eukaryotic sequences in Swiss-Prot and TrEMBL. Users may directly enter or upload a file with a list of protein sequences or Swiss-Prot/TrEMBL accession numbers.

MoViESTool Content

http://ang.cz3.nus.edu.sg/cgi-bin/prog/norm.pl

MoViES (Molecular Vibrations Evaluation Server) allows the computation of the vibrational and thermal fluctuations dynamics for the chemical bonds of biomolecules. The server takes as input a structure file in PDB format and returns results by email. Analysis of vibrational motions and thermal fluctuational dynamics can give insights into the functional properties of both proteins and nucleic acids.

PRED-GPCRTool Content

http://athina.biol.uoa.gr/bioinformatics/PRED-GPCR/

PRED-GPCR is a tool which queries user-supplied sequences against a database of HMMs corresponding to G-protein coupled receptor (GPCR) families in order to determine which GPCR family the query sequence most resembles.

Phydbac2Tool Content

http://igs-server.cnrs-mrs.fr/phydbac/

Phydbac2 (Phylogenomic display of bacterial genes) is a tool to visualize and explore the phylogenomic profiles of bacterial protein sequences. It also allows the user to view sequence similarity across different organisms, access other genes with similar conservation profiles, and view genes that are found nearby a selected gene in multiple genomes.

PrediSiTool Content

http://www.predisi.de/

PrediSi (Prediction of Signal Peptides) takes one or more amino acid sequences as input and predicts the likelihood that they are signal peptides as well as their cleavage positions. It can be used to analyse whole proteome datasets.

Proteome AnalystTool Content

http://www.cs.ualberta.ca/~bioinfo/PA/

Proteome Analyst is a high-throughput tool for predicting properties for each protein in a proteome. The user provides a proteome in fasta format, and the system employs Psi-blast, Psipred and Modeller to predict protein function and subcellular localization. Proteome Analyst uses machine-learned classifiers to predict things such as GO molecular function. User-supplied training data can also be used to create custom classifiers.

SDPpredTool Content

http://bioinf.fbb.msu.ru/SDPpred/

SDPpred is a tool for predicting which residues of a protein determine functional differences relative to its homologues. It takes as input an multiple sequence alignment of a protein family divided into groups based on the perceived functional differences.

This content is being maintained by okalinina.

Protein > Biochemical Features

GDAPTool Content

http://www.doe-mbi.ucla.edu/Services/GDAP

The Genomic Disulfide Analysis Program (GDAP) predicts disulfide bonds for a user-supplied protein sequence. GDAP also provides access to pre-computed predictions of disulfide bonds for over 100 microbial genomes.

IBM Bioinformatics and Pattern Discovery GroupTool Content

http://cbcsrv.watson.ibm.com/Tspd.html

Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.

MONSTERTool Content

http://monster.northwestern.edu/

MONSTER takes a PDB structure file as input, and predicts non-bonding interactions that have the potential to provide stability to the molecule. The output can be viewed or downloaded as XML or text, or can be viewed as a 3D structure or a 2D diagram. There is also a good tutorial available at the site.

This content is being maintained by samseaver.

Onto-ToolsTool Content

http://vortex.cs.wayne.edu/Projects.html

Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.

PDB2PQR ServerTool Content

http://pdb2pqr.sourceforge.net/

Server that allows users to convert PDB files into PQR files by adding missing atoms, optimizing hydrogen bonding and assigning atomic charge and radius parameters. The resulting PQR file can be used for electrostatic calculations which can give insight into the influence of electrostatics on biomolecular structures.

ProMoSTTool Content

http://proteomics.mcw.edu/promost

ProMoST (Protein Modification Screening Tool) is a program to calculate accurate MW and pI values from proteins considering the effects of post-translational modifications. Results are displayed as calculated values of pI and MW for each protein and are also plotted on two-dimensional (2D) gel images.

This content is being maintained by halligan.

QgridTool Content

http://www.netasa.org/qgrid/index.html

Server which provides cluster tree diagrams of a protein based on the charged atoms or hydrophobicity of each of its residues. The diagram allows for visual inspection of the distribution of hydrophobic and charged regions in proteins.

Protein > Databases

Protein > Domains and Motifs

CD-SearchDatabase Content

http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi

CD-Search allows the user to search the Conserved Domain Database (CDD) for conserved structural and functional domains in a sequence of interest; links to 3-D structures where applicable.

CHOPTool Content

http://www.rostlab.org/services/CHOP/

CHOP takes a protein sequence as input, and returns a list of protein sequence fragments with homology to PDB and Pfam domains and to proteins from the SWISS-PROT database.

FoldMiner and LOCK 2Tool Content

http://foldminer.stanford.edu/

FoldMiner aligns a user-supplied or identified query structure to a database of single domain targets to discover structural neighbours and characteristic motifs. Query structures can also be aligned to one or more user-specified structures using the LOCK 2 algorithm.

IBM Bioinformatics and Pattern Discovery GroupTool Content

http://cbcsrv.watson.ibm.com/Tspd.html

Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.

MyHitsTool Content

http://myhits.isb-sib.ch

The MyHits server integrates several tools with a focus on protein annotation and the analysis of protein domains. Guest users have access to tools such as ClustalW and T-Coffee and databases like Swiss-Prot, Prosite and Interpro. Registration allows users to store their results in private databases, and is free for academic users.

This content is being maintained by myhits.

NRPS-PKSTool Content

http://www.nii.res.in/nrps-pks.html

NRPS-PKS is an tool comprising four integrated databases for the analysis of large multi-enzymatic multi-domain megasynthases. The user can submit a query sequence to search for domains or view the properties of the products.

PRED-GPCRTool Content

http://athina.biol.uoa.gr/bioinformatics/PRED-GPCR/

PRED-GPCR is a tool which queries user-supplied sequences against a database of HMMs corresponding to G-protein coupled receptor (GPCR) families in order to determine which GPCR family the query sequence most resembles.

PredictProteinTool Content

http://www.predictprotein.org

PredictProtein is a protein sequence analysis and structure prediction tool. Users provide a protein sequence, and PredictProtein reports similar sequences, PROSITE sequence motifs, and various types of structure prediction information. You can also use META PredictProtein to submit a sequence to one web form and query several other servers at once, retrieving the results by e-mail.

SledgeHMMERTool Content

http://bioapps.rit.albany.edu/sledgeHMMER/

SledgeHMMER is a tool for searching the Pfam database using a parallelized version of the program hmmpfam. The user can perform queries with one or more sequences at a time and then receive the results by e-mail.

Protein > Interactions, Pathways, Enzymes

BioInfo3DTool Content

http://bioinfo3d.cs.tau.ac.il/

BioInfo3D is a collection of tools for the structural analysis of proteins, including tools for structural alignments and prediction of protein interactions.

This content is being maintained by BioInfo3D.

Onto-ToolsTool Content

http://vortex.cs.wayne.edu/Projects.html

Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.

PathBLASTTool Content

http://www.pathblast.org/

PathBLAST is a tool for cross-species comparison of protein interaction networks. PathBLAST takes a short protein interaction path as input and searches against an available protein-protein interation network specified by the user.

RobettaTool Content

http://robetta.bakerlab.org/

The Robetta server provides protein structure prediction tools and interface alanine scanning. The structure prediction is accomplished by either comparative modelling or the de novo Rosetta fragment insertion method. Interface alanine scanning is employed to predict how protein-protein interactions could be affected by mutations.

SBT Interface - dbMHCTool Content

http://www.ncbi.nlm.nih.gov/gv/mhc/sbt.cgi?cmd=main

The SBT (Sequencing-based typing) interface is a tool accompanying the NCBI's Major Histocompatibility Complex database (dbMHC). It identifies the allelic composition of sequence-based typing (SBT) results of cDNA or genomic sequences. Samples are compared with databases containing all known alleles for several human leukocyte antigen (HLA) and killer cell immunoglobulin-like receptors (KIR) loci. An interactive sequence viewer allows inspection of the alignments.

WebInterViewerTool Content

http://interviewer.inha.ac.kr/

WebInterViewer is a tool for visualizing molecular interaction networks. It employs a fast-layout algorithm, and can visualize data from interaction databases directly.

This content is being maintained by khan.

Protein > Localization and Targeting

BOMPTool Content

http://www.bioinfo.no/tools/bomp

The beta-barrel Outer Membrane protein Predictor (BOMP) takes one or more fasta-formatted polypeptide sequences from Gram-negative bacteria as input and predicts whether or not they are beta-barrel integral outer membrane proteins.

ESLpredTool Content

http://www.imtech.res.in/raghava/eslpred/

ESLpred is a tool for predicting subcellular localization of proteins using support vector machines. The predictions are based on dipeptide and amino acid composition, and physico-chemical properties.

This content is being maintained by raghavagps.

PRED-TMBBTool Content

http://bioinformatics.biol.uoa.gr/PRED-TMBB/

PRED-TMBB is a tool that takes a Gram-negative bacteria protein sequence as input and predicts the transmembrane strands and the probability of it being an outer membrane beta-barrel protein. The user has a choice of three different decoding methods.

This content is being maintained by pbagos.

PrediSiTool Content

http://www.predisi.de/

PrediSi (Prediction of Signal Peptides) takes one or more amino acid sequences as input and predicts the likelihood that they are signal peptides as well as their cleavage positions. It can be used to analyse whole proteome datasets.

Protein > Molecular Dynamics and Docking

BioInfo3DTool Content

http://bioinfo3d.cs.tau.ac.il/

BioInfo3D is a collection of tools for the structural analysis of proteins, including tools for structural alignments and prediction of protein interactions.

This content is being maintained by BioInfo3D.

ElNemoTool Content

http://igs-server.cnrs-mrs.fr/elnemo/start.html

ElNemo (The Elastic Network Model) is a tool for predicting the possible movements (ie. conformational changes and other structural changes) of macromolecules. This tool allows users to compute, visualize, and analyse low-frequency normal modes of a protein.

MoViESTool Content

http://ang.cz3.nus.edu.sg/cgi-bin/prog/norm.pl

MoViES (Molecular Vibrations Evaluation Server) allows the computation of the vibrational and thermal fluctuations dynamics for the chemical bonds of biomolecules. The server takes as input a structure file in PDB format and returns results by email. Analysis of vibrational motions and thermal fluctuational dynamics can give insights into the functional properties of both proteins and nucleic acids.

Protein > Phylogeny Reconstruction

CVTreeTool Content

http://tlife.fudan.edu.cn/cvtree

Composition Vector Tree (CVTree) infers phylogenetic relationships between microbial organisms by comparing their proteomes using a composition vector approach.

Phydbac2Tool Content

http://igs-server.cnrs-mrs.fr/phydbac/

Phydbac2 (Phylogenomic display of bacterial genes) is a tool to visualize and explore the phylogenomic profiles of bacterial protein sequences. It also allows the user to view sequence similarity across different organisms, access other genes with similar conservation profiles, and view genes that are found nearby a selected gene in multiple genomes.

T-COFFEETool Content

http://www.tcoffee.org/

The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.

Protein > Presentation and Format

T-COFFEETool Content

http://www.tcoffee.org/

The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.

WAViSTool Content

http://wavis.img.cas.cz/

The Web Alignment Visualization Server (WAViS) provides various web tools to enhance the presentation of amino acid or nucleotide multiple sequence alignments.

Protein > Protein Expression

ProMoSTTool Content

http://proteomics.mcw.edu/promost

ProMoST (Protein Modification Screening Tool) is a program to calculate accurate MW and pI values from proteins considering the effects of post-translational modifications. Results are displayed as calculated values of pI and MW for each protein and are also plotted on two-dimensional (2D) gel images.

This content is being maintained by halligan.

Protein > Proteomics

ISOTOPICATool Content

http://coco.protein.osaka-u.ac.jp/isotopica/

Tools developed to aid in the identification of mass spectrum that allow the calculation of mass values with isotopic distributions based on molecular formulas, peptides/proteins, DNA/RNA, carbohydrate sequences or combinations thereof. A viewer for visualizing results is also available.

ProMoSTTool Content

http://proteomics.mcw.edu/promost

ProMoST (Protein Modification Screening Tool) is a program to calculate accurate MW and pI values from proteins considering the effects of post-translational modifications. Results are displayed as calculated values of pI and MW for each protein and are also plotted on two-dimensional (2D) gel images.

This content is being maintained by halligan.

Proteome AnalystTool Content

http://www.cs.ualberta.ca/~bioinfo/PA/

Proteome Analyst is a high-throughput tool for predicting properties for each protein in a proteome. The user provides a proteome in fasta format, and the system employs Psi-blast, Psipred and Modeller to predict protein function and subcellular localization. Proteome Analyst uses machine-learned classifiers to predict things such as GO molecular function. User-supplied training data can also be used to create custom classifiers.

Protein > Sequence Features

GDAPTool Content

http://www.doe-mbi.ucla.edu/Services/GDAP

The Genomic Disulfide Analysis Program (GDAP) predicts disulfide bonds for a user-supplied protein sequence. GDAP also provides access to pre-computed predictions of disulfide bonds for over 100 microbial genomes.

LowComplexityTool Content

http://wwwmgs.bionet.nsc.ru/mgs/programs/low_complexity/

LowComplexity is a tool that searches for low complexity regions of DNA or protein sequences. Using LowComplexity you can search long sequences (chromosomes, genomes) or a set of aligned sequences. This resource also contains links to other algorithms for evaluating the complexity of sequences.

Protein > Sequence Retrieval

BLASTTool Content

http://www.ncbi.nlm.nih.gov/BLAST/

Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families.

NLProtTool Content

http://cubic.bioc.columbia.edu/services/nlprot/index.html

NLProt is a tool for finding protein names in natural language text. This data-mining method is a useful approach for extracting protein UniprotIDs from research articles for the construction of custom datasets and/or databases.

PubCrawlerTool Content

http://pubcrawler.gen.tcd.ie/

It goes to the library. You go to the pub; receive email alerts for current contents of PubMed and GenBank; e.g. use accession number of htg record as query to receive sequence updates (as the version number changes).

RNA > Databases

RNA > Motifs

ERPINTool Content

http://tagc.univ-mrs.fr/erpin/

ERPIN (Easy RNA Profile IdentificatioN) takes as input an RNA sequence alignment and secondary structure annotation and will identify a wide variety of known RNA motifs (such as tRNAs, 5S rRNAs, SRP RNA, C/D box snoRNAs, hammerhead motifs, miRNAs and others) in your sequence(s) of interest. Also contains tool for drawing secondary structure of motifs.

IBM Bioinformatics and Pattern Discovery GroupTool Content

http://cbcsrv.watson.ibm.com/Tspd.html

Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.

Riboswitch finderTool Content

http://riboswitch.bioapps.biozentrum.uni-wuerzburg.de/server.html

RNA motif search program that identifies RNA motifs called riboswitches which are metabolic binding domains in mRNA that regulate gene expression. The program was originally designed around a set of riboswitches found in Bacillus subtilis.

RNA > Sequence Retrieval

BLASTTool Content

http://www.ncbi.nlm.nih.gov/BLAST/

Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families.

PubCrawlerTool Content

http://pubcrawler.gen.tcd.ie/

It goes to the library. You go to the pub; receive email alerts for current contents of PubMed and GenBank; e.g. use accession number of htg record as query to receive sequence updates (as the version number changes).

RNA > Structure Prediction, Visualization, and Design

DEQORDatabase Content

http://deqor.mpi-cbg.de/deqor_new/input.html

Tool which aids in the design and quality control of small interfering RNAs (siRNAs) for RNA interference (RNAi) and gene silencing. It evaluates the inhibitory potency of potential siRNA sequences as well as identifying gene regions that have a high silencing capacity.

ERPINTool Content

http://tagc.univ-mrs.fr/erpin/

ERPIN (Easy RNA Profile IdentificatioN) takes as input an RNA sequence alignment and secondary structure annotation and will identify a wide variety of known RNA motifs (such as tRNAs, 5S rRNAs, SRP RNA, C/D box snoRNAs, hammerhead motifs, miRNAs and others) in your sequence(s) of interest. Also contains tool for drawing secondary structure of motifs.

ILMTool Content

http://cic.cs.wustl.edu/RNA/

Server which provides iterated loop matching and maximum weighted matching algorithms for pseudoknot containing RNA secondary structure prediction. Algorithms can apply thermodynamic and comparative information, and thus can be used for either aligned or individual sequences.

MoViESTool Content

http://ang.cz3.nus.edu.sg/cgi-bin/prog/norm.pl

MoViES (Molecular Vibrations Evaluation Server) allows the computation of the vibrational and thermal fluctuations dynamics for the chemical bonds of biomolecules. The server takes as input a structure file in PDB format and returns results by email. Analysis of vibrational motions and thermal fluctuational dynamics can give insights into the functional properties of both proteins and nucleic acids.

MolProbityTool Content

http://molprobity.biochem.duke.edu/

MolProbity is a structure validation web service for diagnosing problems in 3D models of proteins, nucleic acids or complexes. It adds and optimizes H atoms (correcting 180° flipped Asn/Gln/His sidechains), and then calculates global and local validation for all-atom contacts (steric clashes, H-bonds & vdW), covalent geometry, and conformation (Ramachandran & rotamers for protein, ribose puckers & suite conformers for RNA). Results are displayed online as 3D graphics and sortable charts.

This content is being maintained by dcrjsr.

siDirectTool Content

http://design.RNAi.jp/

Server for computing small interfering RNA (siRNA) sequences which are best suited for mammalian RNA interference (RNAi). The site accepts a sequence as input and returns a list of siRNA candidates.

siRNA Selection ServerTool Content

http://jura.wi.mit.edu/bioc/siRNA

Server aiding the design of short interfering RNAs (siRNAs) by providing information on stability, SNPs and specificity of a potential siRNA.

This content is being maintained by lewitter.

Sequence Comparison > Alignment Editing and Visualization

CHAOS/DIALIGN WWW serverTool Content

http://dialign.gobics.de/chaos-dialign-submission

The CHAOS/DIALIGN WWW server is a multiple sequence alignment site which passes input sequences through CHAOS to create a list of local similarites. These similarities serve as anchor points, allowing DIALIGN to conduct global alignments faster. ABC can then be used for the interactive visualization of the alignment.

T-COFFEETool Content

http://www.tcoffee.org/

The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.

WAViSTool Content

http://wavis.img.cas.cz/

The Web Alignment Visualization Server (WAViS) provides various web tools to enhance the presentation of amino acid or nucleotide multiple sequence alignments.

Sequence Comparison > Analysis of Aligned Sequences

AGenDATool Content

http://bibiserv.techfak.uni-bielefeld.de/agenda/

AGenDA is a web tool that compares the genomic sequences from evolutionarily related organisms in order to make gene predictions. It takes pairs of genomic sequences as input, aligns the sequences, and makes predictions based on splice signals, start and stop codons, and areas of conserved sequence.

ASmodelerTool Content

http://genome.ewha.ac.kr/ECgene/ASmodeler/

Gene modeling server which focuses on the modeling of alternative splicing. It is based on the alignment of mRNA, EST and protein sequences and combines genome-based clustering and transcript assembly. Supports human, mouse and rat genomes.

CRASPTool Content

http://wwwmgs.bionet.nsc.ru/mgs/programs/crasp/

Correlation analysis of the amino acid substitutions in protein sequences (CRASP) takes multiple alignments of amino acid sequences as input, and detects coordinated residue substitutions. These substitutions may suggest dependent evolution of functionally related pairs of amino acids.

CSTminerTool Content

http://www.caspur.it/CSTminer/

CSTminer searches for Conserved Sequence Tags (CST) in pairwise comparisons of genomic sequences. Output is a graphical map showing the location of CSTs overlayed with known Ensembl exons and ESTs. A scoring system reflects coding or non-coding potential of these conserved blocks.

ECR BrowserTool Content

http://ecrbrowser.dcode.org/

The ECR (Evolutionary Conserved Regions) browser is a web-based tool for visualizing and navigating through whole genome alignments of several vertebrate species. Users can also submit sequences for alignment with one of the genomes represented.

GObletTool Content

http://goblet.molgen.mpg.de/

GOblet allows the user to BLAST one or more protein or nucleotide sequences against databases that have sequences mapped to Gene Ontology (GO) terms. Results can be viewed for individual sequences, or as survey statistics for the group if more than one sequence is submitted.

LowComplexityTool Content

http://wwwmgs.bionet.nsc.ru/mgs/programs/low_complexity/

LowComplexity is a tool that searches for low complexity regions of DNA or protein sequences. Using LowComplexity you can search long sequences (chromosomes, genomes) or a set of aligned sequences. This resource also contains links to other algorithms for evaluating the complexity of sequences.

PDATool Content

http://pda.uab.es/pda2/

PDA (Pipeline Diversity Analysis) is a server which searches for polymorphic sequences in large databases, and estimates their genetic diversity. Results contain sequence alignments (generated by ClustalW) and include statistics on polymorphism, synonymous and non-synonymous substitutions, linkage disequilibrium, and codon bias for your alignment. The full results set is also available for download as a MySQL database.

This content is being maintained by soniacas.

Phydbac2Tool Content

http://igs-server.cnrs-mrs.fr/phydbac/

Phydbac2 (Phylogenomic display of bacterial genes) is a tool to visualize and explore the phylogenomic profiles of bacterial protein sequences. It also allows the user to view sequence similarity across different organisms, access other genes with similar conservation profiles, and view genes that are found nearby a selected gene in multiple genomes.

SDPpredTool Content

http://bioinf.fbb.msu.ru/SDPpred/

SDPpred is a tool for predicting which residues of a protein determine functional differences relative to its homologues. It takes as input an multiple sequence alignment of a protein family divided into groups based on the perceived functional differences.

This content is being maintained by okalinina.

T-COFFEETool Content

http://www.tcoffee.org/

The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.

Sequence Comparison > Comparative Genomics

Phydbac2Tool Content

http://igs-server.cnrs-mrs.fr/phydbac/

Phydbac2 (Phylogenomic display of bacterial genes) is a tool to visualize and explore the phylogenomic profiles of bacterial protein sequences. It also allows the user to view sequence similarity across different organisms, access other genes with similar conservation profiles, and view genes that are found nearby a selected gene in multiple genomes.

Sequence Comparison > Multiple Sequence Alignments

CE-MCTool Content

http://bioinformatics.albany.edu/~cemc/

A multiple protein structure alignment server which creates an all-to-all pairwise alignment using a combinatorial extension program and then using Monte Carlo optimization methods conducts an iterative global optimization. Results are formatted using JOY.

CHAOS/DIALIGN WWW serverTool Content

http://dialign.gobics.de/chaos-dialign-submission

The CHAOS/DIALIGN WWW server is a multiple sequence alignment site which passes input sequences through CHAOS to create a list of local similarites. These similarities serve as anchor points, allowing DIALIGN to conduct global alignments faster. ABC can then be used for the interactive visualization of the alignment.

IBM Bioinformatics and Pattern Discovery GroupTool Content

http://cbcsrv.watson.ibm.com/Tspd.html

Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.

MyHitsTool Content

http://myhits.isb-sib.ch

The MyHits server integrates several tools with a focus on protein annotation and the analysis of protein domains. Guest users have access to tools such as ClustalW and T-Coffee and databases like Swiss-Prot, Prosite and Interpro. Registration allows users to store their results in private databases, and is free for academic users.

This content is being maintained by myhits.

T-COFFEETool Content

http://www.tcoffee.org/

The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.

Sequence Comparison > Other Alignment Tools

E2GTool Content

http://bibiserv.techfak.uni-bielefeld.de/e2g/

E2G is a tool that maps a large set of EST and cDNA sequences to a user-supplied genomic sequence. The use of pre-computed indexed data structures increases the efficiency of the sequence comparison process, allowing a large amount of data to be mapped within a reasonable timeframe.

Genotyping - NCBITool Content

http://www.ncbi.nih.gov/projects/genotyping/formpage.cgi

The Genotyping tool at the NCBI identifies the genotype (or subtype) of viral sequences by using a sliding window approach to BLAST analysis against reference sequences for different viral subtypes. Results are shown as a graphical output plotting the top-scoring genotype. An alignment tool is also available.

IMGT/V-QUESTTool Content

http://imgt.cines.fr/

The international ImMunoGeneTics information system (IMGT) V-QUEry and STandardization (V-QUEST) tool compares a user-provided fasta-formatted germline or rearranged T cell receptor or immunoglobulin variable sequence with a reference set of sequences.

IccareTool Content

http://genopole.toulouse.inra.fr/bioinfo/Iccare/

The Iccare (Interspecific Comparative Clustering and Annotation foR Est) web server compares all available EST and mRNA sequences for a query organism against the set of transcripts for a reference organism. The results are presented graphically and relative to the location of genes on the chromosomes of the reference organism.

T-COFFEETool Content

http://www.tcoffee.org/

The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.

Sequence Comparison > Similarity Searching

BLASTTool Content

http://www.ncbi.nlm.nih.gov/BLAST/

Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families.

CHOPTool Content

http://www.rostlab.org/services/CHOP/

CHOP takes a protein sequence as input, and returns a list of protein sequence fragments with homology to PDB and Pfam domains and to proteins from the SWISS-PROT database.

Onto-ToolsTool Content

http://vortex.cs.wayne.edu/Projects.html

Onto-Tools is a suite of tools for data mining based on information from Gene Ontology (GO). Onto-Tools includes an annotation database and the data mining tools: Onto-Express, Onto-Compare, Onto-Design, Onto-Translate, Onto-Miner, Pathway-Express, Promoter-Express, nsSNPCounter, TAQ, and OE2GO; free registration is required.

T-COFFEETool Content

http://www.tcoffee.org/

The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.