Alternative Splicing Prediction (ASPic) can predict alternative splicing of user submitted genes based on comparative analysis of available transcript and genome data from a variety of species. Results include graphical and tabular views of the splicing patterns of full-length mRNA isoforms compatible with the detected splice sites of the genes as well as structural and functional annotations.
E2G is a tool that maps a large set of EST and cDNA sequences to a user-supplied genomic sequence. The use of pre-computed indexed data structures increases the efficiency of the sequence comparison process, allowing a large amount of data to be mapped within a reasonable timeframe.
The Genotyping tool at the NCBI identifies the genotype (or subtype) of viral sequences by using a sliding window approach to BLAST analysis against reference sequences for different viral subtypes. Results are shown as a graphical output plotting the top-scoring genotype. An alignment tool is also available.
HHrep is a tool for the de novo identification
of repeats in protein sequences based on the pairwise comparison of profile hidden Markov models (HMMs).
HMMER Web Server is a tool for interactive sequence similarity searching. Methods are available for single protein sequence, multiple protein sequence alignment or profile HMM against a target sequence databases, and for searching a protein sequence against Pfam.
The Iccare (Interspecific Comparative Clustering and Annotation foR Est) web server compares all available EST and mRNA sequences for a query organism against the set of transcripts for a reference organism. The results are presented graphically and relative to the location of genes on the chromosomes of the reference organism.
The international ImMunoGeneTics information system (IMGT) V-QUEry and STandardization (V-QUEST) tool compares a user-provided fasta-formatted germline or rearranged T cell receptor or immunoglobulin variable sequence with a reference set of sequences.
MaxAlign is a web server for maximizing usable data in an alignment. It maximizes the number of nucleotide (or amino acid) symbols present in gap-free columns by selecting the optimal subset of sequences to exclude from the alignment.
PAL2NAL converts a multiple sequence alignment of proteins and the corresponding DNA (or mRNA) sequences into a codon alignment. Synonymous (Ks) and non-synonymous (Ka) substitution rates can be calculated.
The T-COFFEE site includes links to a collection of tools for computing, evaluating, and manipulating multiple alignments of protein sequences and structures. T-COFFEE is a protein multiple sequence alignment tool for sequences with less than 30 percent identity. Expresso (or 3DCoffee) aligns sequences using structural information. M-Coffee is a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Other algorithms provide RNA sequence alignment, and distantly related sequence alignment by homology extension. A consistency based multiple sequence alignment program has been added.