Allows correlation of GeneChip array results with array design and annotation information; provides access to array content information, including probe sequences and gene annotations; free registration is required.
AUGUSTUS is a eukaryotic gene prediction tool. It can integrate evidence, e.g. from RNA-Seq, ESTs, proteomics, but can also predict genes ab initio. The PPX extension to AUGUSTUS can take a protein sequence multiple sequence alignment as input to find new members of the family in a genome. It can be run through a web interface, or downloaded and run locally.
The Berkeley Phylogenomics Group provides a series of
web servers for phylogenomic analysis: classification of sequences to pre-computed families and subfamilies using the PhyloFacts Phylogenomic Encyclopedia, FlowerPower clustering of proteins sharing the same domain architecture, MUSCLE multiple sequence alignment, SATCHMO simultaneous alignment and tree construction, and SCI-PHY subfamily identification.
Server which scans upstream of genes in the same gene expression cluster for regulatory sequence motifs using a Gibbs sampling strategy. The Markov background model is used for non-motif bases, improving specificity of predicted motif locations.
Family Identification with Structure Anchored HMMs (FISH) is a server for the identification of remote sequence homologues on the basis of protein domains.
Gene3D provides accurate structural domain family assignments for over 1100 genomes and nearly 10,000,000 proteins. A hidden Markov model library, constructed from the manually curated CATH structural domain hierarchy, is used to search UniProt, RefSeq and Ensembl protein sequences. The resulting matches are refined into simple multi-domain architectures. The domain assignments are integrated with multiple external protein function descriptions (e.g. Gene Ontology and KEGG), structural annotations (e.g. coiled coils, disordered regions and sequence polymorphisms) and family resources (e.g. Pfam and eggNog). Gene3D also provides a set of services, including an interactive genome coverage graph visualizer, DAS annotation resources, sequence search facilities and SOAP services.
The GeneMark family of programs employ Markov models and are specifically tuned for gene prediction for sequences from prokaryotes, viral genomes and eukaryotes.
Gibbs Motif Sampler allows you to identify motifs, conserved regions, in DNA or protein sequences. This tool can be applied for the detection of transcription factor binding sites (TFBS).
Gene Locator and Interpolated Markov Modeler; this prokaryote-gene finding tool is the primary microbial gene finder used at TIGR; free (including source code) with registration for non-commercial use.
GRIFFIN (G-protein-Receptor Interacting Feature Finding INstrument) uses a support vector machine and hidden markov model to predict G-protein coupled receptors (GPCRs) and G-protein coupling selectivity.
Based on the comparison of profile HMMs, HHpred takes a protein sequence or multiple sequence alignment as input and searches for remote homologues in an assortment of databases such as PDB, SMART and Pfam. The user can select either a local or global alignment method, and the search results can be used to generate 3D structural models.
HHrep is a tool for the de novo identification
of repeats in protein sequences based on the pairwise comparison of profile hidden Markov models (HMMs).
HHsenser is a tool for sensitive iterative sequence searching based on HMM-HMM comparison. Starting from a single sequence or an alignment, HHSenser is able to build alignments with as many near or remote homologs as possible allowing users to explore protein superfamilies.