BActeriocin GEnome mining tooL (BAGEL) identifies putative bacteriocin ORFs (antimicrobial peptides) based on a database containing information about known bacteriocins and adjacent genes involved in bacteriocin activity.
BAGEL2 (BActeriocin GEnome mining tooL) identifies putative bacteriocins on the basis of conserved domains, physical properties and genomic context. Improved genome mining capacity from BAGEL.
The Bioinformatics Resource for Oral Pathogens (BROP) contains tools for genomics of oral pathogens including Genome Viewer, GOAL (genome wide ORF alignment), an oral pathogen microarray database, an entrez counter, oral pathogen specific BLAST, and a codon usage database.
The Candida Genome Database provides online access to genomic sequence data and manually curated functional information about genes and proteins of the human pathogen Candida albicans. Two tools have been added to the DB. The Biochemical Pathways tool provides visualization of metabolic pathways and analysis tools that facilitate interpretation of experimental data, including results of large-scale experiments, in the context of Candida metabolism. Textpresso for Candida allows searching through the full-text of Candida-specific literature, including clinical and epidemiological studies.
CyanoBase is the genome database for cyanobacteria, which are model organisms for photosynthesis. The database houses cyanobacteria species information, complete genome sequences, genome-scale experiment data, gene information, gene annotations and mutant information.
The Ensembl project provides high quality, integrated annotation on chordate and selected eukaryotic genomes within a consistent and accessible infrastructure. All supported species include comprehensive, evidence-based gene annotations and a selected set of genomes includes additional data focused on variation, comparative, evolutionary, functional and regulatory annotation. The most advanced resources are provided for key species including human, mouse, rat and zebrafish.
Ensembl Genomes is a new portal offering integrated access to genome-scale data from non-vertebrate species of scientific interest, developed using the Ensembl genome annotation and visualisation platform. Ensembl Genomes consists of five sub-portals (for bacteria, protists, fungi, plants and invertebrate metazoa) designed to complement the availability of vertebrate genomes in Ensembl. Data types incorporated include annotation of (protein and non-protein coding) genes, cross references to external resources, and high throughput experimental data (e.g. data from large scale studies of gene expression and polymorphism visualised in their genomic context). Additionally, extensive comparative analysis has been performed, both within defined clades and across the wider taxonomy, and sequence alignments and gene trees resulting from this can be accessed through the site.
Gene Locator and Interpolated Markov Modeler; this prokaryote-gene finding tool is the primary microbial gene finder used at TIGR; free (including source code) with registration for non-commercial use.
The H-Invitational Database (H-InvDB) is a comprehensive annotation resource of human genes and transcripts. The latest release of H-InvDB (release 6.2) provides the annotation for 219,765 human transcripts in 43,159 human gene clusters based on human full-length cDNAs and mRNAs. H-InvDB now provides several new annotation features, such as mapping of microarray probes, new gene models, relation to known ncRNAs and information from the Glycogene database. H-InvDB also provides useful data mining resources-'Navigation search', 'H-InvDB Enrichment Analysis Tool (HEAT)' and web service APIs.
Extensive server possessing a wide range of tools for pattern discovery in DNA and protein sequences as well as in text. Tools for multiple sequence alignment, gene discovery, protein annotation, and other applications also exist on this server. A detailed help page is provided for all tools.
Lots of information on fugu including complete draft sequence, annotations, comparative vertebrate genomics, phylogenies, publications, and information about the fugu genome project; has some very useful tools including annotation tools, an Ensembl mirror, and BLAST.
MADAP is a clustering tool for the interpretation annotation data mapped onto complete or partial genome sequences. Initially developed for determining transcription start sites (TSS) by defining 5\' and 3\'ends of mRNA, MADAP also has utility in clustering other annotation data types (ChIP-chip data, for example).