The antibiotic and Secondary Metabolite Analysis Shell (antiSMASH) is a resource for identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences.
The Aspergillus Genome Database (AspGD) is an online genomics resource for researchers studying the genetics and molecular biology of the Aspergilli. AspGD combines high-quality manual curation of the experimental scientific literature examining the genetics and molecular biology of Aspergilli, cutting-edge comparative genomics approaches to iteratively refine and improve structural gene annotations across multiple Aspergillus species, and web-based research tools for accessing and exploring the data.
The Candida Genome Database provides online access to genomic sequence data and manually curated functional information about genes and proteins of the human pathogen Candida albicans. Two tools have been added to the DB. The Biochemical Pathways tool provides visualization of metabolic pathways and analysis tools that facilitate interpretation of experimental data, including results of large-scale experiments, in the context of Candida metabolism. Textpresso for Candida allows searching through the full-text of Candida-specific literature, including clinical and epidemiological studies.
Server which attempts to identify any motifs related to genes predicted to share regulatory elements. It alters Gibbs sampling through biasing searches towards conserved sequences across multiple species.
DRYGIN (Data Repository of Yeast Genetic Interactions)-a web database system that provides a central platform for yeast genetic network analysis and visualization. DRYGIN searches SGA interactions and integrates other data sources, in order to associate the genetic interactions with pathway information, protein complexes, other binary genetic and physical interactions, and Gene Ontology functional annotation.
Gene Set Builder is a database-driven, web-based tool designed to help researchers compile, store, export, and share sets of genes. This application supports the 17 eukaryotic genomes found in version 32 of the Ensembl database, which includes species from yeast to human. User-created information such as sets and customized annotations are stored and can be shared to facilitate easy access. Gene sets stored in the system can be exported in a variety of output formats - as lists of identifiers, in tables, or as sequences.
Tool which divides both strands of an input DNA sequence into a set of contiguous oligonucleotides. These oligos are designed with complimentary regions so that the complete set can be combined to synthesize the input DNA sequence in vitro (using oligonucleotide synthesis and assembly PCR).
GeneTrail analyzes gene sets for statistically significant accumulations of genes belonging to functional categories. Two common statistical approaches are implemented including: Over-Representation Analysis (ORA) for comparing a reference set of genes to a test set; and, Gene Set Enrichment Analysis (GSEA) for scoring sorted lists of genes.
Server designed to pinpoint protein-DNA interaction sites at the base pair level. Uses ChIP-array data, word enumeration and position-specific weight matrix updating to search for motifs representing these interaction sites.
Online Resource for EST (OREST) analysis is a EST analysis pipeline which allows rapid analysis of large amounts of ESTs or cDNAs from mammals and fungi. Functional annotation of the dataset is also included via either FunCat or GO annotation.
Regulatory Sequence Analysis Tools (RSAT) detects regulatory signals in non-coding sequences; tools include sequence retrieval, pattern discovery, pattern matching, genome-scale pattern matching, feature-map drawing, random sequence generation and other utilities. Tools may be accessed separately or connected to other tools. As well, the tools are available as web services for integration into programmatic work flows.
New tools incorporated include an automated sequence retrieval from EnsEMBL, motif comparison and motif quality tools, and the capacity to process ChIP-sequence data.
The Saccharomyces Genome Database (SGD) is a scientific database for the molecular biology and genetics of the yeast Saccharomyces cerevisiae. The information in SGD includes functional annotations, mapping and sequence information, protein domains and structure, expression data, mutant phenotypes, physical and genetic interactions and the primary literature from which these data are derived. Phenotypes and genetic interaction data are also annotated and displayed in SGD.