AUGUSTUS is a eukaryotic gene prediction tool. It can integrate evidence, e.g. from RNA-Seq, ESTs, proteomics, but can also predict genes ab initio. The PPX extension to AUGUSTUS can take a protein sequence multiple sequence alignment as input to find new members of the family in a genome. It can be run through a web interface, or downloaded and run locally.
BeeSpace Question/Answering (BSQA) system is a web server that performs integrated text mining for insect biology, covering diverse aspects from molecular interactions of genes to insect behavior. BSQA recognizes a number of entities and relations in Medline documents about the model insect, Drosophila melanogaster.
The DAnCER (Disease Annotated Chromatin Epigenetic Resource) resource integrates information on genes with chromatin modification (CM) function from five model organisms, including human. Currently integrated are gene functional annotations, Pfam domain architecture, protein interaction networks and associated human diseases. Additional supporting evidence includes orthology relationships across organisms, membership in protein complexes, and information on protein 3D structure. DAnCER allows visual explorations of the integrated data and flexible query capabilities using a variety of data filters. In particular, disease information and functional annotations are mapped onto the protein interaction networks.
E-RNAi is a tool for designing and evaluating dsRNA constructs suitable for RNAi experiments in 12 organisms including Drosophila, C. elegans, human and other emerging model organisms. E-RNAi also facilitates the design of secondary RNAi reagents for validation experiments, evaluation of pooled siRNA reagents and batch design.
ENDEAVOUR is a computational approach to prioritize candidate genes for further study. Users input training genes already known to be involved in the process under study and the candidate genes to prioritize. The web server supports multiple species and integrates multiple genomic data to generate the final prioritization.
ENDEAVOUR has been benchmarked in silico and experimentally validated leading to the discovery of novel disease genes in several genetic diseases.
FlyBase is a database of genetic and molecular data for Drosophila. FlyBase includes data on all species from the family Drosophilidae; the primary species represented is Drosophila melanogaster.
FlyTF is a database of computationally predicted and/or experimentally verified site-specific transcription factors (TFs) in the fruit fly Drosophila melanogaster. This database contains information on the manual curation of 1052 FlyBase identifiers, which are putative site-specific transcription factors, based on FlyBase/Gene Ontology annotation or the DBD Transcription Factor Database.
Genome Surveyor is a tool for discovery and analysis of cis-regulatory elements and transcription factors in Drosphila built on the GBrowse genome browser.
The GenomeRNAi database contains phenotypes from published cell-based RNA interference (RNAi) screens in Drosophila and Homo sapiens. The database connects observed phenotypes with annotations of targeted genes and information about the RNAi reagent used for the perturbation experiment. GenomeRNAi also enables reporting of data from microscopy experiments and curated phenotypes from published screens. In addition, the database provides an updated resource of RNAi reagents and their predicted quality that are available for the Drosophila and the human genome. GenomeRNAi also facilitates the integration with other genomic data sets and contains expression profiling (RNA-Seq) data for several cell lines commonly used in RNAi experiments.
iHOP (Information Hyperlinked over Proteins) allows researchers to explore a network of gene and protein interactions based on published scientific literature. For each gene search, iHOP reports sentences from abstracts associating it with other genes, links out to full abstracts, and reports experimental evidence for the interactions, if available. You can also select sentences to create and visualize your own gene model.
Database of protein orthologs that interact (interologs) and proteins with conserved regulatory relationships across species (regulogs). Contains data for C. elegans, Drosophila, Arabidopsis, and Yeast.