The ArrayExpress Archive is a public repository of functional genomics data supporting publications. It includes data generated by sequencing or array-based technologies. Data are submitted by users and imported directly from the NCBI Gene Expression Omnibus. The ArrayExpress Archive is closely integrated with the Gene Expression Atlas and the sequence databases at the European Bioinformatics Institute. Advanced queries provided via ontology enabled interfaces include queries based on technology and sample attributes such as disease, cell types and anatomy.
AUGUSTUS is a eukaryotic gene prediction tool. It can integrate evidence, e.g. from RNA-Seq, ESTs, proteomics, but can also predict genes ab initio. The PPX extension to AUGUSTUS can take a protein sequence multiple sequence alignment as input to find new members of the family in a genome. It can be run through a web interface, or downloaded and run locally.
Bgee is a dataBase for Gene Expression Evolution. It allows one to automatically compare gene expression patterns between species, by referencing expression data on anatomical ontologies, and designing homology relationships between them.
CancerResource, a database that integrates cancer-relevant relationships of compounds and targets from (i) literature mining and (ii) external resources complemented with (iii) essential experimental and supporting information on genes and cellular effects. CancerResource addresses the spectrum of research on compound-target interactions in natural sciences as well as in individualized medicine.
The Center for Information Biology gene EXpression database (CIBEX) is a public repository for gene expression experimental data. The database system is compliant with the MIAME standard.
ConsensusPathDB is a meta-database that integrates physical protein interactions, metabolic and signaling reactions and gene regulatory interactions in a seamless functional association network that simultaneously describes multiple functional aspects of genes, proteins, complexes, metabolites, etc. ConsensusPathDB offers different ways of utilizing these integrated interaction data, with tools for visualization, analysis and interpretation of high-throughput expression data in the light of functional interactions and biological pathways.
COXPRESdb (coexpressed gene database) represents the coexpression relationship for human and mouse. Upgrades include a new comparable coexpression measure, Mutual Rank, five other animal species, rat, chicken, zebrafish, fly and nematoda, and addition of different layers of omics data into the integrated network of genes.
DroID, the Drosophila Interactions Database, is a comprehensive public resource for Drosophila gene and protein interactions. DroID contains genetic interactions and experimentally detected protein-protein interactions curated from the literature and from external databases, and predicted protein interactions based on experiments in other species. Protein interactions are annotated with experimental details and periodically updated confidence scores. The database now includes transcription factor-gene and regulatory RNA-gene interactions. Orthologous gene mappings of Drosophila genes to other organisms are also available to facilitate finding interactions based on gene names and identifiers for a number of common model organisms and humans.
DyNAVacS is a tool for designing DNA vaccines that includes steps for chosing a suitable expression vector, ensuring optimal expression by codon optimization, engineering CpG motifs for enhancing immune responses, and providing additional sequence signals for efficient translation. It also allows includes restriction enzyme mapping and design of primers spanning user specified sequences.
EMAGE is a freely available online database of in situ gene expression patterns in the developing mouse embryo. Gene expression domains from raw images are extracted and integrated spatially into a set of standard 3D virtual mouse embryos at different stages of development, which allows data interrogation by spatial methods. An anatomy ontology is also used to describe sites of expression, which allows data to be queried using text-based methods. Data coverage has been increased by sourcing from a greater selection of journals.
Ensembl Genomes is a new portal offering integrated access to genome-scale data from non-vertebrate species of scientific interest, developed using the Ensembl genome annotation and visualisation platform. Ensembl Genomes consists of five sub-portals (for bacteria, protists, fungi, plants and invertebrate metazoa) designed to complement the availability of vertebrate genomes in Ensembl. Data types incorporated include annotation of (protein and non-protein coding) genes, cross references to external resources, and high throughput experimental data (e.g. data from large scale studies of gene expression and polymorphism visualised in their genomic context). Additionally, extensive comparative analysis has been performed, both within defined clades and across the wider taxonomy, and sequence alignments and gene trees resulting from this can be accessed through the site.
FlyBase is a database of genetic and molecular data for Drosophila. FlyBase includes data on all species from the family Drosophilidae; the primary species represented is Drosophila melanogaster.
The Gene Expression Database (GXD) is a community resource of mouse developmental expression information. GXD integrates different types of expression data at the transcript and protein level and captures expression information from many different mouse strains and mutants. GXD places these data in the larger biological context through integration with other Mouse Genome Informatics (MGI) resources and interconnections with many other databases. Web-based query forms support simple or complex searches. All data are annotated and reviewed by GXD curators.
Integrated system for data analysis with information about expression and gene networks, also contains transcriptional regulatory regions database (TRRD).