BioDrugScreen is a resource for ranking molecules docked against a large number of targets in the human proteome. Nearly 1600 molecules from the freely available NCI diversity set were docked onto 1926 cavities identified on 1589 human targets resulting in >3 million receptor-ligand complexes. The targets in BioDrugScreen originated from Human Cancer Protein Interaction Network, which we have updated, as well as the Human Druggable Proteome, which we have created for the purpose of this effort. This makes the BioDrugScreen resource highly valuable in drug discovery.
Collection of Anti-Microbial Peptides (CAMP) is a free online database that is manually curated and currently holds 3782 antimicrobial sequences. These sequences are divided into experimentally validated (patents and non-patents: 2766) and predicted (1016) datasets based on their reference literature. Information like source organism, activity (MIC values), reference literature, target and non-target organisms of AMPs are captured in the database. Prediction and sequence analysis tools, including BLAST, are integrated in the database.
CancerResource, a database that integrates cancer-relevant relationships of compounds and targets from (i) literature mining and (ii) external resources complemented with (iii) essential experimental and supporting information on genes and cellular effects. CancerResource addresses the spectrum of research on compound-target interactions in natural sciences as well as in individualized medicine.
This content is being maintained by robertpreissner.
Chemical Entities of Biological Interest (ChEBI) is a freely available dictionary of molecular entities focused on 'small' chemical compounds. The molecular entities in question are either natural products or synthetic products used to intervene in the processes of living organisms. In addition to molecular entities, ChEBI contains groups (parts of molecular entities) and classes of entities.
ChemProt, a disease chemical biology database, which is based on a compilation of multiple chemical-protein annotation resources, as well as disease-associated protein-protein interactions (PPIs). The PPI network layer allows for studying disease and tissue specificity through each protein complex. ChemProt can assist in the in silico evaluation of environmental chemicals, natural products and approved drugs, as well as the selection of new compounds based on their activity profile against most known biological targets, including those related to adverse drug events.
DrugBank is a richly annotated database of drug and drug target information. It contains extensive data on the nomenclature, ontology, chemistry, structure, function, action, pharmacology, pharmacokinetics, metabolism and pharmaceutical properties of both small molecule and large molecule (biotech) drugs. It also contains comprehensive information on the target diseases, proteins, genes and organisms on which these drugs act. New data field additions include illustrated drug-action pathways, drug transporter data, drug metabolite data, pharmacogenomic data, adverse drug response data, ADMET data, pharmacokinetic data, computed property data and chemical classification data. DrugBank 3.0 also offers expanded database links, improved search tools for drug-drug and food-drug interaction, new resources for querying and viewing drug pathways and hundreds of new drug entries with detailed patent, pricing and manufacturer data.
EDULISS (EDinburgh University Ligand Selection System) database stores structural, physicochemical and pharmacophoric properties of small molecules. For each compound a single 3D conformer is stored along with over 1600 calculated descriptor values (molecular properties). A very efficient method for unique compound recognition is demonstrated by making use of small subgroups of the descriptors.
EMDataBank.org is a resource for deposition and retrieval of cryoEM maps, models and associated metadata. The resource unifies public access to the two major archives containing EM-based structural data: EM Data Bank (EMDB) and Protein Data Bank (PDB), and facilitates use of EM structural data of macromolecules and macromolecular complexes.
FragmentStore, a resource for the comparison of fragments found in metabolites, drugs or toxic compounds. Starting from 13,000 metabolites, 16,000 drugs and 2200 toxic compounds we generated 35,000 different building blocks (fragments), which are not only relevant to their biosynthesis and degradation but also provide important information regarding side-effects and toxicity. The FragmentStore provides a variety of search options such as 2D structure, molecular weight, rotatable bonds, etc. Various analysis tools have been implemented including the calculation of amino acid preferences of fragments' binding sites, classification of fragments based on the enzyme classification class of the enzyme(s) they bind to and small molecule library generation via a fragment-assembler tool.
This content is being maintained by robertpreissner.
GlycomeDB integrates the structural and taxonomic data of all major public carbohydrate databases, as well as carbohydrates contained in the Protein Data Bank. The database is a unified resource for carbohydrate structures. GlycomeDB retains the links to the original databases and is updated at weekly intervals with the newest structures available from the source databases.
Gypsy Database of Mobile Genetic Elements (GyDB)is a database of viruses and transposable elements based on their phylogenetic classification (per lineage and protein domain). GyDB now includes long terminal repeats (LTR) retroelements and relatives, Ty3/Gypsy, Retroviridae, Ty1/Copia and Bel/Pao LTR retroelements and the Caulimoviridae pararetroviruses of plants. New features include: (i) a variety of descriptions and reviews; (ii) protein-based phylogenies, where phylogenetic levels are assigned to distinct classified elements; (iii) a collection of multiple alignments, lineage-specific hidden Markov models and consensus sequences, called GyDB collection; (iv) updated RefSeq databases and BLAST and HMM servers to facilitate sequence characterization of new LTR retroelement and caulimovirus queries; and (v) a bibliographic server.
A comprehensive and fully curated database for Herb Ingredients' Targets (HIT). Those herbal ingredients with protein target information were carefully curated. The molecular target information involves those proteins being directly/indirectly activated/inhibited, protein binders and enzymes whose substrates or products are those compounds. Those up/down regulated genes are also included under the treatment of individual ingredients. In addition, the experimental condition, observed bioactivity and various references are provided as well for user's reference. The database can be queried via keyword search or similarity search. Crosslinks have been made to TTD, DrugBank, KEGG, PDB, Uniprot, Pfam, NCBI, TCM-ID and other databases.
PROMISCUOUS is a database of drugs, including withdrawn or experimental drugs, annotated with drug-protein and protein-protein relationships compiled from public resources via text and data mining including manual curation. Measures of structural similarity for drugs as well as known side-effects can be easily connected to protein-protein interactions to establish and analyse networks responsible for multi-pharmacology.
Protegen is a web-based database and analysis system that curates, stores and analyzes protective antigens. Protegen includes basic antigen information and experimental evidence curated from peer-reviewed articles. It also includes detailed gene/protein information (e.g. DNA and protein sequences, and COG classification). Different antigen features, such as protein weight and pI, and subcellular localizations of bacterial proteins are precomputed.
This content is being maintained by yongqunh.