This page contains various resources for comparative protein structure modelling and analysis from the Sali Lab at University of California at San Francisco (UCSF).
AntigenDB contains 500 antigens to pathogenic species curated from the literature and other immunological resources. In AntigenDB, a database entry contains information regarding the sequence, structure, origin, etc. of an antigen with additional information such as B and T-cell epitopes, MHC binding, function, gene-expression and post translational modifications, where available. AntigenDB also provides links to major internal and external databases.
The Aspergillus Genome Database (AspGD) is an online genomics resource for researchers studying the genetics and molecular biology of the Aspergilli. AspGD combines high-quality manual curation of the experimental scientific literature examining the genetics and molecular biology of Aspergilli, cutting-edge comparative genomics approaches to iteratively refine and improve structural gene annotations across multiple Aspergillus species, and web-based research tools for accessing and exploring the data.
BeetleBase (http://www.beetlebase.org) has been updated to provide more comprehensive genomic information for the red flour beetle Tribolium castaneum. The database contains genomic sequence scaffolds mapped to 10 linkage groups (genome assembly release Tcas_3.0), genetic linkage maps, the official gene set, Reference Sequences from NCBI (RefSeq), predicted gene models, ESTs and whole-genome tiling array data representing several developmental stages. The database was reconstructed using the upgraded Generic Model Organism Database (GMOD) modules. Genome information is visualized as tracks in GBrowse. The updated genetic map is visualized using the comparative genetic map viewer CMAP. To enhance the database search capabilities, the BLAST and BLAT search tools have been integrated with the GMOD tools.
The Berkeley Phylogenomics Group provides a series of
web servers for phylogenomic analysis: classification of sequences to pre-computed families and subfamilies using the PhyloFacts Phylogenomic Encyclopedia, FlowerPower clustering of proteins sharing the same domain architecture, MUSCLE multiple sequence alignment, SATCHMO simultaneous alignment and tree construction, and SCI-PHY subfamily identification.
BIND SOAP is a web service providing end users with the ability to access functionality offered by the BIND Search Service through a remote Application Programming Interface (API).
The Candida Genome Database provides online access to genomic sequence data and manually curated functional information about genes and proteins of the human pathogen Candida albicans. Two tools have been added to the DB. The Biochemical Pathways tool provides visualization of metabolic pathways and analysis tools that facilitate interpretation of experimental data, including results of large-scale experiments, in the context of Candida metabolism. Textpresso for Candida allows searching through the full-text of Candida-specific literature, including clinical and epidemiological studies.
Cascade PSI-BLAST detects distant protein similarities using a cascade search protocol where PSI-BLAST searches are carried out on each hit, until no new hits are found in the selected database (SwissProt, SCOP, or Pfam).
Computed Atlas of Surface Topography of proteins (CASTp) locates and measures concave surface regions on 3D protein structures. This tool can be used to study surface features, binding sites, and functional regions of proteins.
CATH is a manually curated classification of protein domain structures. Each protein has been chopped into structural domains and assigned into homologous superfamilies (groups of domains that are related by evolution). This classification procedure uses a combination of automated and manual techniques which include computational algorithms, empirical and statistical evidence, literature review and expert analysis.
CD-Search allows the user to search the Conserved Domain Database (CDD) for conserved structural and functional domains in a sequence of interest; links to 3-D structures where applicable.