Document Detail

Comparison of the NCI open database with seven large chemical structural databases.
MedLine Citation:
PMID:  11410049     Owner:  NLM     Status:  MEDLINE    
Eight large chemical databases have been analyzed and compared to each other. Central to this comparison is the open National Cancer Institute (NCI) database, consisting of approximately 250 000 structures. The other databases analyzed are the Available Chemicals Directory ("ACD," from MDL, release 1.99, 3D-version); the ChemACX ("ACX," from CamSoft, Version 4.5); the Maybridge Catalog and the Asinex database (both as distributed by CamSoft as part of ChemInfo 4.5); the Sigma-Aldrich Catalog (CD-ROM, 1999 Version); the World Drug Index ("WDI," Derwent, version 1999.03); and the organic part of the Cambridge Crystallographic Database ("CSD," from Cambridge Crystallographic Data Center, 1999 Version 5.18). The database properties analyzed are internal duplication rates; compounds unique to each database; cumulative occurrence of compounds in an increasing number of databases; overlap of identical compounds between two databases; similarity overlap; diversity; and others. The crystallographic database CSD and the WDI show somewhat less overlap with the other databases than those with each other. In particular the collections of commercial compounds and compilations of vendor catalogs have a substantial degree of overlap among each other. Still, no database is completely a subset of any other, and each appears to have its own niche and thus "raison d'être". The NCI database has by far the highest number of compounds that are unique to it. Approximately 200 000 of the NCI structures were not found in any of the other analyzed databases.
J H Voigt; B Bienfait; S Wang; M C Nicklaus
Related Documents :
1113079 - The effect of cytochalasin b on hyphal morphogenesis in polyporus biennis.
16521839 - Photo-identification, site fidelity, and movement of female gray seals (halichoerus gry...
17689639 - Flat square knots: are 3 throws enough?
24453779 - Efficacy and safety of alemtuzumab in multiple sclerosis and impact on nursing role.
7918119 - Role of 21-deoxyaldosterone in human hypertension.
8558089 - Exchange of oxidized cholesteryl linoleate between ldl and hdl mediated by cholesteryl ...
16723939 - What makes a good nurse? views of patients and carers.
18806359 - Multiple outbreaks of gastroenteritis due to a single strain of genotype gii/4 noroviru...
8278279 - The use and meaning of touch in caring for patients with cancer.
Publication Detail:
Type:  Comparative Study; Journal Article    
Journal Detail:
Title:  Journal of chemical information and computer sciences     Volume:  41     ISSN:  0095-2338     ISO Abbreviation:  J Chem Inf Comput Sci     Publication Date:    2001 May-Jun
Date Detail:
Created Date:  2001-06-18     Completed Date:  2001-07-26     Revised Date:  2007-11-15    
Medline Journal Info:
Nlm Unique ID:  7505012     Medline TA:  J Chem Inf Comput Sci     Country:  United States    
Other Details:
Languages:  eng     Pagination:  702-12     Citation Subset:  IM    
Laboratory of Medicinal Chemistry, Center for Cancer Research, National Cancer Institute, National Institutes of Health, NCI at Frederick, 376 Boyles Street, Frederick, Maryland 21702, USA.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Cluster Analysis
Databases, Factual*
National Institutes of Health (U.S.)
Quantitative Structure-Activity Relationship
Terminology as Topic
United States

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine

Previous Document:  Quantitative structure-property relationships (QSPRs) for the estimation of vapor pressure: a hierar...
Next Document:  Neural network based chemical structure indexing.