Document Detail


GOClonto: an ontological clustering approach for conceptualizing PubMed abstracts.
MedLine Citation:
PMID:  19635585     Owner:  NLM     Status:  MEDLINE    
Abstract/OtherAbstract:
Concurrent with progress in biomedical sciences, an overwhelming of textual knowledge is accumulating in the biomedical literature. PubMed is the most comprehensive database collecting and managing biomedical literature. To help researchers easily understand collections of PubMed abstracts, numerous clustering methods have been proposed to group similar abstracts based on their shared features. However, most of these methods do not explore the semantic relationships among groupings of documents, which could help better illuminate the groupings of PubMed abstracts. To address this issue, we proposed an ontological clustering method called GOClonto for conceptualizing PubMed abstracts. GOClonto uses latent semantic analysis (LSA) and gene ontology (GO) to identify key gene-related concepts and their relationships as well as allocate PubMed abstracts based on these key gene-related concepts. Based on two PubMed abstract collections, the experimental results show that GOClonto is able to identify key gene-related concepts and outperforms the STC (suffix tree clustering) algorithm, the Lingo algorithm, the Fuzzy Ants algorithm, and the clustering based TRS (tolerance rough set) algorithm. Moreover, the two ontologies generated by GOClonto show significant informative conceptual structures.
Authors:
Hai-Tao Zheng; Charles Borchert; Hong-Gee Kim
Related Documents :
19689215 - Computation of median gene clusters.
14729645 - Molecular determinants of the cytotoxicity of platinum compounds: the contribution of i...
20591905 - Integration of pathway knowledge into a reweighted recursive feature elimination approa...
20619355 - Independent component analysis: mining microarray data for fundamental human gene expre...
16051435 - Increase of ampa receptor glutamate receptor 1 subunit and b-cell receptor-associated p...
8961255 - Osmotic and non-osmotic regulation of arginine vasopressin (avp) release, mrna, and pro...
Publication Detail:
Type:  Journal Article; Research Support, Non-U.S. Gov't     Date:  2009-07-25
Journal Detail:
Title:  Journal of biomedical informatics     Volume:  43     ISSN:  1532-0480     ISO Abbreviation:  J Biomed Inform     Publication Date:  2010 Feb 
Date Detail:
Created Date:  2010-02-01     Completed Date:  2010-04-02     Revised Date:  -    
Medline Journal Info:
Nlm Unique ID:  100970413     Medline TA:  J Biomed Inform     Country:  United States    
Other Details:
Languages:  eng     Pagination:  31-40     Citation Subset:  IM    
Affiliation:
Biomedical Knowledge Engineering Laboratory, BK21 College of Dentistry, Seoul National University, 28 Yeongeon-dong, Jongro-gu, Seoul 110-749, Republic of Korea.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Descriptor/Qualifier:
Algorithms
Animals
Artificial Intelligence
Cluster Analysis*
Cytoplasm / metabolism
Database Management Systems
Humans
Information Storage and Retrieval
Male
Models, Biological
Natural Language Processing
Pattern Recognition, Automated / methods*
PubMed*
Software
Vocabulary, Controlled

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine


Previous Document:  Apolipoproteins A-I, A-II and E are independently distributed among intracellular and newly secreted...
Next Document:  Resorcylidene aminoguanidine induces antithrombotic action that is not dependent on its antiglycatio...