Document Detail

Towards a framework for developing semantic relatedness reference standards.
MedLine Citation:
PMID:  21044697     Owner:  NLM     Status:  MEDLINE    
Our objective is to develop a framework for creating reference standards for functional testing of computerized measures of semantic relatedness. Currently, research on computerized approaches to semantic relatedness between biomedical concepts relies on reference standards created for specific purposes using a variety of methods for their analysis. In most cases, these reference standards are not publicly available and the published information provided in manuscripts that evaluate computerized semantic relatedness measurement approaches is not sufficient to reproduce the results. Our proposed framework is based on the experiences of medical informatics and computational linguistics communities and addresses practical and theoretical issues with creating reference standards for semantic relatedness. We demonstrate the use of the framework on a pilot set of 101 medical term pairs rated for semantic relatedness by 13 medical coding experts. While the reliability of this particular reference standard is in the "moderate" range; we show that using clustering and factor analyses offers a data-driven approach to finding systematic differences among raters and identifying groups of potential outliers. We test two ontology-based measures of relatedness and provide both the reference standard containing individual ratings and the R program used to analyze the ratings as open-source. Currently, these resources are intended to be used to reproduce and compare results of studies involving computerized measures of semantic relatedness. Our framework may be extended to the development of reference standards in other research areas in medical informatics including automatic classification, information retrieval from medical records and vocabulary/ontology development.
Serguei V S Pakhomov; Ted Pedersen; Bridget McInnes; Genevieve B Melton; Alexander Ruggieri; Christopher G Chute
Related Documents :
11933757 - Medical informatics: searching for underlying components.
7882677 - Natural language processing of medical texts within the helios environment.
17238387 - Health information text characteristics.
4563537 - The way of the innovator: notes toward a prehistory of medlars.
9357687 - Linking a clinical system to heterogeneous information resources.
8748427 - The emerging role of cytochrome p450 3a in psychopharmacology.
Publication Detail:
Type:  Journal Article; Research Support, N.I.H., Extramural     Date:  2010-10-31
Journal Detail:
Title:  Journal of biomedical informatics     Volume:  44     ISSN:  1532-0480     ISO Abbreviation:  J Biomed Inform     Publication Date:  2011 Apr 
Date Detail:
Created Date:  2011-03-23     Completed Date:  2011-07-22     Revised Date:  2013-07-03    
Medline Journal Info:
Nlm Unique ID:  100970413     Medline TA:  J Biomed Inform     Country:  United States    
Other Details:
Languages:  eng     Pagination:  251-65     Citation Subset:  IM    
Copyright Information:
Copyright © 2010 Elsevier Inc. All rights reserved.
College of Pharmacy, University of Minnesota, Twin Cities, Minneapolis, MN 55455, USA.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Clinical Coding
Databases, Factual
Medical Informatics / methods*
Medical Records Systems, Computerized / standards*
Reference Standards
Grant Support
R01 LM009623-01A2/LM/NLM NIH HHS; R01 LM009623-01A2/LM/NLM NIH HHS; T15 LM07041-19/LM/NLM NIH HHS

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine

Previous Document:  Digestive enzymes of two freshwater fishes (Limia vittata and Gambusia punctata) with different diet...
Next Document:  Real-time lesion assessment using a novel combined ultrasound and radiofrequency ablation catheter.