Document Detail


Automatic extraction of relations between medical concepts in clinical texts.
MedLine Citation:
PMID:  21846787     Owner:  NLM     Status:  In-Data-Review    
Abstract/OtherAbstract:
Objective A supervised machine learning approach to discover relations between medical problems, treatments, and tests mentioned in electronic medical records. Materials and methods A single support vector machine classifier was used to identify relations between concepts and to assign their semantic type. Several resources such as Wikipedia, WordNet, General Inquirer, and a relation similarity metric inform the classifier. Results The techniques reported in this paper were evaluated in the 2010 i2b2 Challenge and obtained the highest F1 score for the relation extraction task. When gold standard data for concepts and assertions were available, F1 was 73.7, precision was 72.0, and recall was 75.3. F1 is defined as 2*Precision*Recall/(Precision+Recall). Alternatively, when concepts and assertions were discovered automatically, F1 was 48.4, precision was 57.6, and recall was 41.7. Discussion Although a rich set of features was developed for the classifiers presented in this paper, little knowledge mining was performed from medical ontologies such as those found in UMLS. Future studies should incorporate features extracted from such knowledge sources, which we expect to further improve the results. Moreover, each relation discovery was treated independently. Joint classification of relations may further improve the quality of results. Also, joint learning of the discovery of concepts, assertions, and relations may also improve the results of automatic relation extraction. Conclusion Lexical and contextual features proved to be very important in relation extraction from medical texts. When they are not available to the classifier, the F1 score decreases by 3.7%. In addition, features based on similarity contribute to a decrease of 1.1% when they are not available.
Authors:
Bryan Rink; Sanda Harabagiu; Kirk Roberts
Related Documents :
15136677 - Measuring progression of cerebral white matter lesions on mri: visual rating and volume...
15471227 - Space flight effects on bacterial physiology.
845097 - Comparing treatment tactics with a hyperactive preschool child: stimulant medication an...
2891427 - Changes in the hydrophobic characteristics of clostridium perfringens spores and spore ...
10121197 - Licensing of low-power medical devices in the 450-470 mhz band--fcc. final rule.
15287147 - Inconceivable? deducting the costs of fertility treatment.
Publication Detail:
Type:  Journal Article    
Journal Detail:
Title:  Journal of the American Medical Informatics Association : JAMIA     Volume:  18     ISSN:  1527-974X     ISO Abbreviation:  J Am Med Inform Assoc     Publication Date:  2011 Sep 
Date Detail:
Created Date:  2011-08-17     Completed Date:  -     Revised Date:  -    
Medline Journal Info:
Nlm Unique ID:  9430800     Medline TA:  J Am Med Inform Assoc     Country:  United States    
Other Details:
Languages:  eng     Pagination:  594-600     Citation Subset:  IM    
Affiliation:
Human Language Technology Research Institute, University of Texas at Dallas, Richardson, Texas, USA.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Descriptor/Qualifier:

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine


Previous Document:  Natural language processing: an introduction.
Next Document:  Health information exchange usage in emergency departments and clinics: the who, what, and why.