Document Detail


Automated non-alphanumeric symbol resolution in clinical texts.
MedLine Citation:
PMID:  22195157     Owner:  NLM     Status:  In-Data-Review    
Abstract/OtherAbstract:
Although clinical texts contain many symbols, relatively little attention has been given to symbol resolution by medical natural language processing (NLP) researchers. Interpreting the meaning of symbols may be viewed as a special case of Word Sense Disambiguation (WSD). One thousand instances of four common non-alphanumeric symbols ('+', '-', '/', and '#') were randomly extracted from a clinical document repository and annotated by experts. The symbols and their surrounding context, in addition to bag-of-Words (BoW), and heuristic rules were evaluated as features for the following classifiers: Naïve Bayes, Support Vector Machine, and Decision Tree, using 10-fold cross-validation. Accuracies for '+', '-', '/', and '#' were 80.11%, 80.22%, 90.44%, and 95.00% respectively, with Naïve Bayes. While symbol context contributed the most, BoW was also helpful for disambiguation of some symbols. Symbol disambiguation with supervised techniques can be implemented with reasonable accuracy as a module for medical NLP systems.
Authors:
Sungrim Moon; Serguei Pakhomov; James Ryan; Genevieve B Melton
Related Documents :
17608647 - Over-the-counter medication use for childhood fever: a cross-sectional study of austral...
14640927 - Aprepitant--a novel nk1-receptor antagonist.
2556517 - Nocturnal asthma: a study in general practice.
17458427 - Factors influencing patient decisions about the use of asthma controller medication.
18203877 - Unlocking the secrets of parkinson disease.
12199177 - Report of a rare case of trauma-induced thyroid storm.
Publication Detail:
Type:  Journal Article     Date:  2011-10-22
Journal Detail:
Title:  AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium     Volume:  2011     ISSN:  1942-597X     ISO Abbreviation:  AMIA Annu Symp Proc     Publication Date:  2011  
Date Detail:
Created Date:  2011-12-23     Completed Date:  -     Revised Date:  -    
Medline Journal Info:
Nlm Unique ID:  101209213     Medline TA:  AMIA Annu Symp Proc     Country:  United States    
Other Details:
Languages:  eng     Pagination:  979-86     Citation Subset:  IM    
Affiliation:
Institute for Health Informatics;
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Descriptor/Qualifier:

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine


Previous Document:  Comparison of SNOMED CT versus Medcin Terminology Concept Coverage for Mild Traumatic Brain Injury.
Next Document:  Temporal Evolution of Biomedical Research Grant Collaborations across Multiple Scales - A CTSA Basel...