| Lancet: a high precision medication event extraction system for clinical text. | |
| | |
MedLine Citation:
|
PMID: 20819865 Owner: NLM Status: MEDLINE |
Abstract/OtherAbstract:
|
OBJECTIVE: This paper presents Lancet, a supervised machine-learning system that automatically extracts medication events consisting of medication names and information pertaining to their prescribed use (dosage, mode, frequency, duration and reason) from lists or narrative text in medical discharge summaries. DESIGN: Lancet incorporates three supervised machine-learning models: a conditional random fields model for tagging individual medication names and associated fields, an AdaBoost model with decision stump algorithm for determining which medication names and fields belong to a single medication event, and a support vector machines disambiguation model for identifying the context style (narrative or list). MEASUREMENTS: The authors, from the University of Wisconsin-Milwaukee, participated in the third i2b2 shared-task for challenges in natural language processing for clinical data: medication extraction challenge. With the performance metrics provided by the i2b2 challenge, the micro F1 (precision/recall) scores are reported for both the horizontal and vertical level. RESULTS: Among the top 10 teams, Lancet achieved the highest precision at 90.4% with an overall F1 score of 76.4% (horizontal system level with exact match), a gain of 11.2% and 12%, respectively, compared with the rule-based baseline system jMerki. By combining the two systems, the hybrid system further increased the F1 score by 3.4% from 76.4% to 79.0%. CONCLUSIONS: Supervised machine-learning systems with minimal external knowledge resources can achieve a high precision with a competitive overall F1 score.Lancet based on this learning framework does not rely on expensive manually curated rules. The system is available online at http://code.google.com/p/lancet/. |
| | |
Authors:
|
Zuofeng Li; Feifan Liu; Lamont Antieau; Yonggang Cao; Hong Yu |
Related Documents
:
|
19536005 - Medical homes: challenges in translating theory into practice. 1231485 - First experiences with a documentation system via display terminals. 9242005 - Supporting multi-level medical education with knowledge-based systems. 10151655 - Schedule for rating disabilities; hemic and lymphatic systems--va. final rule. 6131085 - Twenty-five years of progress in the medical treatment of pediatric and congenital hear... 19536005 - Medical homes: challenges in translating theory into practice. |
Publication Detail:
|
Type: Journal Article; Research Support, N.I.H., Extramural |
Journal Detail:
|
Title: Journal of the American Medical Informatics Association : JAMIA Volume: 17 ISSN: 1527-974X ISO Abbreviation: J Am Med Inform Assoc Publication Date: 2010 Sep-Oct |
Date Detail:
|
Created Date: 2010-09-07 Completed Date: 2010-11-15 Revised Date: 2011-09-13 |
Medline Journal Info:
|
Nlm Unique ID: 9430800 Medline TA: J Am Med Inform Assoc Country: United States |
Other Details:
|
Languages: eng Pagination: 563-7 Citation Subset: IM |
Affiliation:
|
College of Health Sciences, University of Wisconsin-Milwaukee, Wisconsin, USA. |
Export Citation:
|
APA/MLA Format Download EndNote Download BibTex |
| MeSH Terms | |
Descriptor/Qualifier:
|
Artificial Intelligence* Electronic Health Records* Humans Information Storage and Retrieval / methods* Natural Language Processing* |
| Grant Support | |
ID/Acronym/Agency:
|
5R01LM009836/LM/NLM NIH HHS; 5R21RR024933/RR/NCRR NIH HHS; 5U54DA021519/DA/NIDA NIH HHS; U54LM008748/LM/NLM NIH HHS |
| Comments/Corrections | |
From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine
Previous Document: Textractor: a hybrid system for medications and reason for their prescription extraction from clinic...
Next Document: Leveraging informatics for genetic studies: use of the electronic medical record to enable a genome-...