| Extracting noun phrases for all of MEDLINE. | |
| | |
MedLine Citation:
|
PMID: 10566444 Owner: NLM Status: MEDLINE |
Abstract/OtherAbstract:
|
A natural language parser that could extract noun phrases for all medical texts would be of great utility in analyzing content for information retrieval. We discuss the extraction of noun phrases from MEDLINE, using a general parser not tuned specifically for any medical domain. The noun phrase extractor is made up of three modules: tokenization; part-of-speech tagging; noun phrase identification. Using our program, we extracted noun phrases from the entire MEDLINE collection, encompassing 9.3 million abstracts. Over 270 million noun phrases were generated, of which 45 million were unique. The quality of these phrases was evaluated by examining all phrases from a sample collection of abstracts. The precision and recall of the phrases from our general parser compared favorably with those from three other parsers we had previously evaluated. We are continuing to improve our parser and evaluate our claim that a generic parser can effectively extract all the different phrases across the entire medical literature. |
| | |
Authors:
|
N A Bennett; Q He; K Powell; B R Schatz |
Related Documents
:
|
7557514 - Medical conditions differentially affect the development of iadl disability: implicatio... 19650704 - Oleander intoxication in new world camelids: 12 cases (1995-2006). 21774274 - The legal role of medical professionals in decisions to withhold or withdraw life-susta... 3735384 - Evaluation report: dialysis and ancillary equipment. 20677244 - Complaints, grievances, and claims against physicians: does tort reform make a difference? 14516524 - Chronic daily headache. |
Publication Detail:
|
Type: Journal Article; Research Support, U.S. Gov't, Non-P.H.S. |
Journal Detail:
|
Title: Proceedings / AMIA ... Annual Symposium. AMIA Symposium Volume: - ISSN: 1531-605X ISO Abbreviation: Proc AMIA Symp Publication Date: 1999 |
Date Detail:
|
Created Date: 2000-02-01 Completed Date: 2000-02-01 Revised Date: 2009-11-18 |
Medline Journal Info:
|
Nlm Unique ID: 100883449 Medline TA: Proc AMIA Symp Country: UNITED STATES |
Other Details:
|
Languages: eng Pagination: 671-5 Citation Subset: IM |
Affiliation:
|
CANIS-Community Architectures for Network Information Systems, Graduate School of Library and Information Science, University of Illinois at Urbana-Champaign 61820, USA. nabennet@canis.uiuc.edu |
Export Citation:
|
APA/MLA Format Download EndNote Download BibTex |
| MeSH Terms | |
Descriptor/Qualifier:
|
Linguistics MEDLINE* Natural Language Processing* |
| Comments/Corrections | |
From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine
Previous Document: Assurance: the power behind PCASSO security.
Next Document: Knowledge requirements for automated inference of medical textbook markup.