| Comparing general and medical texts for information retrieval based on natural language processing: an inquiry into lexical disambiguation. | |
| | |
MedLine Citation:
|
PMID: 11604745 Owner: NLM Status: MEDLINE |
Abstract/OtherAbstract:
|
In this paper we compare two types of corpus, focusing on the lexical ambiguity of each of them. The first corpus consists mainly of general newspaper articles and literature excerpts, while the second belongs to the medical domain. To conduct the study, we have used two different disambiguation tools. First, each tool was validated in its respective application area. We then use these systems in order to assess and compare both the general ambiguity rate and the particularities of each domain. Quantitative results show that medical documents are lexically less ambiguous than unrestricted documents. Our conclusions emphasize the importance of the application area in the design of NLP tools. |
| | |
Authors:
|
P Ruch; R Baud; A Geissbühler; A M Rassinoux |
Related Documents
:
|
20380975 - Risk factors and medical management of vasospasm after subarachnoid hemorrhage. 8130505 - Recognizing new medical knowledge computationally. 10566325 - Modeling the umls using an oodb. 17822875 - Rapid development of auricular prosthesis using cad and rapid prototyping technologies. 11042425 - Ictal increased writing preceded by dysphasic seizures. 19366885 - Scheduled medications and falls in dementia patients utilizing a wander garden. |
Publication Detail:
|
Type: Comparative Study; Journal Article; Research Support, Non-U.S. Gov't |
Journal Detail:
|
Title: Studies in health technology and informatics Volume: 84 ISSN: 0926-9630 ISO Abbreviation: Stud Health Technol Inform Publication Date: 2001 |
Date Detail:
|
Created Date: 2001-10-17 Completed Date: 2002-01-08 Revised Date: 2008-07-10 |
Medline Journal Info:
|
Nlm Unique ID: 9214582 Medline TA: Stud Health Technol Inform Country: Netherlands |
Other Details:
|
Languages: eng Pagination: 261-5 Citation Subset: IM |
Affiliation:
|
Medical Informatics Division, University Hospital of Geneva,1211 Geneva, Switzerland. ruch@dim.hcuge.ch |
Export Citation:
|
APA/MLA Format Download EndNote Download BibTex |
| MeSH Terms | |
Descriptor/Qualifier:
|
Information Storage and Retrieval* Linguistics* Medical Records Systems, Computerized* Natural Language Processing* Newspapers Vocabulary, Controlled |
From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine
Previous Document: Development of a template model to represent the information content of chest radiology reports.
Next Document: Indexing medical WWW documents by morphemes.