| Information extraction approaches to unconventional data sources for "injury surveillance system": the case of newspapers clippings. | |
| | |
MedLine Citation:
|
PMID: 20703703 Owner: NLM Status: In-Data-Review |
Abstract/OtherAbstract:
|
Injury Surveillance Systems based on traditional hospital records or clinical data have the advantage of being a well established, highly reliable source of information for making an active surveillance on specific injuries, like choking in children. However, they suffer the drawback of delays in making data available to the analysis, due to inefficiencies in data collection procedures. In this sense, the integration of clinical based registries with unconventional data sources like newspaper articles has the advantage of making the system more useful for early alerting. Usage of such sources is difficult since information is only available in the form of free natural-language documents rather than structured databases as required by traditional data mining techniques. Information Extraction (IE) addresses the problem of transforming a corpus of textual documents into a more structured database. In this paper, on a corpora of Italian newspapers articles related to choking in children due to ingestion/inhalation of foreign body we compared the performance of three IE algorithms- (a) a classical rule based system which requires a manual annotation of the rules; (ii) a rule based system which allows for the automatic building of rules; (b) a machine learning method based on Support Vector Machine. Although some useful indications are extracted from the newspaper clippings, this approach is at the time far from being routinely implemented for injury surveillance purposes. |
| | |
Authors:
|
Paola Berchialla; Cecilia Scarinzi; Silvia Snidero; Yousif Rahim; Dario Gregori |
Related Documents
:
|
16943573 - Cataloging the relationships between proteins: a review of interaction databases. 20567993 - Customization of medical report data. 16730633 - Improved methods for fitting sedimentation coefficient distributions derived by time-de... |
Publication Detail:
|
Type: Journal Article Date: 2010-04-27 |
Journal Detail:
|
Title: Journal of medical systems Volume: 36 ISSN: 0148-5598 ISO Abbreviation: J Med Syst Publication Date: 2012 Apr |
Date Detail:
|
Created Date: 2012-03-27 Completed Date: - Revised Date: - |
Medline Journal Info:
|
Nlm Unique ID: 7806056 Medline TA: J Med Syst Country: United States |
Other Details:
|
Languages: eng Pagination: 475-81 Citation Subset: IM |
Affiliation:
|
Department of Public Health and Microbiology, University of Torino, Torino, Italy. |
Export Citation:
|
APA/MLA Format Download EndNote Download BibTex |
| MeSH Terms | |
Descriptor/Qualifier:
|
|
From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine
Previous Document: A novel strategy for load balancing of distributed medical applications.
Next Document: AMI Screening Using Linguistic Fuzzy Rules.