| De-identifying an EHR Database - Anonymity, Correctness and Readability of the Medical Record. | |
| | |
MedLine Citation:
|
PMID: 21893869 Owner: NLM Status: In-Data-Review |
Abstract/OtherAbstract:
|
Electronic health records (EHR) contain a large amount of structured data and free text. Exploring and sharing clinical data can improve healthcare and facilitate the development of medical software. However, revealing confidential information is against ethical principles and laws. We de-identified a Danish EHR database with 437,164 patients. The goal was to generate a version with real medical records, but related to artificial persons. We developed a de-identification algorithm that uses lists of named entities, simple language analysis, and special rules. Our algorithm consists of 3 steps: collect lists of identifiers from the database and external resources, define a replacement for each identifier, and replace identifiers in structured data and free text. Some patient records could not be safely de-identified, so the de-identified database has 323,122 patient records with an acceptable degree of anonymity, readability and correctness (F-measure of 95%). The algorithm has to be adjusted for each culture, language and database. |
| | |
Authors:
|
Kostas Pantazos; Soren Lauesen; Soren Lippert |
Related Documents
:
|
9160009 - The internet & healthcare education: helix. 3755289 - Automated translation of german to english medical text. 11470219 - Aim: a personal view of where i have been and where we might be going. 10998589 - Fundamentals of clinical methodology. 4. diagnosis. 1482929 - An object oriented approach to interpret medical knowledge based on the arden syntax. 18373139 - Use of a handheld computer application for voluntary medication event reporting by inpa... 22486599 - A new taxonomy for describing and defining adherence to medications. 17304399 - Understanding the psychosocial and physical work environment in a singapore medical sch... 15191789 - Oral delivery of medications to companion animals: palatability considerations. |
Publication Detail:
|
Type: Journal Article |
Journal Detail:
|
Title: Studies in health technology and informatics Volume: 169 ISSN: 0926-9630 ISO Abbreviation: Stud Health Technol Inform Publication Date: 2011 |
Date Detail:
|
Created Date: 2011-09-06 Completed Date: - Revised Date: - |
Medline Journal Info:
|
Nlm Unique ID: 9214582 Medline TA: Stud Health Technol Inform Country: Netherlands |
Other Details:
|
Languages: eng Pagination: 862-6 Citation Subset: T |
Affiliation:
|
Software Development Group, IT-University of Copenhagen, Denmark. |
Export Citation:
|
APA/MLA Format Download EndNote Download BibTex |
| MeSH Terms | |
Descriptor/Qualifier:
|
|
From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine
Previous Document: A metadata-based patient register for cooperative clinical research: a case study in acute myeloid l...
Next Document: Service oriented data integration for a biomedical research network.