Document Detail

The PSI semantic validator: a framework to check MIAPE compliance of proteomics data.
MedLine Citation:
PMID:  19834897     Owner:  NLM     Status:  MEDLINE    
The Human Proteome Organization's Proteomics Standards Initiative (PSI) promotes the development of exchange standards to improve data integration and interoperability. PSI specifies the suitable level of detail required when reporting a proteomics experiment (via the Minimum Information About a Proteomics Experiment), and provides extensible markup language (XML) exchange formats and dedicated controlled vocabularies (CVs) that must be combined to generate a standard compliant document. The framework presented here tackles the issue of checking that experimental data reported using a specific format, CVs and public bio-ontologies (e.g. Gene Ontology, NCBI taxonomy) are compliant with the Minimum Information About a Proteomics Experiment recommendations. The semantic validator not only checks the XML syntax but it also enforces rules regarding the use of an ontology class or CV terms by checking that the terms exist in the resource and that they are used in the correct location of a document. Moreover, this framework is extremely fast, even on sizable data files, and flexible, as it can be adapted to any standard by customizing the parameters it requires: an XML Schema Definition, one or more CVs or ontologies, and a mapping file describing in a formal way how the semantic resources and the format are interrelated. As such, the validator provides a general solution to the common problem in data exchange: how to validate the correct usage of a data standard beyond simple XML Schema Definition validation. The framework source code and its various applications can be found at
Luisa Montecchi-Palazzi; Samuel Kerrien; Florian Reisinger; Bruno Aranda; Andrew R Jones; Lennart Martens; Henning Hermjakob
Related Documents :
19834897 - The psi semantic validator: a framework to check miape compliance of proteomics data.
10563047 - Optimizing the balance between false positive and false negative error probabilities of...
20853417 - Fast and furious: effects of body size on strike performance in an arboreal viper trime...
18237837 - Novel technologies for the discovery and quantitation of biomarkers of toxicity.
23647867 - Estimating postmortem interval using rna degradation and morphological changes in tooth...
9390237 - An object-oriented data-driven migration model.
Publication Detail:
Type:  Journal Article; Research Support, Non-U.S. Gov't    
Journal Detail:
Title:  Proteomics     Volume:  9     ISSN:  1615-9861     ISO Abbreviation:  Proteomics     Publication Date:  2009 Nov 
Date Detail:
Created Date:  2009-11-25     Completed Date:  2010-02-22     Revised Date:  -    
Medline Journal Info:
Nlm Unique ID:  101092707     Medline TA:  Proteomics     Country:  Germany    
Other Details:
Languages:  eng     Pagination:  5112-9     Citation Subset:  IM    
European Molecular Biology Laboratory-European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge, UK.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Computational Biology / methods*
Proteomics / standards*
Reproducibility of Results
Grant Support
WT085949MA//Wellcome Trust

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine

Previous Document:  Native electrophoretic techniques to identify protein-protein interactions.
Next Document:  Comparative proteomics of a lycopene-accumulating mutant reveals the important role of oxidative str...