Document Detail

Statistical inference for exploratory data analysis and model diagnostics.
MedLine Citation:
PMID:  19805449     Owner:  NLM     Status:  MEDLINE    
We propose to furnish visual statistical methods with an inferential framework and protocol, modelled on confirmatory statistical testing. In this framework, plots take on the role of test statistics, and human cognition the role of statistical tests. Statistical significance of 'discoveries' is measured by having the human viewer compare the plot of the real dataset with collections of plots of simulated datasets. A simple but rigorous protocol that provides inferential validity is modelled after the 'lineup' popular from criminal legal procedures. Another protocol modelled after the 'Rorschach' inkblot test, well known from (pop-)psychology, will help analysts acclimatize to random variability before being exposed to the plot of the real data. The proposed protocols will be useful for exploratory data analysis, with reference datasets simulated by using a null assumption that structure is absent. The framework is also useful for model diagnostics in which case reference datasets are simulated from the model in question. This latter point follows up on previous proposals. Adopting the protocols will mean an adjustment in working procedures for data analysts, adding more rigour, and teachers might find that incorporating these protocols into the curriculum improves their students' statistical thinking.
Andreas Buja; Dianne Cook; Heike Hofmann; Michael Lawrence; Eun-Kyung Lee; Deborah F Swayne; Hadley Wickham
Related Documents :
15894379 - Applying resampling methods to neurophysiological data.
10466149 - Statistical limitations in functional neuroimaging. i. non-inferential methods and stat...
266339 - Statistical analysis of white blood cell data directed to the detection of malignancy i...
11004419 - Choice of effect measure for epidemiological data.
8530719 - An introduction to logistic regression with an application to the analysis of language ...
16558489 - Technical note: the initial stages of statistical data analysis.
24432199 - Extension of an iterative hybrid ordinal logistic regression/item response theory appro...
1787649 - Sodium modeling in hemodiafiltration.
17896599 - Detection and visualization of surface-pockets to enable phenotyping studies.
Publication Detail:
Type:  Journal Article; Research Support, U.S. Gov't, Non-P.H.S.    
Journal Detail:
Title:  Philosophical transactions. Series A, Mathematical, physical, and engineering sciences     Volume:  367     ISSN:  1364-503X     ISO Abbreviation:  Philos Trans A Math Phys Eng Sci     Publication Date:  2009 Nov 
Date Detail:
Created Date:  2009-10-06     Completed Date:  2010-01-29     Revised Date:  2013-04-24    
Medline Journal Info:
Nlm Unique ID:  101133385     Medline TA:  Philos Trans A Math Phys Eng Sci     Country:  England    
Other Details:
Languages:  eng     Pagination:  4361-83     Citation Subset:  IM    
Wharton School, University of Pennsylvania, Philadelphia, PA 19104, USA.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Data Interpretation, Statistical*
Housing / statistics & numerical data
Models, Theoretical*

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine

Previous Document:  Cherry-picking for complex data: robust structure discovery.
Next Document:  Sufficient dimension reduction and prediction in regression.