The Coverage Properties of Confidence Regions After Model Selection |
| |
Authors: | Paul Kabaila |
| |
Affiliation: | Department of Mathematics and Statistics, La Trobe University, Victoria 3083, Australia E-mail: |
| |
Abstract: | It is very common in applied frequentist (classical) statistics to carry out a preliminary statistical (i.e. data-based) model selection by, for example, using preliminary hypothesis tests or minimizing AIC. This is usually followed by the inference of interest, using the same data, based on the assumption that the selected model had been given to us a priori . This assumption is false and it can lead to an inaccurate and misleading inference. We consider the important case that the inference of interest is a confidence region. We review the literature that shows that the resulting confidence regions typically have very poor coverage properties. We also briefly review the closely related literature that describes the coverage properties of prediction intervals after preliminary statistical model selection. A possible motivation for preliminary statistical model selection is a wish to utilize uncertain prior information in the inference of interest. We review the literature in which the aim is to utilize uncertain prior information directly in the construction of confidence regions, without requiring the intermediate step of a preliminary statistical model selection. We also point out this aim as a future direction for research. |
| |
Keywords: | Confidence interval prediction interval preliminary model selection uncertain prior information |
|
|