Sequential estimate for linear regression models with uncertain number of effective variables |
| |
Authors: | Zhanfeng Wang Yuan-chin Ivan Chang |
| |
Affiliation: | 1. Department of Statistics and Finance, University of Science and Technology of China, Hefei, 230026, China 2. Institute of Statistical Science, Academia Sinica, Taipei, 115, Taiwan
|
| |
Abstract: | As a result of novel data collection technologies, it is now common to encounter data in which the number of explanatory variables collected is large, while the number of variables that actually contribute to the model remains small. Thus, a method that can identify those variables with impact on the model without inferring other noneffective ones will make analysis much more efficient. Many methods are proposed to resolve the model selection problems under such circumstances, however, it is still unknown how large a sample size is sufficient to identify those “effective” variables. In this paper, we apply sequential sampling method so that the effective variables can be identified efficiently, and the sampling is stopped as soon as the “effective” variables are identified and their corresponding regression coefficients are estimated with satisfactory accuracy, which is new to sequential estimation. Both fixed and adaptive designs are considered. The asymptotic properties of estimates of the number of effective variables and their coefficients are established, and the proposed sequential estimation procedure is shown to be asymptotically optimal. Simulation studies are conducted to illustrate the performance of the proposed estimation method, and a diabetes data set is used as an example. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|