Artigo Acesso aberto Revisado por pares

Asymptotic optimality of generalized C, cross-validation, and generalized cross-validation in regression with heteroskedastic errors

1991; Elsevier BV; Volume: 47; Issue: 2-3 Linguagem: Inglês

10.1016/0304-4076(91)90107-o

ISSN

1872-6895

Autores

Donald W. K. Andrews,

Tópico(s)

Control Systems and Identification

Resumo

The problem considered here is that of using a data-driven procedure to select a good estimate from a class of linear estimates indexed by a discrete parameter. In contrast to other papers on this subject, we consider models with heteroskedastic errors. The results apply to model selection problems in linear regression and to nonparametric regression estimation via series estimators, nearest-neighbor estimators, and local regression estimators, among others. Generalized CL (GCL), cross-validation (CV), and generalized cross-validation (GCV) procedures are analyzed. The GCL and CV criteria are shown to be asymptotically optimal under general conditions. A feasible version of GCL, however, is available only in some applications. The GCV criterion is found to be asymptotically optimal only under a condition that is satisfied in some applications but not in others. For example, it is satisfied in the nearest-neighbor estimation context but not in the series estimation, local regression estimation, or model selection contexts. Thus, the CV criterion is the only feasible criterion of the three that is asymptotically optimal under general conditions. The proofs rely heavily on results of Li (1987).

Referência(s)
Altmetric
PlumX