Artigo Revisado por pares

Validation of Regression Models: Methods and Examples

1977; Taylor & Francis; Volume: 19; Issue: 4 Linguagem: Inglês

10.1080/00401706.1977.10489581

ISSN

1537-2723

Autores

Ronald D. Snee,

Tópico(s)

Statistical Methods and Applications

Resumo

Methods to determine the validity of regression models include comparison of model predictions and coefficients with theory, collection of new data to check model predictions. comparison of results with theoretical model calculations, and data splitting or cross-validation in which a portion of the data is used to estimate the model coefficients, and the remainder of the data is used to measure the prediction accuracy of the model. An expository review of these methods is presented. It is concluded that data splitting is an effective method of model validation when it is not practical to collect new data to test the model. The DUPLEX algorithm, developed by R. W. Kennard, is recommended for dividing the data into the estimation set and prediction set when there is no obvious variable such as time to use as a basis to split the data. Several examples are included to illustrate the various methods of model validation.

Referência(s)
Altmetric
PlumX