Artigo Acesso aberto Revisado por pares

QUASI-POISSON VS. NEGATIVE BINOMIAL REGRESSION: HOW SHOULD WE MODEL OVERDISPERSED COUNT DATA?

2007; Wiley; Volume: 88; Issue: 11 Linguagem: Inglês

10.1890/07-0043.1

ISSN

1939-9170

Autores

Jay M. Ver Hoef, Peter L. Boveng,

Tópico(s)

Avian ecology and behavior

Resumo

EcologyVolume 88, Issue 11 p. 2766-2772 Statistical Report QUASI-POISSON VS. NEGATIVE BINOMIAL REGRESSION: HOW SHOULD WE MODEL OVERDISPERSED COUNT DATA? Jay M. Ver Hoef, Corresponding Author Jay M. Ver Hoef [email protected] E-mail: [email protected]Search for more papers by this authorPeter L. Boveng, Peter L. Boveng National Marine Mammal Laboratory, Alaska Fisheries Science Center, National Marine Fisheries Service, 7600 Sand Point Way NE, Building 4, Seattle, Washington 98115-6349 USASearch for more papers by this author Jay M. Ver Hoef, Corresponding Author Jay M. Ver Hoef [email protected] E-mail: [email protected]Search for more papers by this authorPeter L. Boveng, Peter L. Boveng National Marine Mammal Laboratory, Alaska Fisheries Science Center, National Marine Fisheries Service, 7600 Sand Point Way NE, Building 4, Seattle, Washington 98115-6349 USASearch for more papers by this author First published: 01 November 2007 https://doi.org/10.1890/07-0043.1Citations: 687 Corresponding Editor: N. G. Yoccoz. Read the full textAboutPDF ToolsRequest permissionExport citationAdd to favoritesTrack citation ShareShare Give accessShare full text accessShare full-text accessPlease review our Terms and Conditions of Use and check box below to share full-text version of article.I have read and accept the Wiley Online Library Terms and Conditions of UseShareable LinkUse the link below to share a full-text version of this article with your friends and colleagues. Learn more.Copy URL Share a linkShare onEmailFacebookTwitterLinkedInRedditWechat Abstract Quasi-Poisson and negative binomial regression models have equal numbers of parameters, and either could be used for overdispersed count data. While they often give similar results, there can be striking differences in estimating the effects of covariates. We explain when and why such differences occur. The variance of a quasi-Poisson model is a linear function of the mean while the variance of a negative binomial model is a quadratic function of the mean. These variance relationships affect the weights in the iteratively weighted least-squares algorithm of fitting models to data. Because the variance is a function of the mean, large and small counts get weighted differently in quasi-Poisson and negative binomial regression. We provide an example using harbor seal counts from aerial surveys. These counts are affected by date, time of day, and time relative to low tide. We present results on a data set that showed a dramatic difference on estimating abundance of harbor seals when using quasi-Poisson vs. negative binomial regression. This difference is described and explained in light of the different weighting used in each regression method. A general understanding of weighting can help ecologists choose between these two methods. Supporting Information Filename Description https://dx.doi.org/10.6084/m9.figshare.c.3300023 Research data pertaining to this article is located at figshare.com: Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article. Literature Cited Akaike, H. 1973. Information theory and the extension of the maximum likelihood principle. Pages 267–281 in B. N. Petrov, and F. Czaki . Proceedings of the International Symposium on Information Theory. Akademia Kiadoo, Budapest, Hungary. Alexander, N., R. Moyeed, and J. Stander . 2000. Spatial modeling of individual-level parasite counts using the negative binomial distribution. Biostatistics 1: 453–463. Boveng, P. L., J. L. Bengtson, D. E. Withrow, J. C. Cesarone, M. A. Simpkins, K. J. Frost, and J. J. Burns . 2003. The abundance of harbor seals in the Gulf of Alaska. Marine Mammal Science 19: 111–127. Burnham, K. P., and D. R. Anderson . 2002. Model selection and multimodel inference: a practical information theoretic approach. Second edition. Springer-Verlag, Berlin, Germany. Consul, P. C. 1989. Generalized Poisson distribution: properties and applications. Marcel Dekker, New York, New York, USA. Eberhardt, L. L. 1978. Appraising variability in population studies. Journal of Wildlife Management 42: 207–238. Efron, B. 1986. Double exponential families and their use in generalized linear regression. Journal of the American Statistical Association 81: 709–721. Frost, K. J., L. F. Lowry, and J. M. Ver Hoef . 1999. Monitoring the trend of harbor seals in Prince William Sound, Alaska, after the Exxon Valdez oil spill. Marine Mammal Science 15: 494–506. Gardner, W., E. P. Mulvey, and E. C. Shaw . 1995. Regression analyses of counts and rates: Poisson, overdispersed Poisson, and negative binomial models. Psychological Bulletin 118: 392–404. Gelfand, A. E., and S. R. Dalal . 1990. A note on overdispersed exponential families. Biometrika 77: 55–64. Joe, H., and R. Zhu . 2005. Generalized Poisson distribution: the property of mixture of Poisson and comparison with negative binomial distribution. Biometrical Journal 47: 219–229. Lambert, D. 1992. Zero-inflated Poisson regression, with an application to defects in manufacturing. Technometrics 34: 1–14. Lee, Y., and J. A. Nelder . 2000. The relationship between double-exponential families and extended quasi-likelihood families, with application to modeling Geissler's human sex ratio data. Applied Statistics 49: 413–419. Lord, D., S. Washington, and J. Ivan . 2005. Poisson, Poisson-gamma and zero-inflated regression models of motor vehicle crashes: balancing statistical fit and theory. Accident Analysis and Prevention 37: 35–46. Mathews, E. A., and G. W. Pendleton . 2006. Declines in harbor seal (Phoca vitulina) numbers in Glacier Bay National Park, Alaska, 1992–2002. Marine Mammal Science 22: 167–189. McCullagh, P., and J. A. Nelder . 1989. Generalized linear models. Second edition. Chapman and Hall, New York, New York, USA. O'Hara, R. B. 2005. Species richness estimators: How many species can dance on the head of a pin? Journal of Animal Ecology 74: 375–386. Potts, J., and J. Elith . 2006. Comparing species abundance models. Ecological Modelling 199: 153–163. Raftery, A. E. 1995. Bayesian model selection in social research (with discussion by Andrew Gelman, Donald B. Rubin, and Robert M. Hauser). Pages 111–195 in P. V. Marsden Sociological methodology 1995. Blackwell, Cambridge, Massachusetts, USA. Rissanen, J. 1978. Modeling by the shortest data description. Automatica 14: 465–471. SAS Institute. 2003. SAS version 9.1. SAS Institute, Cary, North Carolina, USA. Schwarz, G. 1978. Estimating the dimension of a model. Annals of Statistics 6: 461–464. Sileshi, G. 2006. Selecting the right statistical model for analysis of insect count data by using information theoretic measures. Bulletin of Entomological Research 96: 479–488. Small, R. J., G. W. Pendleton, and K. W. Pitcher . 2003. Trends in abundance of Alaska harbor seals, 1983-2002. Marine Mammal Science 19: 344–362. Terceiro, M. 2003. The statistical properties of recreational catch rate data for some fish stocks off the northeast US coast. Fishery Bulletin 101: 653–672. Vehtari, A., and J. Lampinen . 2003. Expected utility estimation via cross-validation. Pages 701–710 in J. M. Bernardo, M. J. Bayarri, J. O. Berger, A. P. Dawid, D. Heckerman, A. F. M. Smith, and M. West . Bayesian statistics 7. Oxford University Press, Oxford, UK. Ver Hoef, J. M., and K. J. Frost . 2003. A Bayesian hierarchical model for monitoring harbor seal changes in Prince William Sound, Alaska. Environmental and Ecological Statistics 10: 201–219. Wedderburn, R. W. M. 1974. Quasi-likelihood functions, generalized linear models, and the Gauss-Newton method. Biometrika 61: 439–447. White, G. C., and R. E. Bennetts . 1996. Analysis of frequency count data using the negative binomial distribution. Ecology 77: 2549–2557. Citing Literature Volume88, Issue11November 2007Pages 2766-2772 ReferencesRelatedInformation

Referência(s)