Modelling count data distribution: the example of freshwater fish

L. Vaudor; Nicolas Lamouroux; J.M. Olivier

Communication Dans Un Congrès Année : 2009

Modelling count data distribution: the example of freshwater fish

Modélisation de la distribution de données de comptage: l'exemple des poissons d'eau douce

(1) , (1) , (2)

1
2

L. Vaudor

Fonction : Auteur
PersonId : 299
IdHAL : lise-vaudor
ORCID : 0000-0003-1844-6425
IdRef : 170544664

Biologie des écosystèmes aquatiques

Nicolas Lamouroux

Fonction : Auteur
PersonId : 736215
IdHAL : nicolas-lamouroux
ORCID : 0000-0002-9184-2558
IdRef : 127902422

Biologie des écosystèmes aquatiques

J.M. Olivier

Fonction : Auteur

Université Claude Bernard Lyon 1

Résumé

To study abundance and its environmental determinants, one generally needs to characterize the shape of its statistical distribution, for instance through the use of a distribution model. In particular, when inference is based on a limited amount of data, having an a priori idea of the data distribution is particularly helpful. We want to find a distribution model which could be applied generally for freshwater fish abundance data. This model should reflect data's main properties: discretion, overdispersion, high proportion of zeros. We also wish to study the influence of various factors (e.g. species, site, mean abundance, sample size) on the performance of distribution models, so as to discuss the way environment and behaviour might influence the distribution of fish. Moreover, we illustrate a few consequences to considering an appropriate model for abundance count data, rather than a normal approximation, through the examination of confidence intervals around the estimate of mean abundance. We study the distribution of 12 freshwater fish species of the Rhône basin, using a huge dataset consisting in repeated samples (each consisting of 20 to 180 counts) collected by electrofishing between 1985 and 2007. We fit four different models to each of our 2258 samples : a zero-inflated Poisson, a negative binomial, a zero-inflated negative binomial, and a two-part Pareto distribution models. For each sample, we fit these four models by maximum-likelihood and select one model according to the BIC criterion. We carry out logistic ANOVAs to assess the influence of factors such as species, site, mean abundance on the choice of one model among the four considered here. We calculate confidence intervals around mean abundance according to our best-performing distribution model, or through the normal approximation, and compare the results obtained through both methods. Overall, the negative binomial is the most often selected distribution model( 46% of samples). However, the goodness of fit depends a lot on sample features such as mean abundance: in particular, the zero-inflated Poisson model is selected in 56% of the samples whose mean abundance is weaker than 0.6 individual per point. Binomial negative-based confidence intervals are generally wider than confidence intervals based on a Gaussian distributional assumption (92% of samples), in particular when there are very few non-null counts. Besides, they are asymmetric and much longer on their right side, reflecting that mean abundance might actually be a lot higher than the observed mean, given the shape of the distribution.

Domaines

Sciences de l'environnement

Migration Irstea Publications : Connectez-vous pour contacter le contributeur

https://hal.inrae.fr/hal-02592444

Soumis le : vendredi 15 mai 2020-16:14:26

Dernière modification le : vendredi 31 mai 2024-10:16:44

Dates et versions

hal-02592444 , version 1 (15-05-2020)

Identifiants

HAL Id : hal-02592444 , version 1
IRSTEA : PUB00027423

Citer

L. Vaudor, Nicolas Lamouroux, J.M. Olivier. Modelling count data distribution: the example of freshwater fish. 94th ESA Annual Meeting, Aug 2009, Albuquerque, United States. pp.18. ⟨hal-02592444⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LYON1 IRSTEA UDL INRAE ECOFLOWS

10 Consultations

0 Téléchargements

Modelling count data distribution: the example of freshwater fish

Modélisation de la distribution de données de comptage: l'exemple des poissons d'eau douce

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager