Novel methods to correct for observer and sampling bias in presence-only species distribution models

Aim: While species distribution models (SDMs) are standard tools to predict species distributions, they can suffer from observation and sampling biases, particularly presence-only SDMs that often rely on species observations from non-standardized sampling efforts. To address this issue, sampling background points with a target-group strategy is commonly used, although more robust strategies and refinements could be implemented. Here, we exploited a dataset of plant species from the European Alps to propose and demonstrate efficient ways to correct for observer and sampling bias in presence-only models.

Innovation: Recent methods correct for observer bias by using covariates related to accessibility in model calibrations (classic bias covariate correction, Classic-BCC). However, depending on how species are sampled, accessibility covariates may not sufficiently capture observer bias. Here, we introduced BCCs more directly related to sampling effort, as well as a novel corrective method based on stratified resampling of the observational dataset before model calibration (environmental bias correction, EBC). We compared, individually and jointly, the effect of EBC and different BCC strategies, when modelling the distributions of 1’900 plant species. We evaluated model performance with spatial block split-sampling and independent test data, and assessed the accuracy of plant diversity predictions across the European Alps.

Main conclusions: Implementing EBC with BCC showed best results for every evaluation method. Particularly, adding the observation density of a target group as bias covariate (Target-BCC) displayed most realistic modelled species distributions, with a clear positive correlation (r≃0.5) found between predicted and expert-based species richness. Although EBC must be carefully implemented in a species-specific manner, such limitations may be addressed via automated diagnostics included in a provided R function. Implementing EBC and bias covariate correction together may allow future studies to address efficiently observer bias in presence-only models, and overcome the standard need of an independent test dataset for model evaluation.

Data and Resources

Additional Info

Field Value
Source
Version 1.0
Author [{"given_name": "Yohann", "name": "Chauvier", "email": "[email protected]", "data_credit": ["software", "curation", "collection", "validation", "publication"], "identifier": "https://orcid.org/0000-0001-9399-3192", "affiliation": "WSL"}, {"given_name": "Niklaus", "name": "Zimmermann", "email": "[email protected]", "data_credit": ["supervision", "validation", "publication"], "identifier": "https://orcid.org/0000-0003-3099-9604", "affiliation": "WSL"}, {"given_name": "Giovanni", "name": "Poggiato", "email": "[email protected]", "data_credit": ["software", "validation", "publication"], "identifier": "", "affiliation": "LECA"}, {"given_name": "Daria", "name": "Bystrova", "email": "[email protected]", "data_credit": ["validation", "publication", "software"], "identifier": "", "affiliation": "LECA"}, {"given_name": "Philipp", "name": "Brun", "email": "[email protected]", "data_credit": ["validation", "publication"], "identifier": "https://orcid.org/0000-0002-2750-9793", "affiliation": "WSL"}, {"given_name": "Wilfried", "name": "Thuiller", "email": "[email protected]", "data_credit": ["supervision", "software", "validation", "publication"], "identifier": "https://orcid.org/0000-0002-5388-5274", "affiliation": "LECA"}]
Author Email
Maintainer {"affiliation": "WSL", "email": "[email protected]", "given_name": "Yohann", "identifier": "https://orcid.org/0000-0001-9399-3192", "name": "Chauvier"}
Maintainer Email
Shared (this field will be removed in the future) Open
IB1 Sensitivity Class
IB1 Trust Framework
IB1 Dataset Assurance
IB1 Trust Framework
harvest_object_id da90f7bf-9e92-46e9-9336-f70b3c13bae3
harvest_source_id 8fc5dcf9-738c-468f-985c-d55347a92f88
harvest_source_title EnviDat