Building a community-based open harmonised reference data repository for global crop mapping

Boogaard, H., Pratihast, A.K., Laso Bayas, J.. ORCID:, Karanam, S., Fritz, S. ORCID:, Van Tricht, K., Degerickx, J., & Gilliams, S. (2023). Building a community-based open harmonised reference data repository for global crop mapping. PLoS ONE 18 (7) e0287731. 10.1371/journal.pone.0287731.

[thumbnail of journal.pone.0287731.pdf]
journal.pone.0287731.pdf - Published Version
Available under License Creative Commons Attribution.

Download (1MB) | Preview


Reference data is key to produce reliable crop type and cropland maps. Although research projects, national and international programs as well as local initiatives constantly gather crop related reference data, finding, collecting, and harmonizing data from different sources is a challenging task. Furthermore, ethical, legal, and consent-related restrictions associated with data sharing represent a common dilemma faced by international research projects. We address these dilemmas by building a community-based, open, harmonised reference data repository at global extent, ready for model training or product validation. Our repository contains data from different sources such as the Group on Earth Observations Global Agricultural Monitoring Initiative (GEOGLAM) Joint Experiment for Crop Assessment and Monitoring (JECAM) sites, the Radiant MLHub, the Future Harvest (CGIAR) centers, the National Aeronautics and Space Administration Food Security and Agriculture Program (NASA Harvest), the International Institute for Applied Systems Analysis (IIASA) citizen science platforms (LACO-Wiki and Geo-Wiki), as well as from individual project contributions. Data of 2016 onwards were collected, harmonised, and annotated. The data sets spatial, temporal, and thematic quality were assessed applying rules developed in this research. Currently, the repository holds around 75 million harmonised observations with standardized metadata of which a large share is available to the public. The repository, funded by ESA through the WorldCereal project, can be used for either the calibration of image classification deep learning algorithms or the validation of Earth Observation generated products, such as global cropland extent and maize and wheat maps. We recommend continuing and institutionalizing this reference data initiative e.g. through GEOGLAM, and encouraging the community to publish land cover and crop type data following the open science and open data principles.

Item Type: Article
Research Programs: Advancing Systems Analysis (ASA)
Advancing Systems Analysis (ASA) > Novel Data Ecosystems for Sustainability (NODES)
Strategic Initiatives (SI)
Depositing User: Luke Kirwan
Date Deposited: 14 Jul 2023 11:35
Last Modified: 14 Jul 2023 11:35

Actions (login required)

View Item View Item