A Crowdsourced Global Data Set for Validating Built-up Surface Layers V.2

See, L. ORCID: https://orcid.org/0000-0002-2665-7065, Georgieva, I. ORCID: https://orcid.org/0000-0002-5556-794X, Dürauer, M., Kemper, T., Corbane, C., Maffenini, L., Gallego, J., Pesaresi, M., et al. (2021). A Crowdsourced Global Data Set for Validating Built-up Surface Layers V.2. 10.22022/asa/09-2021.128.

[thumbnail of DataforLukeV2.zip] Archive
DataforLukeV2.zip - Published Version
Available under License Creative Commons Attribution.

Download (445MB)

Abstract

This collection contains data that were collected during a crowdsourcing campaign using Geo-Wiki (https://www.geo-wiki.org/). The campaign involved visual interpretation of a sample that is designed for validating any existing global built-up surface product. A zipped shapefile (ValidationGrids.zip) contains the random stratified sample of 50K locations, which consist of 80x80m grids further sub-divided into 10m cells so there are 64 cells per grid. These locations were provided to the crowd, who used very high-resolution satellite images to label the grids as built-up (i.e., containing a building), non-built-up or unsure. The file (Geo-WikiBuilt-upCentroidsAll.csv) contains the data collected in the campaign summarized by the centroid (or central point of each 80m grid location). It also contains fields for quality control, one that indicates if the change information matches the control points (see below) or the majority answer from the crowd, and another that indicates whether the presence/absence of built-up matches the control points (see below) or the majority answer from the crowd. The data collected for all 64 cells per grid can be found in Geo-WikiBuilt-upCellsAll.csv. The Geo-Wiki campaign uses visually interpreted grid locations called control points as part of the scoring mechanism of Geo-Wiki for quality control. These control points are provided by centroid (Geo-WikiBuilt-upCentroidsControls.csv) and for all cells in the 80m grid (Geo-WikiBuilt-upCellsControls.csv). In addition to the raw data, two additional quality-controlled files have been produced. The first file (Geo-WikiBuilt-upCentroidsChangeQualityControlled.csv) provides a single record for each location on change in built-up (if built-up is present) that lists either the control point answer or the majority answer from the crowd. The second file (Geo-WikiBuilt-upCellsQualityControlled.csv) contains a single record for each of the 64 cells in each grid, listing either the control point answer or the majority answer from the crowd. Finally, the file Strata.csv contains the mapping between the grid location and the sampling stratum used in the design of the sample.

Item Type: Data
Research Programs: Advancing Systems Analysis (ASA)
Advancing Systems Analysis (ASA) > Novel Data Ecosystems for Sustainability (NODES)
Related URLs:
Depositing User: Luke Kirwan
Date Deposited: 09 Nov 2021 11:20
Last Modified: 27 Mar 2024 05:00
URI: https://pure.iiasa.ac.at/17534

Actions (login required)

View Item View Item