Enabling Country-Scale Land Cover Mapping with
Meter-Resolution Satellite Imagery

Xin-Yi Tong, Gui-Song Xia, Xiao Xiang Zhu

The Five-Billion-Pixels dataset is released!

The coordinate information of Five-Billion-Pixels is now available!

The test data for land cover mapping is released!

The code and model for land cover mapping is now available!

Introduction

High-resolution satellite images can provide abundant, detailed spatial information for land cover classification, which is particularly important for studying the complicated built environment. However, due to the complex land cover patterns, the costly training sample collections, and the severe distribution shifts of satellite imageries caused by, e.g., geographical differences or acquisition conditions, few studies have applied high-resolution images to land cover mapping in detailed categories at large scale. To fill this gap, we present a large-scale land cover dataset, Five-Billion-Pixels. It contains more than 5 billion labeled pixels of 150 high-resolution Gaofen-2 (4 m) satellite images, annotated in a 24-category system covering artificial-constructed, agricultural, and natural classes. In addition, we propose a deep-learning-based unsupervised domain adaptation approach that can transfer classification models trained on labeled dataset (referred to as the source domain) to unlabeled data (referred to as the target domain) for large-scale land cover mapping. Specifically, we introduce an end-to-end Siamese network employing dynamic pseudo-label assignment and class balancing strategy to perform adaptive domain joint learning. To validate the generalizability of our dataset and the proposed approach across different sensors and different geographical regions, we carry out land cover mapping on five megacities in China and six cities in other five Asian countries severally using: PlanetScope (3 m), Gaofen-1 (8 m), and Sentinel-2 (10 m) satellite images. Over a total study area of 60,000 square kilometers, the experiments show promising results even though the input images are entirely unlabeled. The proposed approach, trained with the Five-Billion-Pixels dataset, enables high-quality and detailed land cover mapping across the whole country of China and some other Asian countries at meter-resolution.

Five-Billion-Pixels

We present a large-scale land cover classification dataset, Five-Billion-Pixels. It has a spatial resolution of 4 m, covers areas more than 50,000 square kilometers in China, and contains more than 5 billion labeled pixels. It possesses the advantage of rich categories, large coverage, wide distribution, and high-spatial resolution, which well reflects the distributions of real-world ground objects and can benefit to different land cover related studies.

Five-Billion-Pixels extends GID and GID-15. Miscellaneous or unclear areas that are extremely difficult to annotate are considered as unlabeled. The categories of labeled pixels are double-checked and ensured to be correct.

The Five-Billion-Pixels dataset is released under the open source license:

  • Link: Onedrive (attention: OneDrive has a maximum download limit of 20GB for .zip files, please consider using the OneDrive client to synchronize the data, or download the data in separate folders or batches)
  • Link: Baidudrive (only available in China region, extraction code: FBPS)
  • It can be easily modified to extract specific categories according to the purpose of the application, for example, by merging categories or setting unwanted categories as background.

    Please use the GDAL library to read 16-bit data and use the PIL library to read 8-bit data (which has been processed as normal images): image = Image.open(imgname).convert('CMYK').

    Land Cover Mapping

    We propose an unsupervised domain adaptation approach for practical large-scale land cover mapping. Concretely, we introduce an end-to-end Siamese network consisting of two branches, which separately process images from the source and target domain. In the target domain branch, image pixels with high-confidence are assigned with pseudo-labels and then utilized for domain joint learning with the source domain branch. To mitigate negative adaptation, we adopt dynamic pseudo-label assignment and class balancing strategy in the network. Based on the proposed approach, we carry out land cover mapping on five megacities in China and six cities in other five Asian countries severally using unlabeled PlanetScope (PS), Gaofen-1 (GF-1), and Sentinel-2 (ST-2) satellite images. The encouraging results show the generality of our dataset and approach across different satellites and regions.

    The code for Dynamic Pseudo-label Assignment is available here:

  • Link: DPA
  • Test images covering densely labeled areas for Chinese megacities can be downloaded from:

  • Link: Onedrive
  • Link: Baidudrive (extraction code: MGCT)
  • Result Demonstration

    Due to the large size of resulting maps, only the central area of each city is shown.



    Beijing

    Chengdu

    Guangzhou

    Shanghai

    Wuhan

    Comparison with Google's Dynamic World and ESA's World Cover in Beijing and Guangzhou (all use ST-2).


    (a-b) Land cover map of central area of Beijing from Dynamic World and World Cover. (c) Our result of Beijing. (d-e) Land cover map of central area of Guangzhou from Dynamic World and World Cover. (f) Our result of Guangzhou.


    Citation

    @article{FBP2023,
      title={Enabling country-scale land cover mapping with meter-resolution satellite imagery},
      author={Tong, Xin-Yi and Xia, Gui-Song and Zhu, Xiao Xiang},
      journal={ISPRS Journal of Photogrammetry and Remote Sensing},
      volume={196},
      pages={178-196},
      year={2023}
    }
    

    Contact

    E-mail: xinyi.tong@tum.de

    Personal page: Xin-Yi Tong