Tropical Geography


Extraction Method of Island buildings in Remote Sensing Images Based on Improved DeepLabv3plus Algorithm

Lingxiao Wang(), Jing Jia   

  1. College of Engineering, Ocean University of China, Qingdao 266100, China
  • Received:2020-10-08 Revised:2020-11-06 Online:2021-08-03


As a typical land element, buildings play a dominant role in land use, urban planning, and disaster risk assessment, which is an important goal of remote-sensing-based surveying and mapping. Presently, with the rapid development of island economies, it is particularly important to extract island buildings accurately to avoid disorderly expansion. Machine learning is a common method for extracting terrain objects from high-resolution remote sensing images. However, the remote sensing features of buildings are complex: traditional machine learning methods suffer from poor robustness and have difficulty in fully mining deep-seated features. The overall accuracy and Mean Intersection over Union (mIoU) score of the deep learning architecture DeepLabv3plus is higher than those of other semantic segmentation models, making it suitable for remote sensing building extraction. In this study, the DeepLabv3plus segmentation architecture is adapted, based on the Xception network and hole convolution, to reduce the large number of pictures (batch size) and video memory required to train a deep learning model. Group Normalization (GN) is used to replace the original Batch Normalization (BN), to improve the feature extraction accuracy for a small batch size. Compared with the original DeepLabv3plus architecture, the DeepLabv3plus-G method can train models with a smaller batch size under limited Video Graphics Array (VGA) memory, and the segmentation accuracy is improved. DeepLabv3plus-G can effectively extract island buildings, which provides a new method for Chinese island construction supervision. As a result of limited island building data, this study designs an intelligent collection of pre-training sets and an annotation method based on multi-source data of domestic urban building datasets. Transfer learning strategy is also adopted to train the network model on the automatically annotated domestic urban building datasets. The final pre-training model was saved in the form of a network architecture and weights. The model size was 158 MB, the number of parameters was 22910480, and the depth was 65 layers. Remote sensing images of buildings from 29 temperate islands (including Chrysanthemum Island, etc.), 48 subtropical islands (including Chongming Island, etc.), and six tropical islands (including Haidian Island) in China from Google Earth, with a spatial resolution of 0.6 m, were employed. After automatic clipping, 891 images of island buildings with dimensions of 512 × 512 pixels were obtained. The open source labeling software Labelme was used for manual labeling before the algorithm experiments. Results showed that both the mIoU and mean Average Precision (mAP) of DeepLav3plus-Gare improved under the same batch size, compared to traditional methods. When the batch size was two, the mIoU and mAP improved by 8.95% and 10.45%, respectively, compared to the scores for the DeepLabv3plus architecture. Under the same network, the accuracy of DeepLabv3plus pixel segmentation improved with increasing patch size. The mIoU and mAP scores for a model with a batch size of eight were 10.51% and 10.26% higher than those of models with a batch size of two, respectively. However, the mAP of DeepLabv3plus-G pixel segmentation hardly changed with an increase in the batch size. In conclusion, when the batch size is small, the mAP and mIoU of the DeepLabv3plus building classification algorithm based on GN increase, and more accurate pixel-level island building extraction results are obtained.

Key words: DeepLabv3plus, automatic labeling of samples, island buildings, semantic segmentation, transfer learning

CLC Number: 

  • P237