UV-SAM: Adapting Segment Anything Model for Urban Village Identification

Authors

  • Xin Zhang Shenzhen International Graduate School, Tsinghua University, Shenzhen, China
  • Yu Liu Department of Electronic Engineering, Tsinghua University, Beijing, China
  • Yuming Lin Department of Electronic Engineering, Tsinghua University, Beijing, China
  • Qingmin Liao Shenzhen International Graduate School, Tsinghua University, Shenzhen, China
  • Yong Li Department of Electronic Engineering, Tsinghua University, Beijing, China

DOI:

https://doi.org/10.1609/aaai.v38i20.30260

Keywords:

General

Abstract

Urban villages, defined as informal residential areas in or around urban centers, are characterized by inadequate infrastructures and poor living conditions, closely related to the Sustainable Development Goals (SDGs) on poverty, adequate housing, and sustainable cities. Traditionally, governments heavily depend on field survey methods to monitor the urban villages, which however are time-consuming, labor-intensive, and possibly delayed. Thanks to widely available and timely updated satellite images, recent studies develop computer vision techniques to detect urban villages efficiently. However, existing studies either focus on simple urban village image classification or fail to provide accurate boundary information. To accurately identify urban village boundaries from satellite images, we harness the power of the vision foundation model and adapt the Segment Anything Model (SAM) to urban village segmentation, named UV-SAM. Specifically, UV-SAM first leverages a small-sized semantic segmentation model to produce mixed prompts for urban villages, including mask, bounding box, and image representations, which are then fed into SAM for fine-grained boundary identification. Extensive experimental results on two datasets in China demonstrate that UV-SAM outperforms existing baselines, and identification results over multiple years show that both the number and area of urban villages are decreasing over time, providing deeper insights into the development trends of urban villages and sheds light on the vision foundation models for sustainable cities. The dataset and codes of this study are available at https://github.com/tsinghua-fib-lab/UV-SAM.

Downloads

Published

2024-03-24

How to Cite

Zhang, X., Liu, Y. ., Lin, Y., Liao, Q., & Li, Y. (2024). UV-SAM: Adapting Segment Anything Model for Urban Village Identification. Proceedings of the AAAI Conference on Artificial Intelligence, 38(20), 22520-22528. https://doi.org/10.1609/aaai.v38i20.30260