Skip to main navigation Skip to search Skip to main content

PolyFootNet: Extracting Polygonal Building Footprints in Off-Nadir Remote Sensing Images

  • Kai Li
  • , Yupeng Deng
  • , Jingbo Chen
  • , Yu Meng*
  • , Zhihao Xi
  • , Junxian Ma
  • , Chenhao Wang
  • , Maolin Wang
  • , Xiangyu Zhao
  • *Corresponding author for this work
  • CAS - Aerospace Information Research Institute
  • University of Chinese Academy of Sciences
  • City University of Hong Kong

Research output: Contribution to journalArticlepeer-review

Abstract

Extracting polygonal building footprints from off-nadir imagery is crucial for diverse applications. Current deep-learning-based extraction approaches predominantly rely on semantic segmentation paradigms and postprocessing algorithms, limiting their boundary precision and applicability. However, existing polygonal extraction methodologies are inherently designed for near-nadir imagery and fail under the geometric complexities introduced by off-nadir viewing angles. To address these challenges, this article introduces the polygonal footprint network (PolyFootNet), a novel deep-learning framework that directly outputs polygonal building footprints without requiring external postprocessing steps. The PolyFootNet employs a high-quality mask prompter to generate precise roof masks, which guide polygonal vertex extraction in a unified model pipeline. A key contribution of the PolyFootNet is introducing the self-offset attention (SOFA) mechanism, grounded in Nadaraya–Watson regression, to effectively mitigate the accuracy discrepancy observed between low-rise and high-rise buildings. This approach allows low-rise building predictions to leverage angular corrections learned from high-rise building offsets, significantly enhancing overall extraction accuracy. Additionally, motivated by the inherent ambiguity of building footprint extraction (BFE) tasks, we systematically investigate alternative extraction paradigms and demonstrate that a combined approach of building masks and offsets achieves superior polygonal footprint results. Extensive experiments validate PolyFootNet’s effectiveness, illustrating its promising potential as a robust, generalizable, and precise polygonal BFE method from challenging off-nadir imagery.

Original languageEnglish
Article number5635016
JournalIEEE Transactions on Geoscience and Remote Sensing
Volume63
DOIs
StatePublished - 2025
Externally publishedYes

Keywords

  • Building detection
  • building footprint extraction (BFE)
  • Nadaraya–Watson regression
  • oblique monocular images
  • off-nadir aerial image
  • segment anything model (SAM)

Fingerprint

Dive into the research topics of 'PolyFootNet: Extracting Polygonal Building Footprints in Off-Nadir Remote Sensing Images'. Together they form a unique fingerprint.

Cite this