TY - JOUR
T1 - PolyFootNet
T2 - Extracting Polygonal Building Footprints in Off-Nadir Remote Sensing Images
AU - Li, Kai
AU - Deng, Yupeng
AU - Chen, Jingbo
AU - Meng, Yu
AU - Xi, Zhihao
AU - Ma, Junxian
AU - Wang, Chenhao
AU - Wang, Maolin
AU - Zhao, Xiangyu
N1 - Publisher Copyright:
© 1980-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - Extracting polygonal building footprints from off-nadir imagery is crucial for diverse applications. Current deep-learning-based extraction approaches predominantly rely on semantic segmentation paradigms and postprocessing algorithms, limiting their boundary precision and applicability. However, existing polygonal extraction methodologies are inherently designed for near-nadir imagery and fail under the geometric complexities introduced by off-nadir viewing angles. To address these challenges, this article introduces the polygonal footprint network (PolyFootNet), a novel deep-learning framework that directly outputs polygonal building footprints without requiring external postprocessing steps. The PolyFootNet employs a high-quality mask prompter to generate precise roof masks, which guide polygonal vertex extraction in a unified model pipeline. A key contribution of the PolyFootNet is introducing the self-offset attention (SOFA) mechanism, grounded in Nadaraya–Watson regression, to effectively mitigate the accuracy discrepancy observed between low-rise and high-rise buildings. This approach allows low-rise building predictions to leverage angular corrections learned from high-rise building offsets, significantly enhancing overall extraction accuracy. Additionally, motivated by the inherent ambiguity of building footprint extraction (BFE) tasks, we systematically investigate alternative extraction paradigms and demonstrate that a combined approach of building masks and offsets achieves superior polygonal footprint results. Extensive experiments validate PolyFootNet’s effectiveness, illustrating its promising potential as a robust, generalizable, and precise polygonal BFE method from challenging off-nadir imagery.
AB - Extracting polygonal building footprints from off-nadir imagery is crucial for diverse applications. Current deep-learning-based extraction approaches predominantly rely on semantic segmentation paradigms and postprocessing algorithms, limiting their boundary precision and applicability. However, existing polygonal extraction methodologies are inherently designed for near-nadir imagery and fail under the geometric complexities introduced by off-nadir viewing angles. To address these challenges, this article introduces the polygonal footprint network (PolyFootNet), a novel deep-learning framework that directly outputs polygonal building footprints without requiring external postprocessing steps. The PolyFootNet employs a high-quality mask prompter to generate precise roof masks, which guide polygonal vertex extraction in a unified model pipeline. A key contribution of the PolyFootNet is introducing the self-offset attention (SOFA) mechanism, grounded in Nadaraya–Watson regression, to effectively mitigate the accuracy discrepancy observed between low-rise and high-rise buildings. This approach allows low-rise building predictions to leverage angular corrections learned from high-rise building offsets, significantly enhancing overall extraction accuracy. Additionally, motivated by the inherent ambiguity of building footprint extraction (BFE) tasks, we systematically investigate alternative extraction paradigms and demonstrate that a combined approach of building masks and offsets achieves superior polygonal footprint results. Extensive experiments validate PolyFootNet’s effectiveness, illustrating its promising potential as a robust, generalizable, and precise polygonal BFE method from challenging off-nadir imagery.
KW - Building detection
KW - building footprint extraction (BFE)
KW - Nadaraya–Watson regression
KW - oblique monocular images
KW - off-nadir aerial image
KW - segment anything model (SAM)
UR - https://www.scopus.com/pages/publications/105011062347
U2 - 10.1109/TGRS.2025.3590054
DO - 10.1109/TGRS.2025.3590054
M3 - 文章
AN - SCOPUS:105011062347
SN - 0196-2892
VL - 63
JO - IEEE Transactions on Geoscience and Remote Sensing
JF - IEEE Transactions on Geoscience and Remote Sensing
M1 - 5635016
ER -