TY - GEN
T1 - QuadFormer
T2 - 21st International Conference on Ubiquitous Robots, UR 2024
AU - Rao, Pratyaksh Prabhav
AU - Qiao, Feng
AU - Zhang, Weide
AU - Yiliang, X.
AU - Deng, Yong
AU - Wu, Guangbin
AU - Zhang, Qiang
AU - Loianno, Giuseppe
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Accurately identifying Power Lines (PLs) is crucial for ensuring the safety of aerial vehicles. Despite the potential of recent deep learning approaches, obtaining high-quality ground truth annotations remains a challenging and labor-intensive task. Unsupervised Domain Adaptation (UDA) emerges as a promising solution, leveraging knowledge from labeled synthetic data to improve performance on unlabeled real images. However, existing UDA methods often suffer of huge computation costs, limiting their deployment on real-time embedded systems commonly utilized on aerial vehicles. To mitigate this problem, this paper introduces QuadFormer, a real-time framework designed for unsupervised semantic segmentation within the UDA paradigm. QuadFormer integrates a lightweight transformer-based segmentation model with a cross-attention mechanism to narrow the gap between a labelled synthetic domain and unlabelled real domain. Furthermore, we design a novel pseudo label scheme to enhance the segmentation accuracy of the unlabelled real data. To facilitate the evaluation of our framework and promote reserach in PL segemntation, we present two new datasets: AutelPL Synthetic and AutelPL Real. Experimental results demonstrate that QuadFormer achieves state-of-the-art performance on both AutelPL Synthetic → TTPLA and AutelPL Synthetic → AutelPL Real tasks. We will publicly release the dataset to the research community.
AB - Accurately identifying Power Lines (PLs) is crucial for ensuring the safety of aerial vehicles. Despite the potential of recent deep learning approaches, obtaining high-quality ground truth annotations remains a challenging and labor-intensive task. Unsupervised Domain Adaptation (UDA) emerges as a promising solution, leveraging knowledge from labeled synthetic data to improve performance on unlabeled real images. However, existing UDA methods often suffer of huge computation costs, limiting their deployment on real-time embedded systems commonly utilized on aerial vehicles. To mitigate this problem, this paper introduces QuadFormer, a real-time framework designed for unsupervised semantic segmentation within the UDA paradigm. QuadFormer integrates a lightweight transformer-based segmentation model with a cross-attention mechanism to narrow the gap between a labelled synthetic domain and unlabelled real domain. Furthermore, we design a novel pseudo label scheme to enhance the segmentation accuracy of the unlabelled real data. To facilitate the evaluation of our framework and promote reserach in PL segemntation, we present two new datasets: AutelPL Synthetic and AutelPL Real. Experimental results demonstrate that QuadFormer achieves state-of-the-art performance on both AutelPL Synthetic → TTPLA and AutelPL Synthetic → AutelPL Real tasks. We will publicly release the dataset to the research community.
KW - Supplementary Material Video
UR - http://www.scopus.com/inward/record.url?scp=85200707446&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85200707446&partnerID=8YFLogxK
U2 - 10.1109/UR61395.2024.10597474
DO - 10.1109/UR61395.2024.10597474
M3 - Conference contribution
AN - SCOPUS:85200707446
T3 - 2024 21st International Conference on Ubiquitous Robots, UR 2024
SP - 161
EP - 167
BT - 2024 21st International Conference on Ubiquitous Robots, UR 2024
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 24 June 2024 through 27 June 2024
ER -