Abstract:The rapid development of low-altitude economy has driven the proliferation of Unmanned Aerial Vehicle (UAV) applications, including logistics, inspection, and emergency response. However, transmitting high-volume image data from UAVs to ground stations faces significant challenges due to limited bandwidth and stringent privacy requirements. To address these issues, a Semantic Communication (SC) framework based on Federated Learning (FL) is proposed for efficient and privacy-preserving image transmission. A Swin Transformer-based Semantic Communication (STSC) architecture is designed to extract multi-scale semantic features under constrained bandwidth conditions. Dedicated communication and computing nodes are deployed on UAVs to enhance real-time coverage and flexibility. Meanwhile, a FL mechanism enables global model training across distributed devices without sharing raw data, thus preserving user privacy. Simulation experiments conducted on the CIFAR-10 dataset demonstrate that the proposed STSC framework achieves at least 5.7 dB improvement in Peak Signal-to-Noise Ratio (PSNR) compared to DeepJSCC baselines, while also showing superior convergence and generalization performance. The framework effectively integrates UAV-assisted deployment with SC and privacy protection, offering a practical solution for bandwidth-constrained image transmission in low-altitude networks.
| Comments: | 13 pages, 10 figures, 2 tables |
| Subjects: | Image and Video Processing (eess.IV); Machine Learning (cs.LG) |
| Cite as: | arXiv:2605.12566 [eess.IV] |
| (or arXiv:2605.12566v1 [eess.IV] for this version) | |
| https://doi.org/10.48550/arXiv.2605.12566 arXiv-issued DOI via DataCite (pending registration) |
Submission history
From: Kexin Zhang [view email]
[v1]
Tue, 12 May 2026 09:18:53 UTC (3,112 KB)
