Xiao, Xuedou and Wang, Wei and He, Jianhua and Zhang, Qian (2024) Task-Oriented Video Compressive Streaming for Real-time Semantic Segmentation. IEEE Transactions on Mobile Computing, 23 (12). pp. 14396-14413. DOI https://doi.org/10.1109/tmc.2024.3446185 (In Press)
Xiao, Xuedou and Wang, Wei and He, Jianhua and Zhang, Qian (2024) Task-Oriented Video Compressive Streaming for Real-time Semantic Segmentation. IEEE Transactions on Mobile Computing, 23 (12). pp. 14396-14413. DOI https://doi.org/10.1109/tmc.2024.3446185 (In Press)
Xiao, Xuedou and Wang, Wei and He, Jianhua and Zhang, Qian (2024) Task-Oriented Video Compressive Streaming for Real-time Semantic Segmentation. IEEE Transactions on Mobile Computing, 23 (12). pp. 14396-14413. DOI https://doi.org/10.1109/tmc.2024.3446185 (In Press)
Abstract
Real-time semantic segmentation (SS) is a major task for various vision-based applications such as self-driving. Due to the limited computing resources and stringent performance requirements, streaming videos from camera-embedded mobile devices to edge servers for SS is a promising approach. While there are increasing efforts on task-oriented video compression, most SS-applicable algorithms apply more uniform compression, as the sensitive regions are less obvious and concentrated. Such processing results in low compression performance and significantly limits the capacity of edge servers supporting real-time SS. In this paper, we propose STAC, a novel task-oriented DNN-driven video compressive streaming algorithm tailed for SS, to strike accuracy-bitrate balance and adapt to time-varying bandwidth. It exploits DNN's gradients as sensitivity metrics for fine-grained spatial adaptive compression and includes a temporal adaptive scheme that integrates spatial adaptation with predictive coding. Furthermore, we design a new bandwidth-aware neural network, serving as a compatible configuration tuner to fit time-varying bandwidth and content. STAC is evaluated in a system with a commodity mobile device and an edge server with real-world network traces. Experiments show that STAC can save up to 63.7-75.2% of bandwidth or improve accuracy by 3.1-9.5% compared to state-of-the-art algorithms, while capable of adapting to time-varying bandwidth.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Image coding; Bandwidth; Streaming media; Semantic segmentation; Accuracy; Servers; Predictive coding; Adaptive streaming; DNN-driven compression; edge computing |
Divisions: | Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 20 Sep 2024 12:43 |
Last Modified: | 11 Dec 2024 18:29 |
URI: | http://repository.essex.ac.uk/id/eprint/39208 |
Available files
Filename: 20240918-STAC-TMC.pdf
Licence: Creative Commons: Attribution 4.0