Xu, Jing and Tang, Jiarun and Huang, Xiao and Gu, Xuan and Li, Lanhua and He, Jianhua and Liu, Wei (2025) Encoder-Decoder Based Deep Reinforcement Learning for Multi-AUV Assisted Data Collection in UWSNs. IEEE Transactions on Vehicular Technology. pp. 1-13. DOI https://doi.org/10.1109/tvt.2025.3642544
Xu, Jing and Tang, Jiarun and Huang, Xiao and Gu, Xuan and Li, Lanhua and He, Jianhua and Liu, Wei (2025) Encoder-Decoder Based Deep Reinforcement Learning for Multi-AUV Assisted Data Collection in UWSNs. IEEE Transactions on Vehicular Technology. pp. 1-13. DOI https://doi.org/10.1109/tvt.2025.3642544
Xu, Jing and Tang, Jiarun and Huang, Xiao and Gu, Xuan and Li, Lanhua and He, Jianhua and Liu, Wei (2025) Encoder-Decoder Based Deep Reinforcement Learning for Multi-AUV Assisted Data Collection in UWSNs. IEEE Transactions on Vehicular Technology. pp. 1-13. DOI https://doi.org/10.1109/tvt.2025.3642544
Abstract
Reliable and timely data collection poses a significant challenge for underwater wireless sensor networks (UWSNs), primarily due to the extremely low data rate of underwater communication and the difficulty in supplying energy to the sensor nodes. In this paper, we explore the use of multiple autonomous underwater vehicles (AUVs) for sequential data collection from sensor nodes distributed over a large area. To facilitate efficient data collection, appropriate cluster heads (CHs) are selected in each region to collect and aggregate data from nearby energy-limited sensor nodes, and then forward the data to the mobile AUVs. Specifically, we investigate how to maximize the value of information (VoI) of the collected data through the joint optimization of CH selection and multi-AUV path planning, while accounting for the energy constraints of sensor nodes. To tackle the formulated VoI maximization problem, we propose a deep reinforcement learning algorithm based on an encoder-decoder architecture. The whole UWSN system is fed into the encoder network, followed by a composite decoder consisting of an AUV selection decoder and a cluster access sequence decoder to generate the cluster access sequence for each AUV. Based on the determined sequences, we further utilize the dynamic programming algorithm to achieve optimal CH selection. Finally, we obtain the sequence of AUVs that access the selected CHs. Simulation results demonstrate that the proposed learning-based approach converges and achieves a higher VoI than existing benchmark algorithms.
| Item Type: | Article |
|---|---|
| Uncontrolled Keywords: | Autonomous underwater vehicle (AUV); underwater wireless sensor networks (UWSNs); data collection; value of information (VoI); path planning |
| Subjects: | Z Bibliography. Library Science. Information Resources > ZR Rights Retention |
| Divisions: | Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
| SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
| Depositing User: | Unnamed user with email elements@essex.ac.uk |
| Date Deposited: | 31 Mar 2026 10:07 |
| Last Modified: | 31 Mar 2026 10:07 |
| URI: | http://repository.essex.ac.uk/id/eprint/42346 |
Available files
Filename: 20251208-Encoder_Decoder_Based_Deep_Reinforcement_Learning_for_Multi_AUV_Assisted_Data_Collection_in_UWSNs.pdf
Licence: Creative Commons: Attribution 4.0