Gu, Xuan and Tang, Jiarun and Huang, Xiao and He, Jianhua and Xu, Jing (2024) Exploiting Deep Reinforcement Learning for Multi-AUV Assisted VoI-Maximum Data Collection in UWSNs. In: IEEE Global Communications Conference (GLOBECOM) 2024, 2024-12-08 - 2024-12-12, Cape Town, South Africa. (In Press)
Gu, Xuan and Tang, Jiarun and Huang, Xiao and He, Jianhua and Xu, Jing (2024) Exploiting Deep Reinforcement Learning for Multi-AUV Assisted VoI-Maximum Data Collection in UWSNs. In: IEEE Global Communications Conference (GLOBECOM) 2024, 2024-12-08 - 2024-12-12, Cape Town, South Africa. (In Press)
Gu, Xuan and Tang, Jiarun and Huang, Xiao and He, Jianhua and Xu, Jing (2024) Exploiting Deep Reinforcement Learning for Multi-AUV Assisted VoI-Maximum Data Collection in UWSNs. In: IEEE Global Communications Conference (GLOBECOM) 2024, 2024-12-08 - 2024-12-12, Cape Town, South Africa. (In Press)
Abstract
Reliable and timely data collection is an important and challenging problem for underwater wireless sensor networks (UWSNs), partly due to the very slow underwater communication and the difficulty in recharging the sensors. In this paper, we exploit the use of autonomous underwater vehicles (AUVs) for UWSN data collection task. Specifically, we investigate how to maximize the value of information (VoI) for data collection through joint optimization of cluster head (CH) selection and multi-AUV path planning. We formulate the joint optimization problem for the task, taking into account the energy constraints of sensor nodes. To solve the problem, we propose a deep reinforcement learning algorithm based on an encoder-decoder architecture. The entire UWSN system is fed into the encoder network, followed by a composite decoder consisting of an AUV selection decoder and a cluster access sequence decoder to obtain the cluster access sequence for each AUV. Based on the determined sequences, we further utilize the dynamic programming algorithm to achieve optimal CH selection. Finally, we obtain the sequence of AUVs accessing the selected CHs. Simulation results demonstrate that the proposed learning-based approach converges and achieves a higher VoI than the existing benchmark algorithms.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Additional Information: | Published proceedings: _not provided_ |
Uncontrolled Keywords: | AUV, data collection, VoI, path planning |
Divisions: | Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 02 Oct 2024 08:18 |
Last Modified: | 02 Oct 2024 08:19 |
URI: | http://repository.essex.ac.uk/id/eprint/39078 |
Available files
Filename: m19308-gu final.pdf