Nadeem, Muhammad Shahroz and Franqueira, Virginia NL and Zhai, Xiaojun and Kurugollu, Fatih (2019) A Survey of Deep Learning Solutions for Multimedia Visual Content Analysis. IEEE Access, 7. pp. 84003-84019. DOI https://doi.org/10.1109/access.2019.2924733
Nadeem, Muhammad Shahroz and Franqueira, Virginia NL and Zhai, Xiaojun and Kurugollu, Fatih (2019) A Survey of Deep Learning Solutions for Multimedia Visual Content Analysis. IEEE Access, 7. pp. 84003-84019. DOI https://doi.org/10.1109/access.2019.2924733
Nadeem, Muhammad Shahroz and Franqueira, Virginia NL and Zhai, Xiaojun and Kurugollu, Fatih (2019) A Survey of Deep Learning Solutions for Multimedia Visual Content Analysis. IEEE Access, 7. pp. 84003-84019. DOI https://doi.org/10.1109/access.2019.2924733
Abstract
The increasing use of social media networks on handheld devices, especially smartphones with powerful built-in cameras, and the widespread availability of fast and high bandwidth broadband connections, added to the popularity of cloud storage, is enabling the generation and distribution of massive volumes of digital media, including images and videos. Such media is full of visual information and holds immense value in today’s world. The volume of data involved calls for automated visual content analysis systems able to meet the demands of practice in terms of efficiency and effectiveness. Deep Learning (DL) has recently emerged as a prominent technique for visual content analysis. It is data-driven in nature and provides automatic end-to-end learning solutions without the need to rely explicitly on predefined handcrafted feature extractors. Another appealing characteristic of DL solutions is the performance they can achieve, once the network is trained, under practical constraints. This paper identifies eight problem domains which require analysis of visual artefacts in multimedia. It surveys the recent, authoritative, and best performing DL solutions and lists the datasets used in the development of these deep methods for the identified types of visual analysis problems. The paper also discusses the challenges that DL solutions face which can compromise their reliability, robustness, and accuracy for visual content analysis.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Visual content analysis; deep learning; machine learning; dataset |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 26 Jun 2019 11:21 |
Last Modified: | 30 Oct 2024 17:29 |
URI: | http://repository.essex.ac.uk/id/eprint/24887 |
Available files
Filename: 08744516.pdf
Licence: Creative Commons: Attribution 3.0