Jameel, Shoaib and Lam, Wai and Bing, Lidong (2015) Supervised topic models with word order structure for document classification and retrieval learning. Information Retrieval Journal, 18 (4). pp. 283-330. DOI https://doi.org/10.1007/s10791-015-9254-2
Jameel, Shoaib and Lam, Wai and Bing, Lidong (2015) Supervised topic models with word order structure for document classification and retrieval learning. Information Retrieval Journal, 18 (4). pp. 283-330. DOI https://doi.org/10.1007/s10791-015-9254-2
Jameel, Shoaib and Lam, Wai and Bing, Lidong (2015) Supervised topic models with word order structure for document classification and retrieval learning. Information Retrieval Journal, 18 (4). pp. 283-330. DOI https://doi.org/10.1007/s10791-015-9254-2
Abstract
One limitation of most existing probabilistic latent topic models for document classification is that the topic model itself does not consider useful side-information, namely, class labels of documents. Topic models, which in turn consider the side-information, popularly known as supervised topic models, do not consider the word order structure in documents. One of the motivations behind considering the word order structure is to capture the semantic fabric of the document. We investigate a low-dimensional latent topic model for document classification. Class label information and word order structure are integrated into a supervised topic model enabling a more effective interaction among such information for solving document classification. We derive a collapsed Gibbs sampler for our model. Likewise, supervised topic models with word order structure have not been explored in document retrieval learning. We propose a novel supervised topic model for document retrieval learning which can be regarded as a pointwise model for tackling the learning-to-rank task. Available relevance assessments and word order structure are integrated into the topic model itself. We conduct extensive experiments on several publicly available benchmark datasets, and show that our model improves upon the state-of-the-art models.
Item Type: | Article |
---|---|
Divisions: | Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 17 Aug 2020 16:42 |
Last Modified: | 15 Jan 2022 01:31 |
URI: | http://repository.essex.ac.uk/id/eprint/26534 |
Available files
Filename: inrt_springer_submission.pdf