Cai, Zijie and Fang, Hui and Liu, Jianhua and Xu, Ge and Long, Yunfei and Guan, Yin and Ke, Tianci (2025) Improving unified information extraction in Chinese mental health domain with instruction-tuned LLMs and type-verification component. Artificial Intelligence in Medicine, 162. p. 103087. DOI https://doi.org/10.1016/j.artmed.2025.103087
Cai, Zijie and Fang, Hui and Liu, Jianhua and Xu, Ge and Long, Yunfei and Guan, Yin and Ke, Tianci (2025) Improving unified information extraction in Chinese mental health domain with instruction-tuned LLMs and type-verification component. Artificial Intelligence in Medicine, 162. p. 103087. DOI https://doi.org/10.1016/j.artmed.2025.103087
Cai, Zijie and Fang, Hui and Liu, Jianhua and Xu, Ge and Long, Yunfei and Guan, Yin and Ke, Tianci (2025) Improving unified information extraction in Chinese mental health domain with instruction-tuned LLMs and type-verification component. Artificial Intelligence in Medicine, 162. p. 103087. DOI https://doi.org/10.1016/j.artmed.2025.103087
Abstract
Background: Extracting psychological counseling help-seeker information from unstructured text is crucial for providing effective mental health support. This task involves identifying personal emotions, psychological states, and underlying psychological issues but faces significant challenges. These challenges include the sensitivity of mental health data, the lack of Chinese instruction datasets, and the difficulties large language models (LLMs) encounter with complex natural language understanding tasks. Objective: This study aims to address these challenges by developing a unified information extraction framework for Chinese mental health texts. Specifically, it leverages instruction-tuned LLMs and incorporates a novel type-verification (TV) component to improve performance while minimizing computational demands. Methods: We first constructed a Chinese mental health domain instruction dataset for mental health information extraction using synthetic data generated by ChatGPT, guided by psychology experts. This dataset includes self-reported statements from psychological counseling help-seekers, capturing their personal situations, emotions, thoughts, and experiences. Subsequently, we fine-tuned open-source LLMs on this dataset to perform named entity recognition, relation extraction, and event extraction. To address errors and omissions in the extracted information, we introduced a type-verification component. This component employs a lightweight model with significantly fewer parameters to verify the extracted types. The verification results were then fed back into LLMs for further refinement. Results: Experimental results demonstrate that our framework achieves outstanding performance in mental health information extraction. The type-verification component significantly enhances extraction accuracy while reducing computational resource requirements through the use of a lightweight model. By combining robust instruction-tuned LLMs with an efficient type-verification component, our approach delivers exceptional results. Conclusion: This study presents a novel and efficient framework for tackling the challenges of mental health information extraction in Chinese texts. By integrating instruction-tuned LLMs with a lightweight type-verification component, our approach significantly improves extraction accuracy and computational efficiency. This framework holds promise for supporting scalable, automated mental health support systems, advancing both research and practical applications in the mental health domain.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Information extraction; Large language models; Mental health; Type-verification |
Divisions: | Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 01 May 2025 14:35 |
Last Modified: | 01 May 2025 14:44 |
URI: | http://repository.essex.ac.uk/id/eprint/40454 |
Available files
Filename: Improving unified information extraction in Chinese mental health domain with instruction-tuned LLMs and type verification component.pdf
Licence: Creative Commons: Attribution-Noncommercial-No Derivative Works 4.0
Embargo Date: 22 February 2026