Enhancing Skin Lesion Classification: A Self-Attention Fusion Approach with Vision Transformer

Izwan Heroza, Rahmat and Gan, John Q and Raza, Haider (2024) Enhancing Skin Lesion Classification: A Self-Attention Fusion Approach with Vision Transformer. In: Medical Image Understanding and Analysis (MIUA), 2024-07-24 - 2024-07-26, Manchester, UK.

Abstract

Automated skin lesion classification is pivotal in modern dermatology, and recent strides in deep learning have shown immense potential in this field. This paper introduces a novel attention mechanism amalgamating various self-attention variants with the Vision Transformer (ViT) architecture to enhance skin lesion classification performance. By integrating Scaled Dot-Product Attention, Multiplicative Attention, and Additive Attention, a unified framework is devised for capturing diverse contextual cues within dermatological images. Extensive experiments are conducted on skin lesion datasets (ISIC 2017) to assess different loss functions, attention mechanisms, and fusion strategies. Results demonstrate that the proposed method significantly enhances classification performance across all metrics, exhibiting a remarkable improvement of over 12% in F1 score compared to the baseline. This approach not only showcases the efficacy of attention mechanisms in dermatological image analysis but also underscores the potential of ViT architecture in advancing automated skin lesion classification, thereby offering promising prospects for improving diagnostic accuracy and patient care in dermatology

Item Metadata

Item Type:	Conference or Workshop Item (Paper)
Uncontrolled Keywords:	Vision Transformer; Self-Attention Fusion; Skin Lesion Classification
Divisions:	Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of
SWORD Depositor:	Unnamed user with email elements@essex.ac.uk
Depositing User:	Unnamed user with email elements@essex.ac.uk
Date Deposited:	02 Oct 2024 15:34
Last Modified:	16 Aug 2025 06:13
URI:	http://repository.essex.ac.uk/id/eprint/38780

Available files

Accepted Version

Filename: MIUA2024_RAHMAT_LNCS.pdf

Download

Enhancing Skin Lesion Classification: A Self-Attention Fusion Approach with Vision Transformer

Abstract

Item Metadata

Share and export

Available files

Accepted Version

Statistics

Altmetrics

Downloads