Multi-resolution vision transformer model for skin cancer subtype classification using histopathology slides

Abstract

Introduction Digital pathology has significantly advanced cancer diagnosis by enabling high-resolution visualisation and assessment of tissue specimens. However, the manual analysis of these images remains labour-intensive and susceptible to human error, resulting in inconsistencies in diagnosis and treatment decisions. Herein, we developed and externally validated a multi-resolution model for classifying subtypes of skin cancer from whole slide images ( WSIs). Methods We constructed a dataset comprising approximately 1.13 million histological patches (dividing the WSIs into non-overlapping tiles) from the Non-Melanoma Skin Cancer Segmentation (NMSCS) and Heidelberg datasets. All the patches were normalised using the Macenko method before training a self-supervised vision transformer-based model to classify the most common subtypes of skin cancer: basal cell carcinoma (BCC), squamous cell carcinoma (SCC), intraepidermal carcinoma (IEC), Melanoma, Naevi, and Non-cancerous. Our multi-resolution model was designed to classify melanoma and non-melanoma skin cancer subtypes, incorporating multi-resolution data at 10x, 20x, 40x, and 400x magnifications. The model was externally validated on 5,147 slides from 4,066 patients for non-melanoma cancer subtypes. The model's performance was evaluated using classification metrics, and the quadratic weighted Cohen's Kappa (?) score was used to measure the agreement between the model's predictions and the actual labels with a 95% confidence interval (CI). Results Our multi-resolution model demonstrated strong classification performance across six classes, achieving an overall ? score of 0.859 (95% CI: 0.851, 0.866) and 0.898 (95% CI: 0.892, 0.904) on the validation and testing sets, respectively, reflecting robust performance across diverse skin cancer subtypes. The multi-resolution model for non-melanoma skin cancer exhibited superior performance, achieving an overall ? score of 0.919 (95% CI: 0.914, 0.924) on the validation set. On the testing set, the ? score ranged from 0.996 to 0.889 with magnifications of 10x, 20x, 40x, and 400x. The attention maps highlighted clinically relevant features for cancerous tissue at different magnifications. Additionally, the model obtained a ? score of 0.791 (95% CI: 0.774, 0.808) on the external data at the slide level, indicating substantial agreement between the model's prediction and the actual label of WSIs. Conclusion Our multi-resolution model has the potential to assist anatomical pathologists in automatically detecting, highlighting, and classifying subtypes of melanoma and non-melanoma skin cancer subtypes directly from WSIs. This capability could ultimately improve patient outcomes and more effective clinical decision-making in digital pathology. For non-melanoma cancer, our model could be deployed in regions with limited access to experienced dermatopathologists and a high incidence of the disease, particularly in low-resource settings.

Competing Interest Statement

A.K.C., P.W.T., and A.W.H. are co-founders of Pandani Solutions Pty Ltd in Australia, which develops AI-driven tools to assist anatomical pathologists.

Funding Statement

This work was supported by an Australian National Health and Medical Research Council Leadership Award (A.W.H.).

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

https://www.frontiersin.org/journals/oncology/articles/10.3389/fonc.2022.1022967/full https://www.sciencedirect.com/science/article/pii/S2352340921008623 https://aws.amazon.com/marketplace/pp/prodview-h4lwgyj4myxh6

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Data Availability

All data are available online.

留言 (0)

沒有登入
gif