ViT-ESA: Enhanced Spatial Attention in ViT for Breast Cancer Histopathological Image Classification

Amrutanshu Panigrahi; Subrata Chowdhury

Home
Proceedings
Vol. 1 No. 3 (2026): LGPR Batch 2 Conference 3
Paper

ViT-ESA: Enhanced Spatial Attention in ViT for Breast Cancer Histopathological Image Classification

Date Published : 26 May 2026

Contributors

Amrutanshu Panigrahi

Postdoctoral Researcher, Lincoln University College, Malaysia

Author

Subrata Chowdhury

Author

Keywords

Breast Cancer Vision Transformer Elephant Search Algorithm Feature optimization

Proceeding

Vol. 1 No. 3 (2026): LGPR Batch 2 Conference 3

Track

Engineering and Sciences

License

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Abstract

Breast cancer is one of the most common causes of death due to cancer among women in the world requiring accurate and early diagnosis. Accurate diagnosis of histopathological images is crucial for final diagnosis. Moreover, many computer diagnosis systems do not capture contextual information globally or at several scales of tissue pattern. Recent studies showed that ViT's models which use deep learning show a better performance in modeling long-range dependencies than CNN’s. Even though high-dimensional deep features can improve performance through improved feature representation, they also introduce redundancy and computation overloads. These can hinder clinical deployment. To address these concerns, the proposed method integrates deep feature extraction based on Vision Transformers with feature selection using the Elephant Search Algorithm (ESA). Throughout the BreakHis breast histopathology dataset when tested at various magnification factors, the proposed ViT–ESA framework improves classification performance at lower feature dimensions. A comparative evaluation of multiple machine learning classifiers demonstrates that the XGBoost classifier performs the best validating the effectiveness of this method with ESA-based transformer features for reliable breast cancer diagnosis.

References

No References

Downloads

PDF

How to Cite

Panigrahi, A., & Chowdhury, S. . (2026). ViT-ESA: Enhanced Spatial Attention in ViT for Breast Cancer Histopathological Image Classification. Sustainable Global Societies Initiative, 1(3). https://vectmag.com/sgsi/paper/view/252