DS1 spectrogram: AEM: Attention Entropy Maximization for Multiple Instance Learning based
  Whole Slide Image Classification

AEM: Attention Entropy Maximization for Multiple Instance Learning based Whole Slide Image Classification

June 18, 20242406.15303

Authors

Honglin Li,Jingxiong Li,Chenglu Zhu,Lin Yang,Yunlong Zhang

Abstract

Multiple Instance Learning (MIL) effectively analyzes whole slide images but faces overfitting due to attention over-concentration. While existing solutions rely on complex architectural modifications or additional processing steps, we introduce Attention Entropy Maximization (AEM), a simple yet effective regularization technique.

Our investigation reveals the positive correlation between attention entropy and model performance. Building on this insight, we integrate AEM regularization into the MIL framework to penalize excessive attention concentration.

To address sensitivity to the AEM weight parameter, we implement Cosine Weight Annealing, reducing parameter dependency. Extensive evaluations demonstrate AEM's superior performance across diverse feature extractors, MIL frameworks, attention mechanisms, and augmentation techniques.

Here is our anonymous code: https://github.com/dazhangyu123/AEM.

Resources

Stay in the loop

Get tldr.takara.ai to Your Email, Everyday.

tldr.takara.aiHome·Daily at 6am UTC·© 2026 takara.ai Ltd

Content is sourced from third-party publications.