首页 / 院系成果 / 成果详情页

CFI-ViT: A coarse-to-fine inference based vision transformer for gastric cancer subtype detection using pathological images 期刊论文

编号：

354C16D75440209E72AABEB75990CC6B
作者：

Wang, Xinghang#^[1,2]Tao, Haibo#^[3]Wang, Bin^[1,2];Jin, Huaiping*^[1,2]Li, Zhenhui*^[3]
语种：

英文
期刊：

BIOMEDICAL SIGNAL PROCESSING AND CONTROL ISSN：1746-8094 2025 年 100 卷 ; FEB
收录：
关键词：

Gastric cancer Subtype classification Histopathology images Vision transformer Attention mechanism Two-stage inference
摘要：

Accurate detection of histopathological cancer subtypes is crucial for personalized treatment. Currently, deep learning methods based on histopathology images have become an effective solution to this problem. However, existing deep learning methods for histopathology image classification often suffer from high computational complexity, not considering the variability of different regions, and failing to synchronize the focus on local-global information effectively. To address these issues, we propose a coarse-to-fine inference based vision transformer (ViT) network (CFI-ViT) for pathological image detection of gastric cancer subtypes. CFI-ViT combines global attention and discriminative and differentiable modules to achieve two-stage inference. In the coarse inference stage, a ViT model with relative position embedding is employed to extract global information from the input images. If the critical information is not sufficiently identified, the differentiable module is adopted to extract local image regions with discrimination for fine-grained screening in the fine inference stage. The effectiveness and superiority of the proposed CFI-ViT method have been validated through three pathological image datasets of gastric cancer, including one private dataset clinically collected from Yunnan Cancer Hospital in China and two publicly available datasets, i.e., HE-GHI-DS and TCGA-STAD. The experimental results demonstrate that CFI-ViT achieves superior recognition accuracy and generalization performance compared to traditional methods, while using only 80 % of the computational resources required by the ViT model.
推荐引用方式
GB/T 7714：

Wang Xinghang,Tao Haibo,Wang Bin, et al. CFI-ViT: A coarse-to-fine inference based vision transformer for gastric cancer subtype detection using pathological images [J].BIOMEDICAL SIGNAL PROCESSING AND CONTROL,2025,100.
APA：

Wang Xinghang,Tao Haibo,Wang Bin,Jin Huaiping,&Li Zhenhui.(2025).CFI-ViT: A coarse-to-fine inference based vision transformer for gastric cancer subtype detection using pathological images .BIOMEDICAL SIGNAL PROCESSING AND CONTROL,100.
MLA：

Wang Xinghang, et al. "CFI-ViT: A coarse-to-fine inference based vision transformer for gastric cancer subtype detection using pathological images" .BIOMEDICAL SIGNAL PROCESSING AND CONTROL 100(2025).
入库时间：

2024/12/9 21:36:04
更新时间：

2024/12/23 22:30:07
条目包含文件：

文件类型：PDF,文件大小：

正在加载全文

未找到全文

浏览次数：230  下载次数：0

分享到：

浏览次数：230

下载次数：0

打印次数：0