RUS  ENG
Full version
JOURNALS // Doklady Rossijskoj Akademii Nauk. Mathematika, Informatika, Processy Upravlenia // Archive

Dokl. RAN. Math. Inf. Proc. Upr., 2025 Volume 527, Pages 378–387 (Mi danma695)

SPECIAL ISSUE: ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING TECHNOLOGIES

Where semantics meets space: a layer-wise analysis of frozen encoders for efficient polyp segmentation

A. Taha, R. Lukmanov

Innopolis University, Innopolis, Russia

Abstract: While deep encoder-decoder dominate endoscopic segmentation, their reliance on full fine-tuning or training from scratch is computationally expensive and data-intensive. This paper challenges this by demonstrating that an extremely efficient model – a frozen foundation model encoder with a shallow decoder – can achieve state-of-the-art performance. Our core contribution is a systematic, layer-wise analysis to identify the single most effective feature source within the encoder’s hierarchy, challenging the common practice of using the final, most abstract layer. We identify a distinct performance peak at an intermediate layer (Layer 12), with an optimal trade-off between high-level semantic understanding and the high-resolution spatial fidelity crucial for segmentation. Despite training only 65k parameters, our method surpasses existing benchmarks on the challenging multi-center PolypGen dataset with a Dice score of 0.972. This work provides an evidence-based methodology for efficient feature extraction, significantly lowering the computational and data barriers for developing high-performance clinical AI tools.

Keywords: endoscopic image segmentation, polyp segmentation, deep learning, transfer learning, self-supervised learning, feature extraction, computational efficiency, foundation models, computer vision, layer-wise analysis.

UDC: 004.032

Received: 21.08.2025
Accepted: 20.09.2025

DOI: 10.7868/S2686954325070331



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2026