🤖 AI Summary
This work addresses a performance bottleneck in low-parameter (<1.5M) deep vision models caused by feature map interference—specifically, the aberrant encoding of multiple semantic concepts by individual neurons. We identify a previously unrecognized interference mechanism: excessive neuronal responses induced by the synergistic effect of bottleneck architectures and superlinear activation functions. To mitigate this, we propose the NoDepth Bottleneck architecture, which decouples channel compression from nonlinear transformation and incorporates a gradient-aware interference suppression module to explicitly constrain feature superposition. Systematic evaluation on ImageNet demonstrates that our approach significantly improves scaling efficiency and accuracy for compact models—for instance, boosting Top-1 accuracy by 2.3% over a ResNet-18 variant. Notably, this is the first method to enable controllable modeling of interference under superlinear activations, establishing a novel design paradigm for ultra-lightweight vision models.
📝 Abstract
The paper investigates the performance of state-of-the-art low-parameter deep neural networks for computer vision, focusing on bottleneck architectures and their behavior using superlinear activation functions. We address interference in feature maps, a phenomenon associated with superposition, where neurons simultaneously encode multiple characteristics. Our research suggests that limiting interference can enhance scaling and accuracy in very low-scaled networks (under 1.5M parameters). We identify key design elements that reduce interference by examining various bottleneck architectures, leading to a more efficient neural network. Consequently, we propose a proof-of-concept architecture named NoDepth Bottleneck built on mechanistic insights from our experiments, demonstrating robust scaling accuracy on the ImageNet dataset. These findings contribute to more efficient and scalable neural networks for the low-parameter range and advance the understanding of bottlenecks in computer vision. https://caiac.pubpub.org/pub/3dh6rsel