Section 01
[Main Floor/Introduction] Convolutional Nearest Neighbors (ConvNN): A New Framework Unifying Convolution and Attention Mechanisms
This article introduces a new neural network architecture called Convolutional Nearest Neighbors (ConvNN), whose core innovation lies in unifying convolution and self-attention mechanisms through a k-nearest neighbor aggregation framework, providing a new theoretical perspective for computer vision model design. ConvNN treats both as special cases of neighbor selection and aggregation (convolution based on spatial proximity, attention based on feature similarity) and reveals a continuous spectrum between them. Experiments show that ConvNN outperforms pure convolution or pure attention schemes on the CIFAR dataset and can be integrated into existing architectures as a plug-and-play module.