GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs
CVPR 2024(2024)
摘要
Vision graph neural networks (ViG) offer a new avenue for exploration in
computer vision. A major bottleneck in ViGs is the inefficient k-nearest
neighbor (KNN) operation used for graph construction. To solve this issue, we
propose a new method for designing ViGs, Dynamic Axial Graph Construction
(DAGC), which is more efficient than KNN as it limits the number of considered
graph connections made within an image. Additionally, we propose a novel
CNN-GNN architecture, GreedyViG, which uses DAGC. Extensive experiments show
that GreedyViG beats existing ViG, CNN, and ViT architectures in terms of
accuracy, GMACs, and parameters on image classification, object detection,
instance segmentation, and semantic segmentation tasks. Our smallest model,
GreedyViG-S, achieves 81.1
Vision GNN and 2.2
less GMACs and a similar number of parameters. Our largest model, GreedyViG-B
obtains 83.9
decrease in parameters and a 69
the same accuracy as ViHGNN with a 67.3
decrease in GMACs. Our work shows that hybrid CNN-GNN architectures not only
provide a new avenue for designing efficient models, but that they can also
exceed the performance of current state-of-the-art models.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要