Multi-column Deep Neural Networks for Image Classification

computer vision and pattern recognition(2012)

引用 6597|浏览924
暂无评分
摘要
Traditional methods of computer vision and machine learning cannot match human performance on tasks such as the recognition of handwritten digits or traffic signs. Our biologically plausible deep artificial neural network architectures can. Small (often minimal) receptive fields of convolutional winner-take-all neurons yield large network depth, resulting in roughly as many sparsely connected neural layers as found in mammals between retina and visual cortex. Only winner neurons are trained. Several deep neural columns become experts on inputs preprocessed in different ways; their predictions are averaged. Graphics cards allow for fast training. On the very competitive MNIST handwriting benchmark, our method is the first to achieve near-human performance. On a traffic sign recognition benchmark it outperforms humans by a factor of two. We also improve the state-of-the-art on a plethora of common image classification benchmarks.
更多
查看译文
关键词
graphics processing units,handwritten character recognition,image classification,image recognition,learning (artificial intelligence),neural nets,MNIST handwriting benchmark,artificial neural network architectures,computer vision,convolutional winner-take-all neurons,fast training,graphics cards,handwritten digits recognition,human performance,image classification,machine learning,multicolumn deep neural networks,retina,sparsely connected neural layers,traffic sign recognition benchmark,traffic signs,visual cortex
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要