Shape-DNA: Effective Character Restoration and Enhancement for Arabic Text Documents

Pattern Recognition(2010)

引用 20|浏览5
暂无评分
摘要
We present a novel learning-based image restoration and enhancement technique for improving character recognition performance of OCR products for degraded documents or documents/text captured with mobile devices such as camera-phones. The proposed technique is language independent and can simultaneously increase the effective resolution and restore broken characters with artifacts due to image capturing device such as a low quality/low resolution camera, or due to previous pre-processing such as extracting text region from the document image. The proposed technique develops a predictive relationship between high-resolution training images and their low-resolution/degraded counterparts, and exploits this relationship in a probabilistic scheme to generate a high resolution image from a low quality, low-resolution text image. We present a fast and scalable implementation of the proposed character restoration algorithm to improve the text recognition for document/text images captured by mobile phones. Experimental results demonstrate that the system effectively increases OCR performance for documents captured by mobile imaging devices, from levels of 50% to levels of over 80% for non-latin document/scene text images at 120dpi.
更多
查看译文
关键词
effective character restoration,scene text image,high-resolution training image,high resolution image,text region,text image,proposed technique,low quality,arabic text documents,text recognition,low-resolution text image,document image,optical character recognition,camera phones,shape,learning artificial intelligence,low resolution,degradation,high resolution,computational modeling,image restoration,mobile devices,mobile device,image resolution,feature extraction,camera phone,text analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要