基于语音和笔的手写数学公式纠错方法

姜映映; 敖  翔; 田  丰; 王绪刚; 戴国忠

基于语音和笔的手写数学公式纠错方法

Error Correction for Handwritten Mathematical Expression Recognition by Pen and Speech

摘要

摘要: 采用识别技术的用户界面往往由于识别率的限制容易出错，如何为这类界面提供自然高效的纠错方法十分重要.手写数学公式具有二维结构，难以识别和纠错.提出一种用于纠正手写数学公式识别错误的多通道技术.它允许用户使用笔纠正切分错误，用笔和语音纠正符号识别和表达式结构分析错误.该技术的核心是一个多通道融合算法.融合算法以笔选择的符号和语音作为输入，根据语音输入的类型是数学术语或者数学符号分别选择融合方法，最后修正手写公式并输出最有可能的识别结果.实验结果表明，该技术能有效地纠正手写数学公式识别中的错误，它比基于笔的单通道纠错技术更加高效.

Abstract: As recognition-based interfaces are error prone, it is important to provide a natural and efficient error correction method for these interfaces. Handwritten mathematical expressions have 2D structures, and it is challenging to recognize them and correct their recognition errors. In this paper, a multimodal error correction technique is introduced for handwritten mathematical expression recognition. It allows users to correct errors by pen and speech. Symbol segmentation errors could be corrected by pen. Symbol recognition errors and structure analysis errors could be corrected by pen or by pen and speech. Users could firstly select an error by pen and then tell the corresponding mathematical term or mathematical symbol by speech. The key of the proposed technique is a multimodal fusion algorithm which fuses handwriting and speech recognition results. The input to the fusion algorithm is the speech and the symbols selected by pen. According to whether the speech input is a mathematical term or a mathematical symbol’s name, the algorithm chooses a specific fusion method to adjust the handwritten expression and get the most likely result. Evaluation shows that the proposed multimodal error correction technique is effective, and it can help users to correct errors in mathematical expression recognition more efficiently than the unimodal pen-based error correction technique.

HTML全文

参考文献(0)

施引文献

资源附件(0)