èŠ‚ç‚¹æ–‡çŒ®

è¯éŸ³è¯†åˆ«å‰ªæžç®—æ³•ç ”ç©¶

Research on Pruning Algorithm for Speech Recognition

åˆ†é¡µä¸‹è½½
åˆ†ç« ä¸‹è½½
æ•´æœ¬ä¸‹è½½
åœ¨çº¿é˜…è¯»
ä¸æ”¯æŒè¿…é›·ç‰ä¸‹è½½å·¥å…·ï¼Œè¯·å–æ¶ˆåŠ é€Ÿå·¥å…·åŽä¸‹è½½ã€‚

ã€ä½œè€…ã€‘ é™†é¥ï¼›

ã€ä½œè€…åŸºæœ¬ä¿¡æ¯ã€‘ åŒ—äº¬é‚®ç”µå¤§å¦ ï¼Œ ä¿¡æ¯ä¸Žä¿¡å·å¤„ç†ï¼Œ 2012ï¼Œ ç¡•å£«

ã€æ‘˜è¦ã€‘ è¯éŸ³æ˜¯äººç±»ä¹‹é—´äº¤äº’æœ€ä¸ºæœ‰æ•ˆçš„æ‰‹æ®µ,ä½¿ç”¨è¯éŸ³è¾“å…¥ä»£æ›¿ä¼ ç»Ÿçš„é”®ç›˜å’Œè§¦å±è¾“å…¥,å°†ä¼šä½¿ç”¨æˆ·æ“ä½œæ›´ä¸ºä¾¿æ·â€”â€”å°†è¯éŸ³æŠ€æœ¯ç”¨äºŽäººæœºäº¤äº’ä¹Ÿä¸€ç›´æ˜¯å‰æ²¿çš„ç ”ç©¶è¯¾é¢˜ã€‚ä¼´éšç€ç§»åŠ¨ç»ˆç«¯çš„å‘å±•å’Œç§»åŠ¨äº’è”ç½‘çš„é£žé€Ÿå‘å±•,åŸºäºŽç§»åŠ¨ç»ˆç«¯çš„è¯éŸ³è¯†åˆ«æŠ€æœ¯ä¹Ÿéšä¹‹æˆä¸ºäº†æ–°çš„ç ”ç©¶çƒç‚¹ã€‚å’Œå·²ç»æ¯”è¾ƒæˆç†Ÿçš„è¯éŸ³è¯†åˆ«æŠ€æœ¯ç›¸æ¯”,åŸºäºŽç§»åŠ¨ç»ˆç«¯çš„è¯†åˆ«æŠ€æœ¯å°†é¢ä¸´æ›´å¤šçš„æŠ€æœ¯æŒ‘æˆ˜,ä¸»è¦è¡¨çŽ°åœ¨ç§»åŠ¨ç»ˆç«¯çš„è®¡ç®—èƒ½åŠ›æœ‰é™,æ ¹æ®ä¼ ç»Ÿç®—æ³•å¼€å‘çš„è¯†åˆ«æŠ€æœ¯ä¸èƒ½æ»¡è¶³ä½œä¸ºè¯éŸ³è¾“å…¥çš„å®žæ—¶æ€§èƒ½è¦æ±‚ï¼›å¦å¤–,é«˜ç²¾åº¦çš„è¯éŸ³è¯†åˆ«éœ€è¦æè¿°èƒ½åŠ›è¾ƒå¼ºçš„è¯è¨€æ¨¡åž‹æ”¯æŒâ€”â€”ä¸€èˆ¬éƒ½é‡‡ç”¨åŸºäºŽç»Ÿè®¡çš„n-gramæ¨¡åž‹,è€Œè¯¥æ¨¡åž‹æ‰€éœ€è¦çš„å˜å‚¨ç©ºé—´è¾ƒå¤§,å¾ˆéš¾åœ¨ç§»åŠ¨ç»ˆç«¯ä¸Šå¾—åˆ°å…¨é¢çš„åº”ç”¨ã€‚å› æ¤å½“å‰å¯¹åŸºäºŽç§»åŠ¨ç»ˆç«¯çš„è¯†åˆ«æŠ€æœ¯çš„ä¸»è¦é—®é¢˜é›†ä¸åœ¨åŽ‹ç¼©è¯†åˆ«å™¨çš„è®¡ç®—ç©ºé—´å’ŒåŠ é€Ÿè¯†åˆ«å™¨çš„è§£ç é€Ÿåº¦è¿™ä¸¤ä¸ªæ–¹é¢ã€‚é’ˆå¯¹ä¸Šè¿°é—®é¢˜,æœ¬æ–‡æå‡ºäº†ä¸€ç§æ ¹æ®ç”¨æˆ·æ‰€åœ¨çš„åœ°ç‚¹ä¿¡æ¯è¿›è¡Œå‰ªæžçš„ç®—æ³•,è¯¥ç®—æ³•èƒ½å¤Ÿæœ‰æ•ˆçš„æå‡è¯†åˆ«å™¨çš„æ€§èƒ½ã€‚æœ¬æ–‡çš„ä¸»è¦å·¥ä½œå†…å®¹å¦‚ä¸‹ï¼š1.åŸºäºŽæœ‰é™çŠ¶æ€è¯æ³•è¯†åˆ«ç³»ç»Ÿçš„å‰ªæžç®—æ³•å®žçŽ°å¤ç«‹è¯è¯†åˆ«ç³»ç»Ÿå’Œå›ºå®šå¥å¼è¯†åˆ«ç³»ç»Ÿæ˜¯è¯éŸ³è¯†åˆ«ç³»ç»Ÿçš„åŸºç¡€,æœ¬æ–‡é¦–å…ˆå¯¹è¿™ç§ç®€å•çš„è¯†åˆ«ç½‘ç»œè¿›è¡Œäº†å‰ªæžç®—æ³•çš„ç ”ç©¶,æå‡ºäº†æ ¹æ®ç”¨æˆ·åœ°ç‚¹ä¿¡æ¯å¯¹è¯†åˆ«ç½‘ç»œè¿›è¡Œå‰ªæžçš„ç®—æ³•,å¹¶ç”¨å¤§é‡çš„å®žéªŒè¯æ˜Žäº†ç®—æ³•çš„æœ‰æ•ˆæ€§ã€‚2.åŸºäºŽè¿žç»è¯è¯†åˆ«ç³»ç»Ÿçš„å‰ªæžç®—æ³•å®žçŽ°ä»¥æœ‰é™çŠ¶æ€è¯†åˆ«ç½‘ç»œå‰ªæžç®—æ³•çš„ç ”ç©¶ä¸ºåŸºç¡€,æœ¬æ–‡åˆ©æå‡ºäº†åˆ©ç”¨è¯è¨€æ¨¡åž‹å‰çž»æ ‘ç»“æž„åœ¨è§£ç æ—¶è¿›è¡Œå‰ªæžçš„ç®—æ³•,è¯¥ç®—æ³•æ ¹æ®ç”¨æˆ·çš„åœ°ç‚¹ä¿¡æ¯å¯¹è§£ç å™¨çš„æœç´¢ç©ºé—´è¿›è¡Œäº†æœ‰æ•ˆåœ°å‰ªæžã€‚é€šè¿‡å¯¹æ¯”å®žéªŒè¯æ˜Ž,è¯¥ç®—æ³•èƒ½å¤Ÿæœ‰æ•ˆçš„æå‡è¯†åˆ«å™¨çš„è¯†åˆ«ç²¾åº¦å’Œé€Ÿåº¦ã€‚æ›´å¤š è¿˜åŽŸ

ã€Abstractã€‘ Speech is the most efficient way for human-beingsâ€™communication. For using speech input instead of traditional keyboard or touch screen input will make user feeling much more convenient, using speech recognition technology into human-computer interaction has been a cutting-edge research, In other hand, with rapid development of mobile terminals and mobile Internet, the speech recognition technology based on mobile terminals will become a new hotspot.Compared to proven speech recognition technique, the mobile terminal based speech recognition technology faced much more technical challenges. Itâ€™s mainly because of the mobile terminalsâ€™hardware is limited, which caused two problems:Firstly, the real-time performance cannot be met when using traditional recognition technology as the speech input systemâ€™s core algorithm; Secondly, high accuracy speech recognition system requires language model support, generally is statistical n-gram model, which is difficult to be used in the mobile terminal application for the larger storage space requirement. Current the researches on the mobile terminal based speech recognition technology are focused on the speeding up computation and storage compression.According to those problems, we proposed and implement an effective location based pruning algorithm. The mainly work is as follows:1. Pruning algorithm based on finite-state grammar recognition systemFirstly we researched on isolated word recognition system and fixed sentence recognition system, which are basic speech recognition systems and have the sampler recognition network than large vocabulary continuous speech recognition system. After lots of work, we proposed an algorithm to pruning network with userâ€™s location information and used large number of experiments to prove the effectiveness of the algorithm.2. Pruning algorithm based on continuous speech recognition systemBased on finite-state recognition network pruning algorithm research, we proposed an algorithm which uses language model look-ahead tree structure and userâ€™s location information to solve the same problem in continuous speech recognition system. By comparing the experiments among different systems we prove that the algorithm can effectively enhance the systemâ€™s recognition accuracy and speed performance.æ›´å¤š è¿˜åŽŸ

ã€å…³é”®è¯ã€‘ æœ¬åœ°æœåŠ¡æœç´¢ï¼› è¯†åˆ«ç½‘ç»œï¼› å‰ªæžï¼› å‰çž»æ ‘ï¼›
ã€Key wordsã€‘ voice-based local searchï¼› recognition networkï¼› pruningï¼› Language Model look-ahead treeï¼›

ã€ç½‘ç»œå‡ºç‰ˆæŠ•ç¨¿äººã€‘ åŒ—äº¬é‚®ç”µå¤§å¦

ã€åˆ†ç±»å·ã€‘TN912.34
ã€è¢«å¼•é¢‘æ¬¡ã€‘1
ã€ä¸‹è½½é¢‘æ¬¡ã€‘257
æ”»è¯»æœŸæˆæžœ

çŸ¥ç½‘èŠ‚ä¸‹è½½

èŠ‚ç‚¹æ–‡çŒ®ä¸ï¼š

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æœ¬æ–‡çš„å¼•æ–‡ç½‘ç»œ

èŠ‚ç‚¹æ–‡çŒ®

èŠ‚ç‚¹æ–‡çŒ®

è¯­éŸ³è¯†åˆ«å‰ªæžç®—æ³•ç ”ç©¶

Research on Pruning Algorithm for Speech Recognition

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

è¯éŸ³è¯†åˆ«å‰ªæžç®—æ³•ç ”ç©¶