èŠ‚ç‚¹æ–‡çŒ®

åŸºäºŽGMMçš„è¯´è¯äººè¯†åˆ«ç³»ç»Ÿç ”ç©¶ä¸Žå®žçŽ°

Research and Implementation on Speaker Recognition System Based on GMM

åˆ†é¡µä¸‹è½½
åˆ†ç« ä¸‹è½½
æ•´æœ¬ä¸‹è½½
åœ¨çº¿é˜…è¯»
ä¸æ”¯æŒè¿…é›·ç‰ä¸‹è½½å·¥å…·ï¼Œè¯·å–æ¶ˆåŠ é€Ÿå·¥å…·åŽä¸‹è½½ã€‚

ã€ä½œè€…ã€‘ é™ˆå¼ºï¼›

ã€ä½œè€…åŸºæœ¬ä¿¡æ¯ã€‘ æ¦æ±‰ç†å·¥å¤§å¦ ï¼Œ ä¿¡å·ä¸Žä¿¡æ¯å¤„ç†ï¼Œ 2010ï¼Œ ç¡•å£«

ã€æ‘˜è¦ã€‘ è¯´è¯äººè¯†åˆ«ä¹Ÿç§°å£°çº¹è¯†åˆ«,å…¶ç›®çš„æ˜¯æ ¹æ®è¯´è¯äººçš„å£°éŸ³ç‰¹å¾æ¥å®Œæˆè¯´è¯äººçš„è¾¨è®¤æˆ–ç¡®è®¤ã€‚éšç€ç½‘ç»œä¿¡æ¯åŒ–æŠ€æœ¯çš„è¿…çŒ›å‘å±•,èº«ä»½éªŒè¯çš„æ•°å—åŒ–ã€éšæ€§åŒ–ã€ä¾¿æ·åŒ–æ˜¾å¾—è¶Šæ¥è¶Šé‡è¦,è¯´è¯äººè¯†åˆ«ä½œä¸ºä¸€ç§ç”Ÿç‰©è®¤è¯æŠ€æœ¯,åœ¨è§†è§‰ç›‘æŽ§ã€èº«ä»½éªŒè¯ã€å¸æ³•åˆ‘ä¾¦åŠé‡‘èžå®‰å…¨ç‰é¢†åŸŸæœ‰ç€å¹¿æ³›åº”ç”¨å‰æ™¯,æˆä¸ºå½“å‰è¯éŸ³ä¿¡å·å¤„ç†é¢†åŸŸçš„ç ”ç©¶çƒç‚¹ã€‚è¯´è¯äººè¯†åˆ«æŠ€æœ¯ç ”ç©¶çš„å…³é”®æ˜¯è¯éŸ³ä¿¡å·çš„ç‰¹å¾æå–å’Œæ¨¡å¼åŒ¹é…ç‰é—®é¢˜ã€‚æœ¬æ–‡åœ¨ç ”ç©¶å½“å‰è¯´è¯äººè¯†åˆ«ä¸»è¦ç®—æ³•çš„åŸºç¡€ä¸Š,é€šè¿‡ç ”ç©¶åŸºäºŽå£°å¦ç‰¹æ€§çš„å€’è°±ç‰¹å¾æå–æ–¹æ³•å’ŒåŸºäºŽæ¨¡æ¿åŒ¹é…åŠæ¦‚çŽ‡ç»Ÿè®¡çš„æ¨¡å¼åŒ¹é…æ–¹æ³•,ç ”ç©¶å®žçŽ°äº†åŸºäºŽçŸ¢é‡é‡åŒ–VQçš„è¯´è¯äººè¯†åˆ«ç³»ç»Ÿ,é‡ç‚¹ç ”ç©¶è®¾è®¡äº†ä¸Žæ–‡æœ¬æ— å…³çš„åŸºäºŽæ··åˆé«˜æ–¯æ¨¡åž‹GMMçš„è¯´è¯äººè¯†åˆ«ç³»ç»Ÿã€‚è®ºæ–‡ä¸»è¦ç ”ç©¶å†…å®¹å¦‚ä¸‹ï¼š(1)æ€»ç»“å½’çº³è¯´è¯äººè¯†åˆ«æŠ€æœ¯çš„å‘å±•ã€ç ”ç©¶çƒç‚¹å’Œéš¾ç‚¹,åˆ†æžè®¨è®ºäº†çŽ°æœ‰è¯´è¯äººè¯†åˆ«ä¸»è¦ç®—æ³•ã€‚(2)åˆ†æžç ”ç©¶äº†è¯´è¯äººè¯†åˆ«è¯éŸ³é¢„å¤„ç†,é‡ç‚¹å¯¹å‡è°±æ³•è¯éŸ³å¢žå¼ºç®—æ³•è¿›è¡Œäº†æ”¹è¿›,é€šè¿‡å®žéªŒåˆ†æžäº†è¯éŸ³å¢žå¼ºæ•ˆæžœ,æé«˜äº†å™ªå£°çŽ¯å¢ƒä¸‹çš„è¯´è¯äººè¯†åˆ«ç³»ç»Ÿçš„é²æ£’æ€§ï¼›ç ”ç©¶äº†è¯´è¯äººè¯†åˆ«çš„ç‰¹å¾æå–åŽŸç†å’Œæ–¹æ³•,ä»¿çœŸå®žçŽ°äº†è¯´è¯äººåŸºéŸ³ç‰¹å¾ã€LPCCå’ŒMFCCå‚æ•°åŠå·®åˆ†å€’è°±å‚æ•°ç‰çš„æå–ã€‚(3)åœ¨åˆ†æžVQåŸºæœ¬åŽŸç†ã€LBGç®—æ³•å’ŒVQç æœ¬åˆå§‹åŒ–çš„åŸºç¡€ä¸Š,è®¾è®¡å®žçŽ°äº†åŸºäºŽVQçš„è¯´è¯äººè¯†åˆ«ç³»ç»Ÿ,å®Œæˆäº†æ¨¡åž‹å‚æ•°è®ç»ƒå’ŒåŒ¹é…è¯†åˆ«è¿‡ç¨‹,å®žéªŒåˆ†æžäº†ä¸åŒæ¨¡åž‹å‚æ•°åŠä¸åŒè¯éŸ³æ ·æœ¬æ—¶é•¿ä¸‹çš„ç³»ç»Ÿè¯†åˆ«æ€§èƒ½ã€‚(4)ä¸ºäº†æé«˜ç³»ç»Ÿè¯†åˆ«çŽ‡å’Œç¨³å®šæ€§,åœ¨ç ”ç©¶GMMæ¨¡åž‹å‚æ•°ä¼°è®¡æœŸæœ›æœ€å¤§åŒ–(EM)ç®—æ³•ã€æ¨¡åž‹å‚æ•°åˆå§‹åŒ–ã€è®ç»ƒå’Œè¯†åˆ«è¿‡ç¨‹çš„åŸºç¡€ä¸Š,ç ”ç©¶è®¾è®¡äº†åŸºäºŽGMMçš„è¯´è¯äººè¯†åˆ«ç³»ç»Ÿ,å¹¶å®Œæˆäº†ç³»ç»Ÿä»¿çœŸå®žéªŒ,åˆ†æžäº†ä¸åŒæ¨¡åž‹å‚æ•°ã€ä¸åŒç‰¹å¾æå–æ–¹æ³•ã€ä¸åŒè¯éŸ³æ ·æœ¬æ—¶é•¿å’Œä¸åŒä¿¡å™ªæ¯”å™ªå£°çŽ¯å¢ƒä¸‹çš„è¯´è¯äººè¯†åˆ«æ€§èƒ½ã€‚(5)åˆ†æžäº†å¼€é›†è¯´è¯äººè¯†åˆ«æ–¹æ³•ã€è¯´è¯äººç¡®è®¤é˜ˆå€¼é€‰å–æ–¹æ³•,ç ”ç©¶äº†ä¸€ç§å…ˆè¾¨è®¤åŽç¡®è®¤çš„å¼€é›†è¯´è¯äººè¯†åˆ«æ–¹æ³•,åˆ†æžäº†é’ˆå¯¹é›†å¤–å†’å……è¯´è¯äººçš„â€œæ‹’è¯†é—®é¢˜â€,å¹¶å®Œæˆäº†åŸºäºŽVQå’ŒGMMä¸¤ç§æ¨¡åž‹çš„å¼€é›†è¯´è¯äººè¯†åˆ«ç³»ç»Ÿæ€§èƒ½åˆ†æžæ¯”è¾ƒã€‚æ›´å¤š è¿˜åŽŸ

ã€Abstractã€‘ Speaker Recognition is also known as Voiceprint Identification, of which the purpose is to indentify or verify the speaker based on the voice.With the rapid development of network information technology, the digitalization, recessivation and facilitation of identity authentication has become more and more important. As a biological authentication technology, Speaker Recognition has wide application prospects in many fields such as surveillance, authentication, investigation and finance security and become a hot spot in the research on speech signal processing. The key technologies of Speaker Recognition are feature extraction and pattern matching currently. On the condition that research the key algorithm of the current speaker recognition, this paper study the method of feature extraction based on acoustic performance, the method of pattern matching base on template matching and probability-statistics.Analyze and verify Speaker Recognition System base on VQ. Thoroughly, study and design of Text-independent Speaker Recognition System based on GMM.The concrete content is as follows:(1) Summarize status of development, the study hotspot and difficulty in speaker recognition technology. Analyze and discuss the existing main algorithm in speaker recognition.(2) Study voice signal processing and spectral subtraction method of speech enhancement algorithms in speaker recognition system of front end process, improves spectral subtraction method. The experiment shows that the robustness of speaker recognition system is improved in noisy environment. Research the fundamental principle of feature extraction of speaker recognition. Realize parameter extraction process of pitch, LPCC, MFCC and its difference by simulation.(3) On the basis of analyzing the fundamental principle of VQ, the algorithm of LBG and mode initialization in VQ, Design and Implementation of speaker recognition system based on VQ. Establish of training model parameters and the process of recognizing parameters matching. Analyze the performance of speaker recognition system in different model parameters and duration of speech samples by experiments.(4) To improve the recognition rate and the stability of the system, research the algorithm of expectation maximization (EM) for parameter estimation, model parameter initialization, the process of training parameters and recognizing parameters in GMM, and complete simulation and experiment. Analyze the performance of system in different model parameter, methods of feature extraction, duration of speech samples, various SNR.(5)Analyze the open-set speaker recognition, the rule and method of getting threshold value in speaker verification. A method of speaker identification followed speaker verification in open-set speaker recognition is presented. Solve "rejection problems" for pretenders.Finally, analyses and compares the performance of open-set speaker recognition based on VQ and GMM.æ›´å¤š è¿˜åŽŸ

ã€å…³é”®è¯ã€‘ è¯éŸ³å¤„ç†ï¼› è¯´è¯äººè¯†åˆ«ï¼› é«˜æ–¯æ··åˆæ¨¡åž‹ï¼› çŸ¢é‡é‡åŒ–ï¼› ç‰¹å¾æå–ï¼›
ã€Key wordsã€‘ Speech Processingï¼› Speaker Recognitionï¼› GMMï¼› VQï¼› Feature Extractionï¼›

ã€ç½‘ç»œå‡ºç‰ˆæŠ•ç¨¿äººã€‘ æ¦æ±‰ç†å·¥å¤§å¦

ã€åˆ†ç±»å·ã€‘TN912.34
ã€è¢«å¼•é¢‘æ¬¡ã€‘21
ã€ä¸‹è½½é¢‘æ¬¡ã€‘911
æ”»è¯»æœŸæˆæžœ

çŸ¥ç½‘èŠ‚ä¸‹è½½

èŠ‚ç‚¹æ–‡çŒ®ä¸ï¼š

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æœ¬æ–‡çš„å¼•æ–‡ç½‘ç»œ

èŠ‚ç‚¹æ–‡çŒ®

èŠ‚ç‚¹æ–‡çŒ®

åŸºäºŽGMMçš„è¯´è¯äººè¯†åˆ«ç³»ç»Ÿç ”ç©¶ä¸Žå®žçŽ°

Research and Implementation on Speaker Recognition System Based on GMM

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

åŸºäºŽGMMçš„è¯´è¯äººè¯†åˆ«ç³»ç»Ÿç ”ç©¶ä¸Žå®žçŽ°