基于可重構(gòu)的語音識別片上系統(tǒng)的設(shè)計

發(fā)布時間：2019-03-21 20:22

【摘要】：近年來,嵌入式系統(tǒng)的語音識別系統(tǒng)已經(jīng)廣泛應(yīng)用到智能家居、工業(yè)控制、移動終端等領(lǐng)域,正改變著人們的生活。由于語言交流是人們之間最自然的交流方式,基于語音識別的人機交互的嵌入式系統(tǒng)越來越成為研究的熱點。然而,現(xiàn)有的語音識別系統(tǒng)或具有很高的CPU使用率,不能完成其它任務(wù);或具有很大的體積,難以在嵌入式系統(tǒng)使用;或網(wǎng)絡(luò)依賴性太高,在無網(wǎng)絡(luò)條件下僅能完成有限詞匯量的識別。為了解決這些問題,在嵌入式語音識別方面還需要對系統(tǒng)結(jié)構(gòu)進行深入的研究。本文提出基于可重構(gòu)的片上語音識別系統(tǒng),在一定程度上有效緩解了上述矛盾。所作的主要工作如下:首先,本文研究了語音信號的信號處理。從信號處理的角度,討論了在語音識別過程中用到關(guān)鍵技術(shù)的原理。這包括預(yù)加重、端點檢測、特征提取等技術(shù)。其次,本文介紹了隱馬爾可夫模型的基本原理以及高斯混合模型的基本原理。通過對隱馬爾可夫模型的三個問題的論述,特別是高斯混合模型表示的隱馬爾可夫模型的B參數(shù)的詳細論述,解決了語音識別系統(tǒng)的訓(xùn)練及識別的原理問題。再次,本文以ZYNQ7000作為SOC設(shè)計平臺,構(gòu)建了嵌入式非特定人孤立詞語音識別系統(tǒng)。在對ZYNQ7000的可重構(gòu)性研究的基礎(chǔ)上,本文一方面在前有的PC端訓(xùn)練軟件的基礎(chǔ)上,進一步將識別模型改進為基于高斯混合模型的隱馬爾可夫模型(GMM-HMM),形成系統(tǒng)驗證平臺,為識別系統(tǒng)提供識別模板和硬件測試數(shù)據(jù)。這包括對訓(xùn)練和識別算法的研究及實現(xiàn)。還包括將系統(tǒng)中間數(shù)據(jù)轉(zhuǎn)換成易于硬件測試的格式。另一方面,將識別算法移植到ZYNQ7000平臺,實現(xiàn)了片上語音識別系統(tǒng)的構(gòu)建。這包括通過對識別流程的評估,完成對識別系統(tǒng)進行了軟硬件劃分,并且完成對語音識別的關(guān)鍵算法作了適合硬件特性的改進。這還包括對關(guān)鍵計算單元的硬件重構(gòu),通過硬件邏輯實現(xiàn)數(shù)字信號處理中的常見算法。在本文中,主要研究了MFCC計算單元的重構(gòu)。最后,通過對系統(tǒng)的識別率和實時性的測試,闡述了采用可重構(gòu)片上語音識別系統(tǒng)優(yōu)勢以及對將來工作的展望。
[Abstract]:In recent years, embedded speech recognition system has been widely used in smart home, industrial control, mobile terminals and other fields, is changing people's lives. Because language communication is the most natural way of communication between people, the embedded system based on speech recognition has become more and more popular in the field of human-computer interaction. However, the existing speech recognition system either has a high CPU usage rate, can not accomplish other tasks, or has a large size, so it is difficult to use in embedded system. Or the network dependence is too high, can only complete the limited vocabulary identification under the condition of no network. In order to solve these problems, embedded speech recognition needs to be deeply studied. In this paper, a reconfigurable on-chip speech recognition system is proposed, which effectively alleviates the above contradictions to a certain extent. The main work is as follows: firstly, this paper studies the signal processing of speech signal. From the point of view of signal processing, the principle of key techniques used in speech recognition is discussed. This includes pre-weighting, endpoint detection, feature extraction and other techniques. Secondly, this paper introduces the basic principle of hidden Markov model and Gao Si mixed model. The training and recognition principle of speech recognition system is solved by discussing three problems of Hidden Markov Model, especially the B parameter of Hidden Markov Model represented by Gao Si's mixed model. Thirdly, using ZYNQ7000 as the design platform of SOC, the embedded speech recognition system for isolated words is constructed. On the basis of the research on the reconfiguration of ZYNQ7000, on the one hand, based on the previous PC training software, the recognition model is further improved to the hidden Markov model (GMM-HMM) based on Gao Si's mixed model to form a system verification platform. Provide identification template and hardware test data for identification system. This includes the research and implementation of training and recognition algorithms. It also includes converting the system intermediate data into a format that is easy to test with hardware. On the other hand, the recognition algorithm is transplanted to ZYNQ7000 platform to realize the construction of on-chip speech recognition system. Through the evaluation of the recognition process, the hardware and software partition of the recognition system is completed, and the improvement of the key algorithm of speech recognition is made suitable for the hardware characteristics. It also includes hardware reconfiguration of key computing units and implementation of common algorithms in digital signal processing through hardware logic. In this paper, the reconstruction of MFCC computing unit is studied. Finally, by testing the recognition rate and real-time performance of the system, the advantages of the reconfigurable on-chip speech recognition system and the prospect of future work are discussed.
【學(xué)位授予單位】：電子科技大學(xué)
【學(xué)位級別】：碩士
【學(xué)位授予年份】：2014
【分類號】：TN912.34

【參考文獻】

相關(guān)期刊論文前1條

1 賁俊,萬旺根,余小清;基于置信度的非特定人語音識別拒識算法的研究[J];計算機應(yīng)用研究;2003年07期

，

本文編號：2445288

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會員下載

Download by Member

本文鏈接：http://www.sikaile.net/kejilunwen/wltx/2445288.html

上一篇：基于人工磁導(dǎo)體的低剖面天線及最優(yōu)結(jié)構(gòu)的研究
下一篇：基于自適應(yīng)閾值的小波包在松動部件信噪分離中的研究

論文發(fā)表

·知網(wǎng)|萬方|維普|龍源|省級|國家級|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

基于可重構(gòu)的語音識別片上系統(tǒng)的設(shè)計