基于嵌入式聲紋識(shí)別系統(tǒng)的研究與實(shí)現(xiàn)
發(fā)布時(shí)間:2018-03-28 10:45
本文選題:聲紋識(shí)別 切入點(diǎn):特征提取 出處:《廣東工業(yè)大學(xué)》2012年碩士論文
【摘要】:近年來(lái),聲紋識(shí)別技術(shù)在逐漸的成熟,聲紋識(shí)別作為一種生物認(rèn)證技術(shù),有其獨(dú)特的優(yōu)點(diǎn),如聲音是非接觸式的,自然的,用戶容易接受。因?yàn)檎Z(yǔ)言這一媒介的優(yōu)勢(shì),通過(guò)語(yǔ)音身份認(rèn)證技術(shù),聲紋識(shí)別迅速應(yīng)用到實(shí)際,突出了巨大的市場(chǎng)潛力,聲紋識(shí)別技術(shù)已成為一個(gè)新興的高技術(shù)產(chǎn)業(yè)。隨著計(jì)算機(jī)硬件和軟件技術(shù),半導(dǎo)體技術(shù),電子技術(shù),通信技術(shù)和網(wǎng)絡(luò)技術(shù)的發(fā)展,以及嵌入式技術(shù)的不斷發(fā)展和更新,其性能和便攜性大大提高。實(shí)時(shí)數(shù)據(jù)采集,濾波處理,可以在低功耗,體積小的嵌入式設(shè)備完成。今天,處理器因?yàn)槠涮厥獾慕Y(jié)構(gòu)和高的編譯效率使其能夠快速的實(shí)現(xiàn)聲紋識(shí)別算法,滿足今天的數(shù)字信號(hào)處理和高實(shí)時(shí)性的要求。高性能嵌入式聲紋識(shí)別系統(tǒng)的聲紋識(shí)別技術(shù),因?yàn)榉奖?經(jīng)濟(jì)性,準(zhǔn)確性和嵌入式系統(tǒng)的便攜性,移動(dòng)性等優(yōu)點(diǎn),被廣泛應(yīng)用于人們的日常生活,擁有廣闊的發(fā)展前景。 本文在分析聲紋識(shí)別的相關(guān)理論與技術(shù)的基礎(chǔ)上,重點(diǎn)研究了基于Mel倒譜系數(shù)(MFCC)的特征參數(shù)的提取和DTW算法進(jìn)行改進(jìn),對(duì)一些不足之處進(jìn)行相應(yīng)的改進(jìn)。最后,它被應(yīng)用在基于ARM11與WinCE嵌入式平臺(tái)下實(shí)現(xiàn)的一個(gè)小容量的嵌入式聲紋識(shí)別系統(tǒng)。在前人工作的基礎(chǔ)上,本文改進(jìn)工作主要包括以下三個(gè)方面: 1.特征提取方面:對(duì)標(biāo)準(zhǔn)的MFCC中存在的不足,提出了相應(yīng)的改進(jìn),加權(quán)差分結(jié)合MFCC語(yǔ)音特征參數(shù)。使用短時(shí)幀能量和短時(shí)加權(quán)過(guò)零率替代MFCC中有負(fù)識(shí)別作用的第1,2階分量,并根據(jù)語(yǔ)音成分的不同貢獻(xiàn)率進(jìn)行加權(quán),然后進(jìn)行一階差分,最終會(huì)合并成一個(gè)新的特征參數(shù)。 2.DTW算法方面:使用改進(jìn)的DTW算法,替代標(biāo)準(zhǔn)的DTW算法,采用整體路徑約束,該算法具有很好的魯棒性,從而提高了算法的效率和代碼質(zhì)量。 3.嵌入式系統(tǒng)實(shí)現(xiàn)方面:在基于ok6410的arm11嵌入式系統(tǒng)中的資源相對(duì)有限的條件下,進(jìn)行了一些優(yōu)化處理。包括操作系統(tǒng)的優(yōu)化定制和移植,通過(guò)跨平臺(tái)的軟件開(kāi)發(fā),成功在搭建好的嵌入式開(kāi)發(fā)平臺(tái)上實(shí)現(xiàn)了聲紋識(shí)別系統(tǒng)。并研究分析了改進(jìn)的DTW算法和傳統(tǒng)DTW算法之間的性能差異,對(duì)在嵌入式中的運(yùn)行情況進(jìn)行了分析。 該系統(tǒng)相關(guān)的實(shí)驗(yàn),實(shí)驗(yàn)結(jié)果表明,對(duì)同一文本的內(nèi)容,識(shí)別系統(tǒng)的識(shí)別率比較高,對(duì)文本無(wú)關(guān)的內(nèi)容,識(shí)別率應(yīng)該改進(jìn);用改進(jìn)后的算法和特征參數(shù),系統(tǒng)的平均識(shí)別率提高4%左右。
[Abstract]:In recent years, voicerecognition technology has gradually matured. As a biometric authentication technology, voicerecognition has its unique advantages, such as sound is contactless, natural and easy to accept by users, because of the advantage of language as a medium. Through the voice identification technology, voicerecognition is applied to practice rapidly, which highlights the huge market potential. Voicerecognition technology has become a new high-tech industry. With the computer hardware and software technology, semiconductor technology, With the development of electronic technology, communication technology and network technology, as well as the continuous development and update of embedded technology, its performance and portability are greatly improved. Today, because of its special structure and high compilation efficiency, the processor can quickly realize the voiceprint recognition algorithm. To meet the requirements of today's digital signal processing and high real-time. High performance embedded voice recognition system voiceprint recognition technology, because of the advantages of convenience, economy, accuracy and embedded system portability, mobility, and other advantages, Widely used in people's daily life, has a broad development prospects. On the basis of analyzing the theory and technology of voiceprint recognition, this paper focuses on the feature parameter extraction based on Mel cepstrum coefficient and the improvement of DTW algorithm. It is applied to a small capacity embedded voiceprint recognition system based on ARM11 and WinCE embedded platform. Based on the previous work, the improvement work in this paper mainly includes the following three aspects:. 1. Feature extraction: for the shortcomings of standard MFCC, a corresponding improvement is put forward. The weighted difference is combined with MFCC speech feature parameters. The second order component with negative recognition in MFCC is replaced by short-time frame energy and short-time weighted zero-crossing rate. The speech components are weighted according to different contribution rates, and then the first order difference is carried out, which will be merged into a new feature parameter. In the aspect of 2.DTW algorithm, the improved DTW algorithm is used instead of the standard DTW algorithm and the global path constraint is adopted. The algorithm has good robustness and improves the efficiency and code quality of the algorithm. 3. The realization of embedded system: under the condition of limited resources in arm11 embedded system based on ok6410, some optimization processes are carried out, including the optimized customization and transplantation of operating system, and the development of cross-platform software. The voiceprint recognition system is successfully implemented on a well built embedded development platform, and the performance difference between the improved DTW algorithm and the traditional DTW algorithm is analyzed, and the running situation in the embedded system is analyzed. The experimental results show that the recognition rate of the system is high for the content of the same text, the recognition rate should be improved for the text-independent content, and the improved algorithm and feature parameters should be used. The average recognition rate of the system is increased by about 4%.
【學(xué)位授予單位】:廣東工業(yè)大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2012
【分類號(hào)】:TP368.1;TN912.34
【引證文獻(xiàn)】
相關(guān)碩士學(xué)位論文 前1條
1 鄒節(jié)凱;基于SOPC技術(shù)的噪聲環(huán)境下聲紋識(shí)別系統(tǒng)的研究[D];武漢理工大學(xué);2013年
,本文編號(hào):1675992
本文鏈接:http://www.sikaile.net/kejilunwen/jisuanjikexuelunwen/1675992.html
最近更新
教材專著