基于CNN的連續(xù)語音說話人聲紋識別

發(fā)布時間：2019-02-13 07:28

【摘要】：近年來,隨著社會生活水平的不斷提高,人們對機(jī)器智能人聲識別的要求越來越高。高斯混合—隱馬爾可夫模型(Gaussian of mixture-hidden Markov model,GMM-HMM)是說話人識別研究領(lǐng)域中最重要的模型。由于該模型對大語音數(shù)據(jù)的建模能力不是很好,對噪聲的頑健性也比較差,模型的發(fā)展遇到了瓶頸。為了解決該問題,研究者開始關(guān)注深度學(xué)習(xí)技術(shù)。引入了CNN深度學(xué)習(xí)模型研究連續(xù)語音說話人識別問題,并提出了CNN連續(xù)說話人識別(continuous speaker recognition of convolutional neural network,CSR-CNN)算法。模型提取固定長度、符合語序的語音片段,形成時間線上的有序語譜圖,通過CNN提取特征序列,經(jīng)過獎懲函數(shù)對特征序列組合進(jìn)行連續(xù)測量。實(shí)驗(yàn)結(jié)果表明,CSR-CNN算法在連續(xù)—片段說話人識別領(lǐng)域取得了比GMM-HMM更好的識別效果。
[Abstract]:In recent years, with the continuous improvement of social living standards, the demand of machine intelligent voice recognition is becoming higher and higher. Gao Si Hybrid-Hidden Markov Model (Gaussian of mixture-hidden Markov model,GMM-HMM) is the most important model in the field of speaker recognition. Because the modeling ability of the model for large speech data is not very good, and the robustness to noise is also relatively poor, the development of the model has encountered a bottleneck. In order to solve this problem, researchers begin to pay attention to the technology of deep learning. In this paper, CNN depth learning model is introduced to study the continuous speech speaker recognition problem, and a CNN continuous speaker recognition (continuous speaker recognition of convolutional neural network,CSR-CNN) algorithm is proposed. The model extracts the speech fragments of fixed length and accords with the word order, and forms the ordered linguistic spectrum on the time line. The feature sequences are extracted by CNN, and the combination of feature sequences is continuously measured by the reward and punishment function. Experimental results show that the CSR-CNN algorithm achieves better recognition performance than GMM-HMM in the field of continuous-segment speaker recognition.
【作者單位】：杭州電子科技大學(xué);
【分類號】：TP393

【相似文獻(xiàn)】

相關(guān)會議論文前8條

1 曹陽;黃泰翼;;基于統(tǒng)計方法的漢語連續(xù)語音中聲調(diào)模式的研究[A];第九屆全國信號處理學(xué)術(shù)年會（CCSP-99）論文集[C];1999年

2 程蘭穎;俞鐵城;李忠香;;基于音節(jié)分割的連續(xù)語音多模板隱馬爾可夫模型的研究[A];第三屆全國人機(jī)語音通訊學(xué)術(shù)會議論文集[C];1994年

3 孫海;范京;劉惠華;;漢語連續(xù)語音中的單字起止點(diǎn)綜合判別的新方法[A];第十屆全國信號處理學(xué)術(shù)年會（CCSP-2001）論文集[C];2001年

4 吳及;許海天;王作英;;連續(xù)數(shù)字串識別中語速的在線自適應(yīng)方法[A];第六屆全國人機(jī)語音通訊學(xué)術(shù)會議論文集[C];2001年

5 沈彩鳳;俞一彪;;采用三音節(jié)FO插值的連續(xù)語音聲調(diào)評測算法[A];2011'中國西部聲學(xué)學(xué)術(shù)交流會論文集[C];2011年

6 肖熙;王作英;;漢語連續(xù)語音聲調(diào)識別的HMM方法[A];第五屆全國人機(jī)語音通訊學(xué)術(shù)會議論文集[C];1998年

7 曹陽;黃泰翼;;基于小波變換的基頻提取和連續(xù)語音中基頻變化模式的分析[A];第四屆全國人機(jī)語音通訊學(xué)術(shù)會議論文集[C];1996年

8 朱思俞;石鋒;;不定人連續(xù)漢語音的四聲識別[A];第二屆全國人機(jī)語音通訊學(xué)術(shù)會議論文集[C];1992年

相關(guān)博士學(xué)位論文前1條

1 鐘金宏;基于音節(jié)的漢語連續(xù)語音聲調(diào)識別方法研究[D];合肥工業(yè)大學(xué);2001年

相關(guān)碩士學(xué)位論文前8條

1 范佳露;3-5歲聽障兒童連續(xù)語音重復(fù)能力的特征及干預(yù)研究[D];華東師范大學(xué);2010年

2 張芳;聽障與健聽兒童連續(xù)語音切換能力的比較及應(yīng)用研究[D];華東師范大學(xué);2009年

3 韓虎;漢語連續(xù)語音的音節(jié)自動標(biāo)注算法研究及實(shí)現(xiàn)[D];哈爾濱工業(yè)大學(xué);2008年

4 袁浩;連續(xù)語音中關(guān)鍵詞快速檢出的研究[D];哈爾濱工業(yè)大學(xué);2011年

5 何義華;基于飛行器的連續(xù)語音指令識別技術(shù)研究[D];南京航空航天大學(xué);2008年

6 陳斌;漢語連續(xù)語音聲韻母類別屬性檢測技術(shù)研究[D];解放軍信息工程大學(xué);2011年

7 嚴(yán)歡;漢語連續(xù)語音聲調(diào)及數(shù)字串識別系統(tǒng)的研究[D];哈爾濱理工大學(xué);2011年

8 施凝;中等詞匯量的漢語連續(xù)語音關(guān)鍵詞識別系統(tǒng)[D];同濟(jì)大學(xué);2006年

，

本文編號：2421318

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會員下載

Download by Member

本文鏈接：http://www.sikaile.net/kejilunwen/xinxigongchenglunwen/2421318.html

上一篇：一種基于數(shù)據(jù)預(yù)處理和卡爾曼濾波的溫室監(jiān)測數(shù)據(jù)融合算法
下一篇：狼群優(yōu)化的神經(jīng)網(wǎng)絡(luò)頻譜感知算法

論文發(fā)表

·知網(wǎng)|萬方|維普|龍源|省級|國家級|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

基于CNN的連續(xù)語音說話人聲紋識別