一種噪聲環(huán)境下的復雜聲音識別方法

發(fā)布時間：2018-10-26 19:27

【摘要】：當今社會已進入人工智能的時代,語音識別技術已經(jīng)相當成熟。而對于實際生活中的復雜聲音,由于其聲源的復雜性和多樣性,加之背景噪聲的干擾,目前對于這一領域的識別研究還遠遠不夠成熟,仍然存在許多問題和缺陷。因此對噪聲環(huán)境下復雜聲音的識別研究具有非常重大的實踐價值和理論價值。復雜聲音是指這樣一類包含多種聲音類型且這些聲音之間的邊界難以區(qū)分的聲音信號。目前對于這類聲音的檢測方法主要沿用傳統(tǒng)的語音識別技術,語音信號發(fā)音方式較為固定且能量平穩(wěn),而復雜聲音種類繁多,發(fā)音原理各不相同,瞬間能量也較大,而且還會被環(huán)境噪音所干擾,因此僅僅采用傳統(tǒng)的語音識別技術不能夠較好地應用于復雜聲音的識別。針對噪聲環(huán)境下這一類聲音識別準確率低的問題,本文主要進行了如下研究工作:(1)首先主要介紹了聲音識別中常用的幾種時頻域特征,通過提取和分析復雜聲音樣本的特征參數(shù),提出了由時頻域特征組合的方式來共同描述復雜聲音,并進行了多種混合特征的對比實驗。(2)在對噪聲環(huán)境下的復雜聲音識別方法研究過程中,針對人工選擇訓練樣本的困難,提出了一種基于聚類標注的訓練樣本選擇算法,能夠更加快速精準地選擇出訓練樣本代表集,并進行了不同聚類方法的對比實驗。(3)最后提出了基于隱馬爾可夫模型(Hidden Markov Mode1,HMM)的復雜聲音識別框架,并進行了訓練和識別。通過對列車聲音以及鳥叫聲兩種不同類型的復雜聲音進行仿真實驗,結果表明,利用時域特征短時自相關函數(shù)以及頻域特征梅爾頻率倒譜系數(shù)組合的混合特征參數(shù)表示復雜聲音特征,使用本文提出的基于近鄰傳播聚類標注的訓練樣本選擇算法,以及通過HMM模型識別框架進行建模,可以顯著提高噪聲環(huán)境下復雜聲音的識別準確率和效率。
[Abstract]:Nowadays, the society has entered the era of artificial intelligence, speech recognition technology has been quite mature. Because of the complexity and diversity of the sound sources and the interference of background noise, the research on the recognition of complex sound in real life is far from mature, and there are still many problems and defects. Therefore, it is of great practical and theoretical value to study the recognition of complex sound in noisy environment. Complex sound is a kind of sound signal which contains many kinds of sound types and whose boundaries are difficult to distinguish. At present, the detection methods of this kind of sound mainly use the traditional speech recognition technology. The speech signal pronunciation mode is relatively fixed and the energy is stable, and there are many kinds of complex sounds, different pronunciation principles and great instantaneous energy. And it will be interfered by environmental noise, so only traditional speech recognition technology can not be applied to the recognition of complex sound. In order to solve the problem of low accuracy in noise environment, the main work of this paper is as follows: (1) firstly, several time-frequency domain features commonly used in sound recognition are introduced. In the process of studying the method of complex sound recognition in noisy environment, a training sample selection algorithm based on clustering tagging is proposed to overcome the difficulty of manually selecting training samples. The training sample representative set can be selected more quickly and accurately, and the comparison experiments of different clustering methods are carried out. (3) finally, a complex voice recognition framework based on hidden Markov model (Hidden Markov Mode1,HMM) is proposed. Training and recognition are also carried out. The simulation results of two different types of complex sounds, train sounds and bird calls, show that, The time domain feature short time autocorrelation function and the mixed feature parameters of frequency domain feature Mel frequency cepstrum coefficient combination are used to represent the complex sound features, and the training sample selection algorithm based on nearest neighbor propagation clustering is proposed in this paper. The accuracy and efficiency of complex sound recognition in noisy environment can be significantly improved by modeling with HMM model recognition framework.
【學位授予單位】：合肥工業(yè)大學
【學位級別】：碩士
【學位授予年份】：2017
【分類號】：TN912.34

【相似文獻】

相關期刊論文前10條

1 ;會找人的機器人[J];科學;2006年01期

2 張宏超;聲音識別簡介[J];信息與控制;1979年03期

3 劉礫;日研制成功世界上第一個連續(xù)聲音識別系統(tǒng)[J];國外自動化;1979年Z1期

4 王憲忠;;前景光明的聲音識別技術[J];華夏星火;2001年09期

5 郭利剛;方土富;;智能聲音識別技術在廣播電視廣告監(jiān)測中的應用[J];廣播與電視技術;2006年12期

6 施智雄;;基于聲音識別的氣味發(fā)生裝置設計與實現(xiàn)[J];電聲技術;2009年05期

7 蔡時昊;顏偉國;;智能聲音識別技術構建廣播電視廣告節(jié)目監(jiān)測系統(tǒng)[J];信息通信;2012年03期

8 王再歡;唐云建;韓鵬;;一種利用聲音識別的森林盜伐檢測方法[J];計算機工程與應用;2012年30期

9 甘振新 ,金世龍;關于聲音識別的一些研究課題[J];信息與控制;1979年03期

10 千葉 ,成美 ,劉小立 ,祝景成;聲音識別技術的現(xiàn)狀與未來[J];國外自動化;1983年02期

相關會議論文前3條

1 楊曜;郭斌;於志文;;一種基于背景聲音識別的社會情境感知方法[A];第八屆和諧人機環(huán)境聯(lián)合學術會議（HHME2012)論文集PCC[C];2012年

2 張明瀚;石為人;丁寧;;一種基于學習的異常聲音識別研究[A];2009中國儀器儀表與測控技術大會論文集[C];2009年

3 高思澤;倪邦發(fā);張貴英;趙常軍;肖才錦;劉存兄;劉超;管永精;;過熱液滴探測器的聲音識別系統(tǒng)設計[A];第十二屆全國活化分析學術交流會論文摘要匯編[C];2010年

相關重要報紙文章前3條

1 日立邋編譯;聲音識別：下一代手機輸入接口[N];中國電子報;2007年

2 本報駐以色列記者　田學科;藏在舌尖上的“身份證”[N];科技日報;2006年

3 李莉;反恐戰(zhàn)場另類“靈眼”[N];中國國防報;2004年

相關博士學位論文前1條

1 張文娟;基于聽覺仿生的目標聲音識別系統(tǒng)研究[D];中國科學院研究生院（長春光學精密機械與物理研究所）;2012年

相關碩士學位論文前10條

1 張楠;西湖之聲“杭州味道”品牌戰(zhàn)略方案評估和建議[D];浙江大學;2015年

2 張?zhí)K楠;基于視頻跟蹤與多模型聲音識別的豬行為檢測與分析[D];太原理工大學;2016年

3 張小霞;基于能量檢測的復雜環(huán)境聲音識別[D];福州大學;2014年

4 尤冠瑜;基于時間編碼的環(huán)境聲音識別[D];福州大學;2013年

5 王熙;基于多頻段譜減法的魯棒性生態(tài)環(huán)境聲音識別[D];福州大學;2013年

6 顏鑫;真實噪聲下利用抗噪冪歸一化倒譜系數(shù)的兩層魯棒環(huán)境聲音識別[D];福州大學;2013年

7 史秋瑩;基于深度學習和遷移學習的環(huán)境聲音識別[D];哈爾濱工業(yè)大學;2016年

8 崔金琦;Non-Speech Body Sounds的感知、識別與應用研究[D];南京大學;2017年

9 樊鵬;一種噪聲環(huán)境下的復雜聲音識別方法[D];合肥工業(yè)大學;2017年

10 胡志峰;基于嵌入式聲音識別技術的列車預警研究[D];西南交通大學;2007年

，

本文編號：2296724

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會員下載

Download by Member

本文鏈接：http://www.sikaile.net/kejilunwen/xinxigongchenglunwen/2296724.html

上一篇：聲帶振動發(fā)音過程機理研究與仿真
下一篇：方向圖可重構天線研究與設計

論文發(fā)表

·知網(wǎng)|萬方|維普|龍源|省級|國家級|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

一種噪聲環(huán)境下的復雜聲音識別方法