天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當(dāng)前位置:主頁 > 科技論文 > 搜索引擎論文 >

基于情感分類的產(chǎn)品評論垂直搜索引擎的研究

發(fā)布時間:2018-08-05 14:18
【摘要】:隨著互聯(lián)網(wǎng)技術(shù)的不斷發(fā)展,電子商務(wù)的不斷興起,BBS、博客、微博的不斷涌現(xiàn),商家與購買者的網(wǎng)上交互日趨頻繁。越來越多的購買者在使用產(chǎn)品后,將產(chǎn)品的評論發(fā)表在網(wǎng)上,評論的數(shù)量與日俱增,評論的本身口語化較多并呈非結(jié)構(gòu)化。商家在決策市場供求關(guān)系、潛在購買者在購買產(chǎn)品時,若從海量的產(chǎn)品評論中人工的挑選出自己關(guān)心的信息,是耗時和費力的,并帶有片面性和滯后性。因此搜索引擎在當(dāng)今互聯(lián)網(wǎng)中扮演著重要的角色,,像百度、谷歌這樣強大的搜索引擎是針對不同領(lǐng)域、不同種類的通用搜索引擎。在特定的產(chǎn)品評論領(lǐng)域中,卻顯得力不從心。所以,在這樣的背景下,一款具有情感分類的產(chǎn)品評論垂直搜索引擎的研究與開發(fā)是很有必要的。 在國內(nèi)外研究現(xiàn)狀的基礎(chǔ)上,針對中文產(chǎn)品評論文本中評價對象的識別、評價短語的識別、評價對象與評價短語的搭配識別及評價短語的情感傾向性分析,做了進一步研究。主要工作如下: (1)在識別評價對象方法上,利用詞性序列獲取評價對象候選集,并提出了評價對象的完整性和穩(wěn)定性的概念及算法,用來過濾評價對象的噪聲。利用評價對象與評價短語的同現(xiàn)規(guī)則及評價對象在整篇評論文本中或整個語料集中出現(xiàn)的頻率,進行評價對象的置信度排序,最終抽取出評價對象。 (2)對連詞詞典、情感詞詞典、程度詞詞典及否定詞詞典進行了完善,用以識別評價短語及分析評價短語的情感傾向性。并通過評價對象與評價短語之間關(guān)系的8個特征,利用支持向量機來識別評價對象與評價短語的搭配關(guān)系,最終判斷整篇評論文本的情感傾向性。 (3)以中文產(chǎn)品評論文本的情感傾向為基礎(chǔ),利用目前流行的SSH框架、mysql數(shù)據(jù)庫及開源軟件包lucene,構(gòu)建了一個垂直搜索引擎,用戶可以方便、快捷的查詢自己感興趣的相關(guān)信息。 通過上述的研究所構(gòu)建的具有情感分類的垂直搜索引擎,使得商家和潛在客戶可以從浩如煙海的評論文章中快速而準(zhǔn)確的找到對自己有用的信息,具有一定的商業(yè)價值。提出的中文文本情感分類的研究方法,具有一定的學(xué)術(shù)價值。
[Abstract]:With the continuous development of Internet technology and the rising of e-commerce, BBSs, blogs, Weibo constantly emerge, the online interaction between merchants and buyers is becoming more and more frequent. More and more buyers post product reviews on the Internet after using the products, the number of comments is increasing, the comments themselves are more colloquial and unstructured. It is time-consuming and laborious for potential buyers to pick out the information they care about from a large number of product reviews when they make decisions on the supply and demand relationship in the market, and it is one-sided and lagging. So search engines play an important role in the Internet today. Powerful search engines like Baidu and Google are aimed at different fields and different kinds of general search engines. In a particular area of product review, however, appears to be inadequate. Therefore, it is necessary to research and develop a vertical search engine with emotion classification for product reviews. Based on the current research situation at home and abroad, this paper makes a further study on the identification of evaluation objects, the identification of evaluation phrases, the collocation identification between evaluation objects and evaluation phrases, and the emotional orientation of evaluation phrases in Chinese product review texts. The main work is as follows: (1) the candidate set of evaluation object is obtained by using part of speech sequence in the method of identifying evaluation object, and the concept and algorithm of integrity and stability of evaluation object are put forward to filter the noise of evaluation object. Using the cooccurrence rule of evaluation object and evaluation phrase and the frequency of the evaluation object appearing in the whole comment text or the whole corpus, the confidence degree of the evaluation object is sorted, and the evaluation object is extracted. (2) the conjunctive dictionary is selected. The dictionary of affective words, the dictionary of degree words and the dictionary of negative words are perfected to identify the evaluation phrases and analyze the affective tendency of the evaluation phrases. Through the eight features of the relationship between the evaluation object and the evaluation phrase, support vector machine is used to identify the collocation relationship between the evaluation object and the evaluation phrase. Finally, the emotional tendency of the whole review text is judged. (3) based on the emotional tendency of the Chinese product review text, a vertical search engine is constructed by using the popular SSH framework MySQL database and open source software package Lucene. Users can easily and quickly query their own interested information. A vertical search engine with emotion classification is constructed through the above research, which enables merchants and potential customers to quickly and accurately find useful information for themselves from a vast number of review articles, which has certain commercial value. The research method of emotion classification of Chinese text has certain academic value.
【學(xué)位授予單位】:湖南工業(yè)大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2012
【分類號】:TP391.3

【相似文獻】

相關(guān)期刊論文 前10條

1 顧鵬堯;;讓搜索引擎更好地服務(wù)于教育教學(xué)[J];科學(xué)24小時;2003年Z1期

2 陳新顏;垂直搜索引擎辨析[J];現(xiàn)代情報;2004年09期

3 胡文勝;;垂直搜索助號碼百事通與商務(wù)領(lǐng)航[J];每周電腦報;2006年32期

4 胡潔;丁寧;關(guān)靜;曹福年;張磊;;基于“PUBMED+PDF”的醫(yī)學(xué)垂直搜索引擎的實踐[J];信息系統(tǒng)工程;2009年05期

5 一林;;垂直搜索:前進路上的喜與憂[J];互聯(lián)網(wǎng)天地;2010年02期

6 牟思;;基于垂直搜索引擎的學(xué)校網(wǎng)站的研究與建設(shè)[J];中國教育技術(shù)裝備;2011年21期

7 田野;垂直搜索火熱為哪般[J];中國計算機用戶;2005年37期

8 胡文勝;;垂直搜索助號碼百事通與商務(wù)領(lǐng)航[J];每周電腦報;2006年31期

9 邊凱;;你會搜索嗎?[J];中國計算機用戶;2007年23期

10 宿建光;;指點通:移動垂直搜索的創(chuàng)新者[J];通信世界;2007年03期

相關(guān)會議論文 前10條

1 王上;于海;王鉦旋;;Deep Web垂直搜索引擎設(shè)計與實現(xiàn)[A];第26屆中國數(shù)據(jù)庫學(xué)術(shù)會議論文集(B輯)[C];2009年

2 林歡歡;王文杰;史忠植;;移動環(huán)境下垂直搜索引擎[A];第三屆全國信息檢索與內(nèi)容安全學(xué)術(shù)會議論文集[C];2007年

3 王旭;杜軍平;;質(zhì)檢總局互聯(lián)網(wǎng)輿情監(jiān)控系統(tǒng)中聚焦爬蟲的研究[A];中國電子學(xué)會第十七屆信息論學(xué)術(shù)年會論文集[C];2010年

4 趙[

本文編號:2166049


資料下載
論文發(fā)表

本文鏈接:http://www.sikaile.net/kejilunwen/sousuoyinqinglunwen/2166049.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶c251c***提供,本站僅收錄摘要或目錄,作者需要刪除請E-mail郵箱bigeng88@qq.com