微博輿情分析系統(tǒng)信息處理模塊的設(shè)計(jì)與實(shí)現(xiàn)
發(fā)布時(shí)間:2019-01-07 17:54
【摘要】:微博作為迅速崛起的新興社交網(wǎng)絡(luò),因?yàn)樾畔⒌膩碓戳驾积R,以及民眾的盲從性,利用微博傳播謠言,擾亂社會(huì)治安的事件時(shí)有發(fā)生。如今,平均每小時(shí)的微博發(fā)布量高達(dá)數(shù)百萬條,僅僅通過人工手段來對(duì)如此多條目進(jìn)行監(jiān)控和分析幾乎是不可能完成的任務(wù),因此依靠現(xiàn)代文本自動(dòng)分析技術(shù)來開發(fā)一款微博輿情分析預(yù)警系統(tǒng)迫在眉睫。 本論文的工作是為一款微博輿情信息分析系統(tǒng)設(shè)計(jì)和開發(fā)其中的信息處理模塊。論文首先介紹了微博輿情分析系統(tǒng)的整體框架設(shè)計(jì),概括描述了該系統(tǒng)底層的信息采集、索引和分詞模塊和其所涉及的相關(guān)開源軟件和技術(shù)。本系統(tǒng)的分析手段是通過微博關(guān)鍵詞來進(jìn)行的,對(duì)其所使用的潛在語義分析(LSA)也做出了相應(yīng)的介紹。 論文的后幾部分主要介紹信息處理模塊,給出其整體設(shè)計(jì)架構(gòu),以及實(shí)時(shí)統(tǒng)計(jì)、自定義統(tǒng)計(jì)、同類詞歸并、微博影響力分析等功能點(diǎn)的具體設(shè)計(jì)和編碼實(shí)現(xiàn)。系統(tǒng)完成的主要功能包括: 1)新浪微博的實(shí)時(shí)統(tǒng)計(jì)和預(yù)警; 2)各類復(fù)雜且精確地自定義統(tǒng)計(jì) 3)同類詞歸并; 4)微博分析以及用戶分析; 5)提供分析功能API; 該系統(tǒng)在今后還將進(jìn)一步發(fā)揮重要作用,為大政工平臺(tái)的應(yīng)用系統(tǒng)提供信息支持和數(shù)據(jù)共享。
[Abstract]:Weibo as a rapidly rising social network, because of the mixed sources of information, as well as the blindness of the public, the use of Weibo to spread rumors, disturbing social order incidents occur from time to time. Today, Weibo publishes millions of posts per hour on average. It is almost impossible to monitor and analyze so many items by manual means alone. Therefore, it is urgent to develop an early warning system for Weibo's public opinion analysis based on modern text automatic analysis technology. The work of this paper is to design and develop the information processing module of Weibo Public opinion Information Analysis system. Firstly, the paper introduces the whole frame design of Weibo public opinion analysis system, and describes the information collection, index and word segmentation module of the system and the related open source software and technology. The analysis method of this system is carried out by Weibo keyword, and the potential semantic analysis (LSA) used by Weibo is also introduced. The last part of the paper mainly introduces the information processing module, and gives its overall design framework, as well as the real-time statistics, custom statistics, congener word merging, Weibo influence analysis and other functional points of the specific design and coding implementation. The main functions of the system include: (1) real-time statistics and early warning of Sina Weibo; (2) complicated and accurate statistics (3) merging of similar words; (4) Weibo analysis and user analysis; 5) providing analysis function API; this system will further play an important role in the future, providing information support and data sharing for the application system of large political platform.
【學(xué)位授予單位】:東華大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2014
【分類號(hào)】:TP311.52;TP393.092
本文編號(hào):2403958
[Abstract]:Weibo as a rapidly rising social network, because of the mixed sources of information, as well as the blindness of the public, the use of Weibo to spread rumors, disturbing social order incidents occur from time to time. Today, Weibo publishes millions of posts per hour on average. It is almost impossible to monitor and analyze so many items by manual means alone. Therefore, it is urgent to develop an early warning system for Weibo's public opinion analysis based on modern text automatic analysis technology. The work of this paper is to design and develop the information processing module of Weibo Public opinion Information Analysis system. Firstly, the paper introduces the whole frame design of Weibo public opinion analysis system, and describes the information collection, index and word segmentation module of the system and the related open source software and technology. The analysis method of this system is carried out by Weibo keyword, and the potential semantic analysis (LSA) used by Weibo is also introduced. The last part of the paper mainly introduces the information processing module, and gives its overall design framework, as well as the real-time statistics, custom statistics, congener word merging, Weibo influence analysis and other functional points of the specific design and coding implementation. The main functions of the system include: (1) real-time statistics and early warning of Sina Weibo; (2) complicated and accurate statistics (3) merging of similar words; (4) Weibo analysis and user analysis; 5) providing analysis function API; this system will further play an important role in the future, providing information support and data sharing for the application system of large political platform.
【學(xué)位授予單位】:東華大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2014
【分類號(hào)】:TP311.52;TP393.092
【參考文獻(xiàn)】
相關(guān)期刊論文 前9條
1 王娟;;網(wǎng)絡(luò)輿情監(jiān)控分析系統(tǒng)構(gòu)建[J];長春理工大學(xué)學(xué)報(bào)(高教版);2007年04期
2 李林容;黎薇;;微博的文化特性及傳播價(jià)值[J];當(dāng)代傳播;2011年01期
3 李紅雷;巴明杰;;當(dāng)前涉檢網(wǎng)絡(luò)輿情應(yīng)對(duì)措施探析[J];中國檢察官;2010年17期
4 朱顥東;鐘勇;;結(jié)合優(yōu)化的文檔頻和LSA的特征選擇方法[J];計(jì)算機(jī)工程與應(yīng)用;2009年34期
5 姚清耘;劉功申;李翔;;基于向量空間模型的文本聚類算法[J];計(jì)算機(jī)工程;2008年18期
6 劉克強(qiáng);;2009共享版ICTCLAS的分析與使用[J];科教文匯(上旬刊);2009年08期
7 歐陽宏基;葛萌;趙薔;;基于JDBC與設(shè)計(jì)模式的數(shù)據(jù)庫連接池實(shí)現(xiàn)方法[J];計(jì)算機(jī)技術(shù)與發(fā)展;2011年01期
8 楊麗華;戴齊;郭艷軍;;KNN文本分類算法研究[J];微計(jì)算機(jī)信息;2006年21期
9 程春蕊;劉萬軍;;高內(nèi)聚低耦合軟件架構(gòu)的構(gòu)建[J];計(jì)算機(jī)系統(tǒng)應(yīng)用;2009年07期
,本文編號(hào):2403958
本文鏈接:http://www.sikaile.net/guanlilunwen/ydhl/2403958.html
最近更新
教材專著