基于混合余弦相似度的中文文本層次關(guān)系挖掘
[Abstract]:Hierarchical relationship is one of the most important relationships between concepts of Chinese text. The correct judgment of hierarchical relationship is the basic research content of domain ontology automatic construction, text data mining and other information processing. Firstly, the candidate hierarchical relationships among concepts are listed out, and a kernel function classifier is constructed, which combines the semantic cosine similarity of part of speech sequence and the cosine similarity of relational words, and the mining problem of hierarchical relations between concepts is transformed into a classification problem. Then the classifier is trained by template annotation of text data. Finally, the pre-processed Chinese text is input and the candidate hierarchical relationship is judged by kernel function classifier. Taking the Chinese text in the field of air force weapon equipment as the test data, the experimental results show that the method is simple and reliable, and has good accuracy and recall rate.
【作者單位】: 西北工業(yè)大學(xué)計(jì)算機(jī)學(xué)院;
【基金】:國家部委基金智能信息處理支撐技術(shù)項(xiàng)目(513150703) 陜西省自然科學(xué)基金資助項(xiàng)目(2015JM6290)
【分類號(hào)】:TP391.1
【相似文獻(xiàn)】
相關(guān)期刊論文 前10條
1 蘭杰;在西文狀態(tài)下閱讀中文文本文件[J];電腦知識(shí);1997年02期
2 駱衛(wèi)華,羅振聲,宮小瑾;中文文本自動(dòng)校對(duì)技術(shù)的研究[J];計(jì)算機(jī)研究與發(fā)展;2004年01期
3 顧益軍,樊孝忠,于江德,李良富;受限領(lǐng)域中文文本主題標(biāo)引系統(tǒng)研究[J];計(jì)算機(jī)應(yīng)用;2004年01期
4 李長榮,闞戈;中文文本2-分類模型在上證指數(shù)趨勢(shì)分析中的應(yīng)用研究[J];齊齊哈爾大學(xué)學(xué)報(bào);2005年02期
5 許細(xì)清;林世平;;面向中文文本的觀點(diǎn)檢索技術(shù)研究[J];福州大學(xué)學(xué)報(bào)(自然科學(xué)版);2010年05期
6 薛麗敏;李殿偉;肖斌;;中文文本情感傾向性五元模型研究[J];通信技術(shù);2011年07期
7 劉開瑛,薛翠芳,鄭家恒,周曉強(qiáng);中文文本中抽取特征信息的區(qū)域與技術(shù)[J];中文信息學(xué)報(bào);1998年02期
8 劉晶茹,王開鑄;中文文本自動(dòng)校對(duì)技術(shù)研究及系統(tǒng)組成[J];電腦學(xué)習(xí);1999年06期
9 劉來e,
本文編號(hào):2230624
本文鏈接:http://www.sikaile.net/kejilunwen/ruanjiangongchenglunwen/2230624.html