天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當(dāng)前位置:主頁 > 科技論文 > 軟件論文 >

基于壓縮位圖索引的RDF數(shù)據(jù)存儲(chǔ)與管理

發(fā)布時(shí)間:2018-06-17 23:27

  本文選題:RDF + 數(shù)據(jù)存儲(chǔ); 參考:《北京交通大學(xué)》2017年碩士論文


【摘要】:隨著資源描述框架(Resource Description Framework,RDF)在各個(gè)領(lǐng)域的廣泛應(yīng)用,如何對(duì)海量RDF數(shù)據(jù)的存儲(chǔ)與管理成為近年來的研究熱點(diǎn),F(xiàn)有的RDF數(shù)據(jù)管理系統(tǒng)大都采用傳統(tǒng)的關(guān)系型數(shù)據(jù)庫來存儲(chǔ)數(shù)據(jù),這種方式已難以高效地管理海量數(shù)據(jù)。如何設(shè)計(jì)一種高性能、可擴(kuò)展為分布式的RDF數(shù)據(jù)存儲(chǔ)和管理系統(tǒng)具有重要意義。本文設(shè)計(jì)了一種基于位圖索引的RDF數(shù)據(jù)存儲(chǔ)方案,并實(shí)現(xiàn)了基于該存儲(chǔ)方案的RDF管理系統(tǒng),最后通過系統(tǒng)測試驗(yàn)證了該方案的可行性與有效性。本文研究工作主要包括以下幾個(gè)方面。(1)總結(jié)了現(xiàn)有的RDF數(shù)據(jù)存儲(chǔ)方案。分析了當(dāng)前主流的數(shù)據(jù)存儲(chǔ)技術(shù)及RDF數(shù)據(jù)存儲(chǔ)模型的優(yōu)缺點(diǎn),并對(duì)其進(jìn)行了簡單的分析與總結(jié)。(2)提出了一種基于位圖索引的高擴(kuò)展性底層存儲(chǔ)方案。該方案在持久層將RDF數(shù)據(jù)文件分塊進(jìn)行順序存儲(chǔ),實(shí)現(xiàn)了系統(tǒng)的可擴(kuò)展性;同時(shí)為RDF關(guān)鍵詞構(gòu)建基于壓縮位圖的查詢索引,降低了運(yùn)行時(shí)內(nèi)存資源消耗。(3)設(shè)計(jì)了基于本方案的數(shù)據(jù)查詢算法。該算法能夠充分利用位圖索引邏輯計(jì)算的性能優(yōu)勢,保證了高效的查詢效率。(4)實(shí)現(xiàn)了基于本方案的RDF數(shù)據(jù)存儲(chǔ)和查詢系統(tǒng)fishdb,并采用測試數(shù)據(jù)集在單機(jī)偽分布式系統(tǒng)環(huán)境下對(duì)該系統(tǒng)進(jìn)行了性能測試。與開源RDF管理系統(tǒng)Google Cayley的相比,fishdb能夠以較小的內(nèi)存資源消耗為代價(jià)換取較高的查詢性能提升,驗(yàn)證了本方案的可行性和有效性。
[Abstract]:With the wide application of Resource description Framework (RDF) in various fields, how to store and manage massive RDF data has become a hot topic in recent years. Most of the existing RDF data management systems use traditional relational databases to store data, which is difficult to manage mass data efficiently. How to design a high performance and extensible RDF data storage and management system is of great significance. In this paper, a RDF data storage scheme based on bitmap index is designed, and the RDF management system based on this storage scheme is implemented. Finally, the feasibility and effectiveness of the scheme are verified by system test. The main work of this paper includes the following aspects: 1) summarize the existing RDF data storage scheme. This paper analyzes the advantages and disadvantages of the current mainstream data storage technology and RDF data storage model, and gives a simple analysis and summary of the RDF data storage model. In the persistence layer, the RDF data file is stored sequentially, and the system scalability is realized. At the same time, the query index based on compressed bitmap is constructed for the RDF keyword. The data query algorithm based on this scheme is designed. This algorithm can make full use of the performance advantage of bitmap index logic computing. The RDF data storage and query system fishdbbased on this scheme is implemented, and the performance of the system is tested by using the test data set in the single machine pseudo-distributed system environment. Compared with the open source RDF management system Google Cayley, fishdb can improve the query performance at the cost of less memory resource consumption, which verifies the feasibility and effectiveness of this scheme.
【學(xué)位授予單位】:北京交通大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2017
【分類號(hào)】:TP333;TP315

【參考文獻(xiàn)】

相關(guān)碩士學(xué)位論文 前2條

1 朱敏;基于HBase的RDF數(shù)據(jù)存儲(chǔ)與查詢研究[D];南京大學(xué);2013年

2 金強(qiáng);基于HBase的RDF存儲(chǔ)系統(tǒng)的研究與設(shè)計(jì)[D];浙江大學(xué);2011年

,

本文編號(hào):2032927

資料下載
論文發(fā)表

本文鏈接:http://www.sikaile.net/kejilunwen/ruanjiangongchenglunwen/2032927.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶5a644***提供,本站僅收錄摘要或目錄,作者需要?jiǎng)h除請(qǐng)E-mail郵箱bigeng88@qq.com