并行文件存儲系統(tǒng)關(guān)鍵技術(shù)的研究
[Abstract]:With the development of the Internet and the constant improvement of the electronic level of information, the data also present an explosive growth trend. Although the capacity and performance of the traditional single-machine storage technology have been greatly developed in the past few decades, the single-machine storage technology is still unable to cope with the huge amount of data. Therefore, how to build a high performance, large capacity, high reliability and high scalability data storage system has become an important problem. Under this background, distributed parallel file storage system came into being. Distributed parallel file storage system is a hot research topic in computer academic and business circles at present, and many research institutions and enterprises have also made a lot of achievements. However, most of the products introduced by these research institutions and enterprises are designed according to their own business requirements, which have considerable limitations and shortcomings, and there is still a lot of room for research and improvement. The main work of this paper is as follows: (1) the main distributed file storage systems, such as GFS,Global File System, are compared and analyzed, and their advantages and disadvantages are summarized. A new distributed file system architecture and flat file organization are proposed. (2) an index structure based on Hash table and an extension mechanism based on consistent Hash algorithm are designed. The simulation results show that the consistent Hash algorithm is more scalable than the traditional Hash algorithm. (3) by analyzing the implementation principle and details of the Linux file system, this paper reveals its shortcomings in mass file storage. On this basis, a storage node data storage scheme based on merge mechanism is designed and described in detail. Finally, the experimental results show that the proposed scheme has better read and write performance than the direct file system-based storage. (4) the two causes of the system load imbalance are analyzed: the problem of uneven access to the client and the hot data problem. For the former reason, this paper proposes a load balancing strategy based on the combination of server load model and node static performance to balance the access load of the client. In this paper, a replica quantity management strategy based on data heat statistics is proposed, which can dynamically increase the number of replicas of thermal data and achieve the purpose of distributing the load to multiple nodes.
【學位授予單位】:華南理工大學
【學位級別】:碩士
【學位授予年份】:2012
【分類號】:TP333
【參考文獻】
相關(guān)期刊論文 前6條
1 熊勁,范志華,馬捷,唐榮鋒,李暉,孟丹;DCFS2的元數(shù)據(jù)一致性策略[J];計算機研究與發(fā)展;2005年06期
2 吳偉;謝長生;韓德志;黃建忠;;海量存儲系統(tǒng)中高可擴展性元數(shù)據(jù)服務(wù)器集群設(shè)計[J];計算機科學;2007年07期
3 龐麗萍,何飛躍,徐婕,岳建輝;PVFS寄生式元數(shù)據(jù)管理的設(shè)計與實現(xiàn)[J];計算機工程;2004年20期
4 楊德志;許魯;張建剛;;藍鯨分布式文件系統(tǒng)元數(shù)據(jù)服務(wù)[J];計算機工程;2008年07期
5 趙旺;曹強;;分布式并行文件系統(tǒng)中鎖管理的研究[J];計算機應(yīng)用研究;2007年09期
6 張曉春;劉引;;淺談分布式文件系統(tǒng)關(guān)鍵技術(shù)[J];科學咨詢(決策管理);2009年04期
相關(guān)博士學位論文 前2條
1 王建勇;可擴展的單一映象文件系統(tǒng)[D];中國科學院研究生院(計算技術(shù)研究所);1999年
2 吳思寧;機群文件系統(tǒng)服務(wù)器關(guān)鍵技術(shù)研究[D];中國科學院研究生院(計算技術(shù)研究所);2004年
相關(guān)碩士學位論文 前1條
1 田穎;分布式文件系統(tǒng)中的負載平衡技術(shù)研究[D];中國科學院研究生院(計算技術(shù)研究所);2003年
本文編號:2293800
本文鏈接:http://www.sikaile.net/kejilunwen/jisuanjikexuelunwen/2293800.html