天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當前位置:主頁 > 科技論文 > 計算機論文 >

多核處理器下多級Cache多維度聯(lián)合劃分策略

發(fā)布時間:2018-03-02 09:14

  本文關鍵詞: Cache劃分 IPC 公平性 預取 多核cache塊預測器 出處:《吉林大學》2013年博士論文 論文類型:學位論文


【摘要】:隨著高性能處理器技術(shù)的發(fā)展,存儲墻問題成為影響處理器系統(tǒng)性能的主要因素之一。處理器速度通常會比存儲器的訪問速度快兩個數(shù)量級。當代多核處理器廣泛采用基于大容量最后一級共享高速緩存的結(jié)構(gòu)來縮小這一差距。但是適用于小容量私有高速緩存的傳統(tǒng)管理策略并不適用于管理大容量最后一級共享高速緩存,它可能引起高速緩存缺失數(shù)的增加,,觸發(fā)大量代價昂貴的片外存儲器訪問。解決上述種種問題的主要方法包括對共享cache進行劃分,對cache替換策略進行改進,以及設立cache塊預取器等。 本文針對三級cache的負載結(jié)構(gòu)設立了兩個策略,聯(lián)合劃分策略和預測器劃分策略。聯(lián)合劃分策略首先是一個硬件設計結(jié)構(gòu),對二級私有cache資源進行共享和整合;其次包括了末級共享cache的劃分算法,該算法既考慮了失效率又考慮了公平性等因素。預測器劃分策略包括了針對一二級私有cache設計的多核cache塊預測器,同時結(jié)合末級共享cache劃分算法。實驗結(jié)果表明,聯(lián)合劃分策略比傳統(tǒng)的LRU替換策略在吞吐率上獲得平均17.56%的提升;比基于公平性的劃分算法在吞吐率上平均提升15.69%。聯(lián)合劃分策略的算法公平性相對于傳統(tǒng)LRU算法平均提升至3.8倍,相對于基于失效率的UCP算法提升至3.9倍。而預測器劃分策略的算法在吞吐率和公平性上有著更大的提高,在一級和二級cache中的預測精度和覆蓋率也有顯著的提升。
[Abstract]:With the development of high performance processor technology, The memory wall problem has become one of the main factors affecting the performance of processor systems. Processor speed is usually two orders of magnitude faster than memory access. Modern multicore processors are widely used based on large capacity last-stage sharing. The structure of cache is used to narrow this gap. But the traditional management strategy for small private cache is not suitable for managing large capacity last stage shared cache. It may cause an increase in the number of cache deletions and trigger a large number of costly off-chip memory accesses. The main solutions to these problems include partitioning shared cache and improving cache replacement strategies. And set up cache block prefetcher and so on. In this paper, two strategies are established for the load structure of three-level cache: joint partitioning strategy and predictor partitioning strategy. Firstly, the joint partitioning strategy is a hardware design structure, which shares and integrates the two-level private cache resources. Secondly, the partition algorithm of the last level shared cache is included, which takes into account both the failure rate and the fairness. The partition strategy of the predictor includes a multi-core cache block predictor designed for the private cache of one or two levels. At the same time, combined with the last stage shared cache partitioning algorithm, the experimental results show that the joint partition strategy achieves an average throughput improvement of 17.56% compared with the traditional LRU replacement strategy. Compared with the fairness based partition algorithm, the average throughput of the joint partition algorithm is 15.699.The fairness of the joint partition strategy is 3.8 times higher than that of the traditional LRU algorithm. Compared with the UCP algorithm based on the failure rate, the algorithm of the predictor partition strategy has a greater increase in throughput and fairness, and the prediction accuracy and coverage in the primary and secondary cache are also significantly improved.
【學位授予單位】:吉林大學
【學位級別】:博士
【學位授予年份】:2013
【分類號】:TP332

【參考文獻】

相關期刊論文 前2條

1 賀翔;多機系統(tǒng)中MESI方案探討[J];微型機與應用;1994年07期

2 張駿;樊曉椏;劉松鶴;;面向CMP體系結(jié)構(gòu)的二級CACHE替換算法設計[J];小型微型計算機系統(tǒng);2007年12期



本文編號:1555902

資料下載
論文發(fā)表

本文鏈接:http://www.sikaile.net/kejilunwen/jisuanjikexuelunwen/1555902.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶ca9f2***提供,本站僅收錄摘要或目錄,作者需要刪除請E-mail郵箱bigeng88@qq.com