運(yùn)行時(shí)緩存缺失率曲線生成與共享緩存劃分
發(fā)布時(shí)間:2018-04-05 18:44
本文選題:共享緩存 切入點(diǎn):緩存缺失率曲線 出處:《華中科技大學(xué)》2012年碩士論文
【摘要】:共享緩存結(jié)構(gòu)加速了核與核之間的通訊速度,在多核處理器中有著重要作用。然而,多個(gè)核競爭使用共享緩存,互相污染對方的緩存數(shù)據(jù),降低了系統(tǒng)的整體性能。為了解決這個(gè)問題,研究者提出了基于緩存缺失率曲線劃分緩存的方法劃分共享緩存,從而消除共享緩存競爭。但是,運(yùn)行時(shí)生成缺失率曲線需要特殊的硬件支持,而用純軟件的方法在運(yùn)行時(shí)生成缺失率曲線開銷較高。 針對上述問題,提出了一種純軟件、低開銷和高準(zhǔn)確度的緩存缺失率曲線生成方法,基于生成的缺失率曲線劃分緩存。該方法根據(jù)程序訪問共享緩存的行為分形特點(diǎn),基于分形模型在運(yùn)行時(shí)預(yù)測程序的緩存缺失率曲線,降低了曲線生成的開銷;利用現(xiàn)代處理器上普遍存在的硬件計(jì)數(shù)器單元記錄程序的運(yùn)行時(shí)信息,極大地減少了記錄運(yùn)行時(shí)信息的開銷,并且不需要修改應(yīng)用程序,也不需添加額外的硬件;利用運(yùn)行時(shí)信息計(jì)算預(yù)測模型的參數(shù),,將參數(shù)代入分形預(yù)測模型計(jì)算程序的緩存缺失率曲線,該模型簡單,計(jì)算量小;基于生成的缺失率曲線有效地指導(dǎo)操作系統(tǒng)制定緩存劃分策略,在操作系統(tǒng)層實(shí)現(xiàn)了共享緩存劃分,提升了系統(tǒng)的整體性能。 為了驗(yàn)證該預(yù)測模型的準(zhǔn)確度與該預(yù)測方法的開銷,選用了SPEC CPU2006基準(zhǔn)測試集中的26個(gè)程序進(jìn)行了測試。預(yù)測的缺失率曲線與真實(shí)的缺失率曲線之間的平均誤差為3.13%。使用該預(yù)測方法生成缺失率曲線所需的時(shí)間為420ms;谏傻娜笔是劃分共享緩存后,系統(tǒng)的平均IPC最高提升了20%。
[Abstract]:Shared buffer architecture speeds up the communication between cores and plays an important role in multi-core processors.However, multiple core competition uses shared cache, which pollutes each other's cache data and reduces the overall performance of the system.In order to solve this problem, the researchers proposed a method to partition the shared cache based on the cache missing rate curve, so as to eliminate the competition of the shared cache.However, it needs special hardware support to generate the missing rate curve at run time, and it is expensive to generate the missing rate curve at run time by using pure software.In order to solve the above problems, a pure software, low overhead and high accuracy method for generating cache loss rate curves is proposed, which is based on the generated loss rate curves to partition the cache.According to the behavior fractal characteristic of program accessing shared cache, this method predicts the missing rate curve of program at run time based on fractal model, and reduces the cost of curve generation.The runtime information of the program is recorded by using the common hardware counter unit on the modern processor, which greatly reduces the overhead of recording the run-time information, and does not need to modify the application program or add additional hardware.The parameters of the prediction model are calculated by using the run-time information, and the parameters are substituted into the cache missing rate curve of the calculation program of the fractal prediction model. The model is simple and the calculation amount is small.Based on the generated loss rate curve, the cache partition strategy is effectively directed to the operating system, and the shared cache partition is implemented in the operating system layer, which improves the overall performance of the system.In order to verify the accuracy of the prediction model and the cost of the prediction method, 26 programs in the SPEC CPU2006 benchmark set were selected for testing.The average error between the predicted loss rate curve and the true loss rate curve is 3.13.The time required to generate the defect rate curve using this prediction method is 420 Ms.After the shared cache is partitioned based on the generated loss rate curve, the average IPC of the system increases by 20%.
【學(xué)位授予單位】:華中科技大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2012
【分類號】:TP333
【參考文獻(xiàn)】
相關(guān)期刊論文 前3條
1 李劍慧;馬湘寧;朱傳琪;;動態(tài)二進(jìn)制翻譯與優(yōu)化技術(shù)研究[J];計(jì)算機(jī)研究與發(fā)展;2007年01期
2 官孝峰;梁阿磊;;動態(tài)二進(jìn)制翻譯器CrossBit的性能分析與評估[J];計(jì)算機(jī)工程與應(yīng)用;2008年27期
3 胡成臣,劉斌;基于RED算法的共享緩存管理策略[J];清華大學(xué)學(xué)報(bào)(自然科學(xué)版);2004年07期
相關(guān)碩士學(xué)位論文 前1條
1 袁征;基于頁著色技術(shù)的多核實(shí)時(shí)系統(tǒng)中Cache共享策略的研究[D];東北大學(xué);2009年
本文編號:1715991
本文鏈接:http://www.sikaile.net/kejilunwen/jisuanjikexuelunwen/1715991.html
最近更新
教材專著