基于大象流兩級識別的SDN負載均衡研究
發(fā)布時間:2019-06-20 09:18
【摘要】:隨著云計算、大數(shù)據(jù)等業(yè)務(wù)的興起,數(shù)據(jù)中心網(wǎng)絡(luò)規(guī)模及用戶數(shù)量快速增長,網(wǎng)絡(luò)流量呈爆發(fā)式增加,以TCP/IP架構(gòu)為核心的傳統(tǒng)數(shù)據(jù)中心網(wǎng)絡(luò)難以高效管控海量的數(shù)據(jù)流量,網(wǎng)絡(luò)阻塞問題加劇,網(wǎng)絡(luò)資源浪費嚴重,如何均衡數(shù)據(jù)中心網(wǎng)絡(luò)負載以提高數(shù)據(jù)傳輸效率成為當前亟待解決的問題。近幾年,以控制與轉(zhuǎn)發(fā)分離為特征的軟件定義網(wǎng)絡(luò)(Software Defined Networking,SDN)技術(shù)憑借其強大的流量管控能力得到了IT界的廣泛關(guān)注。邏輯集中的SDN控制器基于全局網(wǎng)絡(luò)視圖可實現(xiàn)高效、細粒度的網(wǎng)絡(luò)流量調(diào)度,這使其在網(wǎng)絡(luò)流量管控方面較以TCP/IP架構(gòu)為核心的傳統(tǒng)網(wǎng)絡(luò)具有較大的優(yōu)勢。本文從大象流識別的角度,對SDN數(shù)據(jù)中心網(wǎng)絡(luò)負載不均衡的問題進行研究。首先,針對現(xiàn)有大象流識別方法識別開銷大的問題,提出一種大象流兩級識別方法。依據(jù)大象流數(shù)據(jù)量大的特點,該方法在識別第一階段提出基于TCP發(fā)送隊列的可疑大象流識別算法(Suspicious Elephant Detection based on Write Queue,SED-WQ),通過監(jiān)測主機端發(fā)送隊列緩存中的數(shù)據(jù)量特征以識別可疑大象流,剔除數(shù)據(jù)量較小的老鼠流以降低第二階段控制器的處理開銷;依據(jù)大象流持續(xù)時間長的特點,該方法在識別第二階段提出基于流持續(xù)時間的真實大象流識別算法(Real Elephant Detection based on Duration Time,RED-DT),通過監(jiān)測網(wǎng)絡(luò)端可疑大象流的持續(xù)時間特征以識別真實大象流,剔除不滿足條件的大象流以提高大象流識別準確性。其次,針對SDN數(shù)據(jù)中心鏈路負載不均衡的問題,提出一種基于大象流兩級識別的SDN網(wǎng)絡(luò)負載均衡策略(Elephant Load Balancing,ELB)。針對網(wǎng)絡(luò)中的大象流,采用基于均勻分布的大象流調(diào)度算法,利用SDN控制器動態(tài)精細地規(guī)劃大象流的最佳轉(zhuǎn)發(fā)路徑以保證ELB策略的管控效率;針對網(wǎng)絡(luò)中的老鼠流,采用基于隨機選路的老鼠流調(diào)度算法,利用SDN控制器靜態(tài)粗放地選取老鼠流的最佳轉(zhuǎn)發(fā)路徑以降低ELB策略的控制器處理開銷。最后,利用Mininet軟件對所提出的大象流兩級識別方法和負載均衡策略ELB進行仿真分析。實驗分析表明,在保證大象流識別的高準確性前提下,大象流兩級識別方法較基于采樣的大象流識別方法可以降低約85%的控制器識別開銷;在保證流量識別開銷較低的前提下,ELB策略較傳統(tǒng)基于等價路由(Equal Cost Multipath Routing,ECMP)的網(wǎng)絡(luò)負載均衡策略降低約10%的平均傳輸時延,提升約5%的鏈路平均利用率。
[Abstract]:With the rise of cloud computing, large data and other services, the network size and the number of users of the data center are rapidly increasing, the network traffic is increasing, and the traditional data center network with the TCP/ IP architecture as the core is difficult to control the mass data traffic efficiently and the network blocking problem is exacerbated, The network resource waste is serious, how to balance the data center network load to improve the data transmission efficiency becomes the current problem to be solved. In recent years, the Software Defined Networking (SDN) technology, which is characterized by the control and forwarding separation, has been widely concerned by the IT community with its powerful flow control ability. The SDN controller in the logic set can realize the high-efficiency and fine-grained network flow scheduling based on the global network view, which has the advantage that the traditional network with the TCP/ IP architecture as the core in the aspect of network flow control. In this paper, the problem of unbalanced network load of SDN data center is studied from the perspective of elephant flow identification. Firstly, aiming at the problem of large identification cost of the existing elephant flow identification method, an elephant flow two-level identification method is proposed. according to the characteristics of large amount of data of the elephant flow, the method provides a suspicious elephant flow identification algorithm (SED-WQ) based on the TCP transmission queue in the first stage, and the data volume characteristic in the queue cache is transmitted through the monitoring host end to identify the suspicious elephant flow, The method comprises the following steps of: removing a mouse stream with a small data volume to reduce the processing cost of the second-stage controller; and according to the characteristic of long duration of the elephant flow, the method provides a real-Elephant Detection based on Duration (RED-DT) based on the flow duration in the second stage. And monitoring the duration characteristic of the suspicious elephant flow at the network end to identify the real elephant flow, and removing the elephant flow which does not meet the condition to improve the identification accuracy of the elephant flow. Secondly, aiming at the problem of unbalanced load of the SDN data center, an SDN network load balancing strategy (ELB) based on two-level identification of the elephant flow is proposed. aiming at the elephant flow in the network, the optimal forwarding path of the elephant flow is dynamically and finely divided by the SDN controller to ensure the control efficiency of the ELB strategy by using an elephant flow scheduling algorithm which is evenly distributed, the rat flow scheduling algorithm based on the random routing is adopted for the rat flow in the network, The optimal forwarding path of the rat stream is selected statically and loosely by the SDN controller to reduce the controller processing overhead of the ELB policy. Finally, using the Minet software, the proposed two-level identification method and load balance strategy ELB are simulated and analyzed. The experimental analysis shows that at the premise of ensuring the high accuracy of the identification of the elephant flow, the method of the two-level identification method of the elephant flow can reduce the controller identification cost of about 85% based on the sampling of the elephant flow identification method, The ELB strategy reduces the average transmission time delay of about 10% compared with the traditional equivalent-based routing (ECMP) network load balancing strategy, and increases the link average utilization rate of about 5%.
【學位授予單位】:杭州電子科技大學
【學位級別】:碩士
【學位授予年份】:2017
【分類號】:TP393.02
[Abstract]:With the rise of cloud computing, large data and other services, the network size and the number of users of the data center are rapidly increasing, the network traffic is increasing, and the traditional data center network with the TCP/ IP architecture as the core is difficult to control the mass data traffic efficiently and the network blocking problem is exacerbated, The network resource waste is serious, how to balance the data center network load to improve the data transmission efficiency becomes the current problem to be solved. In recent years, the Software Defined Networking (SDN) technology, which is characterized by the control and forwarding separation, has been widely concerned by the IT community with its powerful flow control ability. The SDN controller in the logic set can realize the high-efficiency and fine-grained network flow scheduling based on the global network view, which has the advantage that the traditional network with the TCP/ IP architecture as the core in the aspect of network flow control. In this paper, the problem of unbalanced network load of SDN data center is studied from the perspective of elephant flow identification. Firstly, aiming at the problem of large identification cost of the existing elephant flow identification method, an elephant flow two-level identification method is proposed. according to the characteristics of large amount of data of the elephant flow, the method provides a suspicious elephant flow identification algorithm (SED-WQ) based on the TCP transmission queue in the first stage, and the data volume characteristic in the queue cache is transmitted through the monitoring host end to identify the suspicious elephant flow, The method comprises the following steps of: removing a mouse stream with a small data volume to reduce the processing cost of the second-stage controller; and according to the characteristic of long duration of the elephant flow, the method provides a real-Elephant Detection based on Duration (RED-DT) based on the flow duration in the second stage. And monitoring the duration characteristic of the suspicious elephant flow at the network end to identify the real elephant flow, and removing the elephant flow which does not meet the condition to improve the identification accuracy of the elephant flow. Secondly, aiming at the problem of unbalanced load of the SDN data center, an SDN network load balancing strategy (ELB) based on two-level identification of the elephant flow is proposed. aiming at the elephant flow in the network, the optimal forwarding path of the elephant flow is dynamically and finely divided by the SDN controller to ensure the control efficiency of the ELB strategy by using an elephant flow scheduling algorithm which is evenly distributed, the rat flow scheduling algorithm based on the random routing is adopted for the rat flow in the network, The optimal forwarding path of the rat stream is selected statically and loosely by the SDN controller to reduce the controller processing overhead of the ELB policy. Finally, using the Minet software, the proposed two-level identification method and load balance strategy ELB are simulated and analyzed. The experimental analysis shows that at the premise of ensuring the high accuracy of the identification of the elephant flow, the method of the two-level identification method of the elephant flow can reduce the controller identification cost of about 85% based on the sampling of the elephant flow identification method, The ELB strategy reduces the average transmission time delay of about 10% compared with the traditional equivalent-based routing (ECMP) network load balancing strategy, and increases the link average utilization rate of about 5%.
【學位授予單位】:杭州電子科技大學
【學位級別】:碩士
【學位授予年份】:2017
【分類號】:TP393.02
【參考文獻】
相關(guān)期刊論文 前10條
1 白磊;田立勤;陳超;;基于流抽樣和LRU的高速網(wǎng)絡(luò)大流檢測算法[J];計算機應(yīng)用與軟件;2016年04期
2 趙輝;丁鳴;程青松;盧凌;孔晨晟;;SDN與NFV技術(shù)在云數(shù)據(jù)中心的規(guī)模應(yīng)用[J];電信科學;2016年01期
3 周桐慶;蔡志平;夏竟;徐明;;基于軟件定義網(wǎng)絡(luò)的流量工程[J];軟件學報;2016年02期
4 劉曉陸;劉淵;王春龍;;一種基于FEFS與CBF的網(wǎng)絡(luò)大流識別算法[J];計算機工程;2015年09期
5 張朝昆;崔勇;唐,
本文編號:2503106
本文鏈接:http://www.sikaile.net/guanlilunwen/ydhl/2503106.html
最近更新
教材專著