基于Zabbix的云監(jiān)控系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)
本文選題:Zabbix + 云計(jì)算 ; 參考:《電子科技大學(xué)》2017年碩士論文
【摘要】:隨著云計(jì)算時(shí)代的快速到來(lái),現(xiàn)有的IT環(huán)境中存在著多種多樣的設(shè)備,且其系統(tǒng)的構(gòu)成也非常復(fù)雜,架構(gòu)通常由上層應(yīng)用、系統(tǒng)架構(gòu)、底層支持構(gòu)成,因此一款設(shè)備的健壯性、可靠性以及出現(xiàn)問(wèn)題的及時(shí)響應(yīng)都變得越發(fā)的重要。在設(shè)計(jì)與維護(hù)過(guò)程中,為了及時(shí)掌握整個(gè)系統(tǒng)的基礎(chǔ)環(huán)境、業(yè)務(wù)應(yīng)用流程、系統(tǒng)的可用性、通過(guò)獲得的監(jiān)控?cái)?shù)據(jù)分析集群趨勢(shì),需要獲取其中各個(gè)設(shè)備的運(yùn)行狀態(tài),這一切都是離不開監(jiān)控系統(tǒng)。目前市場(chǎng)上有較多優(yōu)秀的開源系統(tǒng),但是綜合發(fā)現(xiàn)僅僅使用監(jiān)控系統(tǒng)而不進(jìn)行架構(gòu)的重新設(shè)計(jì)以及添加相應(yīng)所需功能,則無(wú)法跟上時(shí)代的潮流。監(jiān)控系統(tǒng)的目的是監(jiān)控集群里環(huán)境是否出現(xiàn)異常,但是目前市場(chǎng)上的開源產(chǎn)品沒有保證其具有高可靠性以及監(jiān)控系統(tǒng)本身出現(xiàn)問(wèn)題的自動(dòng)化處理。同時(shí)對(duì)所產(chǎn)生的日志沒有進(jìn)行很好的分析處理,大多數(shù)僅僅是將所監(jiān)控?cái)?shù)據(jù)從數(shù)據(jù)庫(kù)中拿出顯示在web頁(yè)面上。如何設(shè)計(jì)出一款具有高可靠性、架構(gòu)設(shè)計(jì)合理、可對(duì)監(jiān)控日志進(jìn)行分析的監(jiān)控系統(tǒng)是本文研究的主要課題。本論文以基于Zabbix的云監(jiān)控系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)為主要內(nèi)容,設(shè)計(jì)出高可靠性、結(jié)合Splunk日志分析處理的監(jiān)控系統(tǒng)框架并實(shí)現(xiàn)。該監(jiān)控系統(tǒng)的框架結(jié)合目前市場(chǎng)上三大主流開源監(jiān)控產(chǎn)品(Ganglia、host-sflow、Nagios)的優(yōu)點(diǎn),以及吸取仿生物神經(jīng)系統(tǒng)(BANS)的分層模式優(yōu)點(diǎn),仿生形成三層兩域,三層自上而下為:服務(wù)端(類中樞神經(jīng)層)、代理端(類周圍神經(jīng)層)、客戶端(類神經(jīng)元層);兩域包括:類中樞神經(jīng)域和類周圍神經(jīng)域。本云監(jiān)控系統(tǒng)在此分層模式下,分別在程序、架構(gòu)、數(shù)據(jù)庫(kù)中提高本云監(jiān)控系統(tǒng)的可靠性,并結(jié)合Splunk進(jìn)行監(jiān)控日志分析處理,與此同時(shí),可基于Zabbix設(shè)計(jì)出具有防誤報(bào)的、梯度的報(bào)警功能。根據(jù)主流監(jiān)控系統(tǒng)的優(yōu)化思想,在通信協(xié)議的選擇中,與實(shí)際生產(chǎn)環(huán)境相結(jié)合,在同一個(gè)IDC中通信的客戶端插件間通信協(xié)議采用UDP,不同的IDC機(jī)房的客戶端插件間通信協(xié)議采取TCP,這樣在保證可靠性的同時(shí)優(yōu)化了資源的占用;在數(shù)據(jù)庫(kù)的選擇與處理上,采用mysql數(shù)據(jù)庫(kù)并采用Mysql Cluster技術(shù),且對(duì)數(shù)據(jù)庫(kù)進(jìn)行優(yōu)化,以解決監(jiān)控?cái)?shù)據(jù)過(guò)多而造成的數(shù)據(jù)庫(kù)瓶頸和保證數(shù)據(jù)庫(kù)的可靠性。最終實(shí)現(xiàn)基于Zabbix的CloudMonitor云監(jiān)控系統(tǒng)。
[Abstract]:With the rapid arrival of cloud computing era, there are a variety of devices in the existing IT environment, and the composition of its system is very complex. The architecture is usually composed of upper application, system architecture, and bottom support, so a device is robust. Reliability and timely response to problems have become increasingly important. In the process of design and maintenance, in order to grasp the basic environment of the whole system, the business application flow, the usability of the system in time, we need to obtain the running state of each equipment through analyzing the cluster trend through the monitoring data obtained. All this is inseparable from the monitoring system. At present, there are many excellent open source systems in the market, but it can not keep up with the trend of the times if we only use the monitoring system and do not redesign the architecture and add the corresponding functions. The purpose of the monitoring system is to monitor whether there are any anomalies in the environment of the cluster, but the open source products in the market do not guarantee its high reliability and the automatic handling of the problems in the monitoring system itself. At the same time, the generated log is not well analyzed, most of the monitoring data is only taken out from the database and displayed on the web page. How to design a monitoring system with high reliability, reasonable architecture and analysis of monitoring log is the main subject of this paper. In this paper, the design and implementation of the cloud monitoring system based on Zabbix is the main content, and a high reliability monitoring system framework is designed and implemented in combination with Splunk log analysis and processing. The framework of the monitoring system combines the advantages of the three major open source surveillance products (Gangliahost-sflow Nagios) in the market at present, as well as the advantages of the hierarchical model of the biomimetic nervous system (Bans), and bionics forms three layers and two domains. The three layers are: serous (central nervous layer), agent (peripheral nerve layer), client (neuron-like layer), and the two domains include: central nervous region and peripheral nerve domain. In this hierarchical mode, the cloud monitoring system improves the reliability of the cloud monitoring system in the program, structure and database, and carries out monitoring log analysis and processing with Splunk. At the same time, it can be designed based on Zabbix to prevent false positives. Gradient alarm function. According to the optimization idea of mainstream monitoring system, in the choice of communication protocol, it combines with the actual production environment. In the same IDC, the communication protocol between clients and plug-ins adopts UDP.The communication protocol between clients in different IDC rooms adopts TCP, which not only ensures reliability, but also optimizes the usage of resources. Mysql database and MySQL Cluster technology are adopted to solve the bottleneck of database caused by too much monitoring data and to ensure the reliability of database. Finally, the cloud monitor system based on Zabbix is implemented.
【學(xué)位授予單位】:電子科技大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2017
【分類號(hào)】:TP277
【參考文獻(xiàn)】
相關(guān)期刊論文 前10條
1 何金獅;;一種基于epoll模型的高效代理服務(wù)架構(gòu)[J];網(wǎng)絡(luò)安全技術(shù)與應(yīng)用;2016年12期
2 李仕偉;周坤;劉新蕊;李寶林;;MySQL數(shù)據(jù)庫(kù)優(yōu)化技術(shù)[J];信息與電腦(理論版);2016年12期
3 田關(guān)偉;;MySQL復(fù)制技術(shù)分析研究[J];哈爾濱師范大學(xué)自然科學(xué)學(xué)報(bào);2015年04期
4 王寧;胡大偉;;基于多態(tài)多值決策圖的多態(tài)故障樹重要度計(jì)算方法[J];計(jì)算機(jī)集成制造系統(tǒng);2015年05期
5 韓杰;黃寬;陳浩男;李海峰;魏巍;劉釗;;智能建筑環(huán)境CPS的分布式設(shè)計(jì)研究[J];現(xiàn)代建筑電氣;2013年12期
6 郭曉慧;李潤(rùn)知;張茜;王宗敏;;基于Zabbix的分布式服務(wù)器監(jiān)控應(yīng)用研究[J];通信學(xué)報(bào);2013年S2期
7 沈青;董波;肖德寶;;基于服務(wù)器集群的云監(jiān)控系統(tǒng)設(shè)計(jì)與實(shí)現(xiàn)[J];計(jì)算機(jī)工程與科學(xué);2012年10期
8 屈志毅;王濤;李建旭;;DB2大型數(shù)據(jù)庫(kù)容災(zāi)備份實(shí)時(shí)復(fù)制系統(tǒng)[J];計(jì)算機(jī)系統(tǒng)應(yīng)用;2012年03期
9 龍門;夏靖波;張子陽(yáng);;基于概要數(shù)據(jù)結(jié)構(gòu)的網(wǎng)絡(luò)異常檢測(cè)方法[J];計(jì)算機(jī)應(yīng)用與軟件;2011年04期
10 王德政;申山宏;周寧寧;;云計(jì)算環(huán)境下的數(shù)據(jù)存儲(chǔ)[J];計(jì)算機(jī)技術(shù)與發(fā)展;2011年04期
相關(guān)碩士學(xué)位論文 前8條
1 曹旭東;基于BANS的云平臺(tái)監(jiān)控系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)[D];電子科技大學(xué);2015年
2 張宇;基于C/S模式的數(shù)字電視前端設(shè)備綜合網(wǎng)管系統(tǒng)中郵件短信告警的設(shè)計(jì)與實(shí)現(xiàn)[D];北京郵電大學(xué);2015年
3 尤海鵬;基于Ganglia的數(shù)據(jù)中心監(jiān)控平臺(tái)設(shè)計(jì)[D];山東大學(xué);2014年
4 常建華;云資源監(jiān)控系統(tǒng)研究與實(shí)現(xiàn)[D];電子科技大學(xué);2014年
5 艾勇;基于Nagios的局域網(wǎng)監(jiān)控平臺(tái)關(guān)鍵技術(shù)研究[D];電子科技大學(xué);2013年
6 朱振;基于MySQL復(fù)制改進(jìn)的多主復(fù)制數(shù)據(jù)庫(kù)擴(kuò)展實(shí)現(xiàn)[D];上海交通大學(xué);2013年
7 郭小林;基于主動(dòng)和被動(dòng)模式的分布式代理緩存服務(wù)器的設(shè)計(jì)與實(shí)現(xiàn)[D];電子科技大學(xué);2010年
8 楊楚華;分布式防火墻體系結(jié)構(gòu)及協(xié)同防御架構(gòu)研究[D];湖北工業(yè)大學(xué);2008年
,本文編號(hào):2069404
本文鏈接:http://www.sikaile.net/kejilunwen/zidonghuakongzhilunwen/2069404.html