分布式數(shù)據(jù)管理平臺(tái)的設(shè)計(jì)與實(shí)現(xiàn)
發(fā)布時(shí)間:2018-01-18 01:11
本文關(guān)鍵詞:分布式數(shù)據(jù)管理平臺(tái)的設(shè)計(jì)與實(shí)現(xiàn) 出處:《中山大學(xué)》2015年碩士論文 論文類型:學(xué)位論文
更多相關(guān)文章: 分布式 數(shù)據(jù)管理平臺(tái) 標(biāo)簽 數(shù)據(jù)挖掘 用戶畫像
【摘要】:伴隨著web2.0的到來,以及移動(dòng)互聯(lián)網(wǎng)的急速發(fā)展,網(wǎng)絡(luò)上的數(shù)據(jù)也朝多元化的方向發(fā)生了爆炸性的增長。隨著云計(jì)算與大數(shù)據(jù)的興起,企業(yè)開始收集用戶相關(guān)的數(shù)據(jù),希望這些數(shù)據(jù)能夠幫助他們贏得更多的市場(chǎng)。但是,他們低效的數(shù)據(jù)管理方式,使得這些數(shù)據(jù)的價(jià)值并未真正的發(fā)揮出來。在市場(chǎng)營銷過程中,作為核心的用戶數(shù)據(jù)是必須管理好的。本文主要研究的是如何幫助企業(yè)管理海量的用戶數(shù)據(jù),其主要目的是提供一套完整的能夠處理海量用戶數(shù)據(jù)的分布式數(shù)據(jù)管理平臺(tái),該平臺(tái)可以對(duì)用戶進(jìn)行多維度的深度的數(shù)據(jù)挖掘與分析,并最終為每個(gè)用戶打上標(biāo)簽生成一個(gè)可以全面描述用戶的用戶畫像,然后將這些用戶畫像供給客戶進(jìn)行檢索和數(shù)據(jù)應(yīng)用的開發(fā)。由于數(shù)據(jù)管理平臺(tái)在國內(nèi)是個(gè)非常新的商業(yè)領(lǐng)域,因此國內(nèi)幾乎沒有完整的論文對(duì)數(shù)據(jù)管理平臺(tái)進(jìn)行詳細(xì)的介紹,因此本文的設(shè)計(jì)和實(shí)現(xiàn)工作都是基于現(xiàn)實(shí)中的客戶需求來進(jìn)行的。本方案實(shí)現(xiàn)后,提供了一個(gè)高效、穩(wěn)定、可伸縮的分布式數(shù)據(jù)管理平臺(tái),其出眾的數(shù)據(jù)分析和管理能力受到了肯定。分布式數(shù)據(jù)管理平臺(tái)自部署以來,3個(gè)月的穩(wěn)定運(yùn)行,幫助企業(yè)管理了近千萬用戶的上億條數(shù)據(jù),并為這些用戶生成用戶畫像,同時(shí)提供用戶畫像的檢索功能和OpenAPI。滿足了客戶的數(shù)據(jù)管理需求,并為他們的數(shù)據(jù)增值提供了強(qiáng)有力的支撐。
[Abstract]:With the arrival of web2.0 and the rapid development of the mobile Internet, the data on the network has also explosive growth in the direction of diversification. With the rise of cloud computing and big data. Companies are starting to collect user-related data in the hope that it will help them win more markets. However, they have inefficient data management methods. Make the value of these data has not really played out. In the marketing process, as the core of user data must be managed well. This paper mainly studies how to help enterprises to manage a large amount of user data. The main purpose of the platform is to provide a complete set of distributed data management platform which can deal with massive user data. Finally, each user is tagged to generate a user portrait that can describe the user comprehensively. Then these user portraits will be supplied to customers for retrieval and data application development, because the data management platform is a very new business field in China. Therefore, there is almost no complete paper on the data management platform for detailed introduction, so the design and implementation of this paper are based on the reality of customer requirements. Provides an efficient, stable, scalable distributed data management platform, whose outstanding data analysis and management capabilities have been recognized. The distributed data management platform has been running steadily for 3 months since its deployment. It helps the enterprise manage hundreds of millions of data of nearly ten million users, and generates the user portrait for these users, at the same time, it provides the retrieval function of the user portrait and OpenAPI. meets the customer's data management needs. And for their data added to provide a strong support.
【學(xué)位授予單位】:中山大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2015
【分類號(hào)】:TP311.52
,
本文編號(hào):1438775
本文鏈接:http://www.sikaile.net/guanlilunwen/yingxiaoguanlilunwen/1438775.html
最近更新
教材專著