分布式搜索引擎的設(shè)計與實現(xiàn)
發(fā)布時間:2018-03-26 19:23
本文選題:分布式 切入點:搜索引擎 出處:《華東師范大學(xué)》2008年碩士論文
【摘要】: 目前,網(wǎng)絡(luò)上存在大量的資源共享服務(wù)器,這些服務(wù)器一般存儲了一定量的資源,并以web服務(wù)的方式供用戶和其它服務(wù)器訪問。但是隨著服務(wù)器分布越來越廣泛,信息量也會越來越豐富,并且不同服務(wù)器之間信息組織形式也趨向多樣化,用戶難以快速、準(zhǔn)確的檢索到自己需要的資源,因此設(shè)計一個良好的分布式搜索引擎將是搜索引擎能否面相未來的關(guān)鍵因素。 在本文中,我們首先結(jié)合當(dāng)前分布式搜索引擎的研究現(xiàn)狀,深入介紹了ajax、xml等相關(guān)技術(shù),并對分布式搜索引擎的開發(fā)可行性和應(yīng)用前景進行了研究分析。根掘這些分析結(jié)果對系統(tǒng)進行了概要設(shè)計,并將其分為十個功能模塊--客戶端驗證功能模塊、系統(tǒng)檢索代理功能模塊、資源預(yù)覽功能模塊、(XML型)服務(wù)器資源檢索功能模塊、(SQL型)本地服務(wù)器資源檢索功能模塊、高級搜索功能處理用戶檢索信息功能模塊、檢索精度(任意關(guān)鍵字檢索)功能模塊、后臺管理(登陸實現(xiàn))功能模塊、后臺管理添加資源信息功能模塊、后臺管理服務(wù)器注冊與注銷功能模塊。在詳細設(shè)計過程中介紹了每一個模塊的功能,優(yōu)點以及相關(guān)算法。本文最后詳細介紹了系統(tǒng)的使用與測試過程。 總體上,本文論述了一種分布式搜索引擎的設(shè)計方法。經(jīng)驗證,所實現(xiàn)的分布式搜索引擎具有良好的可用性,解決了因當(dāng)前服務(wù)器信息量逐漸增多,信息組織形式多樣化而導(dǎo)致的用戶難以快速、準(zhǔn)確的檢索到自己需要的資源的問題。
[Abstract]:At present, there are a large number of resource sharing servers on the network, which generally store a certain amount of resources and are accessed by users and other servers in the form of web services. The amount of information will become more and more abundant, and the forms of information organization between different servers will also tend to be diversified. It is difficult for users to quickly and accurately retrieve the resources they need. Therefore, the design of a good distributed search engine will be a key factor for the future of search engines. In this paper, we first introduce the relevant technologies, such as ajaxer XML, in combination with the current research status of distributed search engine. The feasibility and application prospect of distributed search engine are studied and analyzed. System retrieval agent function module, resource preview function module / XML) server resource retrieval function module / SQL) local server resource retrieval function module, advanced search function processing user retrieval information function module, Retrieval accuracy (any keyword retrieval) function module, background management (login implementation) function module, background management add resource information function module, In the process of detailed design, the functions, advantages and related algorithms of each module are introduced. Finally, the use and testing process of the system are introduced in detail. As a whole, this paper discusses a design method of distributed search engine. It is proved that the distributed search engine has good usability, which solves the problem that the amount of server information is increasing gradually. The diversity of information organization results in the problem that users can not retrieve the resources they need quickly and accurately.
【學(xué)位授予單位】:華東師范大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2008
【分類號】:TP391.3
【引證文獻】
相關(guān)期刊論文 前1條
1 王俊生;施運梅;張仰森;;基于Hadoop的分布式搜索引擎關(guān)鍵技術(shù)[J];北京信息科技大學(xué)學(xué)報(自然科學(xué)版);2011年04期
相關(guān)碩士學(xué)位論文 前1條
1 龔秋艷;并行網(wǎng)絡(luò)爬蟲設(shè)計與實現(xiàn)[D];華東師范大學(xué);2010年
,本文編號:1669260
本文鏈接:http://www.sikaile.net/kejilunwen/sousuoyinqinglunwen/1669260.html
最近更新
教材專著