基于Lucene的全文檢索系統(tǒng)的設計與實現(xiàn)
發(fā)布時間:2018-05-11 22:29
本文選題:Lucene + 全文搜索。 參考:《廈門大學》2014年碩士論文
【摘要】:二十世紀九十年代開始,計算機技術和互聯(lián)網(wǎng)技術獲得了巨大的發(fā)展,隨著計算機以及互聯(lián)網(wǎng)技術的大規(guī)模普及應用,人們所接觸到的信息量也呈現(xiàn)指數(shù)級的增長,信息量的增大迫使人們必須想出各種方法來快速獲得所需要的有用信息,為此,人們發(fā)明了各式各樣的信息查找技術,但是,如何才能快速高效地完成信息的存儲以及查找操作呢,這是非常值得國內外讀者去研究的課題。 當前,搜索引擎已經(jīng)成為信息網(wǎng)絡化時代最主流的技術之一,作為搜索引擎核心的技術,全文檢索(Full-text Retrieval)是指使用自然語言進行檢索,基于全文索引并以文本數(shù)據(jù)為主要處理對象的檢索技術。全文檢索與普通的數(shù)據(jù)庫檢索設計不太一致,前者需要處理包括結構化數(shù)據(jù)以及非結構化數(shù)據(jù),而后者只能處理結構化數(shù)據(jù),所以,比起普通的數(shù)據(jù)庫檢索,全文檢索具有更強大的功能,更容易滿足用戶的需求。 論文主要是探討藝術學院辦公系統(tǒng)的全文檢索模塊,全文檢索的基本要求就是能夠實現(xiàn)對公文內容,通知公告,內部新聞等文本信息進行內容檢索。系統(tǒng)基于J2EE體系架構進行開發(fā),采用SSH2項目開發(fā)技術架構,使用MYSQL數(shù)據(jù)庫系統(tǒng)。 本文先論述相關技術,從搜索引擎的原理、組成、數(shù)據(jù)結構、工作流程等方面做深入細致地研究分析,然后根據(jù)項目的實際需求,以Lucene工具庫為基礎,設計并且實現(xiàn)一個基于全文檢索的站內搜索引擎系統(tǒng),為用戶提供更為方便的搜索功能。
[Abstract]:Since the 1990s, computer technology and Internet technology have gained tremendous development. With the large-scale popularization and application of computer and Internet technology, the amount of information that people come into contact with has also increased exponentially. The increasing amount of information has forced people to come up with ways to get the useful information they need quickly. For this reason, people have invented various information lookup techniques, but, How to quickly and efficiently complete the information storage and search operation, this is a very worthy of domestic and foreign readers to study the subject. At present, search engine has become one of the most popular technologies in the era of information networking. As the core technology of search engine, Full-text Retrieval (Full-text Retrieval) refers to the use of natural language for retrieval. Retrieval technology based on full-text index and taking text data as main processing object. Full-text retrieval is not exactly the same as the common database retrieval design, which involves both structured and unstructured data, while the latter can only handle structured data, so, compared to ordinary database retrieval, Full-text retrieval has more powerful functions and is easier to meet the needs of users. This paper mainly discusses the full-text retrieval module of the office system of the College of Art. The basic requirement of full-text retrieval is to achieve the content retrieval of official document content, notice announcement, internal news and other text information. The system is developed on the basis of J2EE architecture, SSH2 project development technology framework and MYSQL database system. This article first discusses the related technology, from the search engine principle, the constitution, the data structure, the work flow and so on aspect makes the thorough detailed research and analysis, then according to the project actual demand, takes the Lucene tool library as the foundation, A web search engine system based on full-text search is designed and implemented to provide users with more convenient search functions.
【學位授予單位】:廈門大學
【學位級別】:碩士
【學位授予年份】:2014
【分類號】:TP391.3
【參考文獻】
相關期刊論文 前3條
1 劉寧;陸榮國;繆萬勝;;MVC體系架構從模式到框架的持續(xù)抽象進化[J];計算機工程;2008年04期
2 曹強;;基于Lucene的Web站點站內全文檢索系統(tǒng)的設計與實現(xiàn)[J];圖書情報工作;2007年09期
3 曹大有;王瑜;;基于MyEclipse的Hibernate持久層框架的開發(fā)過程[J];計算機系統(tǒng)應用;2007年12期
,本文編號:1875905
本文鏈接:http://www.sikaile.net/kejilunwen/sousuoyinqinglunwen/1875905.html
最近更新
教材專著