Fuzzy Cluster-Based Query Expansion

by Tai, Chia-Hung

Abstract (Summary)
Advances in information and network technologies have fostered the creation and availability of a vast amount of online information, typically in the form of text documents. Information retrieval (IR) pertains to determining the relevance between a user query and documents in the target collection, then returning those documents that are likely to satisfy the user¡¦s information needs. One challenging issue in IR is word mismatch, which occurs when concepts can be described by different words in the user queries and/or documents. Query expansion is a promising approach for dealing with word mismatch in IR. In this thesis, we develop a fuzzy cluster-based query expansion technique to solve the word mismatch problem. Using existing expansion techniques (i.e., global analysis and non-fuzzy cluster-based query expansion) as performance benchmarks, our empirical results suggest that the fuzzy cluster-based query expansion technique can provide a more accurate query result than the benchmark techniques can.
Bibliographical Information:

Advisor:Chih-ping Wei; Paul J. Hu; Hsing K. Cheng

School:National Sun Yat-Sen University

School Location:China - Taiwan

Source Type:Master's Thesis

Keywords:fuzzy clustering cluster based query expansion term association information retrieval word mismatch document thesaurus text mining


Date of Publication:07/29/2004

© 2009 All Rights Reserved.