Solving the word mismatch problem through automatic text analysis

by Xu, Jinxi

Abstract (Summary)
Information Retrieval (IR) is concerned with locating documents that are relevant for a user's information need or query from a large collection of documents. A fundamental problem for information retrieval is word mismatch. A query is usually a short and incomplete description of the underlying information need. The users of IR systems and the authors of the documents often use different words to refer to the same concepts. This thesis addresses the word mismatch problem through automatic text analysis. We investigate two text analysis techniques, corpus analysis and local context analysis, and apply them in two domains of word mismatch, stemming and general query expansion. Experimental results show that these techniques can result in more effective retrieval.
Bibliographical Information:


School:University of Massachusetts Amherst

School Location:USA - Massachusetts

Source Type:Master's Thesis



Date of Publication:01/01/1997

© 2009 All Rights Reserved.