An Evaluation of Existing Light Stemming Algorithms for Arabic Keyword Searches

by Rogerson, Brittany E.

Abstract (Summary)
The field of Information Retrieval recognizes the importance of stemming in improving retrieval effectiveness. This same tool, when applied to searches conducted in the Arabic language, increases the relevancy of documents returned and expands searches to encompass the general meaning of a word instead of the word itself. Since the Arabic language relies mainly on triconsonantal roots for verb forms and derives nouns by adding affixes, words with similar consonants are closely related in meaning. Stemming allows a search term to focus more on the meaning of a term and closely related terms and less on specific character matches. This paper discusses the strengths of light stemming, the best techniques, and components for algorithmic affix-based stemmers used in keyword searching in the Arabic language.
Bibliographical Information:

Advisor:Ronald E. Bergquist

School:University of North Carolina at Chapel Hill

School Location:USA - North Carolina

Source Type:Master's Thesis

Keywords:information retrieval stemming arabic language morphology evaluation relevance


Date of Publication:11/17/2008

© 2009 All Rights Reserved.