Stemming Algorithms




GRAS: An effective and efficient stemming algorithm for information retrieval
JH Paik, M Mitra, SK Parui... - ACM Transactions on ..., 2011 - dl.acm.org
Abstract A novel graph-based language-independent stemming algorithm suitable for 
information retrieval is proposed in this article. The main features of the algorithm are 
retrieval effectiveness, generality, and computational efficiency. We test our approach on ...
Cited by 1 - Import into BibTeX


{The Porter Stemming Algorithm}
M Porter - 2009 - citeulike.org
... Register and you can start organising your references online. Tags. The Porter Stemming
Algorithm. by: M. Porter. RIS, Export as RIS which can be imported into most citation managers.
BibTeX, Export as BibTeX which can be imported into most citation/bibliography managers. ...
Cited by 135 - Related articles - Cached - Import into BibTeX


[CITATION] Snowball: A language for stemming algorithms, 2001
M Porter - URL http://snowball. tartarus. org/texts/introduction. ..., 2009
Cited by 20 - Related articles - Import into BibTeX


[CITATION] Stemming Algorithm
M Porter - 2010
Cited by 2 - Related articles - Import into BibTeX


A novel corpus-based stemming algorithm using co-occurrence statistics
[PDF] from 202.113.25.19
JH Paik, D Pal... - Proceedings of the 34th international ACM ..., 2011 - dl.acm.org
Abstract We present a stemming algorithm for text retrieval. The algorithm uses the statistics 
collected on the basis of certain corpus analysis based on the co-occurrence between two 
word variants. We use a very simple co-occurrence measure that reflects how often a pair ...
Related articles - All 3 versions - Import into BibTeX


[CITATION] The 'Official'home page for distribution of the Porter Stemming Algorithm
M Porter - Website http://www. tartarus. org/~ martin/ ..., 2008
Cited by 2 - Related articles - Import into BibTeX


The Porter stemming algorithm: then and now
[PDF] from whiterose.ac.uk
P Willett - Program: electronic library and information systems, 2006 - emeraldinsight.com
Purpose-In 1980, Porter presented a simple algorithm for stemming English language 
words. This paper summarises the main features of the algorithm, and highlights its role not 
just in modern information retrieval research, but also in a range of related subject ...
Cited by 21 - Related articles - BL Direct - All 12 versions - Import into BibTeX


WCI 02 Improvements on the Porter's Stemming Algorithm for Portuguese
MVB Soares, RC Prati... - Latin America Transactions ..., 2009 - ieeexplore.ieee.org
Abstract The amount of textual information digitally stored is growing every day. However, 
our capability of processing and analyzing that information is not growing at the same pace. 
To overcome this limitation, it is important to develop semi-automatic processes to extract ...
Cited by 3 - Related articles - All 2 versions - Import into BibTeX


[PDF] A rule-based Arabic stemming algorithm
[PDF] from wseas.us
TMT Sembok, BMA Ata... - ... of the 5th European conference on ..., 2011 - wseas.us
Abstract:-Stemming is used in information retrieval systems to reduce variant word forms to 
common roots in order to improve retrieval effectiveness. As in other languages, there is a 
need for an effective stemming algorithm for the indexing and retrieval of Arabic ...
Related articles - View as HTML - All 2 versions - Import into BibTeX


Evaluation of perstem: a simple and efficient stemming algorithm for Persian
A Jadidinejad, F Mahmoudi... - Multilingual Information Access ..., 2011 - Springer
Persian is a challenging language in the field of NLP. Right-to-left orthography, complex 
morphology, complicated grammatical rules, and different forms of letters make it an 
interesting language for NLP research. In this paper we measure the effectiveness of a ...
Related articles - All 2 versions - Import into BibTeX


A stemming algorithm for the farsi language
[PDF] from psu.edu
K Taghva, R Beckley... - ... Technology: Coding and ..., 2005 - ieeexplore.ieee.org
Abstract In this paper, we report on the design and implementation of a stemmer for the Farsi 
language. The results of our evaluation on a small Farsi document collection shows a 
significant improvement in precision/recall over not stemming.
Cited by 26 - Related articles - All 13 versions - Import into BibTeX


Strength and similarity of affix removal stemming algorithms
[PDF] from sigir.org
WB Frakes... - ACM SIGIR Forum, 2003 - dl.acm.org
Abstract This study evaluated the strength of, and similarity among, four affix removal 
stemming algorithms. Strength and similarity were evaluated in different ways, including new 
metrics based on the Hamming distance measure. Data was collected on stemmer outputs ...
Cited by 45 - Related articles - BL Direct - All 7 versions - Import into BibTeX


Using Stemming Algorithms on a Grid Environment
[PDF] from up.pt
V Roncero, M Costa... - High Performance Computing for ..., 2008 - Springer
Stemming algorithms are commonly used in Information Retrieval with the goal of reducing 
the number of the words which are in the same morpho-logical variant in a common 
representation. Stemming analysis is one of the tasks of the pre-processing phase on text ...
Related articles - All 3 versions - Import into BibTeX


Two Algorithms for Probabilistic Stemming
M Melucci, N Orio - Information Access through Search Engines and ..., 2008 - Springer
This chapter describes two algorithms for probabilistic stemming. A probabilistic stemmer 
aims at detecting word stems by using a probabilistic or statistical model with no or very little 
knowledge about the language for which the stemmer has been built. While illustrating ...
Related articles - Import into BibTeX


Analysis and Algorithms for Stemming Inversion
I Feinerer - Information Retrieval Technology, 2010 - Springer
Stemming is a fundamental technique for processing large amounts of data in information 
retrieval and text mining. However, after processing the reversal of this process is often 
desirable, eg, for human interpretation, or methods which operate on sequences of ...
Related articles - All 2 versions - Import into BibTeX


[PDF] Overview of Stemming Algorithms
[PDF] from the-smirnovs.org
I Smirnov - DePaul University, 2008 - the-smirnovs.org
This paper is an overview of the state-of-the-art in the area of stemming and lemmatization 
algorithms. It covers basic ideas of "classical"(affix removal) techniques as well as some 
recent approaches like stochastic algorithms. The paper scope is restricted by techniques ...
Cited by 4 - Related articles - View as HTML - Import into BibTeX


Stemming Algorithm to Classify Arabic Documents
MAH Omer... - 2010 - Citeseer
Abstract Text classification is the problem of assigning predefined class labels to incoming 
unclassified documents. Many algorithms and researches have been implemented for 
English, Chinese and other languages, while there is few researches introduced for ...
Related articles - Cached - All 5 versions - Import into BibTeX


[PDF] A new Arabic stemming algorithm
[PDF] from uoa.gr
ET AlShammari... - Experimental Linguistics ExLing 2008, 2008 - users.uoa.gr
Abstract Text processing is a vital step in the information retrieval process, text mining, and 
natural language processing. It includes several stages, such as normalization, stop word 
removal, and stemming. Stemming is the process of reducing the lexicon to its root. Due to ...
Related articles - View as HTML - All 7 versions - Import into BibTeX


[CITATION] Evaluation of Lovins Stemming Algorithm in Large Database Systems
JLK Serrano - 2008
Related articles - Import into BibTeX


[CITATION] Python Implementation of Porter Stemming Algorithm
V Gupta - Obtido em: http://tartarus. org/martin/PorterStemmer/ ..., 2008
Cited by 2 - Related articles - Import into BibTeX


STEMBR: A stemming algorithm for the brazilian portuguese language
R Alvares, A Garcia... - Progress in Artificial Intelligence, 2005 - Springer
Stemming algorithms have traditionally been utilized in information retrieval systems as they 
generate a more concise word representation. However, the efficiency of these algorithms 
varies according to the language they are used with. This paper presents STEMBR, a ...
Cited by 8 - Related articles - BL Direct - All 2 versions - Import into BibTeX


Improving query expansion with stemming terms: A new genetic algorithm approach
[PDF] from uned.es
L Araujo... - Evolutionary Computation in Combinatorial ..., 2008 - Springer
Nowadays, searching information in the web or in any kind of document collection has 
become one of the most frequent activities. However, user queries can be formulated in a 
way that hinder the recovery of the requested information. The objective of automatic ...
Cited by 8 - Related articles - BL Direct - All 8 versions - Import into BibTeX


The Effiectiveness of a Graph-Based Algorithm for Stemming
M Bacchin, N Ferro... - Digital Libraries: People, Knowledge, ..., 2002 - Springer
In Information Retrieval (IR), stemming enables a matching of query and document terms 
which are related to a same meaning but which can appear in different morphological 
variants. In this paper we will propose and evaluate a statistical graph-based algorithm for ...
Cited by 17 - Related articles - BL Direct - All 6 versions - Import into BibTeX


Is paice method suitable for evaluating Arabic stemming algorithms?
HM AlSerhan, S Alqrainy... - Computer Engineering & ..., 2008 - ieeexplore.ieee.org
Abstract There are many measurement methodologies used to measure the quality of 
stemming algorithms and to evaluate their effectiveness. All of these measurement are 
designed for English language. In this study we trying to check the viability of the Paice ...
Related articles - All 5 versions - Import into BibTeX


FindStem: Analysis and evaluation of a Turkish stemming algorithm
[PDF] from hacettepe.edu.tr
H Sever... - String Processing and Information Retrieval, 2003 - Springer
In this paper, we evaluate the effectiveness of a new stemming algorithm, FINDSTEM, for 
use with Turkish documents and queries, and compare the use of this algorithm with the 
other two previously defined Turkish stemmers, namely" AF" and" LM" algorithms. Of them, ...
Cited by 13 - Related articles - BL Direct - All 10 versions - Import into BibTeX


[CITATION] Paice/Husk Stemming Algorithm Implemented Over SP Search Index
CJE Bacani - University of the Philippines Los Banos, Laguna. ..., 2006
Cited by 4 - Related articles - Import into BibTeX


[CITATION] A Stemming Algorithm for Tagalog Words
E Bonus - Philippines: De La Salle University, 2003
Cited by 8 - Related articles - Import into BibTeX


[PDF] University of Padua at CLEF 2002: Experiments to evaluate a statistical stemming algorithm
[PDF] from psu.edu
M Bacchin, N Ferro... - Proceedings of CLEF, 2002 - Citeseer
Abstract In Information Retrieval (IR), stemming is used to reduce variant word forms to 
common root. The assumption is that if two words have the same root, then they represent 
the same concept. Hence stemming permits a IR system to match query and document ...
Cited by 12 - Related articles - View as HTML - All 8 versions - Import into BibTeX


[CITATION] A stemming algorithm for Malay language
MT Abdullah, F Ahmad, R Mahmod... - Proceedings of the 4th ..., 2005
Cited by 4 - Related articles - All 2 versions - Import into BibTeX


A generalization of the method for evaluation of stemming algorithms based on error counting
R de Madariaga, J del Castillo... - String Processing and ..., 2005 - Springer
Until the introduction of the method for evaluation of stemming algorithms based on error 
counting, the effectiveness of these algorithms was compared by determining their retrieval 
performance for various experimental test collections. With this method, the performance ...
Cited by 4 - Related articles - BL Direct - All 3 versions - Import into BibTeX


Overcoming stiffness in stochastic simulation stemming from partial equilibrium: A multiscale Monte Carlo algorithm
A Samant... - The Journal of chemical physics, 2005 - link.aip.org
In this paper the problem of stiffness in stochastic simulation of singularly perturbed systems 
is discussed. Such stiffness arises often from partial equilibrium or quasi-steady-state type of 
conditions. A multiscale Monte Carlo method is discussed that first assesses whether ...
Cited by 57 - Related articles - BL Direct - All 6 versions - Import into BibTeX


[CITATION] Online Song Search Engine Using Porter Stemming Algorithm for Keyword Matching
EC Magallanes - 2008
Cited by 1 - Related articles - Import into BibTeX


[CITATION] Rstem: Interface to Snowball implementation of Porter's word stemming algorithm
D Temple Lang - R package version 0.3-1, 2006
Cited by 3 - Related articles - Import into BibTeX


A Prospective Study of Stemming Algorithms for Web Text Mining
[PDF] from ganpatuniversity.ac.in
GN Shakarad - Ganpat University Journal of ..., 2011 - gnujet.ganpatuniversity.ac.in
Abstract Information Retrieval (IR) is essentially a matter of deciding which documents in a 
collection should be retrieved to satisfy a user's need for information. The user's need for 
information is represented by a query or profile, and contains one or more search terms, ...
Related articles - All 2 versions - Import into BibTeX


Study of stemming algorithms
S Kodimala - 2010 - digitalcommons.library.unlv.edu
... UNLV Theses/Dissertations/Professional Papers/Capstones. Title. Study of stemming algorithms.
Author. ... Repository Citation. Kodimala, Savitha, "Study of stemming algorithms" (2010). UNLV
Theses/Dissertations/Professional Papers/Capstones. Paper 754. ...
Cached - Import into BibTeX


[CITATION] The Porter Stemming Algorithm: Available at: http://www. tartarus. org/martin
M Porter - 2006 - PorterStemmer
Cited by 2 - Related articles - Import into BibTeX


[CITATION] The Porter Stemming Algorithm official home page
MF Porter - 2006
Cited by 2 - Related articles - Import into BibTeX


[CITATION] The Lovins stemming algorithm
M Porter... - 2004
Cited by 4 - Related articles - Import into BibTeX


[PDF] Word Stemming Algorithms and Retrieval Effectiveness in Malay and Arabic Documents Retrieval Systems
[PDF] from psu.edu
TMT Sembok - 2005 - Citeseer
Systems (IRS) is generally about understanding of information in the documents concern. 
The more the system able to understand the contents of documents the more effective will be 
the retrieval outcomes. But understanding of the contents is a very complex task. ...
Cited by 3 - Related articles - View as HTML - All 7 versions - Import into BibTeX


[CITATION] Evaluation of Paice/Husk Stemming Algorithm in Large Database Systems
C Atienza - 2008
Related articles - Import into BibTeX


An Evaluation of Existing Light Stemming Algorithms for Arabic Keyword Searches
[PDF] from unc.edu
BE Rogerson - 2008 - etd.ils.unc.edu
Abstract: The field of Information Retrieval recognizes the importance of stemming in 
improving retrieval effectiveness. This same tool, when applied to searches conducted in the 
Arabic language, increases the relevancy of documents returned and expands searches ...
Related articles - All 4 versions - Import into BibTeX


[CITATION] The Lancaster stemming algorithm
R Hooper... - 2005
Cited by 2 - Related articles - Import into BibTeX


[CITATION] A Generalization of the Method for Evaluation of Stemming Algorithms Based on Error Counting
RM Sanchez, JR Fernández... - Proceedings of SPIRE, 2005
Cited by 2 - Related articles - Import into BibTeX


[CITATION] The tagalog stemming algorithm
B Bonus - 1st National Natural Language Processing Research ..., 2004
Cited by 3 - Related articles - Import into BibTeX


[CITATION] The english (porter2) stemming algorithm
MF Porter, R Boulton... - Retrieved, 2002
Cited by 6 - Related articles - Import into BibTeX


[PDF] Further Enhancement to the Porter's Stemming Algorithm
[PDF] from uni-weimar.de
F Yamout, R Demachkieh, G Hamdan... - Ulm, September 21, ..., 2004 - uni-weimar.de
Abstract. Stemming algorithms are used to transform the words in texts into their grammatical 
root form, and are mainly used to improve the Information Retrieval System's efficiency. 
Several algorithms exist with different techniques. The most widely used is the Porter ...
Cited by 2 - Related articles - View as HTML - All 10 versions - Import into BibTeX


[PDF] A Spanish Stemming Algorithm Implementation in PROLOG and C#
[PDF] from uga.edu
DDP Barrenechea - 2006 - ai.uga.edu
Abstract This paper presents two implementations of a spanish stemming algorithm in 
Prolog and C#. The basis for the implementations is a Porter-like algorithm published by the 
Snowball Project. Some additions to the original algorithm are proposed and included in ...
Related articles - View as HTML - All 6 versions - Import into BibTeX


[PDF] ST ANS Algorithm for Root Word Stemming
[PDF] from 198.170.104.138
S Srinivasan... - Information Technology Journal, 2006 - 198.170.104.138
Abstract: Information Retrieval (IR) is essentially a matter of deciding which documents in a 
collection should be retrieved to satisfy a user's need for information. Inmost cases, 
morphological variants of words have similar semantic interpretations and can be ...
Related articles - All 7 versions - Import into BibTeX


A new stemming algorithm to extract quadri-literal Arabic roots
G Kanaan, R Al-Shalabi, JM Jaam... - ... : From Theory to ..., 2004 - ieeexplore.ieee.org
Abstract Summary form only given. We present a new stemming algorithm to extract quadri-
literal Arabic roots. The algorithm starts by excluding the prefixes and checks then the word 
characters starting from the last letter backward to the first one. A temporary matrix is used ...
Cited by 1 - Related articles - Import into BibTeX


[CITATION] Searching malay text using stemming algorithm
R Saian... - 2004 - JICT
Cited by 1 - Related articles - Import into BibTeX


[CITATION] Analysis and evaluation of a Turkish stemming algorithm
H Serer... - 10th International Symposium SPIRE, 2003
Cited by 2 - Related articles - Import into BibTeX


[CITATION] Strength and similarity of affix removal stemming algorithms
WFC Fox... - SIGIR Forum, 2003
Cited by 2 - Related articles - Import into BibTeX


[CITATION] A Stemming Algorithm for Tagalog Words. Manila: De La Salle University
DE Bonus - 2003 - MS Thesis
Cited by 2 - Related articles - Import into BibTeX


Stemming Algorithm to Classify Arabic Documents
MAI H. Omer Shilong Ma - ??????: ????, 2010 - cqvip.com
???? >> ?????? >> ????? >> ??. Stemming Algorithm to Classify
Arabic Documents. ???? ??:. ???? ????????. Marwan AIi.H. Omer
Shilong Ma. School of Computer Science and Engineering ...
Related articles - Import into BibTeX


[PDF] Enhancement of rules-application-order (RAO) stemming algorithm based on the first character of Malay word
[PDF] from uitm.edu.my
G Edatul Muliana - 2005 - eprints.ptar.uitm.edu.my
UNIVERSITI TEKNOLOGI MARA Digital Repository is powered by EPrints 3 which is 
developed by the School of Electronics and Computer Science at the University of 
Southampton. More information and software credits.
View as HTML - Import into BibTeX


Enhancement of rules-application-order (RAO) stemming algorithm based on the first character of Malay word/Edatul Muliana binti Ghazalli
[PDF] from uitm.edu.my
G Edatul Muliana - 2005 - eprints.ptar.uitm.edu.my
Abstract Stemming is important thing to improve retrieval effectiveness. Stemming is used to 
reduce the size of indexing file for relevancy of document retrieval. Stemming is technique to 
truncate the word into the root word that will reduce vocabulary size and improve recall. ...
Import into BibTeX


[CITATION] A language-independent Stemming Algorithm
M Bacchin - 2002 - Ph. D. Thesis, Department of ...
Cited by 2 - Related articles - Import into BibTeX


[CITATION] The analysis and evaluation of stemming algorithms for Turkish
H Sever... - 10th International Symposium on String Processing ..., 2003
Cited by 1 - Related articles - Import into BibTeX


Development of stemming algorithm for wolaytta text
[PDF] from aau.edu.et
L LESSA - 2003 - etd.aau.edu.et
Abstract: This study describes the design of a stemming algorithm for Wolaytta language. To 
give a solid background for the thesis, literatures on conflation in general and stemming 
algorithms in particular were reviewed. Since it is the nature and characteristics of ...
Cited by 1 - Related articles - Import into BibTeX


[CITATION] Improved porter's algorithm for root word stemming
M Saravanan, PCR Raj, VS Murthy... - Proc. of International Conference on ..., 2002
Cited by 4 - Related articles - Import into BibTeX


DEVELOPMENT OF STEMMING ALGORITHM FOR WOLAYTTA TEXT
[PDF] from aau.edu.et
D Amogne - 2003 - etd.aau.edu.et
Abstract: This study describes the design of a stemming algorithm for Wolaytta language. To 
give a solid background for the thesis, literatures on conflation in general and stemming 
algorithms in particular were reviewed. Since it is the nature and characteristics of ...
Related articles - Import into BibTeX


[CITATION] Developing a word-stemming program using Porter's Algorithm
KV Lakshmi - NCSI minor project report, 2002
Cited by 2 - Related articles - Import into BibTeX


[CITATION] Stemming for Complex Medical Spanish Words: Algorithm for multipurpose languages
PE Jesus - 2002
Related articles - Import into BibTeX