NLP - (Word Stem|Stemming)

1 - About

To stem words is to remove word endings like -s and -ing.

Stemming is replacing words with their stems.

Stemming is the process of reducing search tokens to their root (or “stem”)

A search for different type of a word will still yield results.

3 - Example

For instance

  • “bikes” is replaced with “bike”; now query “bike” can find both documents containing “bike” and those containing “bikes”.
  • “search”, “searching” and “searched” can all be reduced to the stem “search”.

4 - Algorithm

SnowBall is a good stemming algorithm.

  • See the attribute filter StringsToWordVector in Weka

5 - Documentation / Reference


Data Science
Data Analysis
Statistics
Data Science
Linear Algebra Mathematics
Trigonometry

Powered by ComboStrap