KinoSearch::Analysis::Stemmer - reduce related words to a shared root |
KinoSearch::Analysis::Stemmer - reduce related words to a shared root
my $stemmer = KinoSearch::Analysis::Stemmer->new( language => 'es' ); my $polyanalyzer = KinoSearch::Analysis::PolyAnalyzer->new( analyzers => [ $lc_normalizer, $tokenizer, $stemmer ], );
Stemming reduces words to a root form. For instance, ``horse'', ``horses'', and ``horsing'' all become ``hors'' -- so that a search for 'horse' will also match documents containing 'horses' and 'horsing'.
This class is a wrapper around Lingua::Stem::Snowball, so it supports the same languages.
Create a new stemmer. Takes a single named parameter, language
, which must
be an ISO two-letter code that Lingua::Stem::Snowball understands.
Copyright 2005-2006 Marvin Humphrey
See KinoSearch version 0.15.
KinoSearch::Analysis::Stemmer - reduce related words to a shared root |