diff options
author | miwi <miwi@FreeBSD.org> | 2009-03-17 05:47:30 +0800 |
---|---|---|
committer | miwi <miwi@FreeBSD.org> | 2009-03-17 05:47:30 +0800 |
commit | abe1eb4848ae791628e40557fd6505a015ddb821 (patch) | |
tree | 17d20c56c015dee51b4224eebc53c03c3cd333d5 /polish | |
parent | bf2d86820f5b72bf97b92d11f68658a0e8ce380d (diff) | |
download | freebsd-ports-gnome-abe1eb4848ae791628e40557fd6505a015ddb821.tar.gz freebsd-ports-gnome-abe1eb4848ae791628e40557fd6505a015ddb821.tar.zst freebsd-ports-gnome-abe1eb4848ae791628e40557fd6505a015ddb821.zip |
PyStemmer provides access to efficient algorithms for calculating a
"stemmed" form of a word. This is a form with most of the common
morphological endings removed; hopefully representing a common
linguistic base form. This is most useful in building search engines
and information retrieval software; for example, a search with stemming
enabled should be able to find a document containing "cycling" given the
query "cycles".
PyStemmer provides algorithms for several (mainly european) languages,
by wrapping the libstemmer library from the Snowball project in a Python
module. It also provides access to the classic Porter stemming algorithm
for english: although this has been superceded by an improved algorithm,
the original algorithm may be of interest to information retrieval
researchers wishing to reproduce results of earlier experiments.
WWW: http://pypi.python.org/pypi/PyStemmer/
PR: ports/132695
Submitted by: Wen Heping <wenheping at gmail.com>
Diffstat (limited to 'polish')
0 files changed, 0 insertions, 0 deletions