Hi,
* N-gram based indexing; No dictionaries are needed
* Support many types of documents; e.g. HTML, MS Word
* Includes library for some programming languages
* Add text incrementally
Could you please explain what is N-gram and what is this package useful
for?
N-gram is
On Sun, 2005-06-05 at 18:21 +0900, Junichi Uekawa wrote:
Hi,
* N-gram based indexing; No dictionaries are needed
* Support many types of documents; e.g. HTML, MS Word
* Includes library for some programming languages
* Add text incrementally
Could you please explain
Could you please explain what is N-gram and what is this package useful
for?
N-gram is when you use n-characters as a 'word'.
In some languages including Japanese, it is impossible to
determine a 'word', and N-gram is a method that defines a
'word' as n-characters.
Do you
At Mon, 06 Jun 2005 09:12:28 +0900,
Junichi Uekawa wrote:
Instead of picking up keywords through whitespace-separated 'word's,
it uses pre-set N number of characters for indexing.
I had a presentation about full-text search engines in Debian, see the
following, and it describe about N-gram
NOKUBI Takatsugu writes:
BTW, there is the n-gram word in libdigest-nilsimsa-perl package's
description without explain of the word.
Where it helps to render the description incomprehensible.
--
John Hasler
--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe.
Le vendredi 03 juin 2005 à 21:14 +0900, Taku YASUI a écrit :
Package: wnpp
Severity: wishlist
* Package name: rast
Version : 0.1.1
Upstream Author : Name [EMAIL PROTECTED]
* URL : http://www.netlab.jp/rast/
* License : GPL
Description : N-gram
Le vendredi 03 juin 2005 à 21:14 +0900, Taku YASUI a écrit :
Package: wnpp
Severity: wishlist
* Package name: rast
Version : 0.1.1
Upstream Author : Name [EMAIL PROTECTED]
* URL : http://www.netlab.jp/rast/
* License : GPL
Description : N-gram
Le vendredi 03 juin 2005 à 21:14 +0900, Taku YASUI a écrit :
Package: wnpp
Severity: wishlist
* Package name: rast
Version : 0.1.1
Upstream Author : Name [EMAIL PROTECTED]
* URL : http://www.netlab.jp/rast/
* License : GPL
Description : N-gram
Package: wnpp
Severity: wishlist
* Package name: rast
Version : 0.1.1
Upstream Author : Name [EMAIL PROTECTED]
* URL : http://www.netlab.jp/rast/
* License : GPL
Description : N-gram based full text search system library
Rast is N-gram based full text
9 matches
Mail list logo