[
https://issues.apache.org/jira/browse/LUCENE-1287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Muir updated LUCENE-1287:
--------------------------------
Attachment: LUCENE-1287.patch
I think this is a nice feature.
I see some interesting hyphenation-only results presented here:
http://lwa09.informatik.tu-darmstadt.de/pub/IR/WebHome/wir2009_leveling.pdf
I updated your patch to trunk, I would like to commit to trunk/3x in a few days
if no one objects.
> Allow usage of HyphenationCompoundWordTokenFilter without dictionary
> --------------------------------------------------------------------
>
> Key: LUCENE-1287
> URL: https://issues.apache.org/jira/browse/LUCENE-1287
> Project: Lucene - Java
> Issue Type: New Feature
> Components: contrib/analyzers
> Reporter: Thomas Peuss
> Assignee: Robert Muir
> Priority: Minor
> Fix For: 3.1
>
> Attachments: LUCENE-1287.patch, LUCENE-1287.patch
>
>
> We should allow to use the HyphenationCompoundWordTokenFilter without a
> dictionary. This produces a lot of "nonword" tokens but might be useful
> sometimes.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]