[
https://issues.apache.org/jira/browse/LUCENE-1166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12591936#action_12591936
]
Thomas Peuss commented on LUCENE-1166:
--------------------------------------
The error is
{code}
[junit] Testsuite:
org.apache.lucene.analysis.compound.TestCompoundWordTokenFilter
[junit] Tests run: 4, Failures: 0, Errors: 2, Time elapsed: 2,139 sec
[junit]
[junit] Testcase:
testHyphenationCompoundWordsDE(org.apache.lucene.analysis.compound.TestCompoundWordTokenFilter):
Caused an ERROR
[junit] File not found:
/home/thomas/projects/lucene-trunk-compound/hyphenation.dtd (No such file or
directory)
[junit]
org.apache.lucene.analysis.compound.hyphenation.HyphenationException: File not
found: /home/thomas/projects/lucene-trunk-compound/hyphenation.dtd (No such
file or directory)
[junit] at
org.apache.lucene.analysis.compound.hyphenation.PatternParser.parse(PatternParser.java:123)
[junit] at
org.apache.lucene.analysis.compound.hyphenation.HyphenationTree.loadPatterns(HyphenationTree.java:138)
[junit] at
org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter.getHyphenationTree(HyphenationCompoundWordTokenFilter.java:142)
[junit] at
org.apache.lucene.analysis.compound.TestCompoundWordTokenFilter.testHyphenationCompoundWordsDE(TestCompoundWordTokenFilter.java:70)
[junit]
[junit]
[junit] Testcase:
testHyphenationCompoundWordsDELongestMatch(org.apache.lucene.analysis.compound.TestCompoundWordTokenFilter):
Caused an ERROR
[junit] File not found:
/home/thomas/projects/lucene-trunk-compound/hyphenation.dtd (No such file or
directory)
[junit]
org.apache.lucene.analysis.compound.hyphenation.HyphenationException: File not
found: /home/thomas/projects/lucene-trunk-compound/hyphenation.dtd (No such
file or directory)
[junit] at
org.apache.lucene.analysis.compound.hyphenation.PatternParser.parse(PatternParser.java:123)
[junit] at
org.apache.lucene.analysis.compound.hyphenation.HyphenationTree.loadPatterns(HyphenationTree.java:138)
[junit] at
org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter.getHyphenationTree(HyphenationCompoundWordTokenFilter.java:142)
[junit] at
org.apache.lucene.analysis.compound.TestCompoundWordTokenFilter.testHyphenationCompoundWordsDELongestMatch(TestCompoundWordTokenFilter.java:96)
[junit]
[junit]
[junit] Test
org.apache.lucene.analysis.compound.TestCompoundWordTokenFilter FAILED
{code}
So it does not find the hyphenation.dtd. I have to investigate how I can make
that DTD know to the parser without copying the hyphenation.dtd to Lucene's
base directory.
> A tokenfilter to decompose compound words
> -----------------------------------------
>
> Key: LUCENE-1166
> URL: https://issues.apache.org/jira/browse/LUCENE-1166
> Project: Lucene - Java
> Issue Type: New Feature
> Components: Analysis
> Reporter: Thomas Peuss
> Assignee: Grant Ingersoll
> Priority: Minor
> Attachments: CompoundTokenFilter.patch, CompoundTokenFilter.patch,
> CompoundTokenFilter.patch, CompoundTokenFilter.patch,
> CompoundTokenFilter.patch, CompoundTokenFilter.patch,
> CompoundTokenFilter.patch, de.xml, hyphenation.dtd
>
>
> A tokenfilter to decompose compound words you find in many germanic languages
> (like German, Swedish, ...) into single tokens.
> An example: Donaudampfschiff would be decomposed to Donau, dampf, schiff so
> that you can find the word even when you only enter "Schiff".
> I use the hyphenation code from the Apache XML project FOP
> (http://xmlgraphics.apache.org/fop/) to do the first step of decomposition.
> Currently I use the FOP jars directly. I only use a handful of classes from
> the FOP project.
> My question now:
> Would it be OK to copy this classes over to the Lucene project (renaming the
> packages of course) or should I stick with the dependency to the FOP jars?
> The FOP code uses the ASF V2 license as well.
> What do you think?
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]