Hi,

while playing with the various stemmers of Lucene(-1.9.1), I got an
index out of bounds exception:

lucene-1.9.1>java -cp
build/contrib/snowball/lucene-snowball-1.9.2-dev.jar
net.sf.snowball.TestApp Kp bla.txt
Exception in thread "main" java.lang.reflect.InvocationTargetException
       at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
       at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:64)
       at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
       at java.lang.reflect.Method.invoke(Method.java:615)
       at net.sf.snowball.TestApp.main(TestApp.java:56)
Caused by: java.lang.StringIndexOutOfBoundsException: String index out
of range: 11
       at java.lang.StringBuffer.charAt(StringBuffer.java:303)
       at net.sf.snowball.SnowballProgram.find_among_b(SnowballProgram.java:270)
       at net.sf.snowball.ext.KpStemmer.r_Step_4(KpStemmer.java:1122)
       at net.sf.snowball.ext.KpStemmer.stem(KpStemmer.java:1997)

This happens when executing
lucene-1.9.1>java -cp
build/contrib/snowball/lucene-snowball-1.9.2-dev.jar
net.sf.snowball.TestApp Kp bla.txt

bla.txt contains just this word: 'spijsvertering'.

After some debugging, and some tests with the original snowball
distribution from snowball.tartarus.org, it seems that the attached
change is needed to avoid the exception.
(The change comes from tartarus' SnowballProgram.java)
Can anyone confirm? Should I open a bug? Do you need more details?

Regards,
Andreas

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to