Hi, Peter!
I got that. I restricted MARKFAST on segments. It works just nearly perfect.
How does MARKFAST match things? Using
Document{-MARKFAST(MyType, { a, b, a b });
on
a b
yields
a b and b but not a.
I would like to have a as well. Can this be done?
Buy the way: I love Ruta.apply().
would be appreciated.
Thanks,
Armin
-Ursprüngliche Nachricht-
Von: Peter Klügl [mailto:pklu...@uni-wuerzburg.de]
Gesendet: Mittwoch, 22. Mai 2013 15:09
An: user@uima.apache.org
Betreff: Re: AW: Ruta - MARKFAST
Hi,
yes this example won't work without changes, because the word list
]
Gesendet: Mittwoch, 22. Mai 2013 15:09
An: user@uima.apache.org
Betreff: Re: AW: Ruta - MARKFAST
Hi,
yes this example won't work without changes, because the word list is
sensitive to white spaces, e.g., you distinguish between n.C. and n.
C.. I know this sound like a bug, but it is rather
Hello Jörn,
absolutely right. But for now I'm still a nooby. That's why I'm asking so much.
Cheers,
Armin
-Ursprüngliche Nachricht-
Von: Jörn Kottmann [mailto:kottm...@gmail.com]
Gesendet: Donnerstag, 23. Mai 2013 14:24
An: user@uima.apache.org
Betreff: Re: Ruta - MARKFAST
On
Hi Peter,
your example does work perfectly fine. But try this as word list and input
document:
nach Christus
nach der Zeitenwende
n. C.
n.C.
nC.
n. Chr.
n. d. Z.
n.d.Z.
unserer Zeit
unserer Zeitrechnung
u. Z.
u.Z.
v. C.
v.C.
vC.
v. Chr.
v. d. Z.
v.d.Z.
vor Christus
vor der Zeitenwende
vor
Hi,
yes this example won't work without changes, because the word list is
sensitive to white spaces, e.g., you distinguish between n.C. and n.
C.. I know this sound like a bug, but it is rather a feature.
In order to solve your problem you could either remove all spaces in
your word list, you