On 7/13/06, Otis Gospodnetic <[EMAIL PROTECTED]> wrote:
Bok Tomi,
What do you mean by "terms are misrepresented"? What should they be, and what
are you seeing?
I mean 3/5 accented characters appear in the index with accents
correctly displayed, but the remaining 2 accented characters appear
]>
To: java-user@lucene.apache.org
Sent: Thursday, July 13, 2006 8:19:31 AM
Subject: accented characters, wildcards and other problems
I've done a bit of testing with accented characters (Croatian, to be
specific) and can't really explain what I see when I explore the index
with luke.
I&
I've done a bit of testing with accented characters (Croatian, to be
specific) and can't really explain what I see when I explore the index
with luke.
I've used accented characters in directory names, file names and file contents.
Now, in the list of terms (in "Top ranking terms", "Overview" tab)