Hi,
in Lucene 2.4.1 (as such in Lucene 2.2) the term enumeration is sorted
case-sensitive: first capital and then small;
For example, this sort on TermEnum is wrong for
human consumption:
Annales
Cafè
Zucche
cafe
this is the correct sort in this case :
Annales
cafe
Cafè
Zucche
In Lucene
, this sort on TermEnum is wrong for human consumption:
Annales
Cafè
Zucche
cafe
this is the correct sort in this case :
Annales
cafe
Cafè
Zucche
In Lucene 2.2 i have made modification on Term.java, TermBuffer.java
(see below) in order to have Term enumerations sorted case-insensitive:
TermEnum
Data Management S.p.A
ffal...@datamanagement.it wrote:
Hi,
in Lucene 2.4.1 (as such in Lucene 2.2) the term enumeration is sorted
case-sensitive: first capital and then small;
For example, this sort on TermEnum is wrong for human consumption:
Annales
Cafè
Zucche
cafe
this is the correct
uot;"));
while ("myFieldNotTokenized".equals(terms.term().field())) {
System.out.println( " " + terms.term());
if (!terms.next()) break;
}
For example, instead to obtain this sort on TermEnum:
Annales
Caf
Zucche
cafe
i need to obtain this :
Annales
cafe
Caf
Zucche
Now i
quot;));
while ("myFieldNotTokenized".equals(terms.term().field())) {
System.out.println( " " + terms.term());
if (!terms.next()) break;
}
For example, instead to obtain this sort on TermEnum:
Annales
Cafè
Zucche
cafe
i need to obtain this :
Annales
ca
( + terms.term());
if (!terms.next()) break;
}
For example, instead to obtain this sort on TermEnum:
Annales
Cafè
Zucche
cafe
i need to obtain this :
Annales
cafe
Cafè
Zucche
Now in Lucene 2.4 i find it difficult because the package index is changed
a lot; can i have
is not-tokenized):
TermEnum terms = reader.terms(new Term(myFieldNotTokenized, ));
while (myFieldNotTokenized.equals(terms.term().field())) {
System.out.println( + terms.term());
if (!terms.next()) break;
}
For example, instead to obtain this sort on TermEnum:
Annales
, ));
while (myFieldNotTokenized.equals(terms.term().field())) {
System.out.println( + terms.term());
if (!terms.next()) break;
}
For example, instead to obtain this sort on TermEnum:
Annales
Cafè
Zucche
cafe
i need to obtain this :
Annales
cafe