Hi,
For a chinese token like ABCD (where A,B,C and D are chinese signs),
CJKAnalyzer will generate the following overlapping bigrams: AB BC CD.
Thus issuing a query containing one chinese sign will not retrieve any
documents. To overcome this, you have to index chinese characters as single
toke
Hi,
You should extend the SegmentReader (or IndexReader) class to implement the
following method:
*public* *long* termID(Term t) *throws* IOException { *return*
tis.getPosition(t);
}
which will give you a mean to get the ID of a given term. This ID is simply
the position of that term within ".
Hi,
Here's an example of the analyzer main's method:
public TokenStream tokenStream(String fieldName, Reader reader) {
TokenStream result = new StandardTokenizer(reader);
result = new StandardFilter(result);
result = new LowerCaseFilter(result);
result = new StopFilter(result, s
Hi,
To handle such a problem, you should use an analyzer with a stemmer (Porter
stemmer for example). You have just to add the stemmer filter to your
analyser.
Hope this help,
Samir
-Message d'origine-
De : spinergywmy [mailto:[EMAIL PROTECTED]
Envoyé : lundi, 27. août 2007 05:55
À :
Hi,
Take a look to the class ISOLatin1AccentFilter ! Add this to your analyzer
and it should work !
Hope this will help,
Samir
-Message d'origine-
De : Chris Lu [mailto:[EMAIL PROTECTED]
Envoyé : lundi, 30. juillet 2007 20:06
À : java-user@lucene.apache.org
Objet : a question for frenc
Hi,
Take a look to http://www.unine.ch/info/clef where you'll find valuable
resources for many languages including French.
Samir
-Message d'origine-
De : Renaud Paquay [mailto:[EMAIL PROTECTED]
Envoyé : vendredi, 22. décembre 2006 10:54
À : java-user@lucene.apache.org
Objet : Frenc
ing 1.9 or 2? I want to make
sure I successfully removed the old version
Samir Abdou wrote:
>
> If you're trying to use the 1.9 version it's OK! The 'Keyword' static
> method
> is not available in the 2.0 version.
>
> Field field = new Field("url
>existing field object?
This is creating a new object.
Samir
Samir Abdou wrote:
>
> Field field = new Field("url","http://localhost:8500/cfdocs/dochome.htm";,
> Field.Store.YES, Field.Index.UN_TOKENIZED);
>
> You need tot translate this to cf script :-)
&g
add documents in the new version instead?
Thanks,
Aaron
Samir Abdou wrote:
>
> Hi,
>
> The method Field.Keyword is not longer available in Lucene's last version!
>
> Which version are you using?
>
> Regards,
> Samir
>
> -Message d'origine-
&g
Hi,
The method Field.Keyword is not longer available in Lucene's last version!
Which version are you using?
Regards,
Samir
-Message d'origine-
De : Aaron Shaw [mailto:[EMAIL PROTECTED]
Envoyé : lundi, 4. décembre 2006 11:08
À : java-user@lucene.apache.org
Objet : Problem: "The selected
Hi,
You indexed without storing vectors! This is why the term vector is null.
Samir
-Message d'origine-
De : Paz Belmonte [mailto:[EMAIL PROTECTED]
Envoyé : mardi, 24. octobre 2006 12:30
À : java-user
Objet : Re: number of term occurrences
Hi,
I have tried this options too and the Te
11 matches
Mail list logo