Re: Is there bug in CJKAnalyzer?

2007-10-22 Thread Samir Abdou
Hi, For a chinese token like ABCD (where A,B,C and D are chinese signs), CJKAnalyzer will generate the following overlapping bigrams: AB BC CD. Thus issuing a query containing one chinese sign will not retrieve any documents. To overcome this, you have to index chinese characters as single toke

Re: Is there a Term ID for each distinctive term indexed in Lucene?

2007-09-01 Thread Samir Abdou
Hi, You should extend the SegmentReader (or IndexReader) class to implement the following method: *public* *long* termID(Term t) *throws* IOException { *return* tis.getPosition(t); } which will give you a mean to get the ID of a given term. This ID is simply the position of that term within ".

RE: Get results from partial search keyword

2007-08-27 Thread Samir Abdou
Hi, Here's an example of the analyzer main's method: public TokenStream tokenStream(String fieldName, Reader reader) { TokenStream result = new StandardTokenizer(reader); result = new StandardFilter(result); result = new LowerCaseFilter(result); result = new StopFilter(result, s

RE: Get results from partial search keyword

2007-08-27 Thread Samir Abdou
Hi, To handle such a problem, you should use an analyzer with a stemmer (Porter stemmer for example). You have just to add the stemmer filter to your analyser. Hope this help, Samir -Message d'origine- De : spinergywmy [mailto:[EMAIL PROTECTED] Envoyé : lundi, 27. août 2007 05:55 À :

RE: a question for french analyzer

2007-07-30 Thread Samir Abdou
Hi, Take a look to the class ISOLatin1AccentFilter ! Add this to your analyzer and it should work ! Hope this will help, Samir -Message d'origine- De : Chris Lu [mailto:[EMAIL PROTECTED] Envoyé : lundi, 30. juillet 2007 20:06 À : java-user@lucene.apache.org Objet : a question for frenc

RE: French stemmer problem

2006-12-22 Thread Samir Abdou
Hi, Take a look to http://www.unine.ch/info/clef where you'll find valuable resources for many languages including French. Samir -Message d'origine- De : Renaud Paquay [mailto:[EMAIL PROTECTED] Envoyé : vendredi, 22. décembre 2006 10:54 À : java-user@lucene.apache.org Objet : Frenc

RE: Problem: "The selected method Keyword was not found"

2006-12-04 Thread Samir Abdou
ing 1.9 or 2? I want to make sure I successfully removed the old version Samir Abdou wrote: > > If you're trying to use the 1.9 version it's OK! The 'Keyword' static > method > is not available in the 2.0 version. > > Field field = new Field("url

RE: Problem: "The selected method Keyword was not found"

2006-12-04 Thread Samir Abdou
>existing field object? This is creating a new object. Samir Samir Abdou wrote: > > Field field = new Field("url","http://localhost:8500/cfdocs/dochome.htm";, > Field.Store.YES, Field.Index.UN_TOKENIZED); > > You need tot translate this to cf script :-) &g

RE: Problem: "The selected method Keyword was not found"

2006-12-04 Thread Samir Abdou
add documents in the new version instead? Thanks, Aaron Samir Abdou wrote: > > Hi, > > The method Field.Keyword is not longer available in Lucene's last version! > > Which version are you using? > > Regards, > Samir > > -Message d'origine- &g

RE: Problem: "The selected method Keyword was not found"

2006-12-04 Thread Samir Abdou
Hi, The method Field.Keyword is not longer available in Lucene's last version! Which version are you using? Regards, Samir -Message d'origine- De : Aaron Shaw [mailto:[EMAIL PROTECTED] Envoyé : lundi, 4. décembre 2006 11:08 À : java-user@lucene.apache.org Objet : Problem: "The selected

RE: number of term occurrences

2006-10-24 Thread Samir Abdou
Hi, You indexed without storing vectors! This is why the term vector is null. Samir -Message d'origine- De : Paz Belmonte [mailto:[EMAIL PROTECTED] Envoyé : mardi, 24. octobre 2006 12:30 À : java-user Objet : Re: number of term occurrences Hi, I have tried this options too and the Te