You are using an Analyzer that throws out non-alphanumeric characters, StandardAnalyzer most likely. You can create your own Analyzer to do exactly what you want. A sample Analyzer is in the Lucene FAQ at http://jguru.com/ .
Otis --- Lixin Meng <[EMAIL PROTECTED]> wrote: > Browsing through some of previous discussion, but I have to say that > I > couldn't find a solution for this. Would you mind provide more clue > on this? > > Regards, > Lixin > > -----Original Message----- > From: Terry Steichen [mailto:[EMAIL PROTECTED] > Sent: Tuesday, March 25, 2003 7:14 PM > To: Lucene Users List; [EMAIL PROTECTED] > Subject: Re: Tokenize negative number > > > Probably tokenized 1234 as a string and treated '-' as a separator. > See > previous discussion on "query". > > Regards, > > Terry > > ----- Original Message ----- > From: "Lixin Meng" <[EMAIL PROTECTED]> > To: "'Lucene Users List'" <[EMAIL PROTECTED]> > Sent: Tuesday, March 25, 2003 9:16 PM > Subject: Tokenize negative number > > > > I have a document with content '.... -1234 ....'. However, after > calling > the > > StandardTokenizer, the token only has '1234' (missed the '-') as > tokeText. > > > > Did anyone experience the similar problem and is there a work > around? > > > > Regards, > > Lixin > > > > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: [EMAIL PROTECTED] > > For additional commands, e-mail: > [EMAIL PROTECTED] > > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > __________________________________________________ Do you Yahoo!? Yahoo! Platinum - Watch CBS' NCAA March Madness, live on your desktop! http://platinum.yahoo.com --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]