Re: Standard Tokenizer Question

John Paul Sondag Tue, 26 Jun 2007 19:04:43 -0700

That solves getting the actual text but Token.  My other problem is that
Token also has "startOffset" and "endOffset" fields.    Standard Token has
"startColumn/Line" and "endColumn/Line" but I was not exactly sure how to
use these.  Could you possibly give me a small example of using these?  I
think I would be able to if I a line is always the same length
(startLine-1(line_length) + startColumn)


Thanks!

--JP

On 6/26/07, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:

Token.termText() perhaps is the same as st.getToken(y).image

Andy
-----Original Message-----
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of John
Paul Sondag
Sent: Wednesday, June 27, 2007 9:32 AM
To: java-user@lucene.apache.org
Subject: Standard Tokenizer Question

Hey,

I Think this is where I ask this.

I'm pretty new to this so this is probably a dumb question.

I'm using the StandardTokenizer class to turn a file into tokens.  I
then
need to be able to later skip to a specific token in the file sent to me
from another source.  So say my StandardTokenizer is called st and I'm
told
to get token y.  At first I was using st.getToken(y).  But this was
returning an object of type  "StandardToken", I really would like to
just
have it return type Token (because it has more useful functions for me
like
termText()).  Right now I just call st.next()  y times to get the y'th
token, which is horribly inefficient but I can't find any way to

a) go right to the token I want
b)  Have it be type Token and not Standard Token.

--JP

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Standard Tokenizer Question

Reply via email to