Re: Untokenized URL

2008-07-04 Thread Shai Erera
Hi Regarding the contentLength, when you add it to the document, do you use *store* it as well (i.e., passing Store.YES or Store.COMPRESS)? Regarding the URL, how do you add it to the document? For example, if you do doc.add(new Field("url", "http://www.cnn.com";, Store.NO, Index.UN_TOKENIZED), i

maven snapshot repository

2008-07-04 Thread Karl Wettin
The snapshots seems to be built every day, but I seems to be producing some jars of a non-trunk revision or branch. Perhaps 2.3.2? It's been like that for at least a month. I just thought it was a fluke the last time I checked. karl ---

Re: TokenStream#reset():boolean?

2008-07-04 Thread Karl Wettin
4 jul 2008 kl. 21.02 skrev Michael McCandless: But what would cause reset() to not actually work? Extentions of TokenStream does not have handle reset(): /** Resets this stream to the beginning. This is an * optional operation, so subclasses may or may not * implement this method.

Re: TokenStream#reset():boolean?

2008-07-04 Thread Michael McCandless
But what would cause reset() to not actually work? And what is a composite stream? I'm just a little confused on the use case here I think... Mike Karl Wettin wrote: I just want to know if my token stream managed to reset or not. Especially that parts of composite streams. karl

Re: TokenStream#reset():boolean?

2008-07-04 Thread Karl Wettin
I just want to know if my token stream managed to reset or not. Especially that parts of composite streams. karl 4 jul 2008 kl. 12.13 skrev Michael McCandless: Karl, I'm sort of confused by this proposal. What is the driver here? It seems like the overall goal is to have rese

Re: TokenStream#reset():boolean?

2008-07-04 Thread Michael McCandless
Karl, I'm sort of confused by this proposal. What is the driver here? It seems like the overall goal is to have reset() return a boolean stating whether it was actually implemented by the subclass of TokenStream? Mike Karl Wettin wrote: How about depricating > public void reset()

[jira] Resolved: (LUCENE-1324) TokenFilter should implement reset()

2008-07-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1324. Resolution: Fixed Fix Version/s: 2.4 Lucene Fields: [New, Patch Availabl

[jira] Commented: (LUCENE-1314) IndexReader.reopen(boolean force)

2008-07-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12610510#action_12610510 ] Michael McCandless commented on LUCENE-1314: Why would you ever need to make a

[jira] Resolved: (LUCENE-1325) add IndexCommit.isOptimized method

2008-07-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1325. Resolution: Fixed > add IndexCommit.isOptimized method > -

Re: changing index format

2008-07-04 Thread Michael McCandless
Well there really is no concrete plan/timeline at this point -- that is the nature of open source. But there are some issues in flight that take us on the first few steps towards flexible indexing. I think LUCENE-1301 (which I'm working on and should be done soon, in 2.4 I think) i

[jira] Resolved: (LUCENE-1326) Inflater.end() method not always called in FieldsReader

2008-07-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1326. Resolution: Fixed Fix Version/s: 2.4 > Inflater.end() method not always cal

[jira] Commented: (LUCENE-1326) Inflater.end() method not always called in FieldsReader

2008-07-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12610489#action_12610489 ] Michael McCandless commented on LUCENE-1326: I'll commit a try/finally for bot

[jira] Assigned: (LUCENE-1326) Inflater.end() method not always called in FieldsReader

2008-07-04 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned LUCENE-1326: -- Assignee: Michael McCandless > Inflater.end() method not always called in Fiel

Untokenized URL

2008-07-04 Thread blazingwolf7
Hi, I am currently working on retrieving url and contentLength of each document found during the search. I want to retrieve it during the calculation of score so that I can influence the score in some other way. I used the methods from TermDocs and TermEnum to get the information. However, the u