Re: TermVectorOffsetStrategy producing Passages with matches out of order? (causing IndexOutOfBoundsException)

2023-07-04 Thread Chris Hostetter
I hacked up the test a bit so it would compile against 9.0 and confirmed the problem existed there as well. So going back a little farther with some manual bisection (to account for the transition from ant to gradle) lead me to the following... # first bad commit: [2719cf6630eb2bd7cb37d0e8462

Re: TermVectorOffsetStrategy producing Passages with matches out of order? (causing IndexOutOfBoundsException)

2023-06-29 Thread Chris Hostetter
With some trial and error I realized two things... 1) the order of the terms in the BooleanQuery seems to matter - but in terms of their "natural order", not the order in the doc (which is why i was so confused trying to reproduce it) 2) the problem happens when using termVectors but

TermVectorOffsetStrategy producing Passages with matches out of order? (causing IndexOutOfBoundsException)

2023-06-29 Thread Chris Hostetter
I've got a user getting java.lang.IndexOutOfBoundsException from the UnifiedHighlighter in Solr 9.1.0 w/Lucene 9.3.0 (And FWIW, this same data, w/same configs, in 8.11.1, purportedtly didn't have this problem) I don't really understand the highlighter code very well, but AFAICT: - Defaul

Re: What does "out of order" mean?

2009-12-01 Thread Stefan Trcek
On Monday 30 November 2009 18:51:34 Nick Burch wrote: > On Mon, Nov 30, 2009 at 12:22 PM, Stefan Trcek wrote: > > I'd do, but was not successful to get the svn repo some months ago. > > I have to claim the sys admin for any svn repo to open a door > > through the firewall. Gave up due to > > > >

Re: What does "out of order" mean?

2009-12-01 Thread Michael McCandless
OK I committed this, plus further removes of "expert" from TopDocs, to trunk (future 3.1), 2.9 and 3.0 branches. Thanks! Mike On Tue, Dec 1, 2009 at 5:31 AM, Stefan Trcek wrote: > On Tuesday 01 December 2009 11:07:41 Michael McCandless wrote: >> OK -- none of IndexSearcher's search methods need

Re: What does "out of order" mean?

2009-12-01 Thread Michael McCandless
Super, thanks. I'll commit your patch, fixing javadocs for TopDocs/TopFieldDocs. Mike On Tue, Dec 1, 2009 at 5:31 AM, Stefan Trcek wrote: > On Tuesday 01 December 2009 11:07:41 Michael McCandless wrote: >> OK -- none of IndexSearcher's search methods needed tweaking?  Just >> TopDocs/TopFieldDo

Re: What does "out of order" mean?

2009-12-01 Thread Stefan Trcek
On Tuesday 01 December 2009 11:07:41 Michael McCandless wrote: > OK -- none of IndexSearcher's search methods needed tweaking? Just > TopDocs/TopFieldDocs? Yes, you can use these methods in Searcher, they are sufficient: TopDocs Searcher.search(Query query, Filter filter, int n) TopFieldDocs Sea

Re: What does "out of order" mean?

2009-12-01 Thread Michael McCandless
OK -- none of IndexSearcher's search methods needed tweaking? Just TopDocs/TopFieldDocs? Mike On Tue, Dec 1, 2009 at 4:52 AM, Stefan Trcek wrote: > On Monday 30 November 2009 18:42:50 Michael McCandless wrote: >> I was able to apply that git patch just fine -- so I think it'll >> work? > > Good

Re: What does "out of order" mean?

2009-12-01 Thread Stefan Trcek
On Monday 30 November 2009 18:42:50 Michael McCandless wrote: > I was able to apply that git patch just fine -- so I think it'll > work? Good to hear it works that simple. This patch completes the task. It is a "two file" patch, so if this will work too, I'm confident. Stefan > On Mon, Nov 30, 2

Re: What does "out of order" mean?

2009-11-30 Thread Nick Burch
On Mon, Nov 30, 2009 at 12:22 PM, Stefan Trcek wrote: I'd do, but was not successful to get the svn repo some months ago. I have to claim the sys admin for any svn repo to open a door through the firewall. Gave up due to $ nmap -p3690 svn.apache.org     PORT     STATE    SERVICE     3690/tcp fi

Re: What does "out of order" mean?

2009-11-30 Thread Michael McCandless
I was able to apply that git patch just fine -- so I think it'll work? Thanks! Mike On Mon, Nov 30, 2009 at 12:22 PM, Stefan Trcek wrote: > On Monday 30 November 2009 14:24:20 Michael McCandless wrote: >> I agree, it's silly we label things like TopDocs/TopFieldDocs as >> expert -- they are no

Re: What does "out of order" mean?

2009-11-30 Thread Stefan Trcek
On Monday 30 November 2009 14:24:20 Michael McCandless wrote: > I agree, it's silly we label things like TopDocs/TopFieldDocs as > expert -- they are no longer for "low level" APIs (or, perhaps since > we've removed the "high level" API (= Hits), what remains should no > longer be considered low le

Re: What does "out of order" mean?

2009-11-30 Thread Michael McCandless
I agree, it's silly we label things like TopDocs/TopFieldDocs as expert -- they are no longer for "low level" APIs (or, perhaps since we've removed the "high level" API (= Hits), what remains should no longer be considered low level). Do you wanna cough up a patch to correct these? Mike On Mon,

Re: What does "out of order" mean?

2009-11-30 Thread Stefan Trcek
On Friday 27 November 2009 14:49:07 Michael McCandless wrote: > So the "don't care" equivalent here is to use IndexSearcher's normal > search APIs (ie, we don't use Version to switch this on or off). Hmm - Searcher/IndexSearchers search methods are "Low level", "Expert", "Expert + low level" or r

Re: What does "out of order" mean?

2009-11-27 Thread Stefan Trcek
On Friday 27 November 2009 14:49:07 Michael McCandless wrote: > > So the "don't care" equivalent here is to use IndexSearcher's normal > search APIs (ie, we don't use Version to switch this on or off). Thanks for the hint. For an unknown reason I once fell into the "search(query, filter, collecto

Re: What does "out of order" mean?

2009-11-27 Thread Michael McCandless
On Fri, Nov 27, 2009 at 8:13 AM, Stefan Trcek wrote: > On Friday 27 November 2009 12:07:07 Michael McCandless wrote: >> Re: What does "out of order" mean? >> >> It refers to the order in which the docIDs are delivered to your >> Collector. >> >> &

Re: What does "out of order" mean?

2009-11-27 Thread Stefan Trcek
On Friday 27 November 2009 12:07:07 Michael McCandless wrote: > Re: What does "out of order" mean? > > It refers to the order in which the docIDs are delivered to your > Collector. > > "Normally" they are always delivered in increasing order. > > However

Re: What does "out of order" mean?

2009-11-27 Thread Michael McCandless
Ds to your collector out of order. In this case, docs are processed in batches (chunks of 1024 docIDs at once), and within a batch you may receive docIDs out of order. Many collectors don't mind getting docIDs out of order, and so it's important to return "true" from your acceptDocsO

What does "out of order" mean?

2009-11-27 Thread Alexander Veit
Hi, The documentation of org.apache.lucene.search.Collector uses the obscure term "out of order". What does "order" mean? The natural order of document IDs, a scoring order, or some other order? -- Cheers, Alex

Re: out of order

2007-08-16 Thread Michael McCandless
> (i'm all thumbs when it comes to index internals and the file formats, but > > i'm just tossig it out there as an idea) > > > > > > > > -Hoss > > > > > > - > >

Re: out of order

2007-08-16 Thread testn
, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > > -- View this message in context: http://www.nabble.com/out-of-order-tf4276385.html#a12187972 Sent from the Lucene - Java Users mailing list archive at Nabble.com. -

Re: out of order

2007-08-16 Thread Chris Hostetter
: After you close that IndexWriter, can you list the files in your : directory (that's a RAMDirectory right?)? Something like this: The OP said this was a fairly small RAMDirectory index right? would it be worth while to just write the whole thing to disk and post it onlin so people could see ev

Re: out of order

2007-08-16 Thread Michael McCandless
> > Closing/ > > opening writers at different times? Any changes to the default parameters > > (mergeFactor, maxBufferedDocs, etc.)? > > > > Mike > > > > "testn" <[EMAIL PROTECTED]> wrote: > >> > >> Here you go > &

Re: out of order

2007-08-16 Thread testn
etc.)? > > Mike > > "testn" <[EMAIL PROTECTED]> wrote: >> >> Here you go >> >> -> Error during the indexing : docs out of order (0 <= 0 ) >> org.apache.lucene.index.CorruptIndexException: docs out of order (0 <=

Re: out of order

2007-08-16 Thread Michael McCandless
this RAMDirectory? Closing/ opening writers at different times? Any changes to the default parameters (mergeFactor, maxBufferedDocs, etc.)? Mike "testn" <[EMAIL PROTECTED]> wrote: > > Here you go > > -> Error during the index

Re: out of order

2007-08-16 Thread testn
Here you go -> Error during the indexing : docs out of order (0 <= 0 ) org.apache.lucene.index.CorruptIndexException: docs out of order (0 <= 0 ) at org.apache.lucene.index.SegmentMerger.appendPostings(SegmentMerger.java:368)

Re: out of order

2007-08-16 Thread Michael McCandless
t happens, I will try to capture > the stacktrace. > > > > Michael McCandless-2 wrote: > > > > > > "testn" <[EMAIL PROTECTED]> wrote: > >> > >> Using Lucene 2.2.0, I still sporadically got doc out of order error. I > >

Re: out of order

2007-08-15 Thread testn
in theory lead to > this too. > > Are you able to easily reproduce it? Can you post the full exception? > > Mike > > - > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail:

Re: out of order

2007-08-15 Thread Michael McCandless
"testn" <[EMAIL PROTECTED]> wrote: > > Using Lucene 2.2.0, I still sporadically got doc out of order error. I > indexed all of my stuff in one thread. Do you have any idea why it > happens? Hm, that is not good. I thought we had finally fixed this with LUCENE-14

out of order

2007-08-15 Thread testn
Using Lucene 2.2.0, I still sporadically got doc out of order error. I indexed all of my stuff in one thread. Do you have any idea why it happens? Thanks! -- View this message in context: http://www.nabble.com/out-of-order-tf4276385.html#a12172277 Sent from the Lucene - Java Users mailing list

"docs out of order" IllegalStateException during merge (LUCENE-140)

2007-01-11 Thread Michael McCandless
Hi all, I would like to draw your attention to an open and rather devious long-standing index corruption issue that we've only now finally gotten to the bottom of: https://issues.apache.org/jira/browse/LUCENE-140 If you hit this, you will typically see a "docs ou

java.io.IOException: term out of order --> HELP

2006-10-03 Thread Michael J. Prichard
We get this when trying to optimize index: Exception in thread "main" java.io.IOException: term out of order at org.apache.lucene.index.TermInfosWriter.add(TermInfosWriter.java:95) at org.apache.lucene.index.SegmentMerger.mergeTermInfo(SegmentMerger.ja

Re: Out-of-order distinct

2006-06-14 Thread Grant Ingersoll
You could implement your own HitCollector interface and remove lower scoring duplicates as you come across them by using a Map or something to keep track as you go. Ken Kinder wrote: I've poked around on google and the archives quite a bite, but I can't find exactly what I need. Say I have a q

Out-of-order distinct

2006-06-14 Thread Ken Kinder
I've poked around on google and the archives quite a bite, but I can't find exactly what I need. Say I have a query that would normally return a set of documents: 1 002 (text...) 2 001 (text...) 3 001 (text...) 4 002 (text...) 5 004 (text...) I'd like that modified to be: 1 002 (text...) 2 001

"docs out of order" while merging

2006-03-15 Thread Paulo Silveira
I ve just get a "docs out of order". I have a database that is indexed everytime an update occurs. The index was ok for the last 3 weeks, and now, after the system throwed an exception because of a write lock that was not released (and I deleted it) I am recebing this: Can anyone

java.io.IOException: term out of order

2005-08-18 Thread Dan Quaroni
out this in the faq or the archives that I looked at. java.io.IOException: term out of order at org.apache.lucene.index.TermInfosWriter.add(TermInfosWriter.java:93) at org.apache.lucene.index.SegmentMerger.mergeTermInfo(SegmentMerger.java:

Re: "docs out of order exception" in Lucene. Please help.

2005-05-18 Thread Matt Magoffin
can't get the error to occur. For me this is occuring via IndexWriter.close(), however: java.lang.IllegalStateException: docs out of order at org.apache.lucene.index.SegmentMerger.appendPostings(SegmentMerger.java:353) at org.apache.lucene.index.SegmentMerger.mergeTermInfo(SegmentMerge

Re: "docs out of order exception" in Lucene. Please help.

2005-03-23 Thread Matt Quail
Writers, either. java.lang.IllegalStateException: docs out of order at org.apache.lucene.index.SegmentMerger.appendPostings(SegmentMerger.java: 353) at org.apache.lucene.index.SegmentMerger.mergeTermInfo(SegmentMerger.java: 316) at org.apache.lucene.index.SegmentMerger.mergeTermInfos(SegmentMerger.

"docs out of order exception" in Lucene. Please help.

2005-03-23 Thread Alexis Rodriguez
aders is closed, we assign a new MultiReader to the variable holding it. So, the problem is that randomly, but oftenly, when we are doing a close on the IndexWriter (after adding a document), we get an "docs out of order" exception. Problem seems to be with some SegmentMergeInfos in