Re: 2.0 release

2006-05-06 Thread Tatu Saloranta
--- Maxim Patramanskij <[EMAIL PROTECTED]> wrote: > Currently, buffer sizes for BufferedIndexInput and > BufferedIndexOutput are equals and have constant > size of 1024 bytes. > > When using a database for index persistence, it > slowdowns performance much > because of relatively small buffer si

Re: Vector

2006-05-06 Thread Tatu Saloranta
--- karl wettin <[EMAIL PROTECTED]> wrote: ... > Even Sun recommends Collection.synchronizedList over > Vector when thread > safty is an issue. > > I belive that replaced with Linked- and ArrayLists > it could save a whole > bunch of ticks at heavy load. Changing Vectors systematically to ArrayL

Re: bytecount as prefix

2006-05-06 Thread Marvin Humphrey
No progress yet. I think my next move is to do what I did when trying to get KinoSearch to write Lucene-compatible indexes: 1) Generate an optimized split-file format Lucene index from a pathological test corpus. 2) Hack KinoSearch so that it ought to produce an index which is identical

[jira] Updated: (LUCENE-564) Class DisjunctionSumScorer does not need to be public.

2006-05-06 Thread paul.elschot (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-564?page=all ] paul.elschot updated LUCENE-564: Attachment: DisjunctionSumScorerPatch6.txt Patch also updates copyright year. > Class DisjunctionSumScorer does not need to be public. > --

[jira] Created: (LUCENE-564) Class DisjunctionSumScorer does not need to be public.

2006-05-06 Thread paul.elschot (JIRA)
Class DisjunctionSumScorer does not need to be public. -- Key: LUCENE-564 URL: http://issues.apache.org/jira/browse/LUCENE-564 Project: Lucene - Java Type: Bug Components: Search Reporter: paul.elschot Se

Re: Vector

2006-05-06 Thread karl wettin
On Sat, 2006-05-06 at 08:55 -0700, Otis Gospodnetic wrote: > I doubt we'd be able to get much juice out of move > unsynchronized Java Collections I might be the only one here that counts every wasted tick? :) But it is not the synchronization I think is the big thief. A LinkedList could do the j

Re: bytecount as prefix

2006-05-06 Thread Marvin Humphrey
On Sat, May 06, 2006 at 05:11:02PM +0900, David Balmain wrote: > Hi Marvin, > > Where are you with this? I also have a vested interest in seeing > Lucene move to using byte counts. I was wondering if I could help out. > Is the patch you pasted here the latest you have? All I've added since then i

Re: Vector

2006-05-06 Thread Otis Gospodnetic
I think most of Vector (or Hashtable) references are leftovers from the pre-Java Collections era, that's all. I doubt we'd be able to get much juice out of move unsynchronized Java Collections, although I'd like to see them for the same reason as Yonik. Otis - Original Message From: Yo

Re: Vector

2006-05-06 Thread Yonik Seeley
On 5/6/06, karl wettin <[EMAIL PROTECTED]> wrote: There are a couple of Vector:s in the code. Is it really necessary to use this expensive thread safe artifact from the dark ages? I've wondered that myself ... seeing "Vector" in the code does hurt my eyes a little :-) It's just one of those thi

Re: 2.0 release

2006-05-06 Thread Maxim Patramanskij
Currently, buffer sizes for BufferedIndexInput and BufferedIndexOutput are equals and have constant size of 1024 bytes. When using a database for index persistence, it slowdowns performance much because of relatively small buffer size. With JDBCDirectory and buffer size increased from 1Kb to 16K

[jira] Commented: (LUCENE-563) IndexReader currently has javadoc errors

2006-05-06 Thread paul.elschot (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-563?page=comments#action_12378196 ] paul.elschot commented on LUCENE-563: - Here, the javadoc output (after the patch, thanks) has warnings on missing packages/classes: jtidy in HtmlDocument.java in contrib/a

Re: bytecount as prefix

2006-05-06 Thread David Balmain
Hi Marvin, Where are you with this? I also have a vested interest in seeing Lucene move to using byte counts. I was wondering if I could help out. Is the patch you pasted here the latest you have? Cheers, Dave On 4/12/06, Marvin Humphrey <[EMAIL PROTECTED]> wrote: Greets, I'm back working on

Re: Vector

2006-05-06 Thread Erik Hatcher
On May 6, 2006, at 3:40 AM, karl wettin wrote: On Sat, 2006-05-06 at 03:28 -0400, Erik Hatcher wrote: On May 6, 2006, at 2:29 AM, karl wettin wrote: There are a couple of Vector:s in the code. Is it really necessary to use this expensive thread safe artifact from the dark ages? +1 Does an

Re: Vector

2006-05-06 Thread karl wettin
On Sat, 2006-05-06 at 03:28 -0400, Erik Hatcher wrote: On May 6, 2006, at 2:29 AM, karl wettin wrote: > > > There are a couple of Vector:s in the code. Is it really necessary to > > use this expensive thread safe artifact from the dark ages? > > +1 > > Does anyone have any numbers on the perform

Re: Vector

2006-05-06 Thread Erik Hatcher
+1 Does anyone have any numbers on the performance differences on such a refactoring? I reckon it wouldn't be that hard to put together a reasonably representative dataset and test before/after. Who's game? Erik - the new dad (again :)! On May 6, 2006, at 2:29 AM, karl wettin wr