Re: Benchmarking results

2006-04-04 Thread Igor Bolotin
For faster Hotspot warm-up you can use Hotspot VM option: -XX:CompileThreshold=NN This option controls number of method invocations/branches before (re-)compiling. Defaults are: 10,000 -server, 1,500 -client. See documentation here: http://java.sun.com/docs/hotspot/VMOptions.html In one of my pre

Re: Test corpus

2006-04-01 Thread Igor Bolotin
Take a look at Project Guttenberg: http://www.gutenberg.org/ Igor On 4/1/06, Pasha Bizhan <[EMAIL PROTECTED]> wrote: > > Hi, > > > From: Marvin Humphrey [mailto:[EMAIL PROTECTED] > > > I'm looking for a test corpus to use for some benchmarking > > and parsing tests. I can whip one up myself, but

[jira] Updated: (LUCENE-532) [PATCH] Indexing on Hadoop distributed file system

2006-03-27 Thread Igor Bolotin (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-532?page=all ] Igor Bolotin updated LUCENE-532: Attachment: indexOnDFS.patch Attached is new patch which is using format number to determine where to read the size as discussed. Thanks! > [PA

[jira] Updated: (LUCENE-532) [PATCH] Indexing on Hadoop distributed file system

2006-03-27 Thread Igor Bolotin (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-532?page=all ] Igor Bolotin updated LUCENE-532: Attachment: TermInfosWriter.patch SegmentTermEnum.patch > [PATCH] Indexing on Hadoop distributed file sys

[jira] Updated: (LUCENE-532) [PATCH] Indexing on Hadoop distributed file system

2006-03-27 Thread Igor Bolotin (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-532?page=all ] Igor Bolotin updated LUCENE-532: Attachment: TermInfosWriter.java SegmentTermEnum.java Two patch files are attached > [PATCH] Indexing on Hadoop distributed file sys

[jira] Created: (LUCENE-532) [PATCH] Indexing on Hadoop distributed file system

2006-03-27 Thread Igor Bolotin (JIRA)
Reporter: Igor Bolotin Priority: Minor Attachments: SegmentTermEnum.java, TermInfosWriter.java In my current project we needed a way to create very large Lucene indexes on Hadoop distributed file system. When we tried to do it directly on DFS using Nutch FsDirectory class - we immediately found