[jira] Created: (LUCENE-2575) Concurrent byte and int block implementations

2010-07-28 Thread Jason Rutherglen (JIRA)
Concurrent byte and int block implementations - Key: LUCENE-2575 URL: https://issues.apache.org/jira/browse/LUCENE-2575 Project: Lucene - Java Issue Type: Improvement Components: Index

[jira] Commented: (LUCENE-2312) Search on IndexWriter's RAM Buffer

2010-07-28 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893490#action_12893490 ] Jason Rutherglen commented on LUCENE-2312: -- I think we can change the *BlockPool

Lucene Test Case Failure: org.apache.lucene.index.TestIndexWriterMergePolicy.testMaxBufferedDocsChange (from TestIndexWriterMergePolicy)

2010-07-28 Thread Mark Miller
Error Message maxMergeDocs=2147483647; numSegments=11; upperBound=10; mergeFactor=10; segs=_65:c5950 _5t:c10->_32 _5u:c10->_32 _5v:c10->_32 _5w:c10->_32 _5x:c10->_32 _5y:c10->_32 _5z:c10->_32 _60:c10->_32 _61:c10->_32 _62:c6->_32 _64:c4->_62 Stacktrace junit.framework.AssertionFailedError: maxMe

Re: SOLR > SolrJ : SolrServer using Http Components

2010-07-28 Thread Chris Hostetter
: is there already someone working or using an implementation of : SolrServer that uses Http Components (http://hc.apache.org/) ? I think the lack of reply is the answer to your question. : someone is already working on it - maybe we can create a Jira issue und : synchronize the work? (The migra

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893456#action_12893456 ] Yonik Seeley commented on LUCENE-1799: -- Hmmm, interesting. I'm sure my JVM is 64 bit

[jira] Commented: (LUCENE-2312) Search on IndexWriter's RAM Buffer

2010-07-28 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893450#action_12893450 ] Jason Rutherglen commented on LUCENE-2312: -- We need to figure out a way to concur

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893413#action_12893413 ] Robert Muir commented on LUCENE-1799: - yeah it did (it didnt seem 'stable' but the fir

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893403#action_12893403 ] Yonik Seeley commented on LUCENE-1799: -- bq. I think your benchmark isnt very reliable

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893398#action_12893398 ] Robert Muir commented on LUCENE-1799: - {quote} But looking at the benchmark, it looks

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893391#action_12893391 ] Robert Muir commented on LUCENE-1799: - by the way, to explain your results on french a

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893386#action_12893386 ] Robert Muir commented on LUCENE-1799: - bq. I was genuinely surprised when you reported

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893381#action_12893381 ] Yonik Seeley commented on LUCENE-1799: -- bq. You havent really been measuring performa

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893368#action_12893368 ] Robert Muir commented on LUCENE-1799: - bq. I have only been measuring performance at t

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893362#action_12893362 ] Yonik Seeley commented on LUCENE-1799: -- bq. so if you want to improve indexing speed,

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893357#action_12893357 ] Robert Muir commented on LUCENE-1799: - I dont think its measurable. 100 million string

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893353#action_12893353 ] Yonik Seeley commented on LUCENE-1799: -- Ummm, so you're against actually measuring an

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893343#action_12893343 ] Robert Muir commented on LUCENE-1799: - {quote} Well... hopefully it's not an issue. Th

Lucene Test Case Failure: org.apache.lucene.index.TestIndexWriter.testCommitThreadSafety (from TestIndexWriter)

2010-07-28 Thread Mark Miller
Error Message MockRAMDirectory: cannot close: there are still open files: {_1m.cfs=1, _1k.cfs=1, _14.cfs=1, _1g.cfs=1, _1h.cfs=1, _1f.cfs=1, _1n.cfs=1, _1i.cfs=1, _1j.cfs=1, _1l.cfs=1} Stacktrace java.lang.RuntimeException: MockRAMDirectory: cannot close: there are still open files: {_1m.cfs=1,

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893339#action_12893339 ] Yonik Seeley commented on LUCENE-1799: -- bq. Thats good news, so we can encode 100 mil

[jira] Updated: (LUCENE-1799) Unicode compression

2010-07-28 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yonik Seeley updated LUCENE-1799: - Attachment: Benchmark.java OK, hopefully the right Benchmark.java this time ;-) > Unicode compr

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893329#action_12893329 ] Robert Muir commented on LUCENE-1799: - Thats good news, so we can encode 100 million s

[jira] Commented: (SOLR-1902) Tika no longer properly extracts content in Solr

2010-07-28 Thread David Thibault (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893328#action_12893328 ] David Thibault commented on SOLR-1902: -- OK, I tried Tommaso's patch and it worked great

[jira] Updated: (LUCENE-1799) Unicode compression

2010-07-28 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yonik Seeley updated LUCENE-1799: - Attachment: Benchmark.java > Unicode compression > --- > > Key:

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893324#action_12893324 ] Yonik Seeley commented on LUCENE-1799: -- OK, I just tried Robert's Benchmark.java (i.e

[jira] Created: (LUCENE-2574) Optimize copies between IndexInput and Output

2010-07-28 Thread Shai Erera (JIRA)
Optimize copies between IndexInput and Output - Key: LUCENE-2574 URL: https://issues.apache.org/jira/browse/LUCENE-2574 Project: Lucene - Java Issue Type: Improvement Components: Store

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893317#action_12893317 ] Robert Muir commented on LUCENE-1799: - I just insist there is no real difference betwe

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893312#action_12893312 ] Michael McCandless commented on LUCENE-1799: The char[] -> byte[] encode time

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893309#action_12893309 ] Robert Muir commented on LUCENE-1799: - Yonik, please see my issue. the fact we can en

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893306#action_12893306 ] Yonik Seeley commented on LUCENE-1799: -- bq. in general you wont get much compression

[jira] Updated: (LUCENE-1799) Unicode compression

2010-07-28 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-1799: Attachment: Benchmark.java attached is my benchmark for english text. UTF-8: 15530ms BOCU-1: 1568

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893297#action_12893297 ] Yonik Seeley commented on LUCENE-1799: -- bq. Yonik can you give more details about how

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893292#action_12893292 ] Michael Busch commented on LUCENE-1799: --- Yonik can you give more details about how y

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893288#action_12893288 ] Robert Muir commented on LUCENE-1799: - yonik, what were you benchmarking? I think you

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-28 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893280#action_12893280 ] Yonik Seeley commented on LUCENE-1799: -- I took a stab at benchmarking encoding speed

[jira] Commented: (LUCENE-2573) Tiered flushing of DWPTs by RAM with low/high water marks

2010-07-28 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893265#action_12893265 ] Jason Rutherglen commented on LUCENE-2573: -- Users probably won't customize to tha

[jira] Commented: (SOLR-1352) DIH: MultiThreaded

2010-07-28 Thread Russell Teabeault (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893264#action_12893264 ] Russell Teabeault commented on SOLR-1352: - I believe there have been some incompatib

[jira] Commented: (LUCENE-2573) Tiered flushing of DWPTs by RAM with low/high water marks

2010-07-28 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893256#action_12893256 ] Michael Busch commented on LUCENE-2573: --- Yeah I like that better too. Will implemen

[jira] Commented: (LUCENE-2573) Tiered flushing of DWPTs by RAM with low/high water marks

2010-07-28 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893248#action_12893248 ] Michael McCandless commented on LUCENE-2573: I think just keep it simple? Use

[jira] Created: (LUCENE-2573) Tiered flushing of DWPTs by RAM with low/high water marks

2010-07-28 Thread Michael Busch (JIRA)
Tiered flushing of DWPTs by RAM with low/high water marks - Key: LUCENE-2573 URL: https://issues.apache.org/jira/browse/LUCENE-2573 Project: Lucene - Java Issue Type: Improvement

[jira] Resolved: (LUCENE-2561) Fix exception handling and thread safety in realtime branch

2010-07-28 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch resolved LUCENE-2561. --- Resolution: Fixed TestStressIndexing2 is not failing because of concurrency problems, so I'm

[jira] Commented: (SOLR-2019) Jetty sometimes randomly takes a long time to start

2010-07-28 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893226#action_12893226 ] Uwe Schindler commented on SOLR-2019: - bq. That jetty runner is not a test class. Ah t

[jira] Commented: (SOLR-2019) Jetty sometimes randomly takes a long time to start

2010-07-28 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893224#action_12893224 ] Michael McCandless commented on SOLR-2019: -- bq. patch that checks sysprop, set from

[jira] Commented: (SOLR-2019) Jetty sometimes randomly takes a long time to start

2010-07-28 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893211#action_12893211 ] Mark Miller commented on SOLR-2019: --- That jetty runner is not a test class. > Jetty somet

[jira] Commented: (SOLR-2019) Jetty sometimes randomly takes a long time to start

2010-07-28 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893210#action_12893210 ] Uwe Schindler commented on SOLR-2019: - I see no problem in havin an insecure hash genera

[jira] Updated: (SOLR-2019) Jetty sometimes randomly takes a long time to start

2010-07-28 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-2019: -- Attachment: SOLR-2019.patch patch that checks sysprop, set from 'ant test' > Jetty sometimes randomly t

[jira] Commented: (SOLR-2019) Jetty sometimes randomly takes a long time to start

2010-07-28 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893200#action_12893200 ] Robert Muir commented on SOLR-2019: --- seems slightly hackish, but is it ok to check a syspr

[jira] Commented: (SOLR-2019) Jetty sometimes randomly takes a long time to start

2010-07-28 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893199#action_12893199 ] Michael McCandless commented on SOLR-2019: -- That patch works for me!! > Jetty some

[jira] Updated: (SOLR-2019) Jetty sometimes randomly takes a long time to start

2010-07-28 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-2019: -- Attachment: SOLR-2019_insecure.patch here is a patch (not for committing) to see if it resolves it. if s

[jira] Commented: (SOLR-2019) Jetty sometimes randomly takes a long time to start

2010-07-28 Thread Chris Male (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893196#action_12893196 ] Chris Male commented on SOLR-2019: -- It is possible to set Random in the SessionIDManager pr

[jira] Created: (SOLR-2019) Jetty sometimes randomly takes a long time to start

2010-07-28 Thread Michael McCandless (JIRA)
Jetty sometimes randomly takes a long time to start --- Key: SOLR-2019 URL: https://issues.apache.org/jira/browse/SOLR-2019 Project: Solr Issue Type: Bug Reporter: Michael McCandles

[jira] Updated: (LUCENE-1799) Unicode compression

2010-07-28 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-1799: Attachment: LUCENE-1799.patch here it is with first stab at decoder (its correct against random ic

[jira] Created: (LUCENE-2572) Maven artifacts for Lucene 4 are not stored in the correct path

2010-07-28 Thread Anthony Signoret (JIRA)
Maven artifacts for Lucene 4 are not stored in the correct path --- Key: LUCENE-2572 URL: https://issues.apache.org/jira/browse/LUCENE-2572 Project: Lucene - Java Issue Type: Bug

[jira] Updated: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-07-28 Thread Stanislaw Osinski (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stanislaw Osinski updated SOLR-1804: Attachment: SOLR-1804-carrot2-3.4.0-dev-trunk.patch A patch against solr trunk, the libs are

[jira] Updated: (LUCENE-1799) Unicode compression

2010-07-28 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-1799: Attachment: LUCENE-1799.patch oops, forgot a check in the surrogate case. > Unicode compression >

[jira] Updated: (LUCENE-1799) Unicode compression

2010-07-28 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-1799: Attachment: LUCENE-1799.patch i optimized the surrogate case here, moving it into the 'prev' calcu

RE: busywait hang using extracting update handler on trunk

2010-07-28 Thread karl.wright
One of the characters that causes trouble is unicode character 243. Karl --- original message --- From: "Wright Karl (Nokia-MS/Cambridge)" Subject: RE: busywait hang using extracting update handler on trunk Date: July 28, 2010 Time: 6:0:4 AM It appears that whenever I see a merge failure, I a

RE: busywait hang using extracting update handler on trunk

2010-07-28 Thread karl.wright
It appears that whenever I see a merge failure, I also apparently have a corrupt index (I get arrayindexoutofbounds exceptions when searching for certain things). So that may be the underlying cause of the merge infinite loop. I've blown away the indexes repeatedly and tried to rebuild. I am

[jira] Commented: (SOLR-1352) DIH: MultiThreaded

2010-07-28 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893093#action_12893093 ] Noble Paul commented on SOLR-1352: -- y do you need to apply a patch? DIH is a separate .jar