instantiated contrib

2010-08-26 Thread Li Li
I have about 70k document, the total indexed size is about 15MB(the orginal text files' size). dir=new RAMDirectory(); IndexWriter write=new IndexWriter(dir,...; for(loop){ writer.addDocument(doc); }

Re: instantiated contrib

2010-08-26 Thread Earwin Burrfoot
IMHO, Instantiated sucks GC-wise. Put more docs in it, do enough queries, and RAMDir eventually outperforms it. On Thu, Aug 26, 2010 at 11:24, Li Li fancye...@gmail.com wrote: I have about 70k document, the total indexed size is about 15MB(the orginal text files' size).                dir=new

[jira] Commented: (LUCENE-2095) Document not guaranteed to be found after write and commit

2010-08-26 Thread vijaykumarraja.grandhi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12902829#action_12902829 ] vijaykumarraja.grandhi commented on LUCENE-2095: Thank you for guiding me

[jira] Commented: (LUCENE-2611) IntelliJ IDEA setup

2010-08-26 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12902853#action_12902853 ] Steven Rowe commented on LUCENE-2611: - bq. Can you provide any information on tests

[jira] Updated: (SOLR-1873) Commit Solr Cloud to trunk

2010-08-26 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated SOLR-1873: -- Attachment: SOLR-1873.patch Latest patch - I think the funny extra line spaces are fixed - appears to

[jira] Commented: (LUCENE-2611) IntelliJ IDEA setup

2010-08-26 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12902858#action_12902858 ] Robert Muir commented on LUCENE-2611: - Thanks Steven. sounds like we should re-open

[jira] Commented: (LUCENE-2611) IntelliJ IDEA setup

2010-08-26 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12902860#action_12902860 ] Steven Rowe commented on LUCENE-2611: - bq. sounds like we should re-open LUCENE-2398

[jira] Updated: (SOLR-2046) add a function to set curl_url in scripts-util

2010-08-26 Thread Koji Sekiguchi (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Sekiguchi updated SOLR-2046: - Attachment: SOLR-2046.patch A new patch attached. In this patch, I'm going to remove readercycle

[jira] Commented: (LUCENE-2611) IntelliJ IDEA setup

2010-08-26 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12902863#action_12902863 ] Robert Muir commented on LUCENE-2611: - bq. Yeah, I thought of that, but e.g. DIH's

[jira] Updated: (SOLR-2046) add a function to set curl_url in scripts-util

2010-08-26 Thread Koji Sekiguchi (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Sekiguchi updated SOLR-2046: - Attachment: SOLR-2046.patch add a function to set curl_url in scripts-util

[jira] Updated: (SOLR-2052) Allow for a list of filter queries and a single docset filter in QueryComponent

2010-08-26 Thread Stephen Green (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stephen Green updated SOLR-2052: Affects Version/s: 4.0 (was: 1.4.1) Allow for a list of filter queries

[jira] Created: (LUCENENET-373) MultiThreading failed in IndexWritter

2010-08-26 Thread Thirumaran.Kasi (JIRA)
MultiThreading failed in IndexWritter - Key: LUCENENET-373 URL: https://issues.apache.org/jira/browse/LUCENENET-373 Project: Lucene.Net Issue Type: Test Environment: WindowsXP(SP3), .net

Possible problem with SimpleFacetsTest in Solr trunk?

2010-08-26 Thread Mark Miller
The latest solr tests are spitting out the exception below. Expected? Looking at the test, it doesn't look expected - though I'm not sure why the test doesn't fail: [junit] SEVERE: Exception during facet counts:java.lang.NegativeArraySizeException [junit] at

[jira] Updated: (LUCENENET-373) MultiThreading failed in IndexWritter

2010-08-26 Thread Thirumaran.Kasi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENENET-373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thirumaran.Kasi updated LUCENENET-373: -- Description: Hi, We are upgraded Lucene version from 1.9.0 to 2.9.2. When we

Re: Possible problem with SimpleFacetsTest in Solr trunk?

2010-08-26 Thread Yonik Seeley
On Thu, Aug 26, 2010 at 10:21 AM, Mark Miller mark.mil...@lucidimagination.com wrote:  The latest solr tests are spitting out the exception below. Expected? Looking at the test, it doesn't look expected - though I'm not sure why the test doesn't fail: I just checked in a faceting optimization

Getting facets for a field from within a SearchComponent

2010-08-26 Thread Luke Venediger
Hi Folks, I'm writing my first SearchComponent to do custom calculations on search results. Is it possible to get the facet values for a field from within a SearchComponent? I've thought of adapting the StatsComponent and FieldFacetStats classes to try and accomplish this. But before I try that,

[jira] Updated: (SOLR-1665) Add debugTimings param so that timings for components can be retrieved without having to do explains(), as in debugQuery

2010-08-26 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated SOLR-1665: -- Attachment: SOLR-1665.patch Here's a patch for this. Adds support for debug=timing,

DIH nested entity and multiple columns

2010-08-26 Thread cmd
my confile xml like this. dataConfig dataSource driver=org.hsqldb.jdbcDriver url=jdbc:hsqldb:/temp/example/ex user=sa / document name=products entity name=item query=select * from item field column=ID name=id / field column=NAME name=name /

[jira] Updated: (LUCENE-2239) Revise NIOFSDirectory and its usage due to NIO limitations on Thread.interrupt

2010-08-26 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-2239: Attachment: LUCENE-2239.patch This patch adds a heads-up to FSDirectory to make uses

[jira] Commented: (SOLR-1665) Add debugTimings param so that timings for components can be retrieved without having to do explains(), as in debugQuery

2010-08-26 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12902909#action_12902909 ] Grant Ingersoll commented on SOLR-1665: --- I intend to commit in a day or two. Add

SolrPerformanceFactors wiki page says contradictory things...

2010-08-26 Thread Eric Pugh
Under Factors affecting memory usage there is this text: When processing an add command for a document, the standard XML update handler has two limitations: • All of the document's fields must simultaneously fit into memory. (Technically, it's actually the sum of min(the actual field

Re: SolrPerformanceFactors wiki page says contradictory things...

2010-08-26 Thread Yonik Seeley
On Thu, Aug 26, 2010 at 12:06 PM, Eric Pugh ep...@opensourceconnections.com wrote: Under Factors affecting memory usage there is this text: When processing an add command for a document, the standard XML update handler has two limitations:        • All of the document's fields must

[jira] Updated: (LUCENE-2623) Random Test Failure org.apache.lucene.index.TestIndexWriter.testAddIndexesWithThreads (from TestIndexWriter)

2010-08-26 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-2623: Component/s: Tests Random Test Failure

[jira] Updated: (LUCENE-2622) Random Test Failure org.apache.lucene.TestExternalCodecs.testPerFieldCodec (from TestExternalCodecs)

2010-08-26 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-2622: Component/s: Tests Random Test Failure org.apache.lucene.TestExternalCodecs.testPerFieldCodec

[jira] Commented: (SOLR-1665) Add debugTimings param so that timings for components can be retrieved without having to do explains(), as in debugQuery

2010-08-26 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12902936#action_12902936 ] Yonik Seeley commented on SOLR-1665: This seems a bit over-engineered? What's the point

[jira] Updated: (SOLR-1316) Create autosuggest component

2010-08-26 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yonik Seeley updated SOLR-1316: --- Attachment: SOLR-1316.patch Here's a patch to the tests: - adds ASL header - removes @Override from

[jira] Commented: (SOLR-1316) Create autosuggest component

2010-08-26 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12902993#action_12902993 ] Mark Miller commented on SOLR-1316: --- bq. removes @Override from interface methods whys

[jira] Commented: (SOLR-1316) Create autosuggest component

2010-08-26 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12902996#action_12902996 ] Yonik Seeley commented on SOLR-1316: Makes it easier to backport to 3x later if desired.

[jira] Commented: (SOLR-1665) Add debugTimings param so that timings for components can be retrieved without having to do explains(), as in debugQuery

2010-08-26 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12903010#action_12903010 ] Grant Ingersoll commented on SOLR-1665: --- I was just going to do a enum type, but then

[jira] Closed: (SOLR-2091) wt=json is served as text/plain

2010-08-26 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yonik Seeley closed SOLR-2091. -- Resolution: Duplicate closing as duplicate of SOLR-1123 wt=json is served as text/plain

[jira] Commented: (SOLR-1123) Change the JSONResponseWriter content type

2010-08-26 Thread Lorrin Nelson (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12903015#action_12903015 ] Lorrin Nelson commented on SOLR-1123: - I propose treating this as two issues: A

[jira] Commented: (SOLR-1665) Add debugTimings param so that timings for components can be retrieved without having to do explains(), as in debugQuery

2010-08-26 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12903016#action_12903016 ] Yonik Seeley commented on SOLR-1665: bq. ResponseBuilder.addDebugInfo() w/o the interest

Re: instantiated contrib

2010-08-26 Thread Earwin Burrfoot
I meant exactly what I said. My opinion is that Instantiated puts more strain on GC compared to RAMDir. If that aspect is negligible for you (less docs, smaller heap) it quite probably runs faster, if not - after some point GC costs will dwarf any gains. 2010/8/26 Li Li fancye...@gmail.com: do

[jira] Commented: (SOLR-236) Field collapsing

2010-08-26 Thread Peter Kieltyka (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12903028#action_12903028 ] Peter Kieltyka commented on SOLR-236: - Hey guys, How difficult would it be to add the

[jira] Commented: (SOLR-2091) wt=json is served as text/plain

2010-08-26 Thread Paul R. Brown (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12903100#action_12903100 ] Paul R. Brown commented on SOLR-2091: - Can we bump the other issue from improvement to

[jira] Commented: (SOLR-2026) Need infrastructure support in Solr for requests that perform multiple sequential queries

2010-08-26 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12903111#action_12903111 ] Hoss Man commented on SOLR-2026: bq. I understand those use-cases but I still have a hard

Re: Should analysis.jsp honor maxFieldLength

2010-08-26 Thread Robert Muir
On Thu, Aug 26, 2010 at 6:06 PM, Chris Hostetter hossman_luc...@fucit.orgwrote: we could concievable support having LimitTokenCountFilter added implicitly even if that option isn't used, via some syntax like you suggest -- but honestly i think it's better to just let the user declare it like

[jira] Reopened: (LUCENE-2295) Create a MaxFieldLengthAnalyzer to wrap any other Analyzer and provide the same functionality as MaxFieldLength provided on IndexWriter

2010-08-26 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir reopened LUCENE-2295: - I'm reopening this for discussion (both because it came up on the mailing list and Mike's question

[jira] Updated: (LUCENE-2625) IndexReader.termDocs() retrieves no documents

2010-08-26 Thread Adriano Crestani (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adriano Crestani updated LUCENE-2625: - Attachment: TestTermDocs.java This test case reproduces the problem. It fails using

[jira] Commented: (SOLR-2091) wt=json is served as text/plain

2010-08-26 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12903161#action_12903161 ] Yonik Seeley commented on SOLR-2091: oops - first of all, apologies for reading your

[jira] Created: (SOLR-2092) Use a native priority queue to order facet results

2010-08-26 Thread Yonik Seeley (JIRA)
Use a native priority queue to order facet results -- Key: SOLR-2092 URL: https://issues.apache.org/jira/browse/SOLR-2092 Project: Solr Issue Type: Improvement Reporter: Yonik

[jira] Updated: (SOLR-2092) Use a native priority queue to order facet results

2010-08-26 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yonik Seeley updated SOLR-2092: --- Attachment: SOLR-2092.patch Here's a draft patch with only UnInvertedField converted from using

[jira] Commented: (LUCENE-2625) IndexReader.termDocs() retrieves no documents

2010-08-26 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12903208#action_12903208 ] Shai Erera commented on LUCENE-2625: You add the field as NOT_ANALYZED, therefore

[jira] Commented: (LUCENE-2625) IndexReader.termDocs() retrieves no documents

2010-08-26 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12903210#action_12903210 ] Robert Muir commented on LUCENE-2625: - Hi, I'm not sure about this one being a bug: #

[jira] Commented: (LUCENE-2625) IndexReader.termDocs() retrieves no documents

2010-08-26 Thread Adriano Crestani (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12903211#action_12903211 ] Adriano Crestani commented on LUCENE-2625: -- Hi Shai, Sorry, but what do you mean

[jira] Commented: (LUCENE-2625) IndexReader.termDocs() retrieves no documents

2010-08-26 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12903219#action_12903219 ] Shai Erera commented on LUCENE-2625: Sorry, I've misread NOT_ANALYZED w/ NO. Too early

[jira] Commented: (LUCENE-2625) IndexReader.termDocs() retrieves no documents

2010-08-26 Thread Adriano Crestani (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12903221#action_12903221 ] Adriano Crestani commented on LUCENE-2625: -- Just confirmed here, invoking seek

[jira] Commented: (LUCENE-2625) IndexReader.termDocs() retrieves no documents

2010-08-26 Thread Adriano Crestani (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12903220#action_12903220 ] Adriano Crestani commented on LUCENE-2625: -- Thanks for the quick reply Shai and

[jira] Commented: (LUCENE-2625) IndexReader.termDocs() retrieves no documents

2010-08-26 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12903227#action_12903227 ] Robert Muir commented on LUCENE-2625: - bq. In my opinion, being unpositioned means the