Re: Incremental Field Updates

2010-05-10 Thread Shai Erera
> > but because of the cost of preparing the inputs (i.e. text > extraction) to Lucene. > You're right ! That and also the cost of fetching the document, in systems where the content lives on other servers/systems. Reindexing is usually (depends on your analysis chain) the cheapest step. Shai On

Re: Incremental Field Updates

2010-05-10 Thread Babak Farhang
>> My take on it is that if someone wants to update the catch-all field, then >> reindexing the document may not be such a bad idea anyway. The purpose of >> those incremental updates is to cope w/ high frequency of updates, which >> usually happen on metadata fields, and not title. > > I agree. I

[jira] Created: (LUCENE-2455) Some house cleaning in addIndexes*

2010-05-10 Thread Shai Erera (JIRA)
Some house cleaning in addIndexes* -- Key: LUCENE-2455 URL: https://issues.apache.org/jira/browse/LUCENE-2455 Project: Lucene - Java Issue Type: Improvement Components: Index Reporter: Sh

Re: When to use addIndexes and when addIndexesNoOptimize

2010-05-10 Thread Shai Erera
> > Hmm addDirectories feels a bit too low level > I don't mind calling it addIndexes(Directory...), but I don't think it's too low level - whoever executes the method passes Directory... and that's exactly what the method does :). Two addIndexes force you to go read the jdoc, but so will addDirec

[jira] Commented: (SOLR-1782) stats.facet assumes FieldCache.StringIndex - fails horribly on multivalued fields

2010-05-10 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12866012#action_12866012 ] Hoss Man commented on SOLR-1782: Crux of the problem is in FieldFacetStats's dependency on u

[jira] Updated: (SOLR-1782) stats.facet assumes FieldCache.StringIndex - fails horribly on multivalued fields

2010-05-10 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man updated SOLR-1782: --- Attachment: SOLR-1782.test.patch patch containing two new test methods that demonstrate this bug -- one create

[jira] Updated: (SOLR-1782) stats.facet assumes FieldCache.StringIndex - fails horribly on multivalued fields

2010-05-10 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man updated SOLR-1782: --- Summary: stats.facet assumes FieldCache.StringIndex - fails horribly on multivalued fields (was: unexpect

Re: Build failed in Hudson: Lucene-3.x #9

2010-05-10 Thread Robert Muir
NOTE: random seed of testcase 'testIndexingThenDeleting' was: 16958521939581297 On Mon, May 10, 2010 at 7:13 PM, Apache Hudson Server < hud...@hudson.zones.apache.org> wrote: > See > > Changes: > > [rmuir] increase random testing f

Build failed in Hudson: Lucene-3.x #9

2010-05-10 Thread Apache Hudson Server
See Changes: [rmuir] increase random testing for hudson [rmuir] support cranking up the memory index test randomness [mikemccand] LUCENE-2449: add 'knob' (-Drandom.multiplier=N, defaults to 1) to tune up how much random testing i

[jira] Commented: (SOLR-1782) unexpected statscomponent values

2010-05-10 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865973#action_12865973 ] Hoss Man commented on SOLR-1782: I'm pokign around the attached RAR file now, and two intere

[jira] Commented: (SOLR-1568) Implement Spatial Filter

2010-05-10 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865942#action_12865942 ] Grant Ingersoll commented on SOLR-1568: --- Sorry for the confusion. The tests for this

[jira] Updated: (SOLR-1841) Unregistering of Searcher MBean doesn't work in Websphere

2010-05-10 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man updated SOLR-1841: --- Attachment: SOLR-1841.patch Updated patch to work against trunk ... also tweaked the variable naming conventi

[jira] Commented: (SOLR-1568) Implement Spatial Filter

2010-05-10 Thread David Smiley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865938#action_12865938 ] David Smiley commented on SOLR-1568: I commented too soon before. The results seemed er

Re: Hudson build is back to normal : Lucene-trunk #1184

2010-05-10 Thread Chris Hostetter
: Subject: Hudson build is back to normal : Lucene-trunk #1184 : : See the problem with the earlier builds was a misconfiguration in where hudson was looking for test results -- and since hutdosn is configured to fail the bui

[jira] Commented: (LUCENE-1585) Allow to control how payloads are merged

2010-05-10 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865932#action_12865932 ] Michael McCandless commented on LUCENE-1585: Patch looks good Shai! I like th

Re: When to use addIndexes and when addIndexesNoOptimize

2010-05-10 Thread Michael McCandless
On Mon, May 10, 2010 at 3:08 PM, Shai Erera wrote: > That's still weird Mike - we call optimize in addIndexes to reduce the > number of SRs, that's fair. So why don't we do that in addIndexesNoOpt? I agree it's weird and inconsistent and all that :) > There, we get a SR per SI. And name of the m

Re: Tomcat 6 & Solr With BASIC Authentication

2010-05-10 Thread Matthew Mauriello
Even after updating the web.xml file in the solr.war file and restarting tomcat as in the example, I am never prompted to input my username and password and I am able to access solr just fine. I checked to make sure the code was copied over correctly after the reset and it was. So I have followed t

[jira] Commented: (LUCENE-2167) Implement StandardTokenizer with the UAX#29 Standard

2010-05-10 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865921#action_12865921 ] Steven Rowe commented on LUCENE-2167: - {quote} bq. A filter that breaks URL type token

Re: Tomcat 6 & Solr With BASIC Authentication

2010-05-10 Thread Matthew Mauriello
Gregor, Thanks for the quick reply. The problem I seem to be having with that example (tried it multiple times) seems to be perhaps with the files I am editing. I have a solr.war file in the webapps folder, do I need to modify that war file or the web.xml file in the solr directory? ~Matt > On 0

Re: Tomcat 6 & Solr With BASIC Authentication

2010-05-10 Thread Gregor Kaczor
On 05/10/2010 08:53 PM, Matthew Mauriello wrote: I am having a problem password protecting my single solr instance in Tomcat 6, does anyone know of a tutorial/example that works in Tomcat 6? Thanks, ~Matt - To unsubscribe, e-m

[jira] Commented: (SOLR-1163) Solr Explorer - A generic GWT client for Solr

2010-05-10 Thread Lance Norskog (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865905#action_12865905 ] Lance Norskog commented on SOLR-1163: - No, that's fine, I'll try it! > Solr Explorer -

Hudson build is back to normal : Lucene-trunk #1184

2010-05-10 Thread Apache Hudson Server
See - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] Commented: (LUCENE-2393) Utility to output total term frequency and df from a lucene index

2010-05-10 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865901#action_12865901 ] Michael McCandless commented on LUCENE-2393: Tom, I agree, we should make it o

[jira] Commented: (LUCENE-2450) Explore write-once attr bindings in the analysis chain

2010-05-10 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865896#action_12865896 ] Michael McCandless commented on LUCENE-2450: Another benefit of the stage mode

[jira] Commented: (LUCENE-2393) Utility to output total term frequency and df from a lucene index

2010-05-10 Thread Tom Burton-West (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865894#action_12865894 ] Tom Burton-West commented on LUCENE-2393: - Hi Mike, Thanks for all your help. I

[jira] Updated: (LUCENE-1585) Allow to control how payloads are merged

2010-05-10 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shai Erera updated LUCENE-1585: --- Attachment: LUCENE-1585_3x.patch Patch includes: * PayloadProcessorProvider which returns DirPayload

[jira] Updated: (LUCENE-2450) Explore write-once attr bindings in the analysis chain

2010-05-10 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-2450: --- Attachment: LUCENE-2450.patch OK I ported it (roughly) to Java -- gonna need some se

[jira] Commented: (LUCENE-2167) Implement StandardTokenizer with the UAX#29 Standard

2010-05-10 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865879#action_12865879 ] Robert Muir commented on LUCENE-2167: - bq. A filter that breaks URL type tokens into t

Re: When to use addIndexes and when addIndexesNoOptimize

2010-05-10 Thread Shai Erera
That's still weird Mike - we call optimize in addIndexes to reduce the number of SRs, that's fair. So why don't we do that in addIndexesNoOpt? There, we get a SR per SI. And name of the method suggests optimize() is avoided on purpose ... it's as if addIndexesNoOpt should be called addDirectories,

Tomcat 6 & Solr With BASIC Authentication

2010-05-10 Thread Matthew Mauriello
I am having a problem password protecting my single solr instance in Tomcat 6, does anyone know of a tutorial/example that works in Tomcat 6? Thanks, ~Matt - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For addition

[jira] Commented: (LUCENE-2167) Implement StandardTokenizer with the UAX#29 Standard

2010-05-10 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865863#action_12865863 ] Steven Rowe commented on LUCENE-2167: - Good point, Marvin - indexing URLs makes no sen

[jira] Commented: (LUCENE-2393) Utility to output total term frequency and df from a lucene index

2010-05-10 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865858#action_12865858 ] Michael McCandless commented on LUCENE-2393: I think we should just replace th

[jira] Commented: (LUCENE-2167) Implement StandardTokenizer with the UAX#29 Standard

2010-05-10 Thread Marvin Humphrey (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865857#action_12865857 ] Marvin Humphrey commented on LUCENE-2167: - I find that it works well to parse URLs

Re: When to use addIndexes and when addIndexesNoOptimize

2010-05-10 Thread Michael McCandless
On Mon, May 10, 2010 at 2:18 PM, Shai Erera wrote: > Ahh, I see. Didn't think of IndexReader extensions. Why do we call > optimize() on the local dir in addIndexes then? What's the benefits? I really don't know! Maybe to handle the case where local index has many segments? Ie, reduce the net nu

Re: When to use addIndexes and when addIndexesNoOptimize

2010-05-10 Thread Shai Erera
Ahh, I see. Didn't think of IndexReader extensions. Why do we call optimize() on the local dir in addIndexes then? What's the benefits? We don't do the same on the incoming readers, so why does it matter if e.g. the local dir has 2 segments and the incoming ones have 100? We insist on optimizing th

[jira] Commented: (LUCENE-2413) Consolidate all (Solr's & Lucene's) analyzers into modules/analysis

2010-05-10 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865838#action_12865838 ] Robert Muir commented on LUCENE-2413: - Committed LUCENE-2413_synonym.patch revision 94

[jira] Commented: (SOLR-1897) The data dir from the core descriptor should override the data dir from the solrconfig.xml rather than the other way round

2010-05-10 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865829#action_12865829 ] Mark Miller commented on SOLR-1897: --- I'm going to do something about this soon - I think i

[jira] Commented: (LUCENE-2167) Implement StandardTokenizer with the UAX#29 Standard

2010-05-10 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865822#action_12865822 ] Robert Muir commented on LUCENE-2167: - {quote} Yup, I would only accept strictly corre

[jira] Commented: (LUCENE-2167) Implement StandardTokenizer with the UAX#29 Standard

2010-05-10 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865819#action_12865819 ] Robert Muir commented on LUCENE-2167: - {quote} I assume you don't mean to say that Eng

[jira] Commented: (LUCENE-2167) Implement StandardTokenizer with the UAX#29 Standard

2010-05-10 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865818#action_12865818 ] Steven Rowe commented on LUCENE-2167: - {quote} bq. One other thing, Robert: what do yo

[jira] Commented: (LUCENE-2167) Implement StandardTokenizer with the UAX#29 Standard

2010-05-10 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865817#action_12865817 ] Steven Rowe commented on LUCENE-2167: - {quote} bq. What do you think about adding tail

[jira] Commented: (LUCENE-2167) Implement StandardTokenizer with the UAX#29 Standard

2010-05-10 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865815#action_12865815 ] Steven Rowe commented on LUCENE-2167: - {quote} bq. Naming will require some thought, t

Re: Lucene.Net 2.9.3

2010-05-10 Thread Leonardo Azize Martins
Hi DIGY, Explain me how can I help. Where are the java codes that needs to be ported to .NET? Leo 2010/5/10 Digy > Hi All, > 2.9.2 tag is created. > > DIGY > > -Original Message- > From: Michael Garski [mailto:mgar...@myspace-inc.com] > Sent: Saturday, May 08, 2010 3:23 AM > To: lucene

Re: When to use addIndexes and when addIndexesNoOptimize

2010-05-10 Thread Michael McCandless
addIndexes accepts IndexReaders, so eg "foreign" IndexReader impls can be passed in (eg FilterIndexReader). While addIndexesNoOptimize accepts Directory, ie it gets a reader using IR.open. Mike On Mon, May 10, 2010 at 5:58 AM, Shai Erera wrote: > Hi > > As I was working on LUCENE-1585 and under

Re: Build failed in Hudson: Lucene-trunk #1183

2010-05-10 Thread Michael McCandless
Ahhh yes that "No test report files found. Configuration error?" -- hmm so what's misconfigured? Any Clover experts...? Mike On Mon, May 10, 2010 at 8:47 AM, Robert Muir wrote: > On Mon, May 10, 2010 at 4:41 AM, Michael McCandless > wrote: >> >> I'm also baffled on what went wrong here! >> >>

RE: Lucene.Net 2.9.3

2010-05-10 Thread Digy
Hi All, 2.9.2 tag is created. DIGY -Original Message- From: Michael Garski [mailto:mgar...@myspace-inc.com] Sent: Saturday, May 08, 2010 3:23 AM To: lucene-net-...@lucene.apache.org Subject: RE: Lucene.Net 2.9.3 +1 for the tag Michael -Original Message- From: Digy [mailto:digy

Re: svn commit: r942676 - in /lucene/dev/trunk/lucene: ./ src/java/org/apache/lucene/index/codecs/preflex/ src/java/org/apache/lucene/index/codecs/standard/ src/java/org/apache/lucene/search/ src/ja

2010-05-10 Thread Michael McCandless
+1 We can start w/ 3 and see how long Hudson takes? We also have to figure out why are builds are mysteriously failing! Mike On Mon, May 10, 2010 at 8:26 AM, Robert Muir wrote: > > > On Mon, May 10, 2010 at 3:47 AM, wrote: >> >> Author: mikemccand >> Date: Mon May 10 07:47:12 2010 >> New Revi

Re: Adding another dimension to Lucene searches

2010-05-10 Thread mark harwood
Having implemented this code on a few projects I find that the major challenge shifts from the back end to the problem of the front end and how to get end users to articulate the questions Lucene can answer with this. Certainly an interesting challenge but that's another topic... - Origi

Re: Adding another dimension to Lucene searches

2010-05-10 Thread J. Delgado
Hierachical documents is a key concept towads a unified structured+unstructured search. It should allow us to fully implement things such as XQuery + Full-Text (http://www.w3.org/TR/xquery-full-text/) Additionally it solves a century old problem: how to deal with section/sub-sections in very large

[jira] Resolved: (LUCENENET-358) CloseableThreadLocal memory leak in LocalDataStoreSlot (with workaround)

2010-05-10 Thread Digy (JIRA)
[ https://issues.apache.org/jira/browse/LUCENENET-358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Digy resolved LUCENENET-358. Assignee: Digy Resolution: Fixed Patch committed. DIGY > CloseableThreadLocal memory leak in Loc

[jira] Updated: (LUCENE-2454) Nested Document query support

2010-05-10 Thread Mark Harwood (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Harwood updated LUCENE-2454: - Attachment: LuceneNestedDocumentSupport-1.zip Initial attachment is code plus illustrative data/

[jira] Created: (LUCENE-2454) Nested Document query support

2010-05-10 Thread Mark Harwood (JIRA)
Nested Document query support - Key: LUCENE-2454 URL: https://issues.apache.org/jira/browse/LUCENE-2454 Project: Lucene - Java Issue Type: New Feature Components: Search Affects Versions: 3.0.2

solr-...@l.a.o

2010-05-10 Thread Grant Ingersoll
Just a reminder that solr-...@lucene.apache.org is deprecated. Please use d...@lucene.a.o. -Grant - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Adding another dimension to Lucene searches

2010-05-10 Thread Grant Ingersoll
Very cool stuff, Mark. Can you just open a JIRA and attach there? On May 10, 2010, at 8:38 AM, mark harwood wrote: > I've put up code, example data and tests for the Nested Document feature > here: http://www.inperspective.com/lucene/LuceneNestedDocumentSupport.zip > > The data used in the uni

Re: svn commit: r942766 - /lucene/mahout/trunk/examples/bin/build-reuters.sh

2010-05-10 Thread Robin Anil
Remove the k-means part. Otherwise it will just launch clustering after reuters is done. On Mon, May 10, 2010 at 8:25 PM, wrote: > Author: jeastman > Date: Mon May 10 14:55:19 2010 > New Revision: 942766 > > URL: http://svn.apache.org/viewvc?rev=942766&view=rev > Log: > Replacing obsolete build

[jira] Commented: (LUCENENET-358) CloseableThreadLocal memory leak in LocalDataStoreSlot (with workaround)

2010-05-10 Thread Ben West (JIRA)
[ https://issues.apache.org/jira/browse/LUCENENET-358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865785#action_12865785 ] Ben West commented on LUCENENET-358: wow. Searching fell from low hundred millisecon

[jira] Resolved: (SOLR-1902) Tika no longer properly extracts content in Solr

2010-05-10 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll resolved SOLR-1902. --- Resolution: Fixed Upgraded to Tika 0.8-SNAPSHOT and added class loading capabilities. > Tika

Indexing a Reader instead of a String to a field value

2010-05-10 Thread David Smiley (@MITRE.org)
I have a DIH setup in which I obtain a java.io.Reader for a field's value. It's a reader because I'm getting it from a source that may store a lot of text. I traced the value of a field, stored for quite some time as an Object, through Solr until it got to Solr's DocumentBuilder line ~272 which

[jira] Commented: (LUCENE-2167) Implement StandardTokenizer with the UAX#29 Standard

2010-05-10 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865765#action_12865765 ] Robert Muir commented on LUCENE-2167: - bq. One other thing, Robert: what do you think

[jira] Commented: (LUCENE-2167) Implement StandardTokenizer with the UAX#29 Standard

2010-05-10 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865763#action_12865763 ] Robert Muir commented on LUCENE-2167: - {quote} But as you mention in a code comment in

[jira] Commented: (LUCENE-2167) Implement StandardTokenizer with the UAX#29 Standard

2010-05-10 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865759#action_12865759 ] Steven Rowe commented on LUCENE-2167: - One other thing, Robert: what do you think of a

[jira] Commented: (LUCENE-2167) Implement StandardTokenizer with the UAX#29 Standard

2010-05-10 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865757#action_12865757 ] Steven Rowe commented on LUCENE-2167: - bq. should we look at any tailorings to this? T

[jira] Commented: (SOLR-1163) Solr Explorer - A generic GWT client for Solr

2010-05-10 Thread David Smiley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865749#action_12865749 ] David Smiley commented on SOLR-1163: Uri, you're comment about the downside of developin

Re: Build failed in Hudson: Lucene-trunk #1183

2010-05-10 Thread Robert Muir
On Mon, May 10, 2010 at 4:41 AM, Michael McCandless wrote: > > I'm also baffled on what went wrong here! > > Mike > It looks like it might have something to do with test report generation? The 3x branch succeeds, without any complaints. BUILD SUCCESSFUL Total time: 5 minutes 7 seconds + exit 0 P

Re: Adding another dimension to Lucene searches

2010-05-10 Thread mark harwood
I've put up code, example data and tests for the Nested Document feature here: http://www.inperspective.com/lucene/LuceneNestedDocumentSupport.zip The data used in the unit tests is chosen to illustrate practical use of real-world content. The final unit tests will work on more abstract data for

Re: svn commit: r942676 - in /lucene/dev/trunk/lucene: ./ src/java/org/apache/lucene/index/codecs/preflex/ src/java/org/apache/lucene/index/codecs/standard/ src/java/org/apache/lucene/search/ src/ja

2010-05-10 Thread Robert Muir
On Mon, May 10, 2010 at 3:47 AM, wrote: > Author: mikemccand > Date: Mon May 10 07:47:12 2010 > New Revision: 942676 > > URL: http://svn.apache.org/viewvc?rev=942676&view=rev > Log: > LUCENE-2449: add 'knob' (-Drandom.multiplier=N, defaults to 1) to tune up > how much random testing is done; fix

[jira] Commented: (LUCENE-2167) Implement StandardTokenizer with the UAX#29 Standard

2010-05-10 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865743#action_12865743 ] Robert Muir commented on LUCENE-2167: - Hi Steven: this is impressive progress! What d

[jira] Updated: (LUCENE-2413) Consolidate all (Solr's & Lucene's) analyzers into modules/analysis

2010-05-10 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-2413: Attachment: LUCENE-2413_synonym.patch attached is a patch to move synonymfilter/synonymmap into th

[jira] Commented: (LUCENE-1585) Allow to control how payloads are merged

2010-05-10 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865719#action_12865719 ] Shai Erera commented on LUCENE-1585: bq. Make sure you fix the whitespace - some inden

When to use addIndexes and when addIndexesNoOptimize

2010-05-10 Thread Shai Erera
Hi As I was working on LUCENE-1585 and understanding more the differences between addIndexes and addIndexesNoOptimize, I was wondering why do we have those two methods? It seems like addIndexes's usage is discouraged, no? Can someone please explain me why it isn't deprecated, and addIndexesNoOpt b

[jira] Commented: (LUCENE-1585) Allow to control how payloads are merged

2010-05-10 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865707#action_12865707 ] Michael McCandless commented on LUCENE-1585: Make sure you fix the whitespace

Re: Build failed in Hudson: Lucene-trunk #1183

2010-05-10 Thread Michael McCandless
I'm also baffled on what went wrong here! Mike On Mon, May 10, 2010 at 12:17 AM, Shai Erera wrote: > I don't see any tests that failed. Anybody got an idea why the build > reports a failure? > > On Mon, May 10, 2010 at 6:38 AM, Apache Hudson Server > wrote: > See

Re: Incremental Field Updates

2010-05-10 Thread Michael McCandless
On Mon, May 10, 2010 at 4:05 AM, Shai Erera wrote: > That's an interesting scenario Mike. > > Previously, I only handled boolean-like terms, as the scenarios we were > asked to support involved just those types of terms. Obviously, when the > approach allows for more, more scenarios pop to mind :)

[jira] Resolved: (LUCENE-2449) Improve random testing

2010-05-10 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-2449. Resolution: Fixed > Improve random testing > -- > >

Re: svn commit: r942410 - in /lucene/dev/trunk/lucene/src: java/org/apache/lucene/index/codecs/preflex/ java/org/apache/lucene/index/codecs/standard/ java/org/apache/lucene/util/ test/org/apache/luc

2010-05-10 Thread Michael McCandless
Thanks Mark! Mike On Sat, May 8, 2010 at 12:17 PM, wrote: > Author: markrmiller > Date: Sat May  8 16:17:27 2010 > New Revision: 942410 > > URL: http://svn.apache.org/viewvc?rev=942410&view=rev > Log: > fix package oddities > > Added: >     > lucene/dev/trunk/lucene/src/test/org/apache/lucene/u

[jira] Commented: (LUCENE-1585) Allow to control how payloads are merged

2010-05-10 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865692#action_12865692 ] Shai Erera commented on LUCENE-1585: Small correction - the comment I've made about co

Re: svn commit: r942519 - in /lucene/dev/branches/branch_3x/lucene/src: java/org/apache/lucene/index/ java/org/apache/lucene/util/ test/org/apache/lucene/util/ test/org/apache/lucene/util/cache/

2010-05-10 Thread Michael McCandless
Thanks Shai! Mike On Sun, May 9, 2010 at 6:44 AM, wrote: > Author: shaie > Date: Sun May  9 10:44:17 2010 > New Revision: 942519 > > URL: http://svn.apache.org/viewvc?rev=942519&view=rev > Log: > Wrong import references for o.a.l.*.cache of DBLRUCache > > Added: >     > lucene/dev/branches/bran

Re: Incremental Field Updates

2010-05-10 Thread Michael McCandless
I think another example would be the catch-all field. EG say my app concatenates the title, abstract and body of a document into the catch-all field. But now I want to change just the title. I think in theory (assuming we can work out an intuitive user-level API exposure of this...), on changing

[jira] Commented: (LUCENE-2453) Make Index Output Buffer Size Configurable

2010-05-10 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12865688#action_12865688 ] Shai Erera commented on LUCENE-2453: Patch looks good ! Few comments: * buffer can sti