Build failed in Hudson: Solr-trunk #702

2009-02-02 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Solr-trunk/702/changes -- [...truncated 6009 lines...] [junit] Running org.apache.solr.analysis.DoubleMetaphoneFilterFactoryTest [junit] Tests run: 2, Failures: 0, Errors: 0, Time elapsed: 0.445 sec

[jira] Commented: (SOLR-1000) DIH FileListEntityProcessor fileName filters directory names and stops recursion

2009-02-02 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12669662#action_12669662 ] Shalin Shekhar Mangar commented on SOLR-1000: - First the ClassCastException was

[jira] Updated: (SOLR-820) replicate After startup for new replication

2009-02-02 Thread Akshay K. Ukey (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Akshay K. Ukey updated SOLR-820: Attachment: SOLR-820.patch Patch in sync with trunk. replicate After startup for new replication

[jira] Commented: (SOLR-1001) using invariant request values from solrconfig.xml inside a data-config.xml regexp

2009-02-02 Thread Fergus McMenemie (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12669705#action_12669705 ] Fergus McMenemie commented on SOLR-1001: I could probably hack around this myself

[jira] Commented: (SOLR-1001) using invariant request values from solrconfig.xml inside a data-config.xml regexp

2009-02-02 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12669714#action_12669714 ] Shalin Shekhar Mangar commented on SOLR-1001: - bq. Is it possible that an

[jira] Updated: (SOLR-799) Add support for hash based exact/near duplicate document handling

2009-02-02 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated SOLR-799: - Attachment: SOLR-799.patch This patch fixes some oddness with how the enabled setting worked and removes

StatsComponent Multivalued Fields

2009-02-02 Thread smock
Hello, I'm curious if there are plans to extend the StatsComponent to cover multivalued fields? Is it a performance issue? Could I get some advice on how I'd incorporate that functionality into the component myself? I'm not sure how to access multivalued fields using the FieldCache...

Re: Leading Wildcard Search

2009-02-02 Thread Mark Miller
smock wrote: Hi, I'm curious about the performance issues around leading wildcard search - is there any way to get around it? Could someone explain to me the nature of the issue? Thanks! Harish A lucene/solr index is much like the index in the back of a book. Imagine I ask you to look up

Re: Leading Wildcard Search

2009-02-02 Thread smock
Hi Mark, Thanks! That clears things up quite a bit. Are there plans to incorporate a solr 'wildcard index' to contain infix terms, or alternately contain a backwards index to get around this term? I'll plan on using the workaround in the meantime. -Harish markrmiller wrote: smock wrote:

[jira] Updated: (SOLR-943) Make it possible to specify dataDir in solr.xml

2009-02-02 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-943?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Noble Paul updated SOLR-943: Attachment: SOLR-943.patch updated to trunk Make it possible to specify dataDir in solr.xml

Re: StatsComponent Multivalued Fields

2009-02-02 Thread Ryan McKinley
On Feb 2, 2009, at 4:50 PM, smock wrote: Hello, I'm curious if there are plans to extend the StatsComponent to cover multivalued fields? I don't think anyone is activly working on that no. Like most things, the reason for the absense is not that it would not be a good idea rather no

[jira] Resolved: (SOLR-999) XPathRecordReader fails on XMLs with nodes mixed with CDATA content

2009-02-02 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shalin Shekhar Mangar resolved SOLR-999. Resolution: Fixed Committed revision 739962. Thanks Fergus and Noble!

[jira] Assigned: (SOLR-994) EnumeratedEntityTransformer

2009-02-02 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shalin Shekhar Mangar reassigned SOLR-994: -- Assignee: Shalin Shekhar Mangar EnumeratedEntityTransformer

Re: StatsComponent Multivalued Fields

2009-02-02 Thread smock
Thanks - I really appreciate your response. Any advice on how I can use the UnInvertedField to access multivalued attributes? I'm assuming using the TermEnum logic in SimpleFacets will suffer performance issues... I'd be happy to contribute the code back if I can get this working. ryantxu

[jira] Commented: (SOLR-921) SolrResourceLoader must cache name vs class

2009-02-02 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12669795#action_12669795 ] Hoss Man commented on SOLR-921: --- Looking at the latest patch, I now see the restriction that

[jira] Commented: (SOLR-999) XPathRecordReader fails on XMLs with nodes mixed with CDATA content

2009-02-02 Thread Fergus McMenemie (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12669620#action_12669620 ] Fergus McMenemie commented on SOLR-999: --- Tested this morning. Works for me!

[jira] Created: (SOLR-1001) using invariant request values from solrconfig.xml inside a data-config.xml regexp

2009-02-02 Thread Fergus McMenemie (JIRA)
using invariant request values from solrconfig.xml inside a data-config.xml regexp -- Key: SOLR-1001 URL: https://issues.apache.org/jira/browse/SOLR-1001 Project: Solr

[jira] Updated: (SOLR-999) XPathRecordReader fails on XMLs with nodes mixed with CDATA content

2009-02-02 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shalin Shekhar Mangar updated SOLR-999: --- Description: XPathRecordReader fails with mixed ontent example: {code:xml}

Re: Leading Wildcard Search

2009-02-02 Thread Koji Sekiguchi
Mark, How about introducing ReverseStringFilter into Lucene to solve this kind of problem? :) https://issues.apache.org/jira/browse/LUCENE-1398 Thank you, Koji smock wrote: Hi Mark, Thanks! That clears things up quite a bit. Are there plans to incorporate a solr 'wildcard index' to

Re: Leading Wildcard Search

2009-02-02 Thread Mark Miller
Nice Koji, I hadn't seen that. I'll take some time to look closer at the patch. I'm going to take a look at your new Lucene Highlighter code when I get some time too. Sounds like good stuff. - Mark Koji Sekiguchi wrote: Mark, How about introducing ReverseStringFilter into Lucene to solve

[jira] Created: (SOLR-1002) Change SolrIndexSearcher to use insertWithOverflow as Lucene does

2009-02-02 Thread Mark Miller (JIRA)
Change SolrIndexSearcher to use insertWithOverflow as Lucene does - Key: SOLR-1002 URL: https://issues.apache.org/jira/browse/SOLR-1002 Project: Solr Issue Type: Improvement

[jira] Updated: (SOLR-1002) Change SolrIndexSearcher to use insertWithOverflow as Lucene does

2009-02-02 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated SOLR-1002: -- Attachment: SOLR-1002.patch first stab Change SolrIndexSearcher to use insertWithOverflow as Lucene

Re: StatsComponent Multivalued Fields

2009-02-02 Thread Ryan McKinley
Sorry no real advice on the UnInvertedField -- its a bit voodo to me, but take a crack at it. That said depending on the size of your index and query response time needs, using the TermEnum may be just fine. On Feb 2, 2009, at 6:17 PM, smock wrote: Thanks - I really appreciate your

[jira] Commented: (SOLR-844) A SolrServer impl to front-end multiple urls

2009-02-02 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12669863#action_12669863 ] Noble Paul commented on SOLR-844: - Hoss. Thanks bq.If the behavior of this class can be

[jira] Commented: (SOLR-921) SolrResourceLoader must cache name vs class

2009-02-02 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12669864#action_12669864 ] Noble Paul commented on SOLR-921: - We have a usecase where we load/unload tens of 1000's (yes

solrindexsearcher delegation

2009-02-02 Thread Mark Miller
Is there a reason that SolrIndexSearcher still delegates to an IndexSearcher rather than extending it? All I can think is that maybe you don't want it identifiable as an IndexSearcher? Or that there is no reason currently... - Mark

[jira] Created: (SOLR-1003) XPathEntityprocessor must allow slurping all text from a given xml node and its children

2009-02-02 Thread Noble Paul (JIRA)
XPathEntityprocessor must allow slurping all text from a given xml node and its children Key: SOLR-1003 URL: https://issues.apache.org/jira/browse/SOLR-1003

[jira] Updated: (SOLR-1003) XPathEntityprocessor must allow slurping all text from a given xml node and its children

2009-02-02 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Noble Paul updated SOLR-1003: - Attachment: SOLR-1003.patch XPathEntityprocessor must allow slurping all text from a given xml node and

[jira] Updated: (SOLR-1001) using invariant request values from solrconfig.xml inside a data-config.xml regexp

2009-02-02 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Noble Paul updated SOLR-1001: - Attachment: SOLR-1001.patch this just fixes the RegexTransformer. We may take a look at the other

[jira] Issue Comment Edited: (SOLR-820) replicate After startup for new replication

2009-02-02 Thread Akshay K. Ukey (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12669686#action_12669686 ] akshay edited comment on SOLR-820 at 2/2/09 9:12 PM: - Patch in

[jira] Assigned: (SOLR-943) Make it possible to specify dataDir in solr.xml

2009-02-02 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-943?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shalin Shekhar Mangar reassigned SOLR-943: -- Assignee: Shalin Shekhar Mangar Make it possible to specify dataDir in solr.xml

[jira] Updated: (SOLR-996) Improve Evaluator API

2009-02-02 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Noble Paul updated SOLR-996: Attachment: SOLR-996.patch Improve Evaluator API - Key: SOLR-996