[jira] Updated: (SOLR-846) Out Of memory doing delta import with fetch size set to -1

2008-11-12 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Noble Paul updated SOLR-846: Attachment: SOLR-846.patch a partial solution. eventually we must stream it or persist the data > Out Of me

[jira] Updated: (SOLR-844) A SolrServer impl to front-end multiple urls

2008-11-12 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Noble Paul updated SOLR-844: Description: Currently a {{CommonsHttpSolrServer}} can talk to only one server. This demands that the user h

[jira] Updated: (SOLR-844) A SolrServer impl to front-end multiple urls

2008-11-12 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Noble Paul updated SOLR-844: Description: Currently a {{CommonsHttpSolrServer}} can talk to only one server. This demands that the user h

Re: Katta's goodness for Solr

2008-11-12 Thread Noble Paul നോബിള്‍ नोब्ळ्
On Thu, Nov 13, 2008 at 10:11 AM, Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > I'm not sure if you are commenting on how Katta does things in that > LoadBalancers part, but Katta doesn't do that as far as I know. Passing > shard URL in request is the Solr thing, but I think we concluded shard U

[jira] Assigned: (SOLR-236) Field collapsing

2008-11-12 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic reassigned SOLR-236: - Assignee: (was: Otis Gospodnetic) > Field collapsing > > >

[jira] Updated: (SOLR-850) Addition of timeouts for distributed searching

2008-11-12 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic updated SOLR-850: -- Assignee: Otis Gospodnetic I don't see any harm in this. Current values (undefined) mean "no tim

Re: Katta's goodness for Solr

2008-11-12 Thread Otis Gospodnetic
I'm not sure if you are commenting on how Katta does things in that LoadBalancers part, but Katta doesn't do that as far as I know. Passing shard URL in request is the Solr thing, but I think we concluded shard URLs can also live in "defaults" for the handler, no? Otis -- Sematext -- http://s

[jira] Created: (SOLR-853) Make DIH API friendly

2008-11-12 Thread Noble Paul (JIRA)
Make DIH API friendly - Key: SOLR-853 URL: https://issues.apache.org/jira/browse/SOLR-853 Project: Solr Issue Type: Improvement Components: contrib - DataImportHandler Reporter: Noble Paul DIH curr

Re: Katta's goodness for Solr

2008-11-12 Thread Noble Paul നോബിള്‍ नोब्ळ्
The way we do distributed search is not straight forward . Introducing extra layers (LoadBalancers) in between the shards looks like a hack to me. Moreover , passing in the shard URL in the request is not a very nice design The clients should be ideally unaware of the fact that they are doing a dis

Re: DataImportHandler and Blobs

2008-11-12 Thread Grant Ingersoll
On Nov 12, 2008, at 12:56 PM, Shalin Shekhar Mangar wrote: I think the best way would be a TikaEntityProcessor which knows how to handle documents. I guess a typical use-case would be FileListEntityProcessor->TikaEntityProcessor as parent-child entities. Also see SOLR-833 which adds a FieldRea

[jira] Updated: (SOLR-852) Refactor common code in various handlers for working with ContentStream Objects

2008-11-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated SOLR-852: - Attachment: SOLR-852.patch Remove the setErrorHeader stuff and just set it in the load. I will lik

[jira] Updated: (SOLR-799) Add support for hash based exact/near duplicate document handling

2008-11-12 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated SOLR-799: - Attachment: SOLR-799.patch This ensures the id field stays unique. Are there any other issues that need to

[jira] Updated: (SOLR-830) snappuller picks bad snapshot name

2008-11-12 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic updated SOLR-830: -- Fix Version/s: 1.3.1 > snappuller picks bad snapshot name > -- >

[jira] Updated: (SOLR-346) need to improve snapinstaller to ignore non-snapshots in data directory

2008-11-12 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic updated SOLR-346: -- Fix Version/s: 1.3.1 > need to improve snapinstaller to ignore non-snapshots in data directory >

[jira] Commented: (SOLR-852) Refactor common code in various handlers for working with ContentStream Objects

2008-11-12 Thread Erik Hatcher (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12647067#action_12647067 ] Erik Hatcher commented on SOLR-852: --- Patch looks good, Grant. The abstract class of Conte

[jira] Updated: (SOLR-852) Refactor common code in various handlers for working with ContentStream Objects

2008-11-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated SOLR-852: - Attachment: SOLR-852.patch Here's a first crack at refactoring common code for ContentStream handli

[jira] Work started: (SOLR-852) Refactor common code in various handlers for working with ContentStream Objects

2008-11-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on SOLR-852 started by Grant Ingersoll. > Refactor common code in various handlers for working with ContentStream > Objects >

[jira] Commented: (SOLR-465) Add configurable DirectoryProvider so that alternate Directory implementations can be specified via solrconfig.xml

2008-11-12 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12647034#action_12647034 ] Mark Miller commented on SOLR-465: -- bq. But couldn't one start the container with the approp

[jira] Commented: (SOLR-465) Add configurable DirectoryProvider so that alternate Directory implementations can be specified via solrconfig.xml

2008-11-12 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12647027#action_12647027 ] Shalin Shekhar Mangar commented on SOLR-465: bq. But couldn't one start the conta

[jira] Updated: (SOLR-850) Addition of timeouts for distributed searching

2008-11-12 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic updated SOLR-850: -- Priority: Minor (was: Major) Fix Version/s: (was: 1.3) 1.4 > Add

[jira] Commented: (SOLR-465) Add configurable DirectoryProvider so that alternate Directory implementations can be specified via solrconfig.xml

2008-11-12 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12647022#action_12647022 ] Otis Gospodnetic commented on SOLR-465: --- But couldn't one start the container with the

[jira] Created: (SOLR-852) Refactor common code in various handlers for working with ContentStream Objects

2008-11-12 Thread Grant Ingersoll (JIRA)
Refactor common code in various handlers for working with ContentStream Objects --- Key: SOLR-852 URL: https://issues.apache.org/jira/browse/SOLR-852 Project: Solr I

Re: ContentStream Strategy

2008-11-12 Thread Grant Ingersoll
See https://issues.apache.org/jira/browse/SOLR-852 I've got a start on this. Will post a patch soon. On Nov 12, 2008, at 2:06 PM, Erik Hatcher wrote: +1 - keep it DRY! Erik On Nov 12, 2008, at 1:19 PM, Grant Ingersoll wrote: So, I'm working on SOLR-284 (Rich Documents, aka Tika) an

[jira] Commented: (SOLR-465) Add configurable DirectoryProvider so that alternate Directory implementations can be specified via solrconfig.xml

2008-11-12 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12647008#action_12647008 ] Mark Miller commented on SOLR-465: -- You will get an FSDirectory no matter which OS. Open is

[jira] Issue Comment Edited: (SOLR-851) Remove dependency on StAX Utils or make it optional

2008-11-12 Thread Gunnar Wagenknecht (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12647004#action_12647004 ] gunnar edited comment on SOLR-851 at 11/12/08 11:12 AM: {

[jira] Commented: (SOLR-851) Remove dependency on StAX Utils or make it optional

2008-11-12 Thread Gunnar Wagenknecht (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12647004#action_12647004 ] Gunnar Wagenknecht commented on SOLR-851: - {blockquote} Original-Nachricht -

[jira] Created: (SOLR-851) Remove dependency on StAX Utils or make it optional

2008-11-12 Thread Gunnar Wagenknecht (JIRA)
Remove dependency on StAX Utils or make it optional --- Key: SOLR-851 URL: https://issues.apache.org/jira/browse/SOLR-851 Project: Solr Issue Type: Wish Affects Versions: 1.3 Re

[jira] Commented: (SOLR-465) Add configurable DirectoryProvider so that alternate Directory implementations can be specified via solrconfig.xml

2008-11-12 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12647002#action_12647002 ] Yonik Seeley commented on SOLR-465: --- Hmmm, can you expand on what's wrong? > Add configura

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-12 Thread Eric Pugh (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12647003#action_12647003 ] Eric Pugh commented on SOLR-284: Grant, I am really excited that you are looking at this pat

Re: ContentStream Strategy

2008-11-12 Thread Erik Hatcher
+1 - keep it DRY! Erik On Nov 12, 2008, at 1:19 PM, Grant Ingersoll wrote: So, I'm working on SOLR-284 (Rich Documents, aka Tika) and I'm looking at the RichDocumentReqHandler included in that patch and I notice it has a lot of overlap with CSVRequestHandler and the XMLUpdateReque

[jira] Updated: (SOLR-850) Addition of timeouts for distributed searching

2008-11-12 Thread patrick o'leary (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] patrick o'leary updated SOLR-850: - Attachment: SOLR-850.patch Patches solr-1.3.0 release Allows for 2 timeout settings to be configur

[jira] Created: (SOLR-850) Addition of timeouts for distributed searching

2008-11-12 Thread patrick o'leary (JIRA)
Addition of timeouts for distributed searching -- Key: SOLR-850 URL: https://issues.apache.org/jira/browse/SOLR-850 Project: Solr Issue Type: Improvement Components: search Affects Ve

ContentStream Strategy

2008-11-12 Thread Grant Ingersoll
So, I'm working on SOLR-284 (Rich Documents, aka Tika) and I'm looking at the RichDocumentReqHandler included in that patch and I notice it has a lot of overlap with CSVRequestHandler and the XMLUpdateRequestHandler such that there seems to be a fair amount of commonality for dealing with t

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12646987#action_12646987 ] Grant Ingersoll commented on SOLR-284: -- {quote} 3. Tika provides a mechanism for impleme

Re: DataImportHandler and Blobs

2008-11-12 Thread Shalin Shekhar Mangar
On Wed, Nov 12, 2008 at 10:44 PM, Grant Ingersoll <[EMAIL PROTECTED]>wrote: > Am I understanding the DIH correctly in that it doesn't work with Blobs and > or binary things? I'm basing this off of JdbcDataSource.getARow() which > seems to be the place that populates the Map that is then passed t

[jira] Commented: (SOLR-561) Solr replication by Solr (for windows also)

2008-11-12 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12646968#action_12646968 ] Otis Gospodnetic commented on SOLR-561: --- I wonder if it might be useful to add copy thr

DataImportHandler and Blobs

2008-11-12 Thread Grant Ingersoll
Am I understanding the DIH correctly in that it doesn't work with Blobs and or binary things? I'm basing this off of JdbcDataSource.getARow() which seems to be the place that populates the Map that is then passed to the Transformer. One of the things that I think might be interesting is,

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12646947#action_12646947 ] Grant Ingersoll commented on SOLR-284: -- Some initial thoughts on moving forward: I thin

[jira] Work started: (SOLR-284) Parsing Rich Document Types

2008-11-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on SOLR-284 started by Grant Ingersoll. > Parsing Rich Document Types > --- > > Key: SOLR-284 > URL: https://is

Re: Katta's goodness for Solr

2008-11-12 Thread Grant Ingersoll
On Nov 11, 2008, at 1:15 PM, Otis Gospodnetic wrote: Quick thought. I saw Stefan's Katta presentation last night. Katta seems nice and simple. If I understood correctly, juicy stuff that is interesting to Solr is: - Katta has a notion of a Primary Master and N Secondary Slaves (no SPOF

[jira] Commented: (SOLR-651) A SearchComponent for fetching TF-IDF values

2008-11-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12646911#action_12646911 ] Grant Ingersoll commented on SOLR-651: -- I committed: freq -> tf idf -> df tf-idf -> tf_

Re: [jira] Resolved: (SOLR-845) Solr 1.3.1 Release

2008-11-12 Thread Gunnar Wagenknecht
Ryan McKinley schrieb: > Yes, you should make an issue related to the StAX dependency. Ok, will do. > Without upgrading to java 1.6, what is your thought on how to remove > the dependencies? I haven't looked if it's a JRE issue. Based on Shalin's response I assumed it's not. If JRE is an issue

[jira] Updated: (SOLR-847) Enhance the snappull command in ReplicationHandler to take in masterUrl

2008-11-12 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Noble Paul updated SOLR-847: Attachment: SOLR-847.patch take in an extra parameter while triggerring snappull > Enhance the snappull comm

[jira] Resolved: (SOLR-833) A DataSource to read data from a field as a reader

2008-11-12 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shalin Shekhar Mangar resolved SOLR-833. Resolution: Fixed Committed revision 713343. Thanks Noble! > A DataSource to read d

[jira] Updated: (SOLR-833) A DataSource to read data from a field as a reader

2008-11-12 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shalin Shekhar Mangar updated SOLR-833: --- Attachment: SOLR-833.patch Bringing patch in sync with trunk (it broke due to to SOLR-8

[jira] Resolved: (SOLR-842) Better error handling for DIH

2008-11-12 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shalin Shekhar Mangar resolved SOLR-842. Resolution: Fixed Committed revision 713335. Thanks Noble! > Better error handling

[jira] Closed: (SOLR-495) Use jarjar links to transform javax.xml classpath to avoid collisions with other xml parsers.

2008-11-12 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Frössman closed SOLR-495. Resolution: Fixed Works without this work around for me with latest jboss 4.2 and solrj 1.3 final. >

[jira] Updated: (SOLR-842) Better error handling for DIH

2008-11-12 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shalin Shekhar Mangar updated SOLR-842: --- Attachment: SOLR-842.patch Adding a test case. I plan to commit shortly. > Better erro