[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-09-16 Thread Chris Harris (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12756154#action_12756154 ] Chris Harris commented on SOLR-284: --- This caught me by surprise, so I'm noting it here in

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-09-16 Thread Chris Harris (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12756259#action_12756259 ] Chris Harris commented on SOLR-284: --- Grant and company: I just noticed that the example

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-09-16 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12756266#action_12756266 ] Yonik Seeley commented on SOLR-284: --- bq. example solrconfig.xml at the head of SVN trunk

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-09-14 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12755058#action_12755058 ] Grant Ingersoll commented on SOLR-284: -- bq. How does a copyField necessitate storing the

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-09-14 Thread David Smiley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12755053#action_12755053 ] David Smiley commented on SOLR-284: --- Grant, your response confuses me. How does a

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-09-14 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12755008#action_12755008 ] Grant Ingersoll commented on SOLR-284: -- Yonik, any objections to me committing the

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-09-12 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12754644#action_12754644 ] Yonik Seeley commented on SOLR-284: --- bq. it is often the case where one wants all values

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-09-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12754646#action_12754646 ] Grant Ingersoll commented on SOLR-284: -- bq. What's the real use-case, to be able to

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-07-14 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12730929#action_12730929 ] Yonik Seeley commented on SOLR-284: --- OK, I've committed the above. I'll work on updating

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-07-08 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12728636#action_12728636 ] Grant Ingersoll commented on SOLR-284: -- bq. The date.format thing is interesting

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-07-07 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12728144#action_12728144 ] Yonik Seeley commented on SOLR-284: --- The current ext.metadata.prefix parameter adds the

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-07-07 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12728303#action_12728303 ] Yonik Seeley commented on SOLR-284: --- The date.format thing is interesting but shouldn't

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-07-01 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12726113#action_12726113 ] Yonik Seeley commented on SOLR-284: --- bq. My only request is that, if you're changing how

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-07-01 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12726116#action_12726116 ] Yonik Seeley commented on SOLR-284: --- bq. It is burdensome to have to add the ignores for

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-07-01 Thread Eric Pugh (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12726115#action_12726115 ] Eric Pugh commented on SOLR-284: I am out of the office 6/29 - 6/30. For urgent issues,

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-07-01 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12726122#action_12726122 ] Yonik Seeley commented on SOLR-284: --- I just tried setting ext.idx.attr=false, and I didn't

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-07-01 Thread Chris Harris (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12726123#action_12726123 ] Chris Harris commented on SOLR-284: --- {quote} bq. My only request is that, if you're

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-06-29 Thread Chris Harris (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12725237#action_12725237 ] Chris Harris commented on SOLR-284: --- bq. Apologies for not reviewing this sooner after it

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-06-29 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12725355#action_12725355 ] Grant Ingersoll commented on SOLR-284: -- bq. ext.ignore.und.fl I think this should be

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-06-28 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12724937#action_12724937 ] Grant Ingersoll commented on SOLR-284: -- bq. I just tried setting ext.idx.attr=false, and

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-06-28 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12724938#action_12724938 ] Grant Ingersoll commented on SOLR-284: -- I will review your comments more tomorrow.

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-06-27 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12724855#action_12724855 ] Yonik Seeley commented on SOLR-284: --- Not sure if I should open a new issue or keep

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-06-27 Thread Eric Pugh (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12724856#action_12724856 ] Eric Pugh commented on SOLR-284: I am out of the office 6/29 - 6/30. For urgent issues,

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-06-27 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12724859#action_12724859 ] Yonik Seeley commented on SOLR-284: --- ext.capture seems problematic in that one needs a

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-06-27 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12724862#action_12724862 ] Yonik Seeley commented on SOLR-284: --- I just tried setting ext.idx.attr=false, and I didn't

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-01-12 Thread Chris Harris (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12663150#action_12663150 ] Chris Harris commented on SOLR-284: --- bq. I could, however, see adding a flag to specify

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-01-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12663186#action_12663186 ] Grant Ingersoll commented on SOLR-284: -- I guess I'm fine with it. So, should we remove

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-01-11 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12662793#action_12662793 ] Grant Ingersoll commented on SOLR-284: -- bq. Hmmm ... that means that if i have a schema

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-01-10 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12662660#action_12662660 ] Grant Ingersoll commented on SOLR-284: -- bq. Should the schema designer just use the UUID

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2009-01-10 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12662696#action_12662696 ] Hoss Man commented on SOLR-284: --- bq. I put in the code b/c I figured it was better to generate

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-12-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12656018#action_12656018 ] Grant Ingersoll commented on SOLR-284: -- Forgot a couple of things on this: 1. To hook

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-12-12 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12656023#action_12656023 ] Rogério Pereira Araújo commented on SOLR-284: - Grant, lemme know how can I help.

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-12-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12656032#action_12656032 ] Grant Ingersoll commented on SOLR-284: -- OK, I just committed: 1. Upgraded to Tika 0.2

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-12-07 Thread Ryan McKinley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12654234#action_12654234 ] Ryan McKinley commented on SOLR-284: Looks like there are a bunch of duplicate .jar files

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-12-07 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12654235#action_12654235 ] Grant Ingersoll commented on SOLR-284: -- Thanks, Ryan, I will remove them. Parsing Rich

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-12-03 Thread Chris Harris (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12652993#action_12652993 ] Chris Harris commented on SOLR-284: --- Currently this patch deploys the Tika libs to

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-12-03 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12653137#action_12653137 ] Grant Ingersoll commented on SOLR-284: -- I think in multicore you can specify a shared

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-30 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12651813#action_12651813 ] Grant Ingersoll commented on SOLR-284: -- bq. Sort of related, I've noticed that

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-25 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12650634#action_12650634 ] Grant Ingersoll commented on SOLR-284: -- bq. Tika doesn't need to do this explicitly

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-24 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12650353#action_12650353 ] Hoss Man commented on SOLR-284: --- bq. if Tika returns a metadata field and you haven't made an

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-24 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12650359#action_12650359 ] Grant Ingersoll commented on SOLR-284: -- {quote} I'm not familiar with the state of the

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-24 Thread Chris Harris (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12650363#action_12650363 ] Chris Harris commented on SOLR-284: --- The 2008-11-15 01:12 PM version of SOLR-284.patch

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-24 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12650365#action_12650365 ] Grant Ingersoll commented on SOLR-284: -- {quote} The 2008-11-15 01:12 PM version of

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-24 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12650368#action_12650368 ] Grant Ingersoll commented on SOLR-284: -- I like how Erik has given names to contribs,

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-24 Thread Erik Hatcher (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12650431#action_12650431 ] Erik Hatcher commented on SOLR-284: --- bq. I'm not familiar with the state of the patch, but

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-22 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12649986#action_12649986 ] Grant Ingersoll commented on SOLR-284: -- {quote} I think I like where this is going.

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-20 Thread Chris Harris (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12649542#action_12649542 ] Chris Harris commented on SOLR-284: --- Is the latest patch supposed to contain a file

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-20 Thread Chris Harris (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12649551#action_12649551 ] Chris Harris commented on SOLR-284: --- A few comment on the ExtractingDocumentLoader: * I

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-15 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12647875#action_12647875 ] Grant Ingersoll commented on SOLR-284: -- Still to do: 1. More unit tests 2. We need to

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-14 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12647618#action_12647618 ] Grant Ingersoll commented on SOLR-284: -- Question for the people watching this: Would

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-14 Thread Erik Hatcher (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12647619#action_12647619 ] Erik Hatcher commented on SOLR-284: --- I'd rather see the old (err, current) wiki page

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-14 Thread Chris Harris (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12647632#action_12647632 ] Chris Harris commented on SOLR-284: --- Grant, I don't really care if you take over the old

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-14 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12647670#action_12647670 ] Grant Ingersoll commented on SOLR-284: -- OK, I've created

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-14 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12647748#action_12647748 ] Grant Ingersoll commented on SOLR-284: -- Things to do: 1. Documentation 2. Way more

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12646947#action_12646947 ] Grant Ingersoll commented on SOLR-284: -- Some initial thoughts on moving forward: I

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-12 Thread Eric Pugh (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12647003#action_12647003 ] Eric Pugh commented on SOLR-284: Grant, I am really excited that you are looking at this

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-11-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12646987#action_12646987 ] Grant Ingersoll commented on SOLR-284: -- {quote} 3. Tika provides a mechanism for

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-09-02 Thread Chris Harris (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12627882#action_12627882 ] Chris Harris commented on SOLR-284: --- A couple of Tika things: I glanced at Tika yesterday,

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-08-31 Thread Chris Harris (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12627333#action_12627333 ] Chris Harris commented on SOLR-284: --- While we're on the subject of breaking changes, I'm

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-07-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12614992#action_12614992 ] Rogério Pereira Araújo commented on SOLR-284: - Who is working on tika based

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-07-19 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12615057#action_12615057 ] Otis Gospodnetic commented on SOLR-284: --- I don't think anyone is working on it

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-05-08 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12595360#action_12595360 ] Otis Gospodnetic commented on SOLR-284: --- +1 for Tika But also +1 for committing this in

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-05-08 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12595365#action_12595365 ] Grant Ingersoll commented on SOLR-284: -- I don't agree on committing it. If Tika is the

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-05-08 Thread Chris Harris (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12595372#action_12595372 ] Chris Harris commented on SOLR-284: --- I'm on the fence about whether this patch makes sense

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-05-07 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12594992#action_12594992 ] Grant Ingersoll commented on SOLR-284: -- Why not just use Tika (or Aperture, but it's

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-05-07 Thread Chris Harris (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12595007#action_12595007 ] Chris Harris commented on SOLR-284: --- I'm not sure this patch entirely reinvents the wheel,

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-05-07 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12595015#action_12595015 ] Grant Ingersoll commented on SOLR-284: -- I think Tika will actually take less effort,

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-05-06 Thread Michel Benevento (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12594730#action_12594730 ] Michel Benevento commented on SOLR-284: --- Hi, just new here, I am working on Rich

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-04-08 Thread Kristoffer Dyrkorn (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12586735#action_12586735 ] Kristoffer Dyrkorn commented on SOLR-284: - Very handy! It could be beneficial to

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-03-28 Thread Eric Pugh (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12583231#action_12583231 ] Eric Pugh commented on SOLR-284: Chris, I like what you are thinking... Really this is sort

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-03-28 Thread Eric Pugh (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12583233#action_12583233 ] Eric Pugh commented on SOLR-284: Oh, and don't forget to vote for it as well:

Re: [jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-03-26 Thread Chris Hostetter
: This patch is failing when i try to index large documents of size : 20MB(mainly excel and pdf) What is the error? What shows up in your logs? What is the value of multipartUploadLimitInKB in your solrconfig.xml say? -Hoss

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-03-25 Thread Chris Harris (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12582062#action_12582062 ] Chris Harris commented on SOLR-284: --- I'm thinking it would be handy if

Re: [jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-03-14 Thread Chris Hostetter
: I got the following error when try to launch solr in tomcat after applying : patch SOLR-284 : java.lang.RuntimeException: org.apache.lucene.index.CorruptIndexException: : Unknown format version: -4 at org.apache.solr.core.SolrCore.getSearcher( that error indicates that the version of lucene

Re: [jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-03-12 Thread pavan kumar donepudi
I got the following error when try to launch solr in tomcat after applying patch SOLR-284 *message* *Severe errors in solr configuration. Check your log files for more detailed information on what may be wrong. If you want solr to continue after configuration errors, change:

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2008-02-15 Thread Juho-Matti Stenberg (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12569275#action_12569275 ] Juho-Matti Stenberg commented on SOLR-284: -- I wrote a simple patch for

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2007-12-13 Thread Jonathan Hipkiss (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12551506 ] Jonathan Hipkiss commented on SOLR-284: --- This is crucial functionaility if Solr is to be accepted as a solution

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2007-11-12 Thread Eric Pugh (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12541879 ] Eric Pugh commented on SOLR-284: Juri, Thanks for the vote on the issue! The next time I update this patch to work

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2007-11-10 Thread Juri Kuehn (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12541535 ] Juri Kuehn commented on SOLR-284: - Hi Eric, thank you for this handler, works like a charm! I need to use non-numeric

[jira] Commented: (SOLR-284) Parsing Rich Document Types

2007-07-02 Thread Ryan McKinley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12509676 ] Ryan McKinley commented on SOLR-284: I haven't run this patch, but have a few questions... What is the *general*