[jira] [Commented] (TIKA-724) PDF text sometimes has extra space between letters

2011-11-17 Thread Ravish Bhagdev (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13151950#comment-13151950 ] Ravish Bhagdev commented on TIKA-724: - Is there a way to control this flag from Solr? W

[jira] [Commented] (TIKA-724) PDF text sometimes has extra space between letters

2011-11-17 Thread Ravish Bhagdev (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13151954#comment-13151954 ] Ravish Bhagdev commented on TIKA-724: - and also in tika.config > PDF te

[jira] [Commented] (TIKA-782) Add support for parsing binary data in RTF files

2011-11-17 Thread Michael McCandless (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13152041#comment-13152041 ] Michael McCandless commented on TIKA-782: - These changes look great! Cutover to Pus

[jira] [Commented] (TIKA-782) Add support for parsing binary data in RTF files

2011-11-17 Thread Arjohn Kampman (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13152076#comment-13152076 ] Arjohn Kampman commented on TIKA-782: - I'll make the necessary changes. Do you mind if

[jira] [Issue Comment Edited] (TIKA-782) Add support for parsing binary data in RTF files

2011-11-17 Thread Arjohn Kampman (Issue Comment Edited) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13152076#comment-13152076 ] Arjohn Kampman edited comment on TIKA-782 at 11/17/11 2:36 PM: ---

[jira] [Commented] (TIKA-782) Add support for parsing binary data in RTF files

2011-11-17 Thread Michael McCandless (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13152119#comment-13152119 ] Michael McCandless commented on TIKA-782: - bq. I'll make the necessary changes. Th

[jira] [Commented] (TIKA-724) PDF text sometimes has extra space between letters

2011-11-17 Thread Michael McCandless (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13152133#comment-13152133 ] Michael McCandless commented on TIKA-724: - Alas, no, I don't believe you can control

[jira] [Resolved] (TIKA-612) Specify PDFBox options via ParseContext

2011-11-17 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-612. - Resolution: Fixed Fix Version/s: 1.1 Assignee: Michael McCandless (was: Jul

[jira] [Updated] (TIKA-782) Add support for parsing binary data in RTF files

2011-11-17 Thread Arjohn Kampman (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Arjohn Kampman updated TIKA-782: Attachment: bin3.patch New patch with the requested changes. > Add support for parsi

[jira] [Commented] (TIKA-782) Add support for parsing binary data in RTF files

2011-11-17 Thread Michael McCandless (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13152246#comment-13152246 ] Michael McCandless commented on TIKA-782: - OK looks great Arjohn! Do you have an ex

[jira] [Commented] (TIKA-782) Add support for parsing binary data in RTF files

2011-11-17 Thread Arjohn Kampman (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13152252#comment-13152252 ] Arjohn Kampman commented on TIKA-782: - Unfortunately, the one that I used is confidentia

[jira] [Updated] (TIKA-782) Add support for parsing binary data in RTF files

2011-11-17 Thread Arjohn Kampman (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Arjohn Kampman updated TIKA-782: Attachment: logo.zip Bingo, found one in the published Enron data. It's an RTF with the Enron logo.

[jira] [Commented] (TIKA-782) Add support for parsing binary data in RTF files

2011-11-17 Thread Michael McCandless (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13152288#comment-13152288 ] Michael McCandless commented on TIKA-782: - That works for me: pre-patch we extract t

[jira] [Resolved] (TIKA-782) Add support for parsing binary data in RTF files

2011-11-17 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-782. - Resolution: Fixed I made minor edits (fixing up whitespace; removing unused param), and whi

[jira] [Commented] (TIKA-734) Out of memory exception with Xlsx file less than 5 MB

2011-11-17 Thread Anirban Mitra (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13152329#comment-13152329 ] Anirban Mitra commented on TIKA-734: Hello , I am using the following code.

[jira] [Commented] (TIKA-734) Out of memory exception with Xlsx file less than 5 MB

2011-11-17 Thread Jukka Zitting (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13152405#comment-13152405 ] Jukka Zitting commented on TIKA-734: Did you see the parse() method [1] that returns a j