[jira] [Commented] (TIKA-1228) Embedded files not extracted properly from PDF

2014-02-03 Thread Jason Sherman (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13889889#comment-13889889 ] Jason Sherman commented on TIKA-1228: - Thanks for the help. Another possibly related i

[jira] [Comment Edited] (TIKA-1228) Embedded files not extracted properly from PDF

2014-02-03 Thread Jason Sherman (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13889889#comment-13889889 ] Jason Sherman edited comment on TIKA-1228 at 2/3/14 8:36 PM: - T

[jira] [Issue Comment Deleted] (TIKA-1228) Embedded files not extracted properly from PDF

2014-02-03 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1228: -- Comment: was deleted (was: I won't have time to fix this for a week or so, but, I'll take this unless a

[jira] [Resolved] (TIKA-1228) Embedded files not extracted properly from PDF

2014-02-03 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-1228. --- Resolution: Fixed Fix Version/s: 1.5 Fixed in r1564042. Thank you, [~agi20dla], for reporting

[jira] [Comment Edited] (TIKA-1228) Embedded files not extracted properly from PDF

2014-02-03 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13889697#comment-13889697 ] Tim Allison edited comment on TIKA-1228 at 2/3/14 6:11 PM: --- I won

[jira] [Comment Edited] (TIKA-1228) Embedded files not extracted properly from PDF

2014-02-03 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13889697#comment-13889697 ] Tim Allison edited comment on TIKA-1228 at 2/3/14 6:09 PM: --- I won

[jira] [Commented] (TIKA-1228) Embedded files not extracted properly from PDF

2014-02-03 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13889697#comment-13889697 ] Tim Allison commented on TIKA-1228: --- I won't have time to fix this for a week or so, but

[jira] [Updated] (TIKA-1228) Embedded files not extracted properly from PDF

2014-02-03 Thread Jason Sherman (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Sherman updated TIKA-1228: Attachment: pdf_with_doc_and_text_attached.pdf Sorry about that. I meant to attach the file in the

[jira] [Commented] (TIKA-1228) Embedded files not extracted properly from PDF

2014-02-03 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13889560#comment-13889560 ] Nick Burch commented on TIKA-1228: -- Do you have a file which shows up the problem? And if

[jira] [Created] (TIKA-1228) Embedded files not extracted properly from PDF

2014-02-03 Thread Jason Sherman (JIRA)
Jason Sherman created TIKA-1228: --- Summary: Embedded files not extracted properly from PDF Key: TIKA-1228 URL: https://issues.apache.org/jira/browse/TIKA-1228 Project: Tika Issue Type: Bug

[jira] [Resolved] (TIKA-1224) Adding Source code (Java, Groovy, C) parser

2014-02-03 Thread Hong-Thai Nguyen (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong-Thai Nguyen resolved TIKA-1224. Resolution: Fixed > Adding Source code (Java, Groovy, C) parser > --

[jira] [Commented] (TIKA-1224) Adding Source code (Java, Groovy, C) parser

2014-02-03 Thread Hong-Thai Nguyen (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13889491#comment-13889491 ] Hong-Thai Nguyen commented on TIKA-1224: Commited on 1563902 > Adding Source code

[jira] [Closed] (TIKA-1227) Apache Tika 1.4 Duplicate extract data

2014-02-03 Thread vivek joshi (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] vivek joshi closed TIKA-1227. - Resolution: Invalid Fix Version/s: 1.4 > Apache Tika 1.4 Duplicate extract data > -

[jira] [Commented] (TIKA-1227) Apache Tika 1.4 Duplicate extract data

2014-02-03 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13889411#comment-13889411 ] Nick Burch commented on TIKA-1227: -- Sounds like there's a problem with your python code th

[jira] [Commented] (TIKA-1227) Apache Tika 1.4 Duplicate extract data

2014-02-03 Thread vivek joshi (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13889395#comment-13889395 ] vivek joshi commented on TIKA-1227: --- Thanks Nick Burch, I tried on command line and it i

[jira] [Commented] (TIKA-1227) Apache Tika 1.4 Duplicate extract data

2014-02-03 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13889387#comment-13889387 ] Nick Burch commented on TIKA-1227: -- I've just tried running tika-app directly on the comma

[jira] [Updated] (TIKA-1227) Apache Tika 1.4 Duplicate extract data

2014-02-03 Thread vivek joshi (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] vivek joshi updated TIKA-1227: -- Attachment: tt1.doc File for which the Duplicated text is coming. Duplicate text from the heading "DEF

[jira] [Created] (TIKA-1227) Apache Tika 1.4 Duplicate extract data

2014-02-03 Thread vivek joshi (JIRA)
vivek joshi created TIKA-1227: - Summary: Apache Tika 1.4 Duplicate extract data Key: TIKA-1227 URL: https://issues.apache.org/jira/browse/TIKA-1227 Project: Tika Issue Type: Bug Compone

[jira] [Commented] (TIKA-245) Support of CHM Format

2014-02-03 Thread Prashanth Ramaswamy (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13889317#comment-13889317 ] Prashanth Ramaswamy commented on TIKA-245: -- Nick, Thanks for your response. Unfort