[jira] [Created] (TIKA-643) tika hangs parsing doc file (attached)

2011-04-20 Thread Enrico Donelli (JIRA)
tika hangs parsing doc file (attached) -- Key: TIKA-643 URL: https://issues.apache.org/jira/browse/TIKA-643 Project: Tika Issue Type: Bug Components: parser Affects Versions: 1.0 Env

[jira] [Created] (TIKA-644) parsing of Microsoft Word doc with style "Heading X" where X>6 creates invalid HTML with tags , etc

2011-04-20 Thread chris hudson (JIRA)
parsing of Microsoft Word doc with style "Heading X" where X>6 creates invalid HTML with tags , etc --- Key: TIKA-644 URL: https://issues.apache.org/jira/brows

[jira] [Updated] (TIKA-644) parsing of Microsoft Word doc with style "Heading X" where X>6 creates invalid HTML with tags , etc

2011-04-20 Thread chris hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] chris hudson updated TIKA-644: -- Description: org.apache.tika.parser.microsoft.WordExtractor will translate heading styles to "h" tags wi

Build failed in Jenkins: Tika-trunk #520

2011-04-20 Thread Apache Hudson Server
See Changes: [nick] TIKA-644 - When generating html headings from word, h6 is the highest the xhtml allows, so don't try generating h7 (or higher) even if Word has a 'Heading 7' style -- Start

[jira] [Resolved] (TIKA-644) parsing of Microsoft Word doc with style "Heading X" where X>6 creates invalid HTML with tags , etc

2011-04-20 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-644. - Resolution: Fixed Fix Version/s: 1.0 Assignee: Nick Burch Good spot! Fixed in r1095429. >

[jira] [Commented] (TIKA-643) tika hangs parsing doc file (attached)

2011-04-20 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13022230#comment-13022230 ] Nick Burch commented on TIKA-643: - Looks to be a bug in the new lower memory NPOIFS that was