[jira] [Commented] (TIKA-2955) PDF parsing to XHTML results in tika attempting to write invalid HTML characters.

2019-10-07 Thread Luke Butters (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16946387#comment-16946387 ] Luke Butters commented on TIKA-2955: Hi I made this PR: https://github.com/apache/tika

[jira] [Commented] (TIKA-2955) PDF parsing to XHTML results in tika attempting to write invalid HTML characters.

2019-10-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16946386#comment-16946386 ] ASF GitHub Bot commented on TIKA-2955: -- LukeButters commented on pull request #285: F

[jira] [Commented] (TIKA-2955) PDF parsing to XHTML results in tika attempting to write invalid HTML characters.

2019-10-07 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16946362#comment-16946362 ] Tim Allison commented on TIKA-2955: --- If you make the PR against master, I’ll cherry-pick

[jira] [Updated] (TIKA-2955) PDF parsing to XHTML results in tika attempting to write invalid HTML characters.

2019-10-07 Thread Luke Butters (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Luke Butters updated TIKA-2955: --- Attachment: fix_with_tests.txt > PDF parsing to XHTML results in tika attempting to write invalid HTML

[jira] [Comment Edited] (TIKA-2955) PDF parsing to XHTML results in tika attempting to write invalid HTML characters.

2019-10-07 Thread Luke Butters (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16946270#comment-16946270 ] Luke Butters edited comment on TIKA-2955 at 10/7/19 9:53 PM: -

[jira] [Comment Edited] (TIKA-2955) PDF parsing to XHTML results in tika attempting to write invalid HTML characters.

2019-10-07 Thread Luke Butters (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16946270#comment-16946270 ] Luke Butters edited comment on TIKA-2955 at 10/7/19 9:41 PM: -

[jira] [Commented] (TIKA-2955) PDF parsing to XHTML results in tika attempting to write invalid HTML characters.

2019-10-07 Thread Luke Butters (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16946270#comment-16946270 ] Luke Butters commented on TIKA-2955: So [wikipedia Valid_characters_in_XML|https://en

[jira] [Comment Edited] (TIKA-2955) PDF parsing to XHTML results in tika attempting to write invalid HTML characters.

2019-10-07 Thread Luke Butters (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16946270#comment-16946270 ] Luke Butters edited comment on TIKA-2955 at 10/7/19 9:40 PM: -

[jira] [Commented] (TIKA-2941) OSGI bundle and app are not self-contained

2019-10-07 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16945889#comment-16945889 ] Bob Paulin commented on TIKA-2941: -- Just an update to provide some transparency around th