[jira] [Updated] (TIKA-2555) Text with [underline] + [another format] in word document generates overlapping html tags.

2018-02-01 Thread Serban Alexe (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Serban Alexe updated TIKA-2555: --- Priority: Minor (was: Major) > Text with [underline] + [another format] in word document generates >

[jira] [Commented] (TIKA-2561) Tika Parser includes oudated/vulnerable version of JSoup

2018-02-01 Thread Asela (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16349427#comment-16349427 ] Asela commented on TIKA-2561: - Hello [~talli...@mitre.org] , As I understand one of the featur

[jira] [Created] (TIKA-2562) tika server parse HTML removes DIVs around hyperlink & adds shape

2018-02-01 Thread NW Brad (JIRA)
NW Brad created TIKA-2562: - Summary: tika server parse HTML removes DIVs around hyperlink & adds shape Key: TIKA-2562 URL: https://issues.apache.org/jira/browse/TIKA-2562 Project: Tika Issue Type: B

[jira] [Updated] (TIKA-2562) tika server parse HTML removes DIVs around hyperlink & adds shape

2018-02-01 Thread NW Brad (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] NW Brad updated TIKA-2562: -- Description: Hyperlinks in a HTML document that are parsed via tika server: curl -X PUT --upload-file tika_adds_