Pascal Essiembre created TIKA-1837:
--
Summary: HtmlEncodingDetector wrongly detects charset from
commented meta
Key: TIKA-1837
URL: https://issues.apache.org/jira/browse/TIKA-1837
Project: Tika
[
https://issues.apache.org/jira/browse/TIKA-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15107690#comment-15107690
]
Andreas Beeker commented on TIKA-1799:
--
I have no idea how osgi bundling works, but ad
[
https://issues.apache.org/jira/browse/TIKA-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15107395#comment-15107395
]
Bob Paulin commented on TIKA-1799:
--
Actually I'd be careful using the wildcard here becaus
[
https://issues.apache.org/jira/browse/TIKA-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15107392#comment-15107392
]
Bob Paulin commented on TIKA-1799:
--
So it's actually a pretty interesting question. If yo
[
https://issues.apache.org/jira/browse/TIKA-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15107368#comment-15107368
]
Tim Allison commented on TIKA-1799:
---
[~kiwiwings], looks like we have to specify packages
[
https://issues.apache.org/jira/browse/TIKA-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15107308#comment-15107308
]
Tim Allison commented on TIKA-1799:
---
Great. Thank you! That did it!
Apologies for the
[
https://issues.apache.org/jira/browse/TIKA-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15107257#comment-15107257
]
Bob Paulin commented on TIKA-1799:
--
[~talli...@mitre.org]
Looks like the structure of org
[
https://issues.apache.org/jira/browse/TIKA-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15107221#comment-15107221
]
Tim Allison commented on TIKA-1836:
---
The better solution of course would be to add proper
[
https://issues.apache.org/jira/browse/TIKA-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15107216#comment-15107216
]
Tim Allison commented on TIKA-1836:
---
Y, done. I asked POI colleagues if they minded if w
[
https://issues.apache.org/jira/browse/TIKA-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15107212#comment-15107212
]
Jorge Spinsanti commented on TIKA-1836:
---
POI issue was report in 2014-08-22. Perhaps
[
https://issues.apache.org/jira/browse/TIKA-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15107212#comment-15107212
]
Jorge Spinsanti edited comment on TIKA-1836 at 1/19/16 7:08 PM:
-
[
https://issues.apache.org/jira/browse/TIKA-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15106919#comment-15106919
]
Jorge Spinsanti edited comment on TIKA-1836 at 1/19/16 7:04 PM:
-
[
https://issues.apache.org/jira/browse/TIKA-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15107067#comment-15107067
]
Tim Allison edited comment on TIKA-1836 at 1/19/16 5:57 PM:
I c
[
https://issues.apache.org/jira/browse/TIKA-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15107080#comment-15107080
]
Tim Allison commented on TIKA-1836:
---
Not already fixed in POI: this is still open:
http
[
https://issues.apache.org/jira/browse/TIKA-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15107077#comment-15107077
]
Tim Allison commented on TIKA-1799:
---
[~bobpaulin], I hate to bother you with this, but do
[
https://issues.apache.org/jira/browse/TIKA-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15107067#comment-15107067
]
Tim Allison edited comment on TIKA-1836 at 1/19/16 5:50 PM:
I c
[
https://issues.apache.org/jira/browse/TIKA-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15107067#comment-15107067
]
Tim Allison commented on TIKA-1836:
---
I concur with Ken, if I understand this correctly, w
[
https://issues.apache.org/jira/browse/TIKA-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15106919#comment-15106919
]
Jorge Spinsanti commented on TIKA-1836:
---
POI is a dependency of TIKA. I think TIKA ca
[
https://issues.apache.org/jira/browse/TIKA-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15106908#comment-15106908
]
Ken Krugler commented on TIKA-1836:
---
This seems to be an issue for POI, as per the messag
[
https://issues.apache.org/jira/browse/TIKA-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jorge Spinsanti updated TIKA-1836:
--
Attachment: test.doc
File used to find the issue.
> Convertion DOC->TXT failed due to POI issue
[
https://issues.apache.org/jira/browse/TIKA-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jorge Spinsanti updated TIKA-1836:
--
Component/s: parser
> Convertion DOC->TXT failed due to POI issue
> -
Jorge Spinsanti created TIKA-1836:
-
Summary: Convertion DOC->TXT failed due to POI issue
Key: TIKA-1836
URL: https://issues.apache.org/jira/browse/TIKA-1836
Project: Tika
Issue Type: Bug
[
https://issues.apache.org/jira/browse/TIKA-1835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated TIKA-1835:
Attachment: TIKA-1835.patch
Patch for trunk. Adds support for iframe and link element link extraction
[
https://issues.apache.org/jira/browse/TIKA-1835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated TIKA-1835:
Flags: Patch,Important (was: Important)
> LinkContentHandler skips iframe and rel tags
> ---
[
https://issues.apache.org/jira/browse/TIKA-1824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15106752#comment-15106752
]
Tim Allison commented on TIKA-1824:
---
Thank you, [~bobpaulin]! Again, this is fantastic.
[
https://issues.apache.org/jira/browse/TIKA-1833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15106723#comment-15106723
]
Tim Allison commented on TIKA-1833:
---
Ha. Ok. Great to hear. It doesn't surprise me tha
[
https://issues.apache.org/jira/browse/TIKA-1823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Luca Moretti updated TIKA-1823:
---
Attachment: blocks_and_tables.dwf
I found this file on the Autodesk website that could be a suitably li
Markus Jelsma created TIKA-1835:
---
Summary: LinkContentHandler skips iframe and rel tags
Key: TIKA-1835
URL: https://issues.apache.org/jira/browse/TIKA-1835
Project: Tika
Issue Type: Bug
28 matches
Mail list logo