[
https://issues.apache.org/jira/browse/TIKA-1753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ben McCann closed TIKA-1753.
Resolution: Later
> Improper word concatenation when extracting pdf
>
[
https://issues.apache.org/jira/browse/TIKA-1753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960254#comment-14960254
]
Ben McCann commented on TIKA-1753:
--
Closing this since it's being tracked at PDFBOX now
> Improper word
GitHub user wiedsche opened a pull request:
https://github.com/apache/tika/pull/59
fix for TIKA-1772 contributed by wiedsche
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/wiedsche/tika TIKA-1772
Alternatively you can review
[
https://issues.apache.org/jira/browse/TIKA-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960290#comment-14960290
]
ASF GitHub Bot commented on TIKA-1772:
--
GitHub user wiedsche opened a pull request:
Alexander Widera created TIKA-1772:
--
Summary: Mimetype of VTT files
Key: TIKA-1772
URL: https://issues.apache.org/jira/browse/TIKA-1772
Project: Tika
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/TIKA-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Alexander Widera updated TIKA-1772:
---
Attachment: upc-video-subtitles-en.vtt
Added example vtt file as attachment.
Thanks for
[
https://issues.apache.org/jira/browse/TIKA-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960504#comment-14960504
]
Hudson commented on TIKA-1772:
--
SUCCESS: Integrated in tika-trunk-jdk1.7 #869 (See
[
https://issues.apache.org/jira/browse/TIKA-1773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960503#comment-14960503
]
Hudson commented on TIKA-1773:
--
SUCCESS: Integrated in tika-trunk-jdk1.7 #869 (See
Andreas Hirtzel created TIKA-1773:
-
Summary: No XML Metadata output for JP2 files
Key: TIKA-1773
URL: https://issues.apache.org/jira/browse/TIKA-1773
Project: Tika
Issue Type: Bug
[
https://issues.apache.org/jira/browse/TIKA-1773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andreas Hirtzel updated TIKA-1773:
--
Attachment: testJPEG.jp2
converted testfile (using Photoshop)
> No XML Metadata output for JP2
[
https://issues.apache.org/jira/browse/TIKA-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960454#comment-14960454
]
Nick Burch commented on TIKA-1772:
--
Thanks for the patch! Couple of minor points - we normally sort the
[
https://issues.apache.org/jira/browse/TIKA-1773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960444#comment-14960444
]
Nick Burch commented on TIKA-1773:
--
Are you able to convert an existing Tika test image (eg
[
https://issues.apache.org/jira/browse/TIKA-1773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960599#comment-14960599
]
Nick Burch commented on TIKA-1773:
--
Ah, I think I've found the issue. Based on
[
https://issues.apache.org/jira/browse/TIKA-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960722#comment-14960722
]
Nick Burch edited comment on TIKA-1772 at 10/16/15 1:46 PM:
Thanks for that.
[
https://issues.apache.org/jira/browse/TIKA-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Burch resolved TIKA-1772.
--
Resolution: Fixed
Fix Version/s: 1.11
Thanks for that. Looks like we can also do mime magic
[
https://issues.apache.org/jira/browse/TIKA-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960821#comment-14960821
]
Hudson commented on TIKA-1772:
--
SUCCESS: Integrated in tika-trunk-jdk1.7 #871 (See
[
https://issues.apache.org/jira/browse/TIKA-1358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960963#comment-14960963
]
Ben Summers commented on TIKA-1358:
---
Evernote have kindly open sourced some code to extract text from
17 matches
Mail list logo