[
https://issues.apache.org/jira/browse/TIKA-897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13258241#comment-13258241
]
Nick Burch commented on TIKA-897:
-
We had support for detecting XML files that are ASCII,
[
https://issues.apache.org/jira/browse/TIKA-700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13245435#comment-13245435
]
Nick Burch commented on TIKA-700:
-
Upgraded to POI 3.8 Final in r1309005.
[
https://issues.apache.org/jira/browse/TIKA-792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13245436#comment-13245436
]
Nick Burch commented on TIKA-792:
-
Thanks for the feedback Marek. As of r1309005 we're now
[
https://issues.apache.org/jira/browse/TIKA-887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13241134#comment-13241134
]
Nick Burch commented on TIKA-887:
-
Is the problem still present in Tika 1.1? Only there were
[
https://issues.apache.org/jira/browse/TIKA-886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13240469#comment-13240469
]
Nick Burch commented on TIKA-886:
-
For cases where the OPCPackage is opened in
[
https://issues.apache.org/jira/browse/TIKA-886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13240470#comment-13240470
]
Nick Burch commented on TIKA-886:
-
Changed in r1306411, the two cases of
[
https://issues.apache.org/jira/browse/TIKA-877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13232861#comment-13232861
]
Nick Burch commented on TIKA-877:
-
Hmm, that commit wasn't supposed to break anything, it
[
https://issues.apache.org/jira/browse/TIKA-876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13230299#comment-13230299
]
Nick Burch commented on TIKA-876:
-
Can you upload a small example file?
When you try to
[
https://issues.apache.org/jira/browse/TIKA-853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13211801#comment-13211801
]
Nick Burch commented on TIKA-853:
-
If we do need to buffer it all into memory, then there
[
https://issues.apache.org/jira/browse/TIKA-863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13210205#comment-13210205
]
Nick Burch commented on TIKA-863:
-
I'm not sure if we should be setting it as
[
https://issues.apache.org/jira/browse/TIKA-864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13210207#comment-13210207
]
Nick Burch commented on TIKA-864:
-
If we did store them on a ThreadLocal, then how would we
[
https://issues.apache.org/jira/browse/TIKA-865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13210209#comment-13210209
]
Nick Burch commented on TIKA-865:
-
I've had a go at fixing this in r1245426. It'd be good if
[
https://issues.apache.org/jira/browse/TIKA-866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13210219#comment-13210219
]
Nick Burch commented on TIKA-866:
-
If the Tika Config file is missing elements (eg only has
[
https://issues.apache.org/jira/browse/TIKA-863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13210236#comment-13210236
]
Nick Burch commented on TIKA-863:
-
I'm not sure what the best way is to provide an
[
https://issues.apache.org/jira/browse/TIKA-863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13210245#comment-13210245
]
Nick Burch commented on TIKA-863:
-
We could check for the TikaConfig on the ParseContext,
[
https://issues.apache.org/jira/browse/TIKA-858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13207345#comment-13207345
]
Nick Burch commented on TIKA-858:
-
Additionally, what reference did you find for the chosen
[
https://issues.apache.org/jira/browse/TIKA-612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13205376#comment-13205376
]
Nick Burch commented on TIKA-612:
-
The conclusion was to expose the options on the PDFParser
[
https://issues.apache.org/jira/browse/TIKA-818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13205473#comment-13205473
]
Nick Burch commented on TIKA-818:
-
Temp files created through TemporaryResources are already
[
https://issues.apache.org/jira/browse/TIKA-747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13204630#comment-13204630
]
Nick Burch commented on TIKA-747:
-
Getting the central sync to work turned out to be much
[
https://issues.apache.org/jira/browse/TIKA-853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13202238#comment-13202238
]
Nick Burch commented on TIKA-853:
-
We don't want to have a System.gc call in production
[
https://issues.apache.org/jira/browse/TIKA-857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13202928#comment-13202928
]
Nick Burch commented on TIKA-857:
-
Not sure that this issue should have been resolved, as
[
https://issues.apache.org/jira/browse/TIKA-857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13202962#comment-13202962
]
Nick Burch commented on TIKA-857:
-
Looking at the patch, my only comment is wondering if we
[
https://issues.apache.org/jira/browse/TIKA-853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13197782#comment-13197782
]
Nick Burch commented on TIKA-853:
-
It's a Windows thing, because Windows won't let you
[
https://issues.apache.org/jira/browse/TIKA-842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13198047#comment-13198047
]
Nick Burch commented on TIKA-842:
-
One thing to bear in mind is that we try to map the
[
https://issues.apache.org/jira/browse/TIKA-855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13198415#comment-13198415
]
Nick Burch commented on TIKA-855:
-
I believe we're currently missing language profiles for
[
https://issues.apache.org/jira/browse/TIKA-853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13196841#comment-13196841
]
Nick Burch commented on TIKA-853:
-
I've looked at the code again, and I can't spot anything
[
https://issues.apache.org/jira/browse/TIKA-850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13196953#comment-13196953
]
Nick Burch commented on TIKA-850:
-
PasswordProvider added in r1238616, based on the above
[
https://issues.apache.org/jira/browse/TIKA-853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13196077#comment-13196077
]
Nick Burch commented on TIKA-853:
-
Ah, we weren't closing the stream in all cases. This is
[
https://issues.apache.org/jira/browse/TIKA-852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13195598#comment-13195598
]
Nick Burch commented on TIKA-852:
-
It looks like the Apache Licensed MP4Parser
[
https://issues.apache.org/jira/browse/TIKA-851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13194790#comment-13194790
]
Nick Burch commented on TIKA-851:
-
I'm not sure if we're going to be able to differentiate
[
https://issues.apache.org/jira/browse/TIKA-851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13194813#comment-13194813
]
Nick Burch commented on TIKA-851:
-
It looks like most files (not sure if it's all of them
[
https://issues.apache.org/jira/browse/TIKA-851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13194854#comment-13194854
]
Nick Burch commented on TIKA-851:
-
From
[
https://issues.apache.org/jira/browse/TIKA-851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13194891#comment-13194891
]
Nick Burch commented on TIKA-851:
-
I've added the audio/x-m4a alias in r1236734.
[
https://issues.apache.org/jira/browse/TIKA-842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13194916#comment-13194916
]
Nick Burch commented on TIKA-842:
-
Following the confirmation from the IPTC that we can use
[
https://issues.apache.org/jira/browse/TIKA-747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13193046#comment-13193046
]
Nick Burch commented on TIKA-747:
-
Following discussions on the list, I've decided to
[
https://issues.apache.org/jira/browse/TIKA-850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13193124#comment-13193124
]
Nick Burch commented on TIKA-850:
-
Currently, the objects set onto the ParseContext are:
*
[
https://issues.apache.org/jira/browse/TIKA-850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13193131#comment-13193131
]
Nick Burch commented on TIKA-850:
-
Based on this, I think the best option may be to have a
[
https://issues.apache.org/jira/browse/TIKA-818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13192072#comment-13192072
]
Nick Burch commented on TIKA-818:
-
Are you sure the scratchFile should be the real file
[
https://issues.apache.org/jira/browse/TIKA-849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13192079#comment-13192079
]
Nick Burch commented on TIKA-849:
-
We might be able to use the same handler, but it'd need
[
https://issues.apache.org/jira/browse/TIKA-839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13192107#comment-13192107
]
Nick Burch commented on TIKA-839:
-
Thanks for this, applied r1235233.
[
https://issues.apache.org/jira/browse/TIKA-850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13192184#comment-13192184
]
Nick Burch commented on TIKA-850:
-
Does anyone have a feeling for if the password should be
[
https://issues.apache.org/jira/browse/TIKA-760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13192189#comment-13192189
]
Nick Burch commented on TIKA-760:
-
NPE check added in r1235284.
NPE
[
https://issues.apache.org/jira/browse/TIKA-675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13192200#comment-13192200
]
Nick Burch commented on TIKA-675:
-
We could probably do this with a wrapper parser, which
[
https://issues.apache.org/jira/browse/TIKA-241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13192203#comment-13192203
]
Nick Burch commented on TIKA-241:
-
Has there been any luck getting junrar into Maven Central
[
https://issues.apache.org/jira/browse/TIKA-770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13192242#comment-13192242
]
Nick Burch commented on TIKA-770:
-
I've updated the three remaining ones in r1235321, along
[
https://issues.apache.org/jira/browse/TIKA-818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13190965#comment-13190965
]
Nick Burch commented on TIKA-818:
-
Tika does already handle its own temporary files, via
[
https://issues.apache.org/jira/browse/TIKA-844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13191214#comment-13191214
]
Nick Burch commented on TIKA-844:
-
Thanks, patch applied in r1234861.
[
https://issues.apache.org/jira/browse/TIKA-845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13191228#comment-13191228
]
Nick Burch commented on TIKA-845:
-
I think the current logic isn't quite correct. Rather
[
https://issues.apache.org/jira/browse/TIKA-849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13191242#comment-13191242
]
Nick Burch commented on TIKA-849:
-
Sample file committed in r1234886, along with a unit test
[
https://issues.apache.org/jira/browse/TIKA-849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13191259#comment-13191259
]
Nick Burch commented on TIKA-849:
-
Test and parser change committed in r1234904, thanks
It
[
https://issues.apache.org/jira/browse/TIKA-848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13190834#comment-13190834
]
Nick Burch commented on TIKA-848:
-
We can keep this open until it's fixed in PDFBox, and
[
https://issues.apache.org/jira/browse/TIKA-792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13189839#comment-13189839
]
Nick Burch commented on TIKA-792:
-
Are you able to share one of the files that triggers
[
https://issues.apache.org/jira/browse/TIKA-507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13189861#comment-13189861
]
Nick Burch commented on TIKA-507:
-
Thanks for this patch, sorry it has taken so long to get
[
https://issues.apache.org/jira/browse/TIKA-843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13189888#comment-13189888
]
Nick Burch commented on TIKA-843:
-
Do we want to set a timezone on these? For a date with no
[
https://issues.apache.org/jira/browse/TIKA-841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13188469#comment-13188469
]
Nick Burch commented on TIKA-841:
-
Fixed in r1232902, with code similar to the
[
https://issues.apache.org/jira/browse/TIKA-805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13186861#comment-13186861
]
Nick Burch commented on TIKA-805:
-
Thanks, applied in r1231905.
[
https://issues.apache.org/jira/browse/TIKA-87?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13186896#comment-13186896
]
Nick Burch commented on TIKA-87:
I believe this is no longer an issue, because of the recent
[
https://issues.apache.org/jira/browse/TIKA-86?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13186948#comment-13186948
]
Nick Burch commented on TIKA-86:
Turning the file magic into a Tika xml match shouldn't be
[
https://issues.apache.org/jira/browse/TIKA-86?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13187026#comment-13187026
]
Nick Burch commented on TIKA-86:
RegEx magic could be interesting, with a bit of care to
[
https://issues.apache.org/jira/browse/TIKA-842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13187153#comment-13187153
]
Nick Burch commented on TIKA-842:
-
Did you manage to confirm that the IPTC Spec license
[
https://issues.apache.org/jira/browse/TIKA-842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13187183#comment-13187183
]
Nick Burch commented on TIKA-842:
-
I think we'll need the OK from Apache Legal for this,
[
https://issues.apache.org/jira/browse/TIKA-842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13187190#comment-13187190
]
Nick Burch commented on TIKA-842:
-
LEGAL-122 created for this
IPTC
[
https://issues.apache.org/jira/browse/TIKA-360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13185646#comment-13185646
]
Nick Burch commented on TIKA-360:
-
Fractions will be supported when we upgrade to POI 3.8
[
https://issues.apache.org/jira/browse/TIKA-695?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13185001#comment-13185001
]
Nick Burch commented on TIKA-695:
-
Thanks for the sample files. Based on them, I've added
[
https://issues.apache.org/jira/browse/TIKA-695?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13180140#comment-13180140
]
Nick Burch commented on TIKA-695:
-
Would it be possible for you to create some sample files
[
https://issues.apache.org/jira/browse/TIKA-838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13178637#comment-13178637
]
Nick Burch commented on TIKA-838:
-
This breaks the CLIRR check, so I'll have to defer to
[
https://issues.apache.org/jira/browse/TIKA-837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13178638#comment-13178638
]
Nick Burch commented on TIKA-837:
-
Thanks, patch applied in r1226657.
Make
[
https://issues.apache.org/jira/browse/TIKA-793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13177081#comment-13177081
]
Nick Burch commented on TIKA-793:
-
Comment (COM/COMM) tag handling fixed in r1225480 - it
[
https://issues.apache.org/jira/browse/TIKA-835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13177574#comment-13177574
]
Nick Burch commented on TIKA-835:
-
winmail.dat is a TNEF file, which POI supports through
[
https://issues.apache.org/jira/browse/TIKA-830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13177021#comment-13177021
]
Nick Burch commented on TIKA-830:
-
I think the ForkParser instanceof check is a good
[
https://issues.apache.org/jira/browse/TIKA-831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13176068#comment-13176068
]
Nick Burch commented on TIKA-831:
-
I've enabled the last test in r1224864 - I had to switch
[
https://issues.apache.org/jira/browse/TIKA-827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13176069#comment-13176069
]
Nick Burch commented on TIKA-827:
-
I'm not a big fan of the temp file idea, so I've had a
[
https://issues.apache.org/jira/browse/TIKA-793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13176070#comment-13176070
]
Nick Burch commented on TIKA-793:
-
I've tracked this to two bugs. Both relate to the
[
https://issues.apache.org/jira/browse/TIKA-829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13175881#comment-13175881
]
Nick Burch commented on TIKA-829:
-
Thanks, patch applied in r1224675.
Tika
[
https://issues.apache.org/jira/browse/TIKA-830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13175882#comment-13175882
]
Nick Burch commented on TIKA-830:
-
If we're not going to support the ForkParser like this,
[
https://issues.apache.org/jira/browse/TIKA-826?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13175240#comment-13175240
]
Nick Burch commented on TIKA-826:
-
POI doesn't support .xlsb files, and nor is it likely to
[
https://issues.apache.org/jira/browse/TIKA-823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13173820#comment-13173820
]
Nick Burch commented on TIKA-823:
-
Note that it looks like the strings are prefixed with a 4
[
https://issues.apache.org/jira/browse/TIKA-819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13172825#comment-13172825
]
Nick Burch commented on TIKA-819:
-
You have to explicitly ask for embedded files to be
[
https://issues.apache.org/jira/browse/TIKA-700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13172950#comment-13172950
]
Nick Burch commented on TIKA-700:
-
Upgraded to POI 3.8 beta 5 in r1221109.
[
https://issues.apache.org/jira/browse/TIKA-805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13172951#comment-13172951
]
Nick Burch commented on TIKA-805:
-
The patch doesn't seem to apply cleanly against trunk, is
[
https://issues.apache.org/jira/browse/TIKA-757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13172974#comment-13172974
]
Nick Burch commented on TIKA-757:
-
I believe that as of r1221115 most of these are now
[
https://issues.apache.org/jira/browse/TIKA-818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13173000#comment-13173000
]
Nick Burch commented on TIKA-818:
-
I've just gone to make the change, and discovered that
[
https://issues.apache.org/jira/browse/TIKA-811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13168288#comment-13168288
]
Nick Burch commented on TIKA-811:
-
Do you know if 2.5.0-RC3 available in Maven Central, or
[
https://issues.apache.org/jira/browse/TIKA-806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13168369#comment-13168369
]
Nick Burch commented on TIKA-806:
-
You can always get a false positive with mime magic
[
https://issues.apache.org/jira/browse/TIKA-812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13168902#comment-13168902
]
Nick Burch commented on TIKA-812:
-
If we put in a slightly higher priority match for
[
https://issues.apache.org/jira/browse/TIKA-806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13168056#comment-13168056
]
Nick Burch commented on TIKA-806:
-
If you use DefaultDetector it isn't an issue, as the
[
https://issues.apache.org/jira/browse/TIKA-803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13168124#comment-13168124
]
Nick Burch commented on TIKA-803:
-
As of r1213560, the message body is now wrapped in a div
[
https://issues.apache.org/jira/browse/TIKA-805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13168154#comment-13168154
]
Nick Burch commented on TIKA-805:
-
Thanks Yegor!
Assuming no objections, I'll apply this
[
https://issues.apache.org/jira/browse/TIKA-808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13167314#comment-13167314
]
Nick Burch commented on TIKA-808:
-
I've added some unit tests in r1213131 for this case.
[
https://issues.apache.org/jira/browse/TIKA-809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13167324#comment-13167324
]
Nick Burch commented on TIKA-809:
-
This should be improved when we move to POI 3.8 beta 5,
[
https://issues.apache.org/jira/browse/TIKA-804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13167037#comment-13167037
]
Nick Burch commented on TIKA-804:
-
Questions are best asked on the Mailing Lists, rather
[
https://issues.apache.org/jira/browse/TIKA-804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13167038#comment-13167038
]
Nick Burch commented on TIKA-804:
-
Seems to parse just fine as a regular outlook file
[
https://issues.apache.org/jira/browse/TIKA-806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13167043#comment-13167043
]
Nick Burch commented on TIKA-806:
-
The file format allows for the directory entries to occur
[
https://issues.apache.org/jira/browse/TIKA-800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13162730#comment-13162730
]
Nick Burch commented on TIKA-800:
-
Looks like the issue is that ArchiveInputStream (from
[
https://issues.apache.org/jira/browse/TIKA-800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13162735#comment-13162735
]
Nick Burch commented on TIKA-800:
-
In that case, maybe it's best to have the wrapping done
[
https://issues.apache.org/jira/browse/TIKA-800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13163238#comment-13163238
]
Nick Burch commented on TIKA-800:
-
Fixed in r1210736 by wrapping the ArchiveInputStream, the
[
https://issues.apache.org/jira/browse/TIKA-802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13163304#comment-13163304
]
Nick Burch commented on TIKA-802:
-
I have just retried with the 1.0 version of tika-app, and
[
https://issues.apache.org/jira/browse/TIKA-797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13161568#comment-13161568
]
Nick Burch commented on TIKA-797:
-
Good spot! Thanks, patch applied in r1209438.
[
https://issues.apache.org/jira/browse/TIKA-791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13158444#comment-13158444
]
Nick Burch commented on TIKA-791:
-
One thing - I'm not sure that we should be returning the
[
https://issues.apache.org/jira/browse/TIKA-697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13158070#comment-13158070
]
Nick Burch commented on TIKA-697:
-
Thanks for this
I've tweaked the existing mime magic in
1 - 100 of 129 matches
Mail list logo