[
https://issues.apache.org/jira/browse/TIKA-4267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-4267:
--
Summary: Not getting correct mime type for a few file extensions. example:
csv
[
https://issues.apache.org/jira/browse/TIKA-4267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-4267:
--
Affects Version/s: 1.28.4
> Not getting correct mimet type for few file extensions. exam
[
https://issues.apache.org/jira/browse/TIKA-4267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17851598#comment-17851598
]
Tilman Hausherr edited comment on TIKA-4267 at 6/3/24 12:06 PM
[
https://issues.apache.org/jira/browse/TIKA-4267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17851598#comment-17851598
]
Tilman Hausherr edited comment on TIKA-4267 at 6/3/24 12:07 PM
[
https://issues.apache.org/jira/browse/TIKA-4267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17851598#comment-17851598
]
Tilman Hausherr commented on TIKA-4267:
---
The current version is 2.9.2, please retry with that one
[
https://issues.apache.org/jira/browse/TIKA-1907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-1907:
--
Fix Version/s: 3.0.0
> Big Pdf parsing to text - Out of mem
[
https://issues.apache.org/jira/browse/TIKA-4254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17845590#comment-17845590
]
Tilman Hausherr edited comment on TIKA-4254 at 5/12/24 9:40 AM:
THausherr
[
https://issues.apache.org/jira/browse/TIKA-4254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17845566#comment-17845566
]
Tilman Hausherr commented on TIKA-4254:
---
Why would we ever run the test twice in the same
, Apr 29, 2024 at 10:47 AM Tilman Hausherr
wrote:
The positive side is that it's less interruptions.
One negative side is that there seems to be a maximum. Today it didn't
report the AWS update, which was detected in the past.
Tilman
changing quickly, then that might be an argument for daily.
On Apr 10, 2024, at 12:53 PM, Tilman Hausherr
wrote:
I'm fine with daily because this way we can learn ASAP if there are
troubles with new dependency versions, although I'm now too busy.
Tilman
-- Original-Nachricht --
Von: Tim Allison
[
https://issues.apache.org/jira/browse/TIKA-4245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17840922#comment-17840922
]
Tilman Hausherr commented on TIKA-4245:
---
The file claims to be utf-16 but it isn't. If I change
[
https://issues.apache.org/jira/browse/TIKA-4245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17840908#comment-17840908
]
Tilman Hausherr commented on TIKA-4245:
---
Happens also with the tika app GUI.
> Tika does not
[
https://issues.apache.org/jira/browse/TIKA-4245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-4245:
--
Description:
We use org.apache.tika.parser.AutoDetectParser to get the content of html
files
[
https://issues.apache.org/jira/browse/TIKA-4166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17839745#comment-17839745
]
Tilman Hausherr edited comment on TIKA-4166 at 4/22/24 3:27 PM:
It turned
[
https://issues.apache.org/jira/browse/TIKA-4166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17839745#comment-17839745
]
Tilman Hausherr commented on TIKA-4166:
---
It turned out to be something different than the missing
Hi,
We look what the CVE is about. Some CVEs are irrelevant (see recent rant
from Tim) and we can add an exclusion in the OSS section. Sometimes all
what is needed is to update a dependency or add it in the management
section or exclude it (in the assumptions that the tests cover everything).
[
https://issues.apache.org/jira/browse/TIKA-4166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17839652#comment-17839652
]
Tilman Hausherr commented on TIKA-4166:
---
The latest Apache parent update means a javadoc update
[
https://issues.apache.org/jira/browse/TIKA-4240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17836236#comment-17836236
]
Tilman Hausherr commented on TIKA-4240:
---
I prefer daily but if more people feel pressured or annoyed
[
https://issues.apache.org/jira/browse/TIKA-4240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-4240:
--
Component/s: build
> Change dependabot to wee
[
https://issues.apache.org/jira/browse/TIKA-4240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17836224#comment-17836224
]
Tilman Hausherr commented on TIKA-4240:
---
Not a burden (that was Eric, sort-of), I just don't have
I'm fine with daily because this way we can learn ASAP if there are troubles
with new dependency versions, although I'm now too busy.
Tilman
-- Original-Nachricht --
Von: Tim Allison
Betreff: Bump dependabot to weekly?
Datum: 10.04.2024, 18:08 Uhr
An:
All,
Tilman has been doing heroic
[
https://issues.apache.org/jira/browse/TIKA-4238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17834529#comment-17834529
]
Tilman Hausherr commented on TIKA-4238:
---
This was a low-hanging fruit. I could also have done
[
https://issues.apache.org/jira/browse/TIKA-4238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17834529#comment-17834529
]
Tilman Hausherr edited comment on TIKA-4238 at 4/6/24 2:12 PM
[
https://issues.apache.org/jira/browse/TIKA-4218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-4218:
--
Affects Version/s: 2.9.1
> Run regression tests to support 2.9.2 rele
[
https://issues.apache.org/jira/browse/TIKA-4218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr resolved TIKA-4218.
---
Assignee: Tim Allison
Resolution: Fixed
> Run regression tests to support 2.9.2 rele
[
https://issues.apache.org/jira/browse/TIKA-4171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr reassigned TIKA-4171:
-
Assignee: Tim Allison
> Tika server only returns last value for PDFs that have multi
[
https://issues.apache.org/jira/browse/TIKA-4218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-4218:
--
Fix Version/s: 2.9.2
> Run regression tests to support 2.9.2 rele
[
https://issues.apache.org/jira/browse/TIKA-4171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr resolved TIKA-4171.
---
Resolution: Fixed
> Tika server only returns last value for PDFs that have multi
[
https://issues.apache.org/jira/browse/TIKA-4238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr resolved TIKA-4238.
---
Resolution: Fixed
> replace some deprecated c
Tilman Hausherr created TIKA-4239:
-
Summary: Update to 2.9.3
Key: TIKA-4239
URL: https://issues.apache.org/jira/browse/TIKA-4239
Project: Tika
Issue Type: Task
Components: build
[
https://issues.apache.org/jira/browse/TIKA-4239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-4239:
--
Affects Version/s: 2.9.2
> Update to 2.9.3
> ---
>
> Ke
[
https://issues.apache.org/jira/browse/TIKA-4162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr resolved TIKA-4162.
---
Assignee: Tilman Hausherr
Resolution: Fixed
> Update to 2.
Tilman Hausherr created TIKA-4238:
-
Summary: replace some deprecated code
Key: TIKA-4238
URL: https://issues.apache.org/jira/browse/TIKA-4238
Project: Tika
Issue Type: Task
Affects
I've created 2.9.3 version in JIRA administration. Someone (Tim?) please
set the 2.9.2 version to released or whatever (I didn't want to touch
that part)
Tilman
[
https://issues.apache.org/jira/browse/TIKA-4236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-4236:
--
Fix Version/s: 2.9.3
> tika-parser-nlp-module has an unnecessary Guava depende
[
https://issues.apache.org/jira/browse/TIKA-4236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-4236:
--
Fix Version/s: (was: 2.9.2)
> tika-parser-nlp-module has an unnecessary Guava depende
[
https://issues.apache.org/jira/browse/TIKA-4236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr resolved TIKA-4236.
---
Assignee: Tilman Hausherr
Resolution: Fixed
> tika-parser-nlp-module has an unnecess
[
https://issues.apache.org/jira/browse/TIKA-4236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-4236:
--
Fix Version/s: 2.9.2
3.0.0
> tika-parser-nlp-module has an unnecessary Gu
[
https://issues.apache.org/jira/browse/TIKA-4236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17834385#comment-17834385
]
Tilman Hausherr commented on TIKA-4236:
---
I found only a test dependency mentioned directly. It's
[
https://issues.apache.org/jira/browse/TIKA-4236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17834282#comment-17834282
]
Tilman Hausherr commented on TIKA-4236:
---
https://tika.apache.org/
"The Apache Tika PMC ha
[
https://issues.apache.org/jira/browse/TIKA-4236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17834277#comment-17834277
]
Tilman Hausherr edited comment on TIKA-4236 at 4/5/24 12:21 PM
[
https://issues.apache.org/jira/browse/TIKA-4236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17834277#comment-17834277
]
Tilman Hausherr commented on TIKA-4236:
---
Is this what you had in mind?
https://github.com/apache
[
https://issues.apache.org/jira/browse/TIKA-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17833807#comment-17833807
]
Tilman Hausherr commented on TIKA-4231:
---
Yes it is text, but the PDF is using a feature that we
[
https://issues.apache.org/jira/browse/TIKA-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17833385#comment-17833385
]
Tilman Hausherr commented on TIKA-4231:
---
No this is not being worked on. You'll have to use OCR
[
https://issues.apache.org/jira/browse/TIKA-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17832291#comment-17832291
]
Tilman Hausherr commented on TIKA-4231:
---
I have attached an extraction with pdfbox 2.0.31: [^arabic
[
https://issues.apache.org/jira/browse/TIKA-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-4231:
--
Attachment: arabic-pdfbox.txt
> Parsing Arabic PDF is returning bad d
[
https://issues.apache.org/jira/browse/TIKA-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17832284#comment-17832284
]
Tilman Hausherr commented on TIKA-4231:
---
This doesn't change my argument. The latest version
[
https://issues.apache.org/jira/browse/TIKA-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17832258#comment-17832258
]
Tilman Hausherr commented on TIKA-4231:
---
The current tika version is 2.9.1, soon to be 2.9.2
[
https://issues.apache.org/jira/browse/TIKA-4228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-4228:
--
Affects Version/s: 2.9.0
> Tika parser crashes JVM when it gets metadata and embedded obje
+1
successful build on Windows 10, oracle jdk 1.8.0_391
Tilman
On 26.03.2024 16:52, Tim Allison wrote:
A candidate for the Tika 2.9.2 release is available at:
https://dist.apache.org/repos/dist/dev/tika/2.9.2
The release candidate is a zip archive of the sources in:
[
https://issues.apache.org/jira/browse/TIKA-4218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830954#comment-17830954
]
Tilman Hausherr commented on TIKA-4218:
---
6FOMNUPGPA6IG66Z4NIUEQIVOR5ON46Q (an MP4 file) has a loss
[
https://issues.apache.org/jira/browse/TIKA-4218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830604#comment-17830604
]
Tilman Hausherr commented on TIKA-4218:
---
To be honest I didn't look further, because these problems
[
https://issues.apache.org/jira/browse/TIKA-4171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830110#comment-17830110
]
Tilman Hausherr edited comment on TIKA-4171 at 3/23/24 5:50 PM:
We have
[
https://issues.apache.org/jira/browse/TIKA-4171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-4171:
--
Attachment: testPDF_XFA_govdocs1_258578.pdf.html
> Tika server only returns last value for P
[
https://issues.apache.org/jira/browse/TIKA-4171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830113#comment-17830113
]
Tilman Hausherr commented on TIKA-4171:
---
Proposed change: add these 3 lines before the last one
[
https://issues.apache.org/jira/browse/TIKA-4171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830110#comment-17830110
]
Tilman Hausherr commented on TIKA-4171:
---
We have a regression with the file [^876503.pdf
[
https://issues.apache.org/jira/browse/TIKA-4171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-4171:
--
Attachment: 876503.pdf
> Tika server only returns last value for PDFs that have multi
[
https://issues.apache.org/jira/browse/TIKA-4218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830105#comment-17830105
]
Tilman Hausherr commented on TIKA-4218:
---
Follow up in TIKA-4171
> Run regression tests to supp
[
https://issues.apache.org/jira/browse/TIKA-4171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr reopened TIKA-4171:
---
> Tika server only returns last value for PDFs that have multiple of the same
&g
[
https://issues.apache.org/jira/browse/TIKA-4218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830097#comment-17830097
]
Tilman Hausherr commented on TIKA-4218:
---
Confirmed, I reverted just that change and then the text
[
https://issues.apache.org/jira/browse/TIKA-4218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830094#comment-17830094
]
Tilman Hausherr edited comment on TIKA-4218 at 3/23/24 3:59 PM:
Oops
[
https://issues.apache.org/jira/browse/TIKA-4218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830094#comment-17830094
]
Tilman Hausherr commented on TIKA-4218:
---
Oops, or it's part of XFA, I just found it too.
>
[
https://issues.apache.org/jira/browse/TIKA-4218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830093#comment-17830093
]
Tilman Hausherr commented on TIKA-4218:
---
I found one difference: "Enter the full
[
https://issues.apache.org/jira/browse/TIKA-4218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830079#comment-17830079
]
Tilman Hausherr commented on TIKA-4218:
---
The word "party" appears 36 times in the jso
[ https://issues.apache.org/jira/browse/TIKA-4218 ]
Tilman Hausherr deleted comment on TIKA-4218:
---
was (Author: tilman):
There are also improvements not in my own test results, e.g. the "FOP" pdf
file. Either something went wro
[
https://issues.apache.org/jira/browse/TIKA-4218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830071#comment-17830071
]
Tilman Hausherr commented on TIKA-4218:
---
There are also improvements not in my own test results, e.g
[
https://issues.apache.org/jira/browse/TIKA-4218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830069#comment-17830069
]
Tilman Hausherr commented on TIKA-4218:
---
Weird indeed, 876503.pdf didn't appear in the PDFBox
[
https://issues.apache.org/jira/browse/TIKA-4206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-4206:
--
Description:
I see TIKA-216 which aims to prevent Zip bombs, but I'm seeing what looks like
[
https://issues.apache.org/jira/browse/TIKA-4214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr closed TIKA-4214.
-
Resolution: Duplicate
Duplicate of TIKA-4199.
> Update apache compress in tika to 1.26+ for
[
https://issues.apache.org/jira/browse/TIKA-4199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17826996#comment-17826996
]
Tilman Hausherr commented on TIKA-4199:
---
The original error you reported wasn't really a bug
[ https://issues.apache.org/jira/browse/TIKA-4166 ]
Tilman Hausherr deleted comment on TIKA-4166:
---
was (Author: tilman):
I've reverted it and will investigate / fix this later. Seems to be a problem
with angus-activation.
> depende
[
https://issues.apache.org/jira/browse/TIKA-4166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17824953#comment-17824953
]
Tilman Hausherr commented on TIKA-4166:
---
I've reverted it and will investigate / fix this later
[
https://issues.apache.org/jira/browse/TIKA-4199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr resolved TIKA-4199.
---
Resolution: Fixed
Commons-Compress has been updated to 1.26.1, I have reverted the workaround
[
https://issues.apache.org/jira/browse/TIKA-4199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr reassigned TIKA-4199:
-
Assignee: Tilman Hausherr
> commons-compress 1.26.0 breaks Apache Tika 2.
[
https://issues.apache.org/jira/browse/TIKA-4203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-4203:
--
Fix Version/s: 3.0.0
> Add @deprecated annotation where nee
[
https://issues.apache.org/jira/browse/TIKA-4203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-4203:
--
Affects Version/s: 3.0.0
> Add @deprecated annotation where nee
[
https://issues.apache.org/jira/browse/TIKA-4203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr resolved TIKA-4203.
---
Resolution: Fixed
> Add @deprecated annotation where nee
Tilman Hausherr created TIKA-4203:
-
Summary: Add @deprecated annotation where needed
Key: TIKA-4203
URL: https://issues.apache.org/jira/browse/TIKA-4203
Project: Tika
Issue Type: Task
[
https://issues.apache.org/jira/browse/TIKA-4199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-4199:
--
Fix Version/s: 2.9.2
3.0.0
> commons-compress 1.26.0 breaks Apache T
[
https://issues.apache.org/jira/browse/TIKA-4199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17818937#comment-17818937
]
Tilman Hausherr commented on TIKA-4199:
---
I tried an another solution
{code:java
[
https://issues.apache.org/jira/browse/TIKA-4201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17818873#comment-17818873
]
Tilman Hausherr commented on TIKA-4201:
---
Yeah, makes sense.
> Add hard limit to stream read
[
https://issues.apache.org/jira/browse/TIKA-4199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17818867#comment-17818867
]
Tilman Hausherr edited comment on TIKA-4199 at 2/20/24 3:37 PM:
{quote}I'm
[
https://issues.apache.org/jira/browse/TIKA-4199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17818867#comment-17818867
]
Tilman Hausherr commented on TIKA-4199:
---
{quote}I'm not declaring this a problem with commons
[
https://issues.apache.org/jira/browse/TIKA-4199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17818823#comment-17818823
]
Tilman Hausherr commented on TIKA-4199:
---
After merging I discovered that the SevenZWrapper class
[
https://issues.apache.org/jira/browse/TIKA-4200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr closed TIKA-4200.
-
Resolution: Duplicate
Our CI is failing because of the CVE :-( Duplicate of TIKA-4199. I'm still
[
https://issues.apache.org/jira/browse/TIKA-4199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17818774#comment-17818774
]
Tilman Hausherr edited comment on TIKA-4199 at 2/20/24 11:57 AM:
-
I'm
[
https://issues.apache.org/jira/browse/TIKA-4199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17818774#comment-17818774
]
Tilman Hausherr commented on TIKA-4199:
---
I'm working on it
https://github.com/apache/pdfbox/pull
[
https://issues.apache.org/jira/browse/TIKA-3841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-3841:
--
Summary: An exception occurred when parsing some word documents using tika,
tika_exception
[
https://issues.apache.org/jira/browse/TIKA-3841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-3841:
--
Summary: An exception occurred when parsing some word documents using
tikatika_exception
[
https://issues.apache.org/jira/browse/TIKA-4183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr closed TIKA-4183.
-
Resolution: Duplicate
duplicate of TIKA-4162, it was done there on 17.11.2023
[
https://issues.apache.org/jira/browse/TIKA-4162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-4162:
--
Fix Version/s: 2.9.2
> Update to 2.9.2
> ---
>
> Ke
mention that there are already 2 votes and
you're still missing one. However ponymail is not showing me the vote of Tilman
Hausherr. Do you know what happened there?
Chris
PS: I'm not subscribed to this list, so please keep me in CC
[
https://issues.apache.org/jira/browse/TIKA-4162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-4162:
--
Affects Version/s: 2.9.1
> Update to 2.9.2
> ---
>
> Ke
[
https://issues.apache.org/jira/browse/TIKA-4172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr closed TIKA-4172.
-
Resolution: Not A Bug
> Apple binary file incorrectly identified as text/x-sql due to filen
[
https://issues.apache.org/jira/browse/TIKA-4173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17796450#comment-17796450
]
Tilman Hausherr commented on TIKA-4173:
---
It wasn't really a problem locally, I only had to change
[
https://issues.apache.org/jira/browse/TIKA-4173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17796431#comment-17796431
]
Tilman Hausherr commented on TIKA-4173:
---
I noticed that it didn't have the correct version, but I
[
https://issues.apache.org/jira/browse/TIKA-4172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17789647#comment-17789647
]
Tilman Hausherr commented on TIKA-4172:
---
Your file starts with 00 14 64 30
[
https://issues.apache.org/jira/browse/TIKA-4172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17789542#comment-17789542
]
Tilman Hausherr commented on TIKA-4172:
---
application/octet-stream is defined as the default
[
https://issues.apache.org/jira/browse/TIKA-4172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17789318#comment-17789318
]
Tilman Hausherr commented on TIKA-4172:
---
https://tika.apache.org/2.1.0/detection.html
"
[
https://issues.apache.org/jira/browse/TIKA-4172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17788982#comment-17788982
]
Tilman Hausherr edited comment on TIKA-4172 at 11/23/23 5:05 AM:
-
Which
1 - 100 of 845 matches
Mail list logo