[
https://issues.apache.org/jira/browse/TIKA-2372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16016811#comment-16016811
]
Luis Filipe Nassif commented on TIKA-2372:
--
7zip supports it and dozens of other formats (iso,
I think this would be ok if we added a warning that -z is different and a
pointer to changing the config?
On 2017-05-18 17:02 (-0400), Nick Burch wrote:
> Hi All>
>
> I've just been caught out by the Tika App's -z on a PDF not extracting the >
> embedded images. I think we probably shouldn't
Hi All
I've just been caught out by the Tika App's -z on a PDF not extracting the
embedded images. I think we probably shouldn't tweak the default config
for the other Tika App modes, but what about extract? Any reason why we
shouldn't turn on the PDF Parser option "extractInlineImages" when
[
https://issues.apache.org/jira/browse/TIKA-2372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16016352#comment-16016352
]
Hudson commented on TIKA-2372:
--
UNSTABLE: Integrated in Jenkins build Tika-trunk #1271 (See
[
https://issues.apache.org/jira/browse/TIKA-2372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16016324#comment-16016324
]
Nick Burch commented on TIKA-2372:
--
For a GPL licensed library, catacombae
Nick Burch created TIKA-2372:
Summary: OSX DMG support
Key: TIKA-2372
URL: https://issues.apache.org/jira/browse/TIKA-2372
Project: Tika
Issue Type: Improvement
Components: parser
[
https://issues.apache.org/jira/browse/TIKA-1334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16016303#comment-16016303
]
Tom Barber commented on TIKA-1334:
--
my new guy is looking for an excuse to get started in programming I'll
Please DO NOT use Apache Tika for malware scanning. Please use a package that
is designed for malware detection.
From: Prateek Agarwal [mailto:pra.a...@gmail.com]
Sent: Thursday, May 18, 2017 8:17 AM
To: Allison, Timothy B. ; dev@tika.apache.org
Subject: Re: TikaInputStream
+1 Thank you!
-Original Message-
From: Chris Mattmann [mailto:mattm...@apache.org]
Sent: Thursday, May 18, 2017 10:15 AM
To: dev@tika.apache.org
Subject: Re: Tika 1.15
Hey Tim,
I am, Luis is, you are, that’s probably a good enough start. I’ll roll the RC
this afternoon, early AM
Hey Tim,
I am, Luis is, you are, that’s probably a good enough start. I’ll roll the RC
this afternoon, early
AM pacific tomorrow!
Cheers,
Chris
On 5/18/17, 3:56 AM, "Allison, Timothy B." wrote:
Yes, yes we are...if you and fellow devs are ok with the log message in
[
https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16015808#comment-16015808
]
Chris A. Mattmann commented on TIKA-2359:
-
totally agree! this is good for 1.15! thanks Tim and
[
https://issues.apache.org/jira/browse/TIKA-2368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann resolved TIKA-2368.
-
Resolution: Fixed
Assignee: Tim Allison
Fix Version/s: 1.15
thanks Tim!
>
[
https://issues.apache.org/jira/browse/TIKA-2368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16015800#comment-16015800
]
Chris A. Mattmann commented on TIKA-2368:
-
+1
> Clean up SentimentParser dependencies
>
Julien Massiera created TIKA-2371:
-
Summary: Check properties presence - PDFParser
Key: TIKA-2371
URL: https://issues.apache.org/jira/browse/TIKA-2371
Project: Tika
Issue Type: Improvement
Hello all,
Over the last year or so we in Commons have been working towards a newly
released component “commons-text,”, and we were wondering if folks wanted
to begin consuming commons-text so that we can consolidate the maintenance
of the code performing edit distances and similarity scores (for
[
https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16015636#comment-16015636
]
Hudson commented on TIKA-2359:
--
SUCCESS: Integrated in Jenkins build Tika-trunk #1270 (See
[
https://issues.apache.org/jira/browse/TIKA-2368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16015635#comment-16015635
]
Hudson commented on TIKA-2368:
--
SUCCESS: Integrated in Jenkins build Tika-trunk #1270 (See
[
https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16015601#comment-16015601
]
Luis Filipe Nassif commented on TIKA-2359:
--
Hi [~talli...@mitre.org]! I am ok with the message for
[
https://issues.apache.org/jira/browse/TIKA-2370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16015597#comment-16015597
]
Hudson commented on TIKA-2370:
--
SUCCESS: Integrated in Jenkins build Tika-trunk #1269 (See
Yes, yes we are...if you and fellow devs are ok with the log message in
TIKA-2359.
Happy to change that message if there are any concerns/recommendations.
Onward! Thank you!
Cheers,
Tim
-Original Message-
From: Chris Mattmann [mailto:mattm...@apache.org]
Sent: Wednesday,
[
https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16015571#comment-16015571
]
Tim Allison edited comment on TIKA-2359 at 5/18/17 10:55 AM:
-
I just added
[
https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16015571#comment-16015571
]
Tim Allison commented on TIKA-2359:
---
How about:
{noformat}
LOG.info("Tesseract OCR is
[
https://issues.apache.org/jira/browse/TIKA-2360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-2360.
---
Resolution: Fixed
Sounds like we're in concurrence. Again, [~chrismattmann], apologies for
moving
[
https://issues.apache.org/jira/browse/TIKA-2368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16015560#comment-16015560
]
Tim Allison commented on TIKA-2368:
---
I added tika-translate to the exclusion list. We'll still get an
[
https://issues.apache.org/jira/browse/TIKA-2368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2368:
--
Priority: Minor (was: Blocker)
> Clean up SentimentParser dependencies
>
While Apache Tika can be used to support forensic analysis/malware detection,
it is NOT designed to identify malware. DO NOT rely on Apache Tika to identify
malware.
I'd recommend using clamav or a commercial antivirus program.
If you want to use Tika for another reason (text/metadata
[
https://issues.apache.org/jira/browse/TIKA-2370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2370:
--
Description: [~icirellik] opened https://github.com/apache/tika/pull/181
to point out that we're not
[
https://issues.apache.org/jira/browse/TIKA-2370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2370:
--
Description: [~icirellik] opened https://github.com/apache/tika/pull/181
to point out that we're not
[
https://issues.apache.org/jira/browse/TIKA-2370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-2370.
---
Resolution: Fixed
Fix Version/s: 1.15
Thank you [~icirellik]!
> Close Font in TrueTypeParser
>
Tim Allison created TIKA-2370:
-
Summary: Close Font in TrueTypeParser
Key: TIKA-2370
URL: https://issues.apache.org/jira/browse/TIKA-2370
Project: Tika
Issue Type: Bug
Reporter: Tim
30 matches
Mail list logo