Hi,
What do you mean, the detection is faulty? What is the expected result in
that case?
Thanks,
Tyler
On Mar 3, 2015 1:10 AM, "Oleg Tikhonov" wrote:
> Hi,
> Just for the record ...
> It can happen if a file contains context that at least written in two
> different languages. For instance, the
Hi,
Just for the record ...
It can happen if a file contains context that at least written in two
different languages. For instance, the first half of file, say, is a German
and the second one, say ... a French. In such case detection would be
faulty.
Br,
Oleg
On 3 Mar 2015 04:03, "Tyler Palsulich
[
https://issues.apache.org/jira/browse/TIKA-995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14344407#comment-14344407
]
Hudson commented on TIKA-995:
-
SUCCESS: Integrated in tika-trunk-jdk1.7 #525 (See
[https://buil
[
https://issues.apache.org/jira/browse/TIKA-998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tyler Palsulich closed TIKA-998.
Resolution: Won't Fix
This is >2 years old. So, I'm closing as Won't Fix. Feel free to reopen if
you'
[
https://issues.apache.org/jira/browse/TIKA-995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tyler Palsulich resolved TIKA-995.
--
Resolution: Fixed
Assignee: Tyler Palsulich
Really dropping the ball on these issues, [~mark
[
https://issues.apache.org/jira/browse/TIKA-885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tyler Palsulich closed TIKA-885.
Resolution: Not a Problem
Awesome. Thank you!
> Possible ConcurrentModificationException while access
[
https://issues.apache.org/jira/browse/TIKA-885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14344307#comment-14344307
]
Luis Filipe Nassif commented on TIKA-885:
-
This can be closed with "Not a problem",
[
https://issues.apache.org/jira/browse/TIKA-993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tyler Palsulich closed TIKA-993.
Resolution: Cannot Reproduce
This issue is >2 years old and has no attachment for the text. So, I'm cl
[
https://issues.apache.org/jira/browse/TIKA-987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14344119#comment-14344119
]
Tyler Palsulich commented on TIKA-987:
--
Issue still exists in Tika 1.8-SNAPSHOT. Didn't
[
https://issues.apache.org/jira/browse/TIKA-985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14344076#comment-14344076
]
Tyler Palsulich commented on TIKA-985:
--
Again, sorry for letting this fall off, [~marku
[
https://issues.apache.org/jira/browse/TIKA-980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14344068#comment-14344068
]
Tyler Palsulich commented on TIKA-980:
--
Sorry for having this issue fall off the radar,
[
https://issues.apache.org/jira/browse/TIKA-978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tyler Palsulich resolved TIKA-978.
--
Resolution: Fixed
Seems to be fixed in 1.8-SNAPSHOT:
{code}
[INFO] BUILD SUCCESS
[INFO] -
[
https://issues.apache.org/jira/browse/TIKA-974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tyler Palsulich updated TIKA-974:
-
Fix Version/s: 2.0
> No longer return charset info in Metadata's CONTENT_ENCODING
>
[
https://issues.apache.org/jira/browse/TIKA-972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tyler Palsulich resolved TIKA-972.
--
Resolution: Fixed
Marking as Fixed, since PDFBOX-1512 was fixed in PDFBox 1.8.8 (Tika's current
v
[
https://issues.apache.org/jira/browse/TIKA-891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343876#comment-14343876
]
Tyler Palsulich commented on TIKA-891:
--
I was mistaken when I said there are only 3 PUT
[
https://issues.apache.org/jira/browse/TIKA-891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tyler Palsulich updated TIKA-891:
-
Fix Version/s: (was: 1.8)
1.9
> Use POST in addition to PUT on method calls i
Tyler Palsulich created TIKA-1564:
-
Summary: Organize tika-server package structure
Key: TIKA-1564
URL: https://issues.apache.org/jira/browse/TIKA-1564
Project: Tika
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/TIKA-891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343845#comment-14343845
]
Tyler Palsulich commented on TIKA-891:
--
Ah... I meant they were both equally right/wron
[
https://issues.apache.org/jira/browse/TIKA-944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343665#comment-14343665
]
Tim Allison edited comment on TIKA-944 at 3/2/15 9:53 PM:
--
Some ite
[
https://issues.apache.org/jira/browse/TIKA-944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343665#comment-14343665
]
Tim Allison edited comment on TIKA-944 at 3/2/15 9:53 PM:
--
Some ite
[
https://issues.apache.org/jira/browse/TIKA-964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-964.
--
Resolution: Won't Fix
Let's move users of tika-app's server to Apache CXF's JAX-RS server in the
tika-ser
[
https://issues.apache.org/jira/browse/TIKA-891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343816#comment-14343816
]
Sergey Beryozkin edited comment on TIKA-891 at 3/2/15 9:44 PM:
---
[
https://issues.apache.org/jira/browse/TIKA-891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343816#comment-14343816
]
Sergey Beryozkin commented on TIKA-891:
---
If I said PUT was the same as POST I'd be pro
[
https://issues.apache.org/jira/browse/TIKA-964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343795#comment-14343795
]
Vitaliy Filippov commented on TIKA-964:
---
Oh, thanks. I see the new interface is very s
[
https://issues.apache.org/jira/browse/TIKA-964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343765#comment-14343765
]
Tim Allison edited comment on TIKA-964 at 3/2/15 9:10 PM:
--
As a new
[
https://issues.apache.org/jira/browse/TIKA-964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343765#comment-14343765
]
Tim Allison commented on TIKA-964:
--
As a newbie to JAX-RS a few months ago(?), I was initia
[
https://issues.apache.org/jira/browse/TIKA-964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343753#comment-14343753
]
Vitaliy Filippov commented on TIKA-964:
---
I use old tika server to index binary files i
[
https://issues.apache.org/jira/browse/TIKA-964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343750#comment-14343750
]
Tyler Palsulich commented on TIKA-964:
--
+1, Tim. Come to think of it, I'm not sure what
[
https://issues.apache.org/jira/browse/TIKA-1545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tyler Palsulich closed TIKA-1545.
-
Resolution: Won't Fix
Fix Version/s: (was: 1.8)
This type of thing should be more of a d
[
https://issues.apache.org/jira/browse/TIKA-1301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343743#comment-14343743
]
Tim Allison edited comment on TIKA-1301 at 3/2/15 8:52 PM:
---
Moved
[
https://issues.apache.org/jira/browse/TIKA-758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343742#comment-14343742
]
Hudson commented on TIKA-758:
-
SUCCESS: Integrated in tika-trunk-jdk1.7 #524 (See
[https://buil
[
https://issues.apache.org/jira/browse/TIKA-1301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343743#comment-14343743
]
Tim Allison commented on TIKA-1301:
---
Moved to a new server: 162.242.228.174:9998
> Estab
[
https://issues.apache.org/jira/browse/TIKA-964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343735#comment-14343735
]
Tim Allison commented on TIKA-964:
--
My preference would be to close as "not going to fix" b
[
https://issues.apache.org/jira/browse/TIKA-758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343717#comment-14343717
]
Tim Allison commented on TIKA-758:
--
And then I remembered
[this|https://issues.apache.org/
[
https://issues.apache.org/jira/browse/TIKA-758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tyler Palsulich resolved TIKA-758.
--
Resolution: Fixed
Worarounds removed in r1663415. Thanks, Tim!
> Address TODOs when we upgrade to
[
https://issues.apache.org/jira/browse/TIKA-964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343675#comment-14343675
]
Tyler Palsulich commented on TIKA-964:
--
This seems like a worthy change. But, it looks
[
https://issues.apache.org/jira/browse/TIKA-758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343672#comment-14343672
]
Tim Allison commented on TIKA-758:
--
Y, we should be good. Please remove or let me know if
[
https://issues.apache.org/jira/browse/TIKA-944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343665#comment-14343665
]
Tim Allison commented on TIKA-944:
--
There's a slight disconnect in how we handle extraction
[
https://issues.apache.org/jira/browse/TIKA-960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tyler Palsulich closed TIKA-960.
Resolution: Not a Problem
I tested this locally with
[setSuppressDuplicateOverlappingText|http://tika
[
https://issues.apache.org/jira/browse/TIKA-955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343631#comment-14343631
]
Tim Allison commented on TIKA-955:
--
Let's leave this one open. It is on my list to get to
[
https://issues.apache.org/jira/browse/TIKA-959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tyler Palsulich closed TIKA-959.
Resolution: Cannot Reproduce
This issue is >2 years old and we don't have a test file. Likely an encod
[
https://issues.apache.org/jira/browse/TIKA-954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343620#comment-14343620
]
Tyler Palsulich commented on TIKA-954:
--
91468cee-fb0a-4692-adfd-c2b3cb0613da.docx now t
[
https://issues.apache.org/jira/browse/TIKA-955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343624#comment-14343624
]
Tyler Palsulich commented on TIKA-955:
--
Is there interest in implementing this? Does an
[
https://issues.apache.org/jira/browse/TIKA-954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tyler Palsulich updated TIKA-954:
-
Description:
Stack trace produced with attached docx file
{code}
2012-07-13_04:45:36.86910 java.lang
[
https://issues.apache.org/jira/browse/TIKA-456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343606#comment-14343606
]
Tyler Palsulich commented on TIKA-456:
--
bq. If we allow the user to configure the Threa
[
https://issues.apache.org/jira/browse/TIKA-953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tyler Palsulich resolved TIKA-953.
--
Resolution: Fixed
Since we're now at Compress 1.9 and both files are detected correctly, I'm
clos
[
https://issues.apache.org/jira/browse/TIKA-944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343599#comment-14343599
]
Tyler Palsulich commented on TIKA-944:
--
We're definitely a lot closer! Language detecti
[
https://issues.apache.org/jira/browse/TIKA-944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343566#comment-14343566
]
Nick Burch commented on TIKA-944:
-
With the server as-is, you can get the metadata as text,
[
https://issues.apache.org/jira/browse/TIKA-891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343520#comment-14343520
]
Tyler Palsulich commented on TIKA-891:
--
[~serg.berezhnoy], I think I misunderstood your
Don't need a 2.0 label, can just mark that as the fix version! It can't
hurt to tag them early.
Does anyone else have any input on issue labeling, fix versions, affects
versions, issue type, priority, etc? Right now, I feel like I pick some
attributes arbitrarily (especially priority below blocker
[
https://issues.apache.org/jira/browse/TIKA-928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tyler Palsulich updated TIKA-928:
-
Fix Version/s: 2.0
> Separation of Tika Core Properties From Metadata Processing
> -
[
https://issues.apache.org/jira/browse/TIKA-944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343500#comment-14343500
]
Tyler Palsulich commented on TIKA-944:
--
+1. It would be nice to expose more of Tika's a
[
https://issues.apache.org/jira/browse/TIKA-770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tyler Palsulich updated TIKA-770:
-
Fix Version/s: 2.0
> New ODF metadata keys
> -
>
> Key: TIKA-770
[
https://issues.apache.org/jira/browse/TIKA-675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tyler Palsulich closed TIKA-675.
Resolution: Fixed
Marking as fixed. Please see the RecursiveParserWrapper. Thanks Nick.
> PackageExtr
[
https://issues.apache.org/jira/browse/TIKA-1489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1489:
--
Attachment: testPDF_no_extract_yes_accessibility_owner_user.pdf
testPDF_no_extract_yes_acc
[
https://issues.apache.org/jira/browse/TIKA-891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343373#comment-14343373
]
Sergey Beryozkin commented on TIKA-891:
---
Just to clarify: I've no objections to migrat
[
https://issues.apache.org/jira/browse/TIKA-891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343330#comment-14343330
]
Sergey Beryozkin commented on TIKA-891:
---
Why are both PUT and POST out ? GET does not
[
https://issues.apache.org/jira/browse/TIKA-928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343331#comment-14343331
]
Nick Burch commented on TIKA-928:
-
Yup, come Tika 2.0 we can remove the backwards compatible
[
https://issues.apache.org/jira/browse/TIKA-917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tyler Palsulich updated TIKA-917:
-
Labels: new-parser (was: )
> Parser for executables (metadata)
> -
[
https://issues.apache.org/jira/browse/TIKA-942?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343308#comment-14343308
]
Tyler Palsulich commented on TIKA-942:
--
Hi [~jukkaz], it looks like we now automaticall
[
https://issues.apache.org/jira/browse/TIKA-938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tyler Palsulich closed TIKA-938.
Resolution: Cannot Reproduce
This issue is > 2 years old and we don't have the document. So, I'm closi
[
https://issues.apache.org/jira/browse/TIKA-928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343301#comment-14343301
]
Tyler Palsulich commented on TIKA-928:
--
Hey Nick. Is this another issue we'll touch up
[
https://issues.apache.org/jira/browse/TIKA-891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343297#comment-14343297
]
Tyler Palsulich commented on TIKA-891:
--
So, if PUT and POST are out, it looks like the
[
https://issues.apache.org/jira/browse/TIKA-1559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343261#comment-14343261
]
Tim Allison commented on TIKA-1559:
---
[~sashkap], thank you for raising this issue and off
[
https://issues.apache.org/jira/browse/TIKA-456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343193#comment-14343193
]
Tim Allison commented on TIKA-456:
--
[~tpalsulich], until you pinged me on this, I regret th
[
https://issues.apache.org/jira/browse/TIKA-891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14342977#comment-14342977
]
Sergey Beryozkin commented on TIKA-891:
---
IMHO it might make sense to keep PUT as depre
66 matches
Mail list logo