[
https://issues.apache.org/jira/browse/TIKA-4181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17832336#comment-17832336
]
ASF GitHub Bot commented on TIKA-4181:
--
bartek commented on code in PR #1702:
URL: ht
[
https://issues.apache.org/jira/browse/TIKA-4181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17832337#comment-17832337
]
ASF GitHub Bot commented on TIKA-4181:
--
bartek commented on code in PR #1702:
URL: ht
bartek commented on code in PR #1702:
URL: https://github.com/apache/tika/pull/1702#discussion_r1544981545
##
tika-pipes/tika-grpc/src/main/proto/tika.proto:
##
Review Comment:
For your consideration @nddipiazza, I ran `buf lint` on this protobuf (as I
am syncing it to a l
bartek commented on code in PR #1702:
URL: https://github.com/apache/tika/pull/1702#discussion_r1544981545
##
tika-pipes/tika-grpc/src/main/proto/tika.proto:
##
Review Comment:
For your consideration @nddipiazza, I ran `buf lint` on this protobuf (as I
am syncing it to a l
[
https://issues.apache.org/jira/browse/TIKA-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17832293#comment-17832293
]
Aamir commented on TIKA-4231:
-
No, this doesn't look better. Actually, I would say that it loo
[
https://issues.apache.org/jira/browse/TIKA-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17832291#comment-17832291
]
Tilman Hausherr commented on TIKA-4231:
---
I have attached an extraction with pdfbox 2
[
https://issues.apache.org/jira/browse/TIKA-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated TIKA-4231:
--
Attachment: arabic-pdfbox.txt
> Parsing Arabic PDF is returning bad data
> -
[
https://issues.apache.org/jira/browse/TIKA-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aamir updated TIKA-4231:
Affects Version/s: 2.9.1
> Parsing Arabic PDF is returning bad data
>
>
>
[
https://issues.apache.org/jira/browse/TIKA-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aamir updated TIKA-4231:
Description:
Attached is a PDF with arabic text in it.
When parsed using tika version 2.6.0 or 2.9.1, it produces g
[
https://issues.apache.org/jira/browse/TIKA-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17832289#comment-17832289
]
Aamir commented on TIKA-4231:
-
The problem persists with 2.9.1
I am updating the versions in t
[
https://issues.apache.org/jira/browse/TIKA-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17832284#comment-17832284
]
Tilman Hausherr commented on TIKA-4231:
---
This doesn't change my argument. The latest
[
https://issues.apache.org/jira/browse/TIKA-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aamir updated TIKA-4231:
Description:
Attached is a PDF with arabic text in it.
When parsed using tika version 2.6.0, it produces gibberish
[
https://issues.apache.org/jira/browse/TIKA-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17832260#comment-17832260
]
Aamir commented on TIKA-4231:
-
Sorry, I meant tika-parsers-standard-package 2.6.0
> Parsing A
[
https://issues.apache.org/jira/browse/TIKA-4231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17832258#comment-17832258
]
Tilman Hausherr commented on TIKA-4231:
---
The current tika version is 2.9.1, soon to
Aamir created TIKA-4231:
---
Summary: Parsing Arabic PDF is returning bad data
Key: TIKA-4231
URL: https://issues.apache.org/jira/browse/TIKA-4231
Project: Tika
Issue Type: Bug
Affects Versions: 2.6.0
nddipiazza opened a new pull request, #1702:
URL: https://github.com/apache/tika/pull/1702
Add an Apache Tika GRPC Server
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To
Never mind - found a way to make it work with junit5 with some googling
On Fri, Mar 29, 2024 at 3:01 AM Nicholas DiPiazza <
nicholas.dipia...@gmail.com> wrote:
> Is there some easy way I can relax the Junit4 ban for the Gprc service?
>
>
>
Is there some easy way I can relax the Junit4 ban for the Gprc service?
18 matches
Mail list logo