[jira] [Commented] (TIKA-3374) Non-Unicode archive entry name is garbled

2021-04-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17336999#comment-17336999 ] ASF GitHub Bot commented on TIKA-3374: -- Ryan421 commented on a change in pull request #433: URL:

[GitHub] [tika] Ryan421 commented on a change in pull request #433: [TIKA-3374] Apply charset detection for archive entry name

2021-04-29 Thread GitBox
Ryan421 commented on a change in pull request #433: URL: https://github.com/apache/tika/pull/433#discussion_r623514986 ## File path: tika-parsers/tika-parsers-classic/tika-parsers-classic-modules/tika-parser-pkg-module/src/main/java/org/apache/tika/parser/pkg/PackageParser.java

[jira] [Commented] (TIKA-3376) Improve handling of write limit reached in new /tika json endpoint

2021-04-29 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17335762#comment-17335762 ] Hudson commented on TIKA-3376: -- SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #214 (See

[jira] [Commented] (TIKA-3374) Non-Unicode archive entry name is garbled

2021-04-29 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17335746#comment-17335746 ] Hudson commented on TIKA-3374: -- SUCCESS: Integrated in Jenkins build Tika » tika-branch1x-jdk8 #122 (See

[jira] [Commented] (TIKA-3376) Improve handling of write limit reached in new /tika json endpoint

2021-04-29 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17335747#comment-17335747 ] Hudson commented on TIKA-3376: -- SUCCESS: Integrated in Jenkins build Tika » tika-branch1x-jdk8 #122 (See

[jira] [Created] (TIKA-3376) Improve handling of write limit reached in new /tika json endpoint

2021-04-29 Thread Tim Allison (Jira)
Tim Allison created TIKA-3376: - Summary: Improve handling of write limit reached in new /tika json endpoint Key: TIKA-3376 URL: https://issues.apache.org/jira/browse/TIKA-3376 Project: Tika

[jira] [Commented] (TIKA-3374) Non-Unicode archive entry name is garbled

2021-04-29 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17335680#comment-17335680 ] Hudson commented on TIKA-3374: -- SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #213 (See

[jira] [Commented] (TIKA-3374) Non-Unicode archive entry name is garbled

2021-04-29 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17335545#comment-17335545 ] Hudson commented on TIKA-3374: -- UNSTABLE: Integrated in Jenkins build Tika » tika-main-jdk8 #212 (See

[jira] [Commented] (TIKA-3374) Non-Unicode archive entry name is garbled

2021-04-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17335483#comment-17335483 ] ASF GitHub Bot commented on TIKA-3374: -- tballison merged pull request #433: URL:

[GitHub] [tika] tballison merged pull request #433: [TIKA-3374] Apply charset detection for archive entry name

2021-04-29 Thread GitBox
tballison merged pull request #433: URL: https://github.com/apache/tika/pull/433 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please

[jira] [Commented] (TIKA-3374) Non-Unicode archive entry name is garbled

2021-04-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17335476#comment-17335476 ] ASF GitHub Bot commented on TIKA-3374: -- tballison commented on a change in pull request #433: URL:

[GitHub] [tika] tballison commented on a change in pull request #433: [TIKA-3374] Apply charset detection for archive entry name

2021-04-29 Thread GitBox
tballison commented on a change in pull request #433: URL: https://github.com/apache/tika/pull/433#discussion_r623052485 ## File path: tika-parsers/tika-parsers-classic/tika-parsers-classic-modules/tika-parser-pkg-module/src/main/java/org/apache/tika/parser/pkg/PackageParser.java

Re: Release 1.27?

2021-04-29 Thread Tim Allison
Thank you Konstantin! I’m not planning on updating POI because ooxml schemas lite didn’t have enough classes for our unit tests. Andi recently made some updates on their trunk, and I haven’t had a chance to confirm those fixes work :(. If we wanted to drop the full ooxml schemas into our jar, I

Re: Release 1.27?

2021-04-29 Thread Konstantin Gribov
+1 for release Are you planning to merge TIKA-3164 (update to POI 5.0.0) for this release? -- Best regards, Konstantin Gribov. On Wed, Apr 28, 2021 at 9:36 PM Oleg Tikhonov wrote: > +1 > > On Wed, Apr 28, 2021, 19:22 Tim Allison wrote: > > > All, > > > > There have been a number of key

[jira] [Updated] (TIKA-3164) Upgrade to POI 5.0.0 when available

2021-04-29 Thread Konstantin Gribov (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov updated TIKA-3164: Issue Type: Task (was: Bug) > Upgrade to POI 5.0.0 when available >

[jira] [Commented] (TIKA-3374) Non-Unicode archive entry name is garbled

2021-04-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17335171#comment-17335171 ] ASF GitHub Bot commented on TIKA-3374: -- Ryan421 commented on pull request #433: URL:

[GitHub] [tika] Ryan421 commented on pull request #433: [TIKA-3374] Apply charset detection for archive entry name

2021-04-29 Thread GitBox
Ryan421 commented on pull request #433: URL: https://github.com/apache/tika/pull/433#issuecomment-828968582 Add unit test with a dummy EncodingDetector to verify the charset detection flow is executed. -- This is an automated message from the Apache Git Service. To respond to the