[PR] Bump aws.version from 1.12.683 to 1.12.684 [tika]

2024-03-20 Thread via GitHub
dependabot[bot] opened a new pull request, #1671: URL: https://github.com/apache/tika/pull/1671 Bumps `aws.version` from 1.12.683 to 1.12.684. Updates `com.amazonaws:aws-java-sdk-s3` from 1.12.683 to 1.12.684 Changelog Sourced from

[jira] [Commented] (TIKA-4038) Fix dependency problem in tika-parsers-standard-package

2024-03-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17829294#comment-17829294 ] ASF GitHub Bot commented on TIKA-4038: -- sandeshkr419 commented on PR #1130: URL:

Re: [PR] TIKA-4038: Remove shading of `tika-parsers-standard-package` [tika]

2024-03-20 Thread via GitHub
sandeshkr419 commented on PR #1130: URL: https://github.com/apache/tika/pull/1130#issuecomment-2010569496 Thanks @tballison @THausherr - I'm able to upgrade tika now. Last question, when are we expecting 2.9.2 to be available/released? -- This is an automated message from the Apache

Re: [PR] Support for adding custom tika configuration [tika-helm]

2024-03-20 Thread via GitHub
ahilmathew commented on code in PR #15: URL: https://github.com/apache/tika-helm/pull/15#discussion_r1532770531 ## README.md: ## @@ -84,6 +89,27 @@ while true; do kubectl --namespace tika-test port-forward $POD_NAME 9998:$CONTAI * Install it: - with Helm 3: `helm install

[jira] [Commented] (TIKA-4038) Fix dependency problem in tika-parsers-standard-package

2024-03-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17829292#comment-17829292 ] ASF GitHub Bot commented on TIKA-4038: -- tballison commented on PR #1130: URL:

Re: [PR] TIKA-4038: Remove shading of `tika-parsers-standard-package` [tika]

2024-03-20 Thread via GitHub
tballison commented on PR #1130: URL: https://github.com/apache/tika/pull/1130#issuecomment-2010508279 https://github.com/apache/tika/pull/1130#issuecomment-2010280292 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

[jira] [Commented] (TIKA-4038) Fix dependency problem in tika-parsers-standard-package

2024-03-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17829287#comment-17829287 ] ASF GitHub Bot commented on TIKA-4038: -- sandeshkr419 commented on PR #1130: URL:

Re: [PR] TIKA-4038: Remove shading of `tika-parsers-standard-package` [tika]

2024-03-20 Thread via GitHub
sandeshkr419 commented on PR #1130: URL: https://github.com/apache/tika/pull/1130#issuecomment-2010476782 Thanks @tballison and @THausherr for the quick help on addressing this. I still have a blocker in consuming 2.8 and above tika dependencies. Would you happen to have any

[jira] [Commented] (TIKA-4038) Fix dependency problem in tika-parsers-standard-package

2024-03-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17829284#comment-17829284 ] ASF GitHub Bot commented on TIKA-4038: -- tballison commented on PR #1130: URL:

Re: [PR] TIKA-4038: Remove shading of `tika-parsers-standard-package` [tika]

2024-03-20 Thread via GitHub
tballison commented on PR #1130: URL: https://github.com/apache/tika/pull/1130#issuecomment-2010467082 Doh! Thank you, @THausherr . I'm happy to cherry-pick that bit as well. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[jira] [Commented] (TIKA-4211) Tika extractor fails to extract embedded excel from pptx

2024-03-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17829281#comment-17829281 ] ASF GitHub Bot commented on TIKA-4211: -- tballison opened a new pull request, #1670: URL:

[PR] TIKA-4211 -- first attempt/shot in the dark [tika]

2024-03-20 Thread via GitHub
tballison opened a new pull request, #1670: URL: https://github.com/apache/tika/pull/1670 Thanks for your contribution to [Apache Tika](https://tika.apache.org/)! Your help is appreciated! Before opening the pull request, please verify that * there is an open issue on the

[jira] [Commented] (TIKA-4038) Fix dependency problem in tika-parsers-standard-package

2024-03-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17829280#comment-17829280 ] ASF GitHub Bot commented on TIKA-4038: -- THausherr commented on PR #1130: URL:

Re: [PR] TIKA-4038: Remove shading of `tika-parsers-standard-package` [tika]

2024-03-20 Thread via GitHub
THausherr commented on PR #1130: URL: https://github.com/apache/tika/pull/1130#issuecomment-2010370590 Yes; although I see that your last improvement wasn't added to 2.9.2, I'll do it. @gastaldi you can test with a snapshot

[jira] [Commented] (TIKA-4213) Improvements to jdbc pipes reporter

2024-03-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17829275#comment-17829275 ] ASF GitHub Bot commented on TIKA-4213: -- tballison opened a new pull request, #1669: URL:

[PR] TIKA-4213 -- improve jdbc pipes reporter [tika]

2024-03-20 Thread via GitHub
tballison opened a new pull request, #1669: URL: https://github.com/apache/tika/pull/1669 Thanks for your contribution to [Apache Tika](https://tika.apache.org/)! Your help is appreciated! Before opening the pull request, please verify that * there is an open issue on the

[jira] [Commented] (TIKA-4038) Fix dependency problem in tika-parsers-standard-package

2024-03-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17829269#comment-17829269 ] ASF GitHub Bot commented on TIKA-4038: -- tballison commented on PR #1130: URL:

[jira] [Commented] (TIKA-4038) Fix dependency problem in tika-parsers-standard-package

2024-03-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17829270#comment-17829270 ] ASF GitHub Bot commented on TIKA-4038: -- tballison commented on PR #1130: URL:

Re: [PR] TIKA-4038: Remove shading of `tika-parsers-standard-package` [tika]

2024-03-20 Thread via GitHub
tballison commented on PR #1130: URL: https://github.com/apache/tika/pull/1130#issuecomment-2010281749 I think the iworks and compress thing is fixed in 1.26.1. @THausherr does that sound right? The iworks issue rings a bell... -- This is an automated message from the Apache Git Service.

Re: [PR] TIKA-4038: Remove shading of `tika-parsers-standard-package` [tika]

2024-03-20 Thread via GitHub
tballison commented on PR #1130: URL: https://github.com/apache/tika/pull/1130#issuecomment-2010280292 Sorry, I haven't looked carefully at your gradle file...is it pulling in transitive dependencies, like `tika-parser-misc-office-module` for example? -- This is an automated message from

Re: About Tika 2.9.2 release date

2024-03-20 Thread Tim Allison
Fellow devs and community, I'd like to fix TIKA-4211 before the next release. It has been a while since our last 2.x release. What do you think about aiming for starting the voting process early next week? Any other blockers? On Tue, Mar 19, 2024 at 7:49 PM Shu Peng wrote: > Dear Tika Team, >

[jira] [Commented] (TIKA-4211) Tika extractor fails to extract embedded excel from pptx

2024-03-20 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17829265#comment-17829265 ] Tim Allison commented on TIKA-4211: --- This is awesome. Thank you! I'll see if I can create a demo file

[jira] [Commented] (TIKA-4038) Fix dependency problem in tika-parsers-standard-package

2024-03-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17828943#comment-17828943 ] ASF GitHub Bot commented on TIKA-4038: -- gastaldi commented on PR #1130: URL:

Re: [PR] TIKA-4038: Remove shading of `tika-parsers-standard-package` [tika]

2024-03-20 Thread via GitHub
gastaldi commented on PR #1130: URL: https://github.com/apache/tika/pull/1130#issuecomment-2009549387 No idea what can be causing that, perhaps @tballison might know -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

[jira] [Commented] (TIKA-4211) Tika extractor fails to extract embedded excel from pptx

2024-03-20 Thread Xiaohong Yang (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17828942#comment-17828942 ] Xiaohong Yang commented on TIKA-4211: - For step 0: if you run java -jar tika-app-2.9.1.jar -J -t