Re: [PR] TIKA-4252: switch to using the parse context for additional http headers [tika]

2024-06-06 Thread via GitHub
tballison merged PR #1778: URL: https://github.com/apache/tika/pull/1778 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] TIKA-4252: switch to using the parse context for additional http headers [tika]

2024-06-03 Thread via GitHub
tballison commented on PR #1778: URL: https://github.com/apache/tika/pull/1778#issuecomment-2145904427 Ha, @nddipiazza. I did earlier this morning. I chose your choices over mine in the merge, largely. See

Re: [PR] TIKA-4252: switch to using the parse context for additional http headers [tika]

2024-06-03 Thread via GitHub
nddipiazza commented on PR #1778: URL: https://github.com/apache/tika/pull/1778#issuecomment-2145900710 sure will do @tballison sorry didn't see this until now -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

Re: [PR] TIKA-4260 -- add ParseContext to fetchers and emitters [tika]

2024-06-03 Thread via GitHub
tballison closed pull request #1776: TIKA-4260 -- add ParseContext to fetchers and emitters URL: https://github.com/apache/tika/pull/1776 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

Re: [PR] TIKA-4260 -- add ParseContext to fetchers and emitters [tika]

2024-06-03 Thread via GitHub
tballison commented on PR #1776: URL: https://github.com/apache/tika/pull/1776#issuecomment-2144945579 I merged this into @nddipiazza 's TIKA-4252 PR. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

Re: [PR] Bump com.github.albfernandez:juniversalchardet from 2.4.0 to 2.5.0 [tika]

2024-06-02 Thread via GitHub
THausherr merged PR #1795: URL: https://github.com/apache/tika/pull/1795 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] Bump org.xerial:sqlite-jdbc from 3.45.3.0 to 3.46.0.0 [tika]

2024-06-02 Thread via GitHub
THausherr merged PR #1796: URL: https://github.com/apache/tika/pull/1796 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] Bump com.google.guava:guava from 33.2.0-jre to 33.2.1-jre [tika]

2024-06-02 Thread via GitHub
THausherr merged PR #1797: URL: https://github.com/apache/tika/pull/1797 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] Bump commons-net:commons-net from 3.10.0 to 3.11.0 [tika]

2024-06-02 Thread via GitHub
THausherr merged PR #1798: URL: https://github.com/apache/tika/pull/1798 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] Bump org.apache.maven.plugins:maven-javadoc-plugin from 3.6.3 to 3.7.0 [tika]

2024-06-02 Thread via GitHub
THausherr merged PR #1799: URL: https://github.com/apache/tika/pull/1799 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] Bump org.freemarker:freemarker from 2.3.32 to 2.3.33 [tika]

2024-06-02 Thread via GitHub
THausherr merged PR #1800: URL: https://github.com/apache/tika/pull/1800 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] Bump org.apache.jackrabbit:oak-jackrabbit-api from 1.62.0 to 1.64.0 [tika]

2024-06-02 Thread via GitHub
THausherr merged PR #1802: URL: https://github.com/apache/tika/pull/1802 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] Bump org.apache.maven.plugin-tools:maven-plugin-annotations from 3.13.0 to 3.13.1 [tika]

2024-06-02 Thread via GitHub
THausherr merged PR #1793: URL: https://github.com/apache/tika/pull/1793 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] Bump com.google.errorprone:error_prone_annotations from 2.27.1 to 2.28.0 [tika]

2024-06-02 Thread via GitHub
THausherr merged PR #1801: URL: https://github.com/apache/tika/pull/1801 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] Bump org.apache.maven.plugins:maven-shade-plugin from 3.5.3 to 3.6.0 [tika]

2024-06-02 Thread via GitHub
THausherr merged PR #1803: URL: https://github.com/apache/tika/pull/1803 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] Bump org.apache.maven.plugins:maven-enforcer-plugin from 3.4.1 to 3.5.0 [tika]

2024-06-02 Thread via GitHub
THausherr merged PR #1792: URL: https://github.com/apache/tika/pull/1792 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] Bump aws.version from 1.12.730 to 1.12.734 [tika]

2024-06-02 Thread via GitHub
THausherr merged PR #1794: URL: https://github.com/apache/tika/pull/1794 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[PR] Bump org.apache.maven.plugins:maven-shade-plugin from 3.5.3 to 3.6.0 [tika]

2024-06-02 Thread via GitHub
dependabot[bot] opened a new pull request, #1803: URL: https://github.com/apache/tika/pull/1803 Bumps [org.apache.maven.plugins:maven-shade-plugin](https://github.com/apache/maven-shade-plugin) from 3.5.3 to 3.6.0. Commits

[PR] Bump org.apache.jackrabbit:oak-jackrabbit-api from 1.62.0 to 1.64.0 [tika]

2024-06-02 Thread via GitHub
dependabot[bot] opened a new pull request, #1802: URL: https://github.com/apache/tika/pull/1802 Bumps org.apache.jackrabbit:oak-jackrabbit-api from 1.62.0 to 1.64.0. [![Dependabot compatibility

[PR] Bump com.google.errorprone:error_prone_annotations from 2.27.1 to 2.28.0 [tika]

2024-06-02 Thread via GitHub
dependabot[bot] opened a new pull request, #1801: URL: https://github.com/apache/tika/pull/1801 Bumps [com.google.errorprone:error_prone_annotations](https://github.com/google/error-prone) from 2.27.1 to 2.28.0. Release notes Sourced from

[PR] Bump commons-net:commons-net from 3.10.0 to 3.11.0 [tika]

2024-06-02 Thread via GitHub
dependabot[bot] opened a new pull request, #1798: URL: https://github.com/apache/tika/pull/1798 Bumps commons-net:commons-net from 3.10.0 to 3.11.0. [![Dependabot compatibility

[PR] Bump org.freemarker:freemarker from 2.3.32 to 2.3.33 [tika]

2024-06-02 Thread via GitHub
dependabot[bot] opened a new pull request, #1800: URL: https://github.com/apache/tika/pull/1800 Bumps org.freemarker:freemarker from 2.3.32 to 2.3.33. [![Dependabot compatibility

[PR] Bump org.apache.maven.plugins:maven-javadoc-plugin from 3.6.3 to 3.7.0 [tika]

2024-06-02 Thread via GitHub
dependabot[bot] opened a new pull request, #1799: URL: https://github.com/apache/tika/pull/1799 Bumps [org.apache.maven.plugins:maven-javadoc-plugin](https://github.com/apache/maven-javadoc-plugin) from 3.6.3 to 3.7.0. Commits

[PR] Bump com.google.guava:guava from 33.2.0-jre to 33.2.1-jre [tika]

2024-06-02 Thread via GitHub
dependabot[bot] opened a new pull request, #1797: URL: https://github.com/apache/tika/pull/1797 Bumps [com.google.guava:guava](https://github.com/google/guava) from 33.2.0-jre to 33.2.1-jre. Release notes Sourced from

[PR] Bump org.xerial:sqlite-jdbc from 3.45.3.0 to 3.46.0.0 [tika]

2024-06-02 Thread via GitHub
dependabot[bot] opened a new pull request, #1796: URL: https://github.com/apache/tika/pull/1796 Bumps [org.xerial:sqlite-jdbc](https://github.com/xerial/sqlite-jdbc) from 3.45.3.0 to 3.46.0.0. Release notes Sourced from

[PR] Bump aws.version from 1.12.730 to 1.12.734 [tika]

2024-06-02 Thread via GitHub
dependabot[bot] opened a new pull request, #1794: URL: https://github.com/apache/tika/pull/1794 Bumps `aws.version` from 1.12.730 to 1.12.734. Updates `com.amazonaws:aws-java-sdk-s3` from 1.12.730 to 1.12.734 Changelog Sourced from

[PR] Bump org.apache.maven.plugin-tools:maven-plugin-annotations from 3.13.0 to 3.13.1 [tika]

2024-06-02 Thread via GitHub
dependabot[bot] opened a new pull request, #1793: URL: https://github.com/apache/tika/pull/1793 Bumps [org.apache.maven.plugin-tools:maven-plugin-annotations](https://github.com/apache/maven-plugin-tools) from 3.13.0 to 3.13.1. Commits

[PR] Bump com.github.albfernandez:juniversalchardet from 2.4.0 to 2.5.0 [tika]

2024-06-02 Thread via GitHub
dependabot[bot] opened a new pull request, #1795: URL: https://github.com/apache/tika/pull/1795 Bumps [com.github.albfernandez:juniversalchardet](https://github.com/albfernandez/juniversalchardet) from 2.4.0 to 2.5.0. Release notes Sourced from

[PR] Bump org.apache.maven.plugins:maven-enforcer-plugin from 3.4.1 to 3.5.0 [tika]

2024-06-02 Thread via GitHub
dependabot[bot] opened a new pull request, #1792: URL: https://github.com/apache/tika/pull/1792 Bumps [org.apache.maven.plugins:maven-enforcer-plugin](https://github.com/apache/maven-enforcer) from 3.4.1 to 3.5.0. Release notes Sourced from

Re: [PR] Tika 4266 [tika]

2024-05-31 Thread via GitHub
tballison commented on PR #1791: URL: https://github.com/apache/tika/pull/1791#issuecomment-2141697075 Accidentally included maven build cache in this PR. Closing and will reopen shortly. -- This is an automated message from the Apache Git Service. To respond to the message, please log

Re: [PR] Tika 4266 [tika]

2024-05-31 Thread via GitHub
tballison closed pull request #1791: Tika 4266 URL: https://github.com/apache/tika/pull/1791 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[PR] Tika 4266 [tika]

2024-05-30 Thread via GitHub
tballison opened a new pull request, #1791: URL: https://github.com/apache/tika/pull/1791 Thanks for your contribution to [Apache Tika](https://tika.apache.org/)! Your help is appreciated! Before opening the pull request, please verify that * there is an open issue on the

Re: [PR] TIKA-4220 -- revert workaround [tika]

2024-05-30 Thread via GitHub
tballison merged PR #1790: URL: https://github.com/apache/tika/pull/1790 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] TIKA-4229 - Add a Microsoft Graph Fetcher [tika]

2024-05-30 Thread via GitHub
bartek commented on code in PR #1698: URL: https://github.com/apache/tika/pull/1698#discussion_r1620663162 ## tika-pipes/tika-fetchers/tika-fetcher-microsoft-graph/src/main/java/org/apache/tika/pipes/fetchers/microsoftgraph/MicrosoftGraphFetcher.java: ## @@ -0,0 +1,140 @@ +/* +

Re: [PR] TIKA-4252: switch to using the parse context for additional http headers [tika]

2024-05-30 Thread via GitHub
tballison commented on PR #1778: URL: https://github.com/apache/tika/pull/1778#issuecomment-2139485380 @nddipiazza I don't mean to cause you more work... is it possible to rebase on the TIKA-4260 branch or merge into that maybe and we can work together there? -- This is an

Re: [PR] TIKA-4221 -- revert workaround [tika]

2024-05-30 Thread via GitHub
tballison merged PR #1789: URL: https://github.com/apache/tika/pull/1789 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[PR] TIKA-4220 -- revert workaround [tika]

2024-05-30 Thread via GitHub
tballison opened a new pull request, #1790: URL: https://github.com/apache/tika/pull/1790 Thanks for your contribution to [Apache Tika](https://tika.apache.org/)! Your help is appreciated! Before opening the pull request, please verify that * there is an open issue on the

[PR] TIKA-4221 -- revert workaround [tika]

2024-05-30 Thread via GitHub
tballison opened a new pull request, #1789: URL: https://github.com/apache/tika/pull/1789 Thanks for your contribution to [Apache Tika](https://tika.apache.org/)! Your help is appreciated! Before opening the pull request, please verify that * there is an open issue on the

Re: [PR] Bump org.codehaus.mojo:exec-maven-plugin from 3.2.0 to 3.3.0 [tika]

2024-05-27 Thread via GitHub
THausherr merged PR #1781: URL: https://github.com/apache/tika/pull/1781 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] Bump commons-cli:commons-cli from 1.7.0 to 1.8.0 [tika]

2024-05-27 Thread via GitHub
THausherr merged PR #1780: URL: https://github.com/apache/tika/pull/1780 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] Bump io.netty:netty-bom from 4.1.109.Final to 4.1.110.Final [tika]

2024-05-27 Thread via GitHub
THausherr merged PR #1782: URL: https://github.com/apache/tika/pull/1782 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] Bump jakarta.websocket:jakarta.websocket-api from 2.1.1 to 2.2.0 [tika]

2024-05-27 Thread via GitHub
THausherr merged PR #1783: URL: https://github.com/apache/tika/pull/1783 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] Bump org.apache.commons:commons-compress from 1.26.1 to 1.26.2 [tika]

2024-05-27 Thread via GitHub
THausherr merged PR #1784: URL: https://github.com/apache/tika/pull/1784 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] Bump com.google.protobuf:protobuf-java from 3.25.3 to 4.27.0 [tika]

2024-05-27 Thread via GitHub
dependabot[bot] closed pull request #1788: Bump com.google.protobuf:protobuf-java from 3.25.3 to 4.27.0 URL: https://github.com/apache/tika/pull/1788 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

Re: [PR] Bump com.google.protobuf:protobuf-java from 3.25.3 to 4.27.0 [tika]

2024-05-27 Thread via GitHub
THausherr commented on PR #1788: URL: https://github.com/apache/tika/pull/1788#issuecomment-2132728855 @dependabot ignore this major version -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

Re: [PR] Bump com.google.cloud:google-cloud-storage from 2.38.0 to 2.39.0 [tika]

2024-05-27 Thread via GitHub
THausherr merged PR #1786: URL: https://github.com/apache/tika/pull/1786 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] Bump com.google.protobuf:protobuf-java from 3.25.3 to 4.27.0 [tika]

2024-05-27 Thread via GitHub
dependabot[bot] commented on PR #1788: URL: https://github.com/apache/tika/pull/1788#issuecomment-2132728913 OK, I won't notify you about version 4.x.x again, unless you re-open this PR. -- This is an automated message from the Apache Git Service. To respond to the message, please log on

Re: [PR] Bump aws.version from 1.12.726 to 1.12.730 [tika]

2024-05-27 Thread via GitHub
THausherr merged PR #1787: URL: https://github.com/apache/tika/pull/1787 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] Bump org.springframework:spring-context from 5.3.35 to 5.3.36 [tika]

2024-05-27 Thread via GitHub
THausherr merged PR #1785: URL: https://github.com/apache/tika/pull/1785 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[PR] Bump aws.version from 1.12.726 to 1.12.730 [tika]

2024-05-27 Thread via GitHub
dependabot[bot] opened a new pull request, #1787: URL: https://github.com/apache/tika/pull/1787 Bumps `aws.version` from 1.12.726 to 1.12.730. Updates `com.amazonaws:aws-java-sdk-s3` from 1.12.726 to 1.12.730 Changelog Sourced from

[PR] Bump com.google.protobuf:protobuf-java from 3.25.3 to 4.27.0 [tika]

2024-05-27 Thread via GitHub
dependabot[bot] opened a new pull request, #1788: URL: https://github.com/apache/tika/pull/1788 Bumps [com.google.protobuf:protobuf-java](https://github.com/protocolbuffers/protobuf) from 3.25.3 to 4.27.0. Commits See full diff in

Re: [PR] Bump org.apache.maven:maven-model from 3.9.6 to 3.9.7 [tika]

2024-05-26 Thread via GitHub
THausherr merged PR #1779: URL: https://github.com/apache/tika/pull/1779 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[PR] Bump com.google.cloud:google-cloud-storage from 2.38.0 to 2.39.0 [tika]

2024-05-26 Thread via GitHub
dependabot[bot] opened a new pull request, #1786: URL: https://github.com/apache/tika/pull/1786 Bumps [com.google.cloud:google-cloud-storage](https://github.com/googleapis/java-storage) from 2.38.0 to 2.39.0. Release notes Sourced from

[PR] Bump org.springframework:spring-context from 5.3.35 to 5.3.36 [tika]

2024-05-26 Thread via GitHub
dependabot[bot] opened a new pull request, #1785: URL: https://github.com/apache/tika/pull/1785 Bumps [org.springframework:spring-context](https://github.com/spring-projects/spring-framework) from 5.3.35 to 5.3.36. Release notes Sourced from

[PR] Bump org.apache.commons:commons-compress from 1.26.1 to 1.26.2 [tika]

2024-05-26 Thread via GitHub
dependabot[bot] opened a new pull request, #1784: URL: https://github.com/apache/tika/pull/1784 Bumps org.apache.commons:commons-compress from 1.26.1 to 1.26.2. [![Dependabot compatibility

[PR] Bump jakarta.websocket:jakarta.websocket-api from 2.1.1 to 2.2.0 [tika]

2024-05-26 Thread via GitHub
dependabot[bot] opened a new pull request, #1783: URL: https://github.com/apache/tika/pull/1783 Bumps [jakarta.websocket:jakarta.websocket-api](https://github.com/eclipse-ee4j/websocket-api) from 2.1.1 to 2.2.0. Commits See full diff in

[PR] Bump io.netty:netty-bom from 4.1.109.Final to 4.1.110.Final [tika]

2024-05-26 Thread via GitHub
dependabot[bot] opened a new pull request, #1782: URL: https://github.com/apache/tika/pull/1782 Bumps [io.netty:netty-bom](https://github.com/netty/netty) from 4.1.109.Final to 4.1.110.Final. Commits

[PR] Bump org.codehaus.mojo:exec-maven-plugin from 3.2.0 to 3.3.0 [tika]

2024-05-26 Thread via GitHub
dependabot[bot] opened a new pull request, #1781: URL: https://github.com/apache/tika/pull/1781 Bumps [org.codehaus.mojo:exec-maven-plugin](https://github.com/mojohaus/exec-maven-plugin) from 3.2.0 to 3.3.0. Release notes Sourced from

[PR] Bump commons-cli:commons-cli from 1.7.0 to 1.8.0 [tika]

2024-05-26 Thread via GitHub
dependabot[bot] opened a new pull request, #1780: URL: https://github.com/apache/tika/pull/1780 Bumps commons-cli:commons-cli from 1.7.0 to 1.8.0. [![Dependabot compatibility

[PR] Bump org.apache.maven:maven-model from 3.9.6 to 3.9.7 [tika]

2024-05-26 Thread via GitHub
dependabot[bot] opened a new pull request, #1779: URL: https://github.com/apache/tika/pull/1779 Bumps [org.apache.maven:maven-model](https://github.com/apache/maven) from 3.9.6 to 3.9.7. Release notes Sourced from

[PR] TIKA-4252: switch to using the parse context for additional http headers [tika]

2024-05-26 Thread via GitHub
nddipiazza opened a new pull request, #1778: URL: https://github.com/apache/tika/pull/1778 * add a parse context * allow additional data to be sent int the parse context to the fetch method -- This is an automated message from the Apache Git Service. To respond to the message, please

Re: [PR] TIKA-4252 fetch tuple metadata [tika]

2024-05-26 Thread via GitHub
nddipiazza closed pull request #1774: TIKA-4252 fetch tuple metadata URL: https://github.com/apache/tika/pull/1774 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

Re: [PR] TIKA-4260 -- add ParseContext to fetchers and emitters [tika]

2024-05-24 Thread via GitHub
tballison commented on PR #1776: URL: https://github.com/apache/tika/pull/1776#issuecomment-2130252325 I'm now getting a clean build with `-DskipTests` lol... That's a step at least. The big TODO is to add serialization of the ParseContext in

Re: [PR] TIKA-4261 -- add attachment type metadata filter [tika]

2024-05-24 Thread via GitHub
tballison merged PR #1777: URL: https://github.com/apache/tika/pull/1777 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[PR] TIKA-4261 -- add attachment type metadata filter [tika]

2024-05-24 Thread via GitHub
tballison opened a new pull request, #1777: URL: https://github.com/apache/tika/pull/1777 Thanks for your contribution to [Apache Tika](https://tika.apache.org/)! Your help is appreciated! Before opening the pull request, please verify that * there is an open issue on the

Re: [PR] TIKA-4259 [tika]

2024-05-24 Thread via GitHub
tballison merged PR #1775: URL: https://github.com/apache/tika/pull/1775 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] TIKA-4260 -- add ParseContext to fetchers and emitters [tika]

2024-05-24 Thread via GitHub
tballison commented on PR #1776: URL: https://github.com/apache/tika/pull/1776#issuecomment-2129532368 Current status -- only working in tika-core. Much more needs to be done throughout the repo to get this working. -- This is an automated message from the Apache Git Service. To respond

[PR] TIKA-4260 -- add ParseContext to fetchers and emitters [tika]

2024-05-24 Thread via GitHub
tballison opened a new pull request, #1776: URL: https://github.com/apache/tika/pull/1776 Thanks for your contribution to [Apache Tika](https://tika.apache.org/)! Your help is appreciated! Before opening the pull request, please verify that * there is an open issue on the

[PR] TIKA-4259 [tika]

2024-05-23 Thread via GitHub
tballison opened a new pull request, #1775: URL: https://github.com/apache/tika/pull/1775 Thanks for your contribution to [Apache Tika](https://tika.apache.org/)! Your help is appreciated! Before opening the pull request, please verify that * there is an open issue on the

Re: [PR] TIKA-4252 fetch tuple metadata [tika]

2024-05-23 Thread via GitHub
nddipiazza commented on PR #1774: URL: https://github.com/apache/tika/pull/1774#issuecomment-2127120285 oops not quite right - need to sync up with @tballison to make sure i'm covering his needs and not just my own -- This is an automated message from the Apache Git Service. To respond

[PR] TIKA-4252 fetch tuple metadata [tika]

2024-05-22 Thread via GitHub
nddipiazza opened a new pull request, #1774: URL: https://github.com/apache/tika/pull/1774 Add ability to add Tika Fetch Metadata -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

Re: [PR] Add Github CI workflows for multi-arch Docker images -- TIKA-4258 [tika-docker]

2024-05-21 Thread via GitHub
tballison commented on PR #19: URL: https://github.com/apache/tika-docker/pull/19#issuecomment-2123275951 At long last, I think we're all set. Thank you @fpiesche for opening this and for all of your work on it! I'm sorry I took the more daft option, but here we are. If you still want,

Re: [PR] Add Github CI workflows for multi-arch Docker images -- TIKA-4258 [tika-docker]

2024-05-21 Thread via GitHub
tballison closed pull request #19: Add Github CI workflows for multi-arch Docker images -- TIKA-4258 URL: https://github.com/apache/tika-docker/pull/19 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

Re: [PR] first draft build local multi-arch images [tika-docker]

2024-05-21 Thread via GitHub
tballison merged PR #21: URL: https://github.com/apache/tika-docker/pull/21 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] Upgrade from jammy to noble [tika-docker]

2024-05-21 Thread via GitHub
tballison merged PR #22: URL: https://github.com/apache/tika-docker/pull/22 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

Re: [PR] Add Github CI workflows for multi-arch Docker images -- TIKA-4258 [tika-docker]

2024-05-21 Thread via GitHub
nextgens commented on PR #19: URL: https://github.com/apache/tika-docker/pull/19#issuecomment-2122027255 I have just tried ``apache/tika:2.9.2-alpha-multi-arch-full`` on an Ampere A1 (arm64) box and that seems to work fine -- This is an automated message from the Apache Git Service. To

Re: [PR] TIKA-4257 -- lower dbf priority [tika]

2024-05-20 Thread via GitHub
tballison merged PR #1773: URL: https://github.com/apache/tika/pull/1773 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[PR] TIKA-4257 -- lower dbf priority [tika]

2024-05-20 Thread via GitHub
tballison opened a new pull request, #1773: URL: https://github.com/apache/tika/pull/1773 Thanks for your contribution to [Apache Tika](https://tika.apache.org/)! Your help is appreciated! Before opening the pull request, please verify that * there is an open issue on the

Re: [PR] Upgrade from jammy to noble [tika-docker]

2024-05-20 Thread via GitHub
tballison commented on PR #22: URL: https://github.com/apache/tika-docker/pull/22#issuecomment-2121117348 Looks like `minimal` gets a little smaller and `full` gets a little bigger, but nothing eye-opening. -- This is an automated message from the Apache Git Service. To respond to the

Re: [PR] Upgrade from jammy to noble [tika-docker]

2024-05-20 Thread via GitHub
tballison commented on PR #22: URL: https://github.com/apache/tika-docker/pull/22#issuecomment-2121108427 It looks like tesseract 5.3.4 had made it into noble and we don't have to pull from `ppa:alex-p/tesseract-ocr5` any more. Let me know if there are any objections to this upgrade.

[PR] Upgrade from jammy to noble [tika-docker]

2024-05-20 Thread via GitHub
tballison opened a new pull request, #22: URL: https://github.com/apache/tika-docker/pull/22 (no comment) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe,

Re: [PR] first draft build local multi-arch images [tika-docker]

2024-05-20 Thread via GitHub
stumpylog commented on PR #21: URL: https://github.com/apache/tika-docker/pull/21#issuecomment-2120927505 Seems good to me. I built it a couple times, and the recent updates makes the builder clean up better -- This is an automated message from the Apache Git Service. To respond to the

Re: [PR] first draft build local multi-arch images [tika-docker]

2024-05-20 Thread via GitHub
stumpylog commented on code in PR #21: URL: https://github.com/apache/tika-docker/pull/21#discussion_r1607058472 ## docker-tool.sh: ## @@ -17,15 +17,24 @@ # specific language governing permissions and limitations # under the License. +stop_and_die() { + docker buildx

Re: [PR] Add Github CI workflows for multi-arch Docker images -- TIKA-4258 [tika-docker]

2024-05-20 Thread via GitHub
tballison commented on PR #19: URL: https://github.com/apache/tika-docker/pull/19#issuecomment-2120876458 > I think building multiarch with buildx requires QEMU, but as long as that's available on the host doing the builds just running buildx should be perfectly fine - that's all the

Re: [PR] first draft build local multi-arch images [tika-docker]

2024-05-20 Thread via GitHub
stumpylog commented on PR #21: URL: https://github.com/apache/tika-docker/pull/21#issuecomment-2120874896 I pulled the image on a Pi (arm64) and had no problems pulling or starting. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to

Re: [PR] Add Github CI workflows for multi-arch Docker images -- TIKA-4258 [tika-docker]

2024-05-20 Thread via GitHub
hegerdes commented on PR #19: URL: https://github.com/apache/tika-docker/pull/19#issuecomment-2120869924 > Wow...it looks like it actually worked?! > > Can you all give this a shot?

Re: [PR] first draft build local multi-arch images [tika-docker]

2024-05-20 Thread via GitHub
tballison commented on PR #21: URL: https://github.com/apache/tika-docker/pull/21#issuecomment-2120848940 Building `full` now... -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

Re: [PR] first draft build local multi-arch images [tika-docker]

2024-05-20 Thread via GitHub
tballison commented on PR #21: URL: https://github.com/apache/tika-docker/pull/21#issuecomment-2120847590 Got it. Thank you @stumpylog ! How's this:

Re: [PR] Add Github CI workflows for multi-arch Docker images -- TIKA-4258 [tika-docker]

2024-05-20 Thread via GitHub
tballison commented on PR #19: URL: https://github.com/apache/tika-docker/pull/19#issuecomment-2120845395 Wow...it looks like it actually worked?! Can you all give this a shot?

Re: [PR] first draft build local multi-arch images [tika-docker]

2024-05-20 Thread via GitHub
stumpylog commented on PR #21: URL: https://github.com/apache/tika-docker/pull/21#issuecomment-2120837966 This works for me, except for the pushing obviously. One minor annoyance, if the build does fail, the builder will still be around, so it will have to be manually removed before

Re: [PR] Add Github CI workflows for multi-arch Docker images -- TIKA-4258 [tika-docker]

2024-05-20 Thread via GitHub
tballison commented on PR #19: URL: https://github.com/apache/tika-docker/pull/19#issuecomment-2120807457 Let's add other registries on a later ticket? How's this look? https://github.com/apache/tika-docker/pull/21 I haven't tested it. -- This is an automated message from

[PR] first draft build local multi-arch images [tika-docker]

2024-05-20 Thread via GitHub
tballison opened a new pull request, #21: URL: https://github.com/apache/tika-docker/pull/21 (no comment) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe,

Re: [PR] Add Github CI workflows for multi-arch Docker images -- TIKA-4258 [tika-docker]

2024-05-20 Thread via GitHub
fpiesche commented on PR #19: URL: https://github.com/apache/tika-docker/pull/19#issuecomment-2120688287 I think building multiarch with buildx requires QEMU, but as long as that's available on the host doing the builds just running buildx should be perfectly fine - that's all the github

Re: [PR] Add Github CI workflows for multi-arch Docker images -- TIKA-4258 [tika-docker]

2024-05-20 Thread via GitHub
tballison commented on PR #19: URL: https://github.com/apache/tika-docker/pull/19#issuecomment-2120577030 > If securing the credentials required for dockerhub is the only concern, I think using github container registry instead may be a great solution.

Re: [PR] Add Github CI workflows for multi-arch Docker images -- TIKA-4258 [tika-docker]

2024-05-20 Thread via GitHub
tballison commented on PR #19: URL: https://github.com/apache/tika-docker/pull/19#issuecomment-2120574718 How's this for a proposed way forward? We basically keep our current workflow on the release manager's laptop/hardware. We modify our build scripts to build a single-arch image,

Re: [PR] Add Github CI workflows for multi-arch Docker images -- TIKA-4258 [tika-docker]

2024-05-20 Thread via GitHub
nextgens commented on PR #19: URL: https://github.com/apache/tika-docker/pull/19#issuecomment-2120530390 If securing the credentials required for dockerhub is the only concern, I think using github container registry instead may be a great solution.

Re: [PR] Add Github CI workflows for multi-arch Docker images -- TIKA-4258 [tika-docker]

2024-05-20 Thread via GitHub
tballison commented on PR #19: URL: https://github.com/apache/tika-docker/pull/19#issuecomment-2120501490 It looks like Airflow at least has moved away from github actions and moved towards a release manager building locally and pushing to dockerhub --

Re: [PR] Add Github CI workflows for multi-arch Docker images [tika-docker]

2024-05-20 Thread via GitHub
tballison commented on PR #19: URL: https://github.com/apache/tika-docker/pull/19#issuecomment-2120479707 I opened: https://issues.apache.org/jira/browse/TIKA-4258 to track this on our JIRA. I also opened an issue on infra. -- This is an automated message from the Apache Git Service. To

Re: [PR] Add Github CI workflows for multi-arch Docker images [tika-docker]

2024-05-20 Thread via GitHub
tballison commented on PR #19: URL: https://github.com/apache/tika-docker/pull/19#issuecomment-2120452143 Pinged asf infra on credentials and how to do this for an asf project. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

Re: [PR] Add Github CI workflows for multi-arch Docker images [tika-docker]

2024-05-20 Thread via GitHub
tballison commented on PR #19: URL: https://github.com/apache/tika-docker/pull/19#issuecomment-2120444104 Let me ping infra at asf to see what we need to do to get this working as an action. I _think_ that's the blocker for me. -- This is an automated message from the Apache Git Service.

  1   2   3   4   5   6   7   8   9   10   >