[PR] Bump io.projectreactor:reactor-core from 3.5.10 to 3.5.11 [tika]

2023-10-10 Thread via GitHub


dependabot[bot] opened a new pull request, #1395:
URL: https://github.com/apache/tika/pull/1395

   Bumps 
[io.projectreactor:reactor-core](https://github.com/reactor/reactor-core) from 
3.5.10 to 3.5.11.
   
   Release notes
   Sourced from https://github.com/reactor/reactor-core/releases;>io.projectreactor:reactor-core's
 releases.
   
   v3.5.11
   
   What's Changed
   :warning: Update considerations and deprecations
   
   ensures addCap always returns value with flag by https://github.com/OlegDokuka;>@​OlegDokuka in https://redirect.github.com/reactor/reactor-core/pull/3610;>reactor/reactor-core#3610
   
   :sparkles: New features and improvements
   
   provides extra check for contextualName presence by https://github.com/OlegDokuka;>@​OlegDokuka in https://redirect.github.com/reactor/reactor-core/pull/3611;>reactor/reactor-core#3611
   Handling 1.0.0 of context-propagation by https://github.com/chemicL;>@​chemicL in https://redirect.github.com/reactor/reactor-core/pull/3609;>reactor/reactor-core#3609
   
   :lady_beetle: Bug fixes
   
   ensures that FluxBufferTime uses proper index 
value during onNext check by https://github.com/OlegDokuka;>@​OlegDokuka in https://redirect.github.com/reactor/reactor-core/pull/3614;>reactor/reactor-core#3614
   
   :book: Documentation, Tests and Build
   
   adds CI nightly builds by https://github.com/OlegDokuka;>@​OlegDokuka in https://redirect.github.com/reactor/reactor-core/pull/3581;>reactor/reactor-core#3581
   Bump actions/setup-java from 3.12.0 to 3.13.0 in /.github/workflows by 
https://github.com/dependabot;>@​dependabot in https://redirect.github.com/reactor/reactor-core/pull/3585;>reactor/reactor-core#3585
   Bump actions/checkout from 3.1.0 to 4.1.0 in /.github/workflows by https://github.com/dependabot;>@​dependabot in https://redirect.github.com/reactor/reactor-core/pull/3586;>reactor/reactor-core#3586
   Bump gradle/gradle-build-action from 2.7.0 to 2.9.0 in 
/.github/workflows by https://github.com/dependabot;>@​dependabot in https://redirect.github.com/reactor/reactor-core/pull/3594;>reactor/reactor-core#3594
   
   :up: Dependency Upgrades
   
   Bump io.spring.nohttp from 0.0.10 to 0.0.11 by https://github.com/dependabot;>@​dependabot in https://redirect.github.com/reactor/reactor-core/pull/3497;>reactor/reactor-core#3497
   Bump org.junit:junit-bom from 5.9.2 to 5.10.0 by https://github.com/dependabot;>@​dependabot in https://redirect.github.com/reactor/reactor-core/pull/3556;>reactor/reactor-core#3556
   Bump me.champeau.gradle.japicmp from 0.4.1 to 0.4.2 by https://github.com/dependabot;>@​dependabot in https://redirect.github.com/reactor/reactor-core/pull/3595;>reactor/reactor-core#3595
   Bump jmhVersion from 1.36 to 1.37 by https://github.com/dependabot;>@​dependabot in https://redirect.github.com/reactor/reactor-core/pull/3596;>reactor/reactor-core#3596
   Bump byteBuddyVersion from 1.14.5 to 1.14.8 by https://github.com/dependabot;>@​dependabot in https://redirect.github.com/reactor/reactor-core/pull/3601;>reactor/reactor-core#3601
   Bump com.gradle.enterprise from 3.14.1 to 3.15 by https://github.com/dependabot;>@​dependabot in https://redirect.github.com/reactor/reactor-core/pull/3602;>reactor/reactor-core#3602
   Bump ch.qos.logback:logback-classic from 1.2.11 to 1.2.12 by https://github.com/dependabot;>@​dependabot in https://redirect.github.com/reactor/reactor-core/pull/3476;>reactor/reactor-core#3476
   Bump de.undercouch.download from 5.4.0 to 5.5.0 by https://github.com/dependabot;>@​dependabot in https://redirect.github.com/reactor/reactor-core/pull/3608;>reactor/reactor-core#3608
   Bump com.gradle.enterprise from 3.15 to 3.15.1 by https://github.com/dependabot;>@​dependabot in https://redirect.github.com/reactor/reactor-core/pull/3613;>reactor/reactor-core#3613
   
   Full Changelog: https://github.com/reactor/reactor-core/compare/v3.5.10...v3.5.11;>https://github.com/reactor/reactor-core/compare/v3.5.10...v3.5.11
   
   
   
   Commits
   
   https://github.com/reactor/reactor-core/commit/c6b7f8778bd398ba38f019f6e43977cafb0a68e5;>c6b7f87
 [release] Prepare and release 3.5.11
   https://github.com/reactor/reactor-core/commit/eb8da030295e1c99455b0e6950c205bf2fdec8d7;>eb8da03
 Merge-ignore 3.4.33 into 3.5.11
   https://github.com/reactor/reactor-core/commit/65bdb3af7cb4f75ddac63242d604711e3df300f8;>65bdb3a
 [release] Next development version 3.4.34-SNAPSHOT
   https://github.com/reactor/reactor-core/commit/ad902b9d13529454b2c43d30d1ddc4ef3a00e5fb;>ad902b9
 [release] Prepare and release 3.4.33
   https://github.com/reactor/reactor-core/commit/759375e40e4146398444ced83bd25909f1e9d6f6;>759375e
 ensures that proper index is used during onNext 
check (https://redirect.github.com/reactor/reactor-core/issues/3614;>#3614)
   https://github.com/reactor/reactor-core/commit/a887af84072827eec23628d53ecf5a4a97db014b;>a887af8
 provides extra check for contextualName presence 

[PR] Bump io.netty:netty-bom from 4.1.99.Final to 4.1.100.Final [tika]

2023-10-10 Thread via GitHub


dependabot[bot] opened a new pull request, #1394:
URL: https://github.com/apache/tika/pull/1394

   Bumps [io.netty:netty-bom](https://github.com/netty/netty) from 4.1.99.Final 
to 4.1.100.Final.
   
   Commits
   
   https://github.com/netty/netty/commit/58df783eb4fc50f95a1061dc4274020d6804caf4;>58df783
 [maven-release-plugin] prepare release netty-4.1.100.Final
   https://github.com/netty/netty/commit/58f75f665aa81a8cbcf6ffa74820042a285c5e61;>58f75f6
 Merge pull request from GHSA-xpw8-rcwv-8f8p
   https://github.com/netty/netty/commit/491144865ad683f43ef1db64256aed8fd84a50be;>4911448
 Do not fail when compressing empty HttpContent (https://redirect.github.com/netty/netty/issues/13655;>#13655)
   https://github.com/netty/netty/commit/caca5e5a1e19a26a3cff3560b79e7ade18398540;>caca5e5
 When read PoolSubpage's variant fields, it should lock on PoolSubpage's head 
...
   https://github.com/netty/netty/commit/d97f2a5606aaccf7494ff29d7229be1349fe746a;>d97f2a5
 Update checkout action to latest version (https://redirect.github.com/netty/netty/issues/13649;>#13649)
   https://github.com/netty/netty/commit/275341f01cbc0e6230c5cc3b47cca3f24611c6da;>275341f
 Fix issue with unrecognized JVM option while running with Java 11 (https://redirect.github.com/netty/netty/issues/13648;>#13648)
   https://github.com/netty/netty/commit/5db037beedca8aa5b6a486fa515cc6af013ced74;>5db037b
 Speedup max direct memory estimation via Unsafe (https://redirect.github.com/netty/netty/issues/13643;>#13643)
   https://github.com/netty/netty/commit/ce5c78cec192e935ff7191d94db06d6abdc27dae;>ce5c78c
 Update actions to the latest version (https://redirect.github.com/netty/netty/issues/13644;>#13644)
   https://github.com/netty/netty/commit/d7a8169f1b9bb4cf7708c066d26490da6be53cce;>d7a8169
 [maven-release-plugin] prepare for next development iteration
   See full diff in https://github.com/netty/netty/compare/netty-4.1.99.Final...netty-4.1.100.Final;>compare
 view
   
   
   
   
   
   [![Dependabot compatibility 
score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=io.netty:netty-bom=maven=4.1.99.Final=4.1.100.Final)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores)
   
   Dependabot will resolve any conflicts with this PR as long as you don't 
alter it yourself. You can also trigger a rebase manually by commenting 
`@dependabot rebase`.
   
   [//]: # (dependabot-automerge-start)
   [//]: # (dependabot-automerge-end)
   
   ---
   
   
   Dependabot commands and options
   
   
   You can trigger Dependabot actions by commenting on this PR:
   - `@dependabot rebase` will rebase this PR
   - `@dependabot recreate` will recreate this PR, overwriting any edits that 
have been made to it
   - `@dependabot merge` will merge this PR after your CI passes on it
   - `@dependabot squash and merge` will squash and merge this PR after your CI 
passes on it
   - `@dependabot cancel merge` will cancel a previously requested merge and 
block automerging
   - `@dependabot reopen` will reopen this PR if it is closed
   - `@dependabot close` will close this PR and stop Dependabot recreating it. 
You can achieve the same result by closing it manually
   - `@dependabot show  ignore conditions` will show all of 
the ignore conditions of the specified dependency
   - `@dependabot ignore this major version` will close this PR and stop 
Dependabot creating any more for this major version (unless you reopen the PR 
or upgrade to it yourself)
   - `@dependabot ignore this minor version` will close this PR and stop 
Dependabot creating any more for this minor version (unless you reopen the PR 
or upgrade to it yourself)
   - `@dependabot ignore this dependency` will close this PR and stop 
Dependabot creating any more for this dependency (unless you reopen the PR or 
upgrade to it yourself)
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@tika.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[PR] Bump com.google.guava:guava from 32.1.2-jre to 32.1.3-jre [tika]

2023-10-10 Thread via GitHub


dependabot[bot] opened a new pull request, #1393:
URL: https://github.com/apache/tika/pull/1393

   Bumps [com.google.guava:guava](https://github.com/google/guava) from 
32.1.2-jre to 32.1.3-jre.
   
   Release notes
   Sourced from https://github.com/google/guava/releases;>com.google.guava:guava's 
releases.
   
   32.1.3
   Maven
   dependency
 groupIdcom.google.guava/groupId
 artifactIdguava/artifactId
 version32.1.3-jre/version
 !-- or, for Android: --
 version32.1.3-android/version
   /dependency
   
   Jar files
   
   https://repo1.maven.org/maven2/com/google/guava/guava/32.1.3-jre/guava-32.1.3-jre.jar;>32.1.3-jre.jar
   https://repo1.maven.org/maven2/com/google/guava/guava/32.1.3-android/guava-32.1.3-android.jar;>32.1.3-android.jar
   
   Guava requires https://github.com/google/guava/wiki/UseGuavaInYourBuild#what-about-guavas-own-dependencies;>one
 runtime dependency, which you can download here:
   
   https://repo1.maven.org/maven2/com/google/guava/failureaccess/1.0.1/failureaccess-1.0.1.jar;>failureaccess-1.0.1.jar
   
   Javadoc
   
   http://guava.dev/releases/32.1.3-jre/api/docs/;>32.1.3-jre
   http://guava.dev/releases/32.1.3-android/api/docs/;>32.1.3-android
   
   JDiff
   
   http://guava.dev/releases/32.1.3-jre/api/diffs/;>32.1.3-jre vs. 
32.1.2-jre
   http://guava.dev/releases/32.1.3-android/api/diffs/;>32.1.3-android vs. 
32.1.2-android
   http://guava.dev/releases/32.1.3-android/api/androiddiffs/;>32.1.3-android
 vs. 32.1.3-jre
   
   Changelog
   
   Changed Gradle Metadata to include dependency versions directly. This 
may address https://redirect.github.com/google/guava/issues/6657;>Could not 
find some-dependency errors that some users have 
reported (which might be a result of users' excluding 
guava-parent). (c6d35cf1a5)
   collect: Changed 
Multisets.unmodifiableMultiset(set).removeIf(predicate) to throw 
an exception always, even if nothing matches predicate. 
(61dbccfda3)
   graph: Fixed the behavior of 
Graph/ValueGraph views for a node when that node is 
removed from the graph. (950799691c)
   io: Fixed Files.createTempDir and 
FileBackedOutputStream under https://redirect.github.com/google/guava/issues/6634;>Windows 
services, a rare use case. (The fix actually covers only Java 9+ 
because Java 8 would require an additional approach. Let us know if you need 
support under Java 8.) (f87f68cd3e)
   net: Made MediaType.parse allow and skip over 
whitespace around the / and = separator tokens in 
addition to the ; separator, for which it was already being 
allowed. (2786f83291)
   util.concurrent: Tweaked Futures.getChecked 
constructor-selection behavior: The method continues to prefer to call 
constructors with a String parameter, but now it breaks ties based 
on whether the constructor has a Throwable parameter. Beyond that, 
the choice of constructor remains undefined. (For this and other reasons, we 
discourage the use of getChecked.) (59cfb2267a)
   
   
   
   
   Commits
   
   See full diff in https://github.com/google/guava/commits;>compare view
   
   
   
   
   
   [![Dependabot compatibility 
score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=com.google.guava:guava=maven=32.1.2-jre=32.1.3-jre)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores)
   
   Dependabot will resolve any conflicts with this PR as long as you don't 
alter it yourself. You can also trigger a rebase manually by commenting 
`@dependabot rebase`.
   
   [//]: # (dependabot-automerge-start)
   [//]: # (dependabot-automerge-end)
   
   ---
   
   
   Dependabot commands and options
   
   
   You can trigger Dependabot actions by commenting on this PR:
   - `@dependabot rebase` will rebase this PR
   - `@dependabot recreate` will recreate this PR, overwriting any edits that 
have been made to it
   - `@dependabot merge` will merge this PR after your CI passes on it
   - `@dependabot squash and merge` will squash and merge this PR after your CI 
passes on it
   - `@dependabot cancel merge` will cancel a previously requested merge and 
block automerging
   - `@dependabot reopen` will reopen this PR if it is closed
   - `@dependabot close` will close this PR and stop Dependabot recreating it. 
You can achieve the same result by closing it manually
   - `@dependabot show  ignore conditions` will show all of 
the ignore conditions of the specified dependency
   - `@dependabot ignore this major version` will close this PR and stop 
Dependabot creating any more for this major version (unless you reopen the PR 
or upgrade to it yourself)
   - `@dependabot ignore this minor version` will close this PR and stop 
Dependabot creating any more for this minor version (unless you reopen the PR 
or upgrade to it yourself)
   - `@dependabot ignore this dependency` will close this PR and stop 
Dependabot creating any more for this dependency (unless you reopen the PR or 

[PR] Bump reactor.netty.version from 1.1.11 to 1.1.12 [tika]

2023-10-10 Thread via GitHub


dependabot[bot] opened a new pull request, #1392:
URL: https://github.com/apache/tika/pull/1392

   Bumps `reactor.netty.version` from 1.1.11 to 1.1.12.
   Updates `io.projectreactor.netty:reactor-netty-core` from 1.1.11 to 1.1.12
   
   Release notes
   Sourced from https://github.com/reactor/reactor-netty/releases;>io.projectreactor.netty:reactor-netty-core's
 releases.
   
   v1.1.12
   
   Reactor Netty 1.1.12 is part of 
2022.0.12 Release Train.
   This is a recommended update for all Reactor Netty 1.1.x 
users.
   What's Changed
   :sparkles: New features and improvements
   
   Depend on Reactor Core v3.5.11 by https://github.com/OlegDokuka;>@​OlegDokuka in 
d68cea13c00dfa149f96e7029e1dcb3ca8a2cf7e, see https://github.com/reactor/reactor-core/releases/tag/v3.5.11;>release 
notes
   Depend on Netty v4.1.100.Final by https://github.com/OlegDokuka;>@​OlegDokuka in 
682358364a24faa2e6218ce43b89bf67e812b724
   Depend on netty-incubator-transport-native-io_uring 
v0.0.23.Final by https://github.com/dependabot;>@​dependabot in https://redirect.github.com/reactor/reactor-netty/issues/2915;>#2915
   Depend on Netty QUIC Codec v0.0.51.Final by https://github.com/violetagg;>@​violetagg in https://redirect.github.com/reactor/reactor-netty/issues/2921;>#2921
   
   :lady_beetle: Bug fixes
   
   Request for read interest when channel is unwritable while 
HttpClient.send(Mono) is used by https://github.com/pderop;>@​pderop in https://redirect.github.com/reactor/reactor-netty/issues/2864;>#2864, 
https://redirect.github.com/reactor/reactor-netty/issues/2902;>#2902, 
https://redirect.github.com/reactor/reactor-netty/issues/2903;>#2903
   Ensure HttpClient#mapConnect() / #doOnRequestError() are 
called when HttpClient#headersWhen() is used by https://github.com/violetagg;>@​violetagg in https://redirect.github.com/reactor/reactor-netty/issues/2912;>#2912
   Ensure Expect header is handled correctly with 
HTTP/2 by https://github.com/violetagg;>@​violetagg in https://redirect.github.com/reactor/reactor-netty/issues/2916;>#2916
   Ensure connection closed when non 200 status for request 
with Expect header and incoming data not completed by https://github.com/violetagg;>@​violetagg in https://redirect.github.com/reactor/reactor-netty/issues/2919;>#2919
   
   :book: Documentation, Tests and Build
   
   Add GraalVM smoke tests by https://github.com/violetagg;>@​violetagg in https://redirect.github.com/reactor/reactor-netty/issues/2899;>#2899
   Add nightly build with Reactor Core 3.6 
SNAPSHOTs by https://github.com/violetagg;>@​violetagg in https://redirect.github.com/reactor/reactor-netty/issues/2911;>#2911
   Add smoke tests for Reactor Core feature 
Hooks.enableAutomaticContextPropagation() by https://github.com/violetagg;>@​violetagg in https://redirect.github.com/reactor/reactor-netty/issues/2922;>#2922, 
https://redirect.github.com/reactor/reactor-netty/issues/2925;>#2925, 
https://redirect.github.com/reactor/reactor-netty/issues/2926;>#2926
   
   :up: Build/Test Dependency Upgrades
   
   Bump hoverfly-java-junit5 to version 0.15.0 by 
https://github.com/dependabot;>@​dependabot in https://redirect.github.com/reactor/reactor-netty/issues/2898;>#2898
   Bump org.gradle.test-retry to version 1.5.6 by 
https://github.com/dependabot;>@​dependabot in https://redirect.github.com/reactor/reactor-netty/issues/2918;>#2918
   Bump build-info-extractor-gradle to version 
4.33.6 by https://github.com/dependabot;>@​dependabot in https://redirect.github.com/reactor/reactor-netty/issues/2923;>#2923
   Bump Gradle to version 7.6.3 by https://github.com/violetagg;>@​violetagg in https://redirect.github.com/reactor/reactor-netty/issues/2924;>#2924
   Bump netty-tcnative-boringssl-static to version 
v2.0.62.Final by https://github.com/violetagg;>@​violetagg in https://redirect.github.com/reactor/reactor-netty/issues/2929;>#2929
   
   Full Changelog: https://github.com/reactor/reactor-netty/compare/v1.1.11...v1.1.12;>https://github.com/reactor/reactor-netty/compare/v1.1.11...v1.1.12
   
   
   
   Commits
   
   https://github.com/reactor/reactor-netty/commit/d68cea13c00dfa149f96e7029e1dcb3ca8a2cf7e;>d68cea1
 [release] Prepare and release 1.1.12
   https://github.com/reactor/reactor-netty/commit/682358364a24faa2e6218ce43b89bf67e812b724;>6823583
 Merge-ignore 1.0.39 into 1.0.3
   https://github.com/reactor/reactor-netty/commit/04db10680091b83f3ba26c16731adad0f0a0a1d2;>04db106
 updates compatibility version
   https://github.com/reactor/reactor-netty/commit/5f964ebbcfc5d723699d996ad0486a39db014df9;>5f964eb
 [release] Back to snapshots, next is 1.0.39-SNAPSHOT
   https://github.com/reactor/reactor-netty/commit/8bddf51c0836bff0a7b4645d0b285b243b9845d9;>8bddf51
 [release] Prepare and release 1.0.38
   https://github.com/reactor/reactor-netty/commit/43268793cceee228a98572359a80d9d008deeaeb;>4326879
 Merge-ignore 1.0.38 into 1.1.12
   https://github.com/reactor/reactor-netty/commit/a1c7978dd6f6b47dcc54637a69f9543669757440;>a1c7978
 

[jira] [Updated] (TIKA-4153) Update RFC822 detection, again

2023-10-10 Thread Tim Allison (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-4153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison updated TIKA-4153:
--
Description: 
On the user list, Kashif Khan supplied the following example of a file that 
does not start with RFC822 fields, is detected as RFC822...leading to loss of 
text.

{noformat}
Some text here 1.
Some text here 2.
Some text here 3.
Original Message
From: some_m...@abc.com 
Sent: Thursday, October 31, 2019 9:52 AM
To: Some person, (The XYZ group)
Subject: RE: Mr. Random person phone call: MESSAGE
Hi,
I am available now to receive the call.
Some text here 4.
Some text here 5.**Some text here 6.
{noformat}

>From what I can tell, we last modified the rfc822 detection on TIKA-4125, with 
>the major minShouldMatch refactoring on TIKA-3153.

  was:
On the user list, Kashif Khan supplied the following example of a file that 
does not start with RFC822 fields, is detected as RFC822...leading to loss of 
text.

{noformat}
Some text here 1.
Some text here 2.
Some text here 3.
Original Message
From: some_m...@abc.com 
Sent: Thursday, October 31, 2019 9:52 AM
To: Some person, (The XYZ group)
Subject: RE: Mr. Random person phone call: MESSAGE
Hi,
I am available now to receive the call.
Some text here 4.
Some text here 5.**Some text here 6.
{noformat}

>From what I can tell, we last modified the rfc822 detection on TIKA-3153.


> Update RFC822 detection, again
> --
>
> Key: TIKA-4153
> URL: https://issues.apache.org/jira/browse/TIKA-4153
> Project: Tika
>  Issue Type: Task
>Reporter: Tim Allison
>Priority: Minor
>
> On the user list, Kashif Khan supplied the following example of a file that 
> does not start with RFC822 fields, is detected as RFC822...leading to loss of 
> text.
> {noformat}
> Some text here 1.
> Some text here 2.
> Some text here 3.
> Original Message
> From: some_m...@abc.com 
> Sent: Thursday, October 31, 2019 9:52 AM
> To: Some person, (The XYZ group)
> Subject: RE: Mr. Random person phone call: MESSAGE
> Hi,
> I am available now to receive the call.
> Some text here 4.
> Some text here 5.**Some text here 6.
> {noformat}
> From what I can tell, we last modified the rfc822 detection on TIKA-4125, 
> with the major minShouldMatch refactoring on TIKA-3153.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (TIKA-4153) Update RFC822 detection, again

2023-10-10 Thread Tim Allison (Jira)
Tim Allison created TIKA-4153:
-

 Summary: Update RFC822 detection, again
 Key: TIKA-4153
 URL: https://issues.apache.org/jira/browse/TIKA-4153
 Project: Tika
  Issue Type: Task
Reporter: Tim Allison


On the user list, Kashif Khan supplied the following example of a file that 
does not start with RFC822 fields, is detected as RFC822...leading to loss of 
text.

{noformat}
Some text here 1.
Some text here 2.
Some text here 3.
Original Message
From: some_m...@abc.com 
Sent: Thursday, October 31, 2019 9:52 AM
To: Some person, (The XYZ group)
Subject: RE: Mr. Random person phone call: MESSAGE
Hi,
I am available now to receive the call.
Some text here 4.
Some text here 5.**Some text here 6.
{noformat}

>From what I can tell, we last modified the rfc822 detection on TIKA-3153.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (TIKA-3948) Migrate to jakarta in Tika 3.x

2023-10-10 Thread Tim Allison (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17773768#comment-17773768
 ] 

Tim Allison commented on TIKA-3948:
---

[~desruisseaux], yep, build and unit tests work with our TIKA-3948 branch!  
Thank you!

> Migrate to jakarta in Tika 3.x
> --
>
> Key: TIKA-3948
> URL: https://issues.apache.org/jira/browse/TIKA-3948
> Project: Tika
>  Issue Type: Task
>Reporter: Tim Allison
>Priority: Major
>  Labels: tika-3x
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


Tika parser not parsing email content

2023-10-10 Thread Kashif Khan
Hi team,
I have been working on the Tika parser to parse a few text files and it has
been working fine until I have come to an issue where it is not able to
parse the text file if it contains 'email/message contents'.
This means if the text file contains any of the terms like 'From: ', 'To:
', or 'Sent: ', it will fail to parse the text correctly.
In my case, the parser is deleting the lines of text files and only a
single line remains out of 40 lines.

I am sharing a snippet of the text file for an example:

>
> *Some text here 1.*
> *Some text here 2.*
> *Some text here 3.*
> *Original Message-*
> *From: some_m...@abc.com *
> *Sent: Thursday, October 31, 2019 9:52 AM*
> *To: Some person, (The XYZ group)*
> *Subject: RE: Mr. Random person phone call: MESSAGE*
> *Hi,*
> *I am available now to receive the call.*
> *Some text here 4.*
> *Some text here 5.**Some text here 6.*


The Tika parser is reducing the above text to only one line as below:

> *Subject: RE: Mr. Random person phone call: MESSAGE*


Note that this is happening in the version later than Tika 1.19, with 1.19
is parsing the contents perfectly fine.

Could you please help me to understand the issue or please suggest some
path forward to this?
This will be very helpful.

Thanks in advance.
-Kashif


Re: [PR] Bump org.eclipse.jetty:jetty-bom from 9.4.52.v20230823 to 9.4.53.v20231009 [tika]

2023-10-10 Thread via GitHub


THausherr merged PR #1391:
URL: https://github.com/apache/tika/pull/1391


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@tika.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org