Re: How to deal with the recursive content in Tika 2
That helped with the recursive parser test Thanks, Sergey On Thu, Apr 28, 2022 at 4:37 PM Sergey Beryozkin wrote: > Great, will give it a try asap > > Cheers, Serget > > On Thu, Apr 28, 2022 at 4:22 PM Tim Allison wrote: > >> Give this a try: >> >> https://github.com/apache/tika/blob/main/tika-core/src/main/java/org/apache/tika/metadata/TikaCoreProperties.java#L60 >> >> On Thu, Apr 28, 2022 at 11:12 AM Sergey Beryozkin >> wrote: >> > >> > Hi Tim, All >> > >> > We have a pending issue in Quarkus Tika to upgrade to Tika 2. >> > One of the problems is that according to a user's comment the recursive >> > content is treated somehow differently in Tika2, specifically, this >> code: >> > >> > >> https://github.com/quarkiverse/quarkus-tika/blob/main/runtime/src/main/java/io/quarkus/tika/TikaParser.java#L95 >> > >> > attempts to get a collection of the parsed outer and embedded documents >> by >> > accessing them as >> > >> > metadata.get(AbstractRecursiveParserWrapperHandler.TIKA_CONTENT); >> > >> > What is the equivalent way to achieve the same with Tika 2 ? >> > >> > Thanks, Sergey >> >
Re: How to deal with the recursive content in Tika 2
Great, will give it a try asap Cheers, Serget On Thu, Apr 28, 2022 at 4:22 PM Tim Allison wrote: > Give this a try: > > https://github.com/apache/tika/blob/main/tika-core/src/main/java/org/apache/tika/metadata/TikaCoreProperties.java#L60 > > On Thu, Apr 28, 2022 at 11:12 AM Sergey Beryozkin > wrote: > > > > Hi Tim, All > > > > We have a pending issue in Quarkus Tika to upgrade to Tika 2. > > One of the problems is that according to a user's comment the recursive > > content is treated somehow differently in Tika2, specifically, this code: > > > > > https://github.com/quarkiverse/quarkus-tika/blob/main/runtime/src/main/java/io/quarkus/tika/TikaParser.java#L95 > > > > attempts to get a collection of the parsed outer and embedded documents > by > > accessing them as > > > > metadata.get(AbstractRecursiveParserWrapperHandler.TIKA_CONTENT); > > > > What is the equivalent way to achieve the same with Tika 2 ? > > > > Thanks, Sergey >
Re: How to deal with the recursive content in Tika 2
Give this a try: https://github.com/apache/tika/blob/main/tika-core/src/main/java/org/apache/tika/metadata/TikaCoreProperties.java#L60 On Thu, Apr 28, 2022 at 11:12 AM Sergey Beryozkin wrote: > > Hi Tim, All > > We have a pending issue in Quarkus Tika to upgrade to Tika 2. > One of the problems is that according to a user's comment the recursive > content is treated somehow differently in Tika2, specifically, this code: > > https://github.com/quarkiverse/quarkus-tika/blob/main/runtime/src/main/java/io/quarkus/tika/TikaParser.java#L95 > > attempts to get a collection of the parsed outer and embedded documents by > accessing them as > > metadata.get(AbstractRecursiveParserWrapperHandler.TIKA_CONTENT); > > What is the equivalent way to achieve the same with Tika 2 ? > > Thanks, Sergey
How to deal with the recursive content in Tika 2
Hi Tim, All We have a pending issue in Quarkus Tika to upgrade to Tika 2. One of the problems is that according to a user's comment the recursive content is treated somehow differently in Tika2, specifically, this code: https://github.com/quarkiverse/quarkus-tika/blob/main/runtime/src/main/java/io/quarkus/tika/TikaParser.java#L95 attempts to get a collection of the parsed outer and embedded documents by accessing them as metadata.get(AbstractRecursiveParserWrapperHandler.TIKA_CONTENT); What is the equivalent way to achieve the same with Tika 2 ? Thanks, Sergey