Re: How to deal with the recursive content in Tika 2

2022-04-29 Thread Sergey Beryozkin
That helped with the recursive parser test

Thanks, Sergey

On Thu, Apr 28, 2022 at 4:37 PM Sergey Beryozkin 
wrote:

> Great, will give it a try asap
>
> Cheers, Serget
>
> On Thu, Apr 28, 2022 at 4:22 PM Tim Allison  wrote:
>
>> Give this a try:
>>
>> https://github.com/apache/tika/blob/main/tika-core/src/main/java/org/apache/tika/metadata/TikaCoreProperties.java#L60
>>
>> On Thu, Apr 28, 2022 at 11:12 AM Sergey Beryozkin 
>> wrote:
>> >
>> > Hi Tim, All
>> >
>> > We have a pending issue in Quarkus Tika to upgrade to Tika 2.
>> > One of the problems is that according to a user's comment the recursive
>> > content is treated somehow differently in Tika2, specifically, this
>> code:
>> >
>> >
>> https://github.com/quarkiverse/quarkus-tika/blob/main/runtime/src/main/java/io/quarkus/tika/TikaParser.java#L95
>> >
>> > attempts to get a collection of the parsed outer and embedded documents
>> by
>> > accessing them as
>> >
>> > metadata.get(AbstractRecursiveParserWrapperHandler.TIKA_CONTENT);
>> >
>> > What is the equivalent way to achieve the same with Tika 2 ?
>> >
>> > Thanks, Sergey
>>
>


Re: How to deal with the recursive content in Tika 2

2022-04-28 Thread Sergey Beryozkin
Great, will give it a try asap

Cheers, Serget

On Thu, Apr 28, 2022 at 4:22 PM Tim Allison  wrote:

> Give this a try:
>
> https://github.com/apache/tika/blob/main/tika-core/src/main/java/org/apache/tika/metadata/TikaCoreProperties.java#L60
>
> On Thu, Apr 28, 2022 at 11:12 AM Sergey Beryozkin 
> wrote:
> >
> > Hi Tim, All
> >
> > We have a pending issue in Quarkus Tika to upgrade to Tika 2.
> > One of the problems is that according to a user's comment the recursive
> > content is treated somehow differently in Tika2, specifically, this code:
> >
> >
> https://github.com/quarkiverse/quarkus-tika/blob/main/runtime/src/main/java/io/quarkus/tika/TikaParser.java#L95
> >
> > attempts to get a collection of the parsed outer and embedded documents
> by
> > accessing them as
> >
> > metadata.get(AbstractRecursiveParserWrapperHandler.TIKA_CONTENT);
> >
> > What is the equivalent way to achieve the same with Tika 2 ?
> >
> > Thanks, Sergey
>


Re: How to deal with the recursive content in Tika 2

2022-04-28 Thread Tim Allison
Give this a try:
https://github.com/apache/tika/blob/main/tika-core/src/main/java/org/apache/tika/metadata/TikaCoreProperties.java#L60

On Thu, Apr 28, 2022 at 11:12 AM Sergey Beryozkin  wrote:
>
> Hi Tim, All
>
> We have a pending issue in Quarkus Tika to upgrade to Tika 2.
> One of the problems is that according to a user's comment the recursive
> content is treated somehow differently in Tika2, specifically, this code:
>
> https://github.com/quarkiverse/quarkus-tika/blob/main/runtime/src/main/java/io/quarkus/tika/TikaParser.java#L95
>
> attempts to get a collection of the parsed outer and embedded documents by
> accessing them as
>
> metadata.get(AbstractRecursiveParserWrapperHandler.TIKA_CONTENT);
>
> What is the equivalent way to achieve the same with Tika 2 ?
>
> Thanks, Sergey


How to deal with the recursive content in Tika 2

2022-04-28 Thread Sergey Beryozkin
Hi Tim, All

We have a pending issue in Quarkus Tika to upgrade to Tika 2.
One of the problems is that according to a user's comment the recursive
content is treated somehow differently in Tika2, specifically, this code:

https://github.com/quarkiverse/quarkus-tika/blob/main/runtime/src/main/java/io/quarkus/tika/TikaParser.java#L95

attempts to get a collection of the parsed outer and embedded documents by
accessing them as

metadata.get(AbstractRecursiveParserWrapperHandler.TIKA_CONTENT);

What is the equivalent way to achieve the same with Tika 2 ?

Thanks, Sergey