Re: [basex-talk] html document retrieval runs out of main memory

2022-04-08 Thread Graydon Saunders
Of course there is a way! Thank you; that is indeed helpful. (I continue to be impressed that you can get your brain to hold all of this at one time.) -- Graydon On Fri, Apr 8, 2022 at 6:10 PM Christian Grün wrote: > Which leads to "is there a way to get the type of an item?" >> > > You can u

Re: [basex-talk] html document retrieval runs out of main memory

2022-04-08 Thread Christian Grün
> > Which leads to "is there a way to get the type of an item?" > You can use inspect:type for that [1]. Hope this helps Christian [1] https://docs.basex.org/wiki/Inspection_Module#inspect:type

Re: [basex-talk] html document retrieval runs out of main memory

2022-04-08 Thread Graydon Saunders
Hi Christian - Alas, the data is a client's and confidential. for $remote in $paths let $name as xs:string := file:name($remote) let $target as xs:string := file:resolve-path($name,$targetBase) let $fetched as item() := http:send-request(, $remote)[2] return if ($fetche

Re: [basex-talk] html document retrieval runs out of main memory

2022-04-08 Thread Christian Grün
Hi Graydon, Maybe it’s TagSoup that has problems to convert some specific HTML files to XML. Did you try to write the responses to disk and parse them in a second step? If your input data is not confidential, could you possibly provide us with an example that runs out of the box? Best, Christian

[basex-talk] html document retrieval runs out of main memory

2022-04-07 Thread Graydon Saunders
Hello -- I'm using the basexgui to run (minus some identifying actual values defined previously in the query) (: for each path, retrieve the document :) for $remote in $paths let $name as xs:string := file:name($remote) let $target as xs:string := file:resolve-path($name,$targetBase) let $f