Hello,

which problems did you encounter? This problem should be solvable using
a small XQuery, basically putting what you describe in natural languages
in XQuery so our processor understands it.

I don't think it would make any sense to add such a specific format.
There are simply way to many possible combinations - You want archive
files extracted, others might want not to do this. In the end we would
end up with a very complex definition language - And what's the point if
we already have a standardized query language like XQuery, which can
achieve the same thing?

Cheers
Dirk

On 11/25/2015 05:38 PM, E. Wray Johnson wrote:
> Here is what I want to do: For a given folder and all its subfolders
> on my physical dive, mirror its contents including the contents of
> archives, parsing xml, json,html, text, etc. using their respective
> parser skipping invalids, and adding all other files as raw. I want
> archive files (*.zip, *.doxc) to be added as raw, however I want the
> text inside archive files like docx (ms-word) to be indexed and any
> files in the archives files that match a filter to be indexed.
>
> Note: It would be nice if there was a single db:add method that
> allowed me to specify a map of filters to parsers with options, where
> all files that do not match a filter (or are invalid) will be
> optionally added as raw.

-- 
Dirk Kirsten, BaseX GmbH, http://basexgmbh.de
|-- Firmensitz: Blarerstrasse 56, 78462 Konstanz
|-- Registergericht Freiburg, HRB: 708285, Geschäftsführer:
|   Dr. Christian Grün, Dr. Alexander Holupirek, Michael Seiferle
`-- Phone: 0049 7531 28 28 676, Fax: 0049 7531 20 05 22

Reply via email to