Franck Delahaye wrote:
> 
> I'd like to index the full text of the xml documents I want to store in
> dbXML.
> To be more precise, what I call the "full text" is any text inside a tag.
> The goal is to index Xpath queries like:
> /document-root-tag/a-content-tag//[contains(text(), "a text criteria")]
> where "a-content-tag" is a "highly" structured tag (nesting level of a
> descendant may be up to 10).
> is it possible to do that ?

Franck,

This shouldn't be too difficult. Basically, you could pull a Collection
document-by-document into DOM document nodes, then using a TreeWalker
(there are demos for this in Xerces) you walk the entire tree filtering
for Text nodes. And write a method that for any provided node the path
to the node is returned, in the form of a series of element names with
the correct delimiters to make it an XPath query. No barrier I can see
except the time and effort of doing it.

Murray

...........................................................................
Murray Altheim                         <mailto:murray.altheim&#x40;sun.com>
XML Technology Center, Java and XML Software
Sun Microsystems, Inc., MS MPK17-102, 1601 Willow Rd., Menlo Park, CA 94025

               Rally against the evils of iceburg lettuce! 
            Grab a kitchen knife and join the Balsamic Jihad!

Reply via email to