Franck Delahaye wrote: > > I'd like to index the full text of the xml documents I want to store in > dbXML. > To be more precise, what I call the "full text" is any text inside a tag. > The goal is to index Xpath queries like: > /document-root-tag/a-content-tag//[contains(text(), "a text criteria")] > where "a-content-tag" is a "highly" structured tag (nesting level of a > descendant may be up to 10). > is it possible to do that ?
Franck, This shouldn't be too difficult. Basically, you could pull a Collection document-by-document into DOM document nodes, then using a TreeWalker (there are demos for this in Xerces) you walk the entire tree filtering for Text nodes. And write a method that for any provided node the path to the node is returned, in the form of a series of element names with the correct delimiters to make it an XPath query. No barrier I can see except the time and effort of doing it. Murray ........................................................................... Murray Altheim <mailto:murray.altheim@sun.com> XML Technology Center, Java and XML Software Sun Microsystems, Inc., MS MPK17-102, 1601 Willow Rd., Menlo Park, CA 94025 Rally against the evils of iceburg lettuce! Grab a kitchen knife and join the Balsamic Jihad!
