Thanks Noble, I gave those examples a try. If I use <field column="body" xpath="/book/body/chapter/p" /> I only get the text from the last <p> element, not from all elements.
If I use <field column="body" xpath="/book/body/chapter" flatten="true"/> or <field column="body" xpath="/book/body/chapter/" flatten="true"/> I don't get back anything for the body column. So the first example is close, but it only gets the text for the last <p> element. If I could get all <p> elements at the same level that would be what I need. The double-slash (/book/body/chapter//p) doesn't seem to be supported. Thanks, -Jay 2009/7/1 Noble Paul നോബിള് नोब्ळ् <noble.p...@corp.aol.com> > complete xpath is not supported > > /book/body/chapter/p > > should work. > > if you wish all the text under <chapter> irrespective of nesting , tag > names use this > <field column="body" xpath="/book/body/chapter flatten="true"/> > > > > > > > On Thu, Jul 2, 2009 at 5:31 AM, Jay Hill<jayallenh...@gmail.com> wrote: > > I'm using the XPathEntityProcessor to parse an xml structure that looks > like > > this: > > > > <book> > > <author>Joe Smith</author> > > <title>World Atlas</title> > > <body> > > <chapter> > > <p>Content I want is here</p> > > <p>More content I want is here.</p> > > <p>Still more content here.>/p> > > </chapter> > > </body> > > </book> > > > > The author and title parse out fine: <field column="title" > > xpath="/book/title"/> <field column="author" xpath="/book/author"/> > > > > But I can't get at the data inside the <p> tags. I want to get all > > non-markup text inside the body tag with something like this: > > > > <field column="body" xpath="/book/body/chapter//p"/> > > > > but that is not supported. > > > > Does anyone know of a way that I can get the content within the <p> tags > > without the markup? > > > > Thanks, > > -Jay > > > > > > -- > ----------------------------------------------------- > Noble Paul | Principal Engineer| AOL | http://aol.com >