Re: Non-HTML XPathExtraction

Michele Mostarda Fri, 14 Sep 2012 07:24:56 -0700

Hi Brian,

On 13 September 2012 00:50, Brian Sletten <[email protected]> wrote:


> Greetings.
>
> I am interested in something similar to the XPathExtractor but for regular
> XML documents, not HTML.  Is there such a thing?  It seems that the
> SingleDocumentExtraction/XPathExtractor pair is based on the assumption of
> HTML.  I've been spelunking in the code this afternoon and it appears as if
> it might be possible if you were able to feed a non-HTMLDocumentImpl into
> the process.
>

Currently Any23 doesn't handle generic XML. The XPathExtractor was meant to
extract fragment of well known HTML pages.
For your purpose why don't use just XSLT[1] ?


>
> Before I spend any more time, I thought I'd ask. Congrats on the new home
> and status. This is a tremendously useful infrastructure. Glad to see it
> getting the recognition it deserves.
>

Thanks a lot!


>
> Regards,
>
> Brian


The best,
Mic

[1] http://en.wikipedia.org/wiki/XSLT


-- 
Michele Mostarda
Senior Software Engineer
skype: michele.mostarda
twitter: micmos
mail: [email protected]
site : http://www.michelemostarda.com

Re: Non-HTML XPathExtraction

Reply via email to