Hello all, just getting up to speed on Cocoon, and finding it all
quite fascinating. No doubt I'll have many more questions in the
coming months.
Anyway, today's question is on xpath. I'm trying to customize the
HTML Generator 'scraper' example to extract bits from a given HTML
page. Now, I have rudimentary knowledge of xpath syntax, but not
enough I guess because I'm stuck.
Here's a sample of the HTML to be scraped:
-----------
<table width="100%" border="0">
<bis t="pr" f="p020326.htm">
<tr>
<td nowrap align="right" valign="top">
26 Mar 2002
</td>
<td valign="top">
Financial Stability Forum holds its seventh meeting
(<a href="p020326.htm">Read</a>)
</td>
</tr>
</bis>
<bis t="pr" f="p020318.htm">
<tr>
<td nowrap align="right" valign="top">
[snip]
</td>
</tr>
</bis>
Etc.
----------
The <bis...> stuff is used by another, non-XML process, but it seemed
to me it should be a no-brainer to write an xpath argument that would
pull out between the <bis> </bis> and transform them.
However, it isn't. Can anyone point me in the right general direction
here?
-----------------------------
Brent Eades, Almonte, Ontario
http://www.almonte.com
http://www.bankofcanada.ca
---------------------------------------------------------------------
Please check that your question has not already been answered in the
FAQ before posting. <http://xml.apache.org/cocoon/faqs.html>
To unsubscribe, e-mail: <[EMAIL PROTECTED]>
For additional commands, e-mail: <[EMAIL PROTECTED]>