Hi there,
I had a patch submitted as well, that does not need any external
library just a xml parser and a xslt processor that is - i think -
part of jdk anyway.
My solution works with a xsl that convert different rss feed to one
format and than I parsed this format with a normal xml parser.
Since the xsl support most rss formats may people are interested to
take a look and I can may fix smaller issues in case it will be
contributed to the svn.
Stefan
Am 21.07.2005 um 17:07 schrieb Chris Mattmann:
Hi Andrzej,
At the time that I was working diligently on this plugin (April/
May), I
had done some thorough research into finding what I felt would be
the most
flexible, reliable way to parse RSS files. The RSS feed parser out
of the
jakarta-commmons sandbox was what I found, and I stand by it. I
understand
your concerns however about its reliance on several libraries, but
it just
comes with the territory in this case. However, as noted in:
http://issues.apache.org/jira/browse/NUTCH-30 by Kevin Burton, when
feedparser 2.0 comes out, the reliance on the external libraries
will be
removed, so I think that by adopting the feedparser based plugin
right now,
we have a clear upgrade path that leads us to the plugin's
independence of
external libraries, without changing (much of) the underlying
source code.
That's my two cents.
Thanks!
Cheers,
Chris Mattmann
On 7/20/05 11:58 PM, "Andrzej Bialecki" <[EMAIL PROTECTED]> wrote:
[EMAIL PROTECTED] wrote:
Hi,
Does anyone know why Chris Mattmann's RSS plugin (
http://issues.apache.org/jira/browse/NUTCH-30 ) wasn't put in the
repository, and whether there are plans to revive it and include it?
That's probably my fault. I was almost ready to import it, but then
during the final review I hesitated - I'm wary of pulling in so many
dependencies. Then other things got in the way, and I sort of
dropped it
for the moment...
If there's no way to parse RSS reliably other than using these
dozens of
libraries, so be it. Is this the case?
______________________________________________
Chris A. Mattmann
[EMAIL PROTECTED]
Staff Member
Modeling and Data Management Systems Section (387)
Data Management Systems and Technologies Group
_________________________________________________
Jet Propulsion Laboratory Pasadena, CA
Office: 171-266B Mailstop: 171-246
_______________________________________________________
Disclaimer: The opinions presented within are my own and do not
reflect
those of either NASA, JPL, or the California Institute of Technology.