Hi there,
I had a patch submitted as well, that does not need any external library just a xml parser and a xslt processor that is - i think - part of jdk anyway. My solution works with a xsl that convert different rss feed to one format and than I parsed this format with a normal xml parser. Since the xsl support most rss formats may people are interested to take a look and I can may fix smaller issues in case it will be contributed to the svn.

Stefan
Am 21.07.2005 um 17:07 schrieb Chris Mattmann:

Hi Andrzej,

At the time that I was working diligently on this plugin (April/ May), I had done some thorough research into finding what I felt would be the most flexible, reliable way to parse RSS files. The RSS feed parser out of the jakarta-commmons sandbox was what I found, and I stand by it. I understand your concerns however about its reliance on several libraries, but it just
comes with the territory in this case. However, as noted in:
http://issues.apache.org/jira/browse/NUTCH-30  by Kevin Burton, when
feedparser 2.0 comes out, the reliance on the external libraries will be removed, so I think that by adopting the feedparser based plugin right now, we have a clear upgrade path that leads us to the plugin's independence of external libraries, without changing (much of) the underlying source code.

That's my two cents.

Thanks!

Cheers,
  Chris Mattmann



On 7/20/05 11:58 PM, "Andrzej Bialecki" <[EMAIL PROTECTED]> wrote:


[EMAIL PROTECTED] wrote:

Hi,

Does anyone know why Chris Mattmann's RSS plugin (
http://issues.apache.org/jira/browse/NUTCH-30 ) wasn't put in the
repository, and whether there are plans to revive it and include it?


That's probably my fault. I was almost ready to import it, but then
during the final review I hesitated - I'm wary of pulling in so many
dependencies. Then other things got in the way, and I sort of dropped it
for the moment...

If there's no way to parse RSS reliably other than using these dozens of
libraries, so be it. Is this the case?


______________________________________________
Chris A. Mattmann
[EMAIL PROTECTED]
Staff Member
Modeling and Data Management Systems Section (387)
Data Management Systems and Technologies Group

_________________________________________________
Jet Propulsion Laboratory            Pasadena, CA
Office: 171-266B                        Mailstop:  171-246
_______________________________________________________

Disclaimer: The opinions presented within are my own and do not reflect
those of either NASA, JPL, or the California Institute of Technology.








Reply via email to