Hi,
* Nico Golde <[email protected]> [2012-03-01 14:20]:
* Jörg Sommer <[email protected]> [2012-02-28 15:08]:
[...]
while the feed at http://datenkanal.org/feeds/atom10.xml contains more
than one entry, newsbeuter shows only one:

% grep -cF '<entry>' atom10.xml
7

The headline from newsbeuter:

newsbeuter 2.5 - Artikel im Feed 'Datenkanal' (0 ungelesen, 1 gesamt) - 
http://datenkanal.org/feeds/atom10.xml

This might be because the XML is broken. In the second entry there "<audio
controls preload="none">" is wrong. I'm not sure if that is the reason why
newsbeuter stops after the second entry.

Andreas, without reading the source of your pull parser, could that be a
reason?

The XML of RSS feeds is parsed with libxml2, so if the XML is seriously broken like in this case, libxml2 will simply stop parsing. I used libxml2 that it will accept even slightly broken XML files, but this particular case is just too much for the XML parser to cope with.

The pull parser is only used for parsing the HTML content of individual items.

Cheers,
Andreas



--
To UNSUBSCRIBE, email to [email protected]
with a subject of "unsubscribe". Trouble? Contact [email protected]

Reply via email to