2011/7/4 Jörn Kottmann <[email protected]>:
> Do you know how to filter tags like this one: {{date|November 13, 2004}} ?
>
> The current implementation just turns it into {{date}}, but they must be
> either
> be replaced by the date should just be removed.
Yes you should try to had support the "date template" to come up with
a handler that can extract the interesting part "November 13, 2004"
and put it into the text buffer. I knew this bug but did not find the
time to investigate, sorry.
> I have the same issue for byline, etc.
>
> The wikinews dump also contains pages which are not news articles,
> I filter them now based on the availability of the {{publish}} tag, and then
> cut the article after the text ends, based on the availability of headings,
> or tags.
>
> I changed the link handling a bit, because many links seem to be inter-wiki
> links
> which the current implementation filters out.
Indeed. We might want to make this behavior controllable through
constructor arguments.
--
Olivier
http://twitter.com/ogrisel - http://github.com/ogrisel