Hi,
I'm noticing some strange output from the latest build of the Extraction
Framework. Namely in the wikipedia_links and geo_coordinates output
files. In the wikipedia_links I get
http://purl.org/dc/elements/1.1/language de . instead of
http://purl.org/dc/elements/1.1/language de@de . and in
I'm trying to use the WikiParser to determine the category list of a
wikipedia page.
The category tags are represented as TextNode objects but when I print out
the toWikiText, it get an empty string. Should categories be TextNodes and
if so, what's the correct extract the category name from the