[Wikidata-bugs] [Maniphest] [Commented On] T145754: Non conform turtle syntax for RDF dump

2016-09-15 Thread D063520
D063520 added a comment.
Thank you very much Daniel for taking this over and addressing it so fast. DennisTASK DETAILhttps://phabricator.wikimedia.org/T145754EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: D063520Cc: Smalyshev, daniel, Aklapper, D063520, D3r1ck01, Izno, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Created] T145754: Non conform turtle syntax for RDF dump

2016-09-15 Thread D063520
D063520 created this task.D063520 added a project: Wikidata.Herald added a subscriber: Aklapper.
TASK DESCRIPTIONHello,

I would like to report a bug in the rdf dump offered by wikidata. It is great that you offer the data in rdf!
I downloaded the following dump:

wikidata-20160829-all-BETA.ttl

Unfortunately it is not valid turtle syntax. If you parse it you will get an error. It appears around the entity Q815674. Unfortunately one of the labels is "\a". This is not accept in turtle due to the backslash. I found it very difficult to find this error and it was also difficult to eliminate it since it is a 70 gb big file. I would suggest in future to parse once the file and check if it is valid before publishing.

Thank you
d063520TASK DETAILhttps://phabricator.wikimedia.org/T145754EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: D063520Cc: Aklapper, D063520, D3r1ck01, Izno, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs