On 26/11/2021 19:09, Andy Seaborne wrote:


On 26/11/2021 16:51, Marco Neumann wrote:
I got the following 107 warnings during parsing of the complete Wikidata
truthy - version of 2021-11-17

Those are included in the report at start-of-thread.
Full report:
https://gist.github.com/afs/15719d46299bcf7346e3c314ac109040

The archives don't present the plain text nicely:

The summary as un-re-formatted plain text:

https://gist.github.com/afs/7b20b36391e186b262cdb485dbfae681

The [U+D83C] are surrogate pairs and should not appear in UTF-8

U+D83C U+DF1F is 🌟 (U+1F31F) and should be encoded into bytes as UTF-8: xF0 x9F x8C x9F

https://en.wiktionary.org/wiki/%F0%9F%8C%9F

Reply via email to