Smalyshev added a comment. |
the goal here is to get anything readable
I get this, but how Chinese label is anything readable for a person who doesn't read Chinese?
anything based on the Latin character set
Ok, this is a bit presuming but I guess a workable heuristic. This means we probably need a predetermined prioritized list of languages, probably a configurable one. I wonder how much value we will get beyond "always fallback to English" though. I.e. how many entities do not have English label but do have label that would be readable by significant percent of readers and in how many cases we'll make the right choice?
In short, I understand the idea now but I am not sure how feasible it is to implement consistently, how much value it would add (need more data on that) and how well it would be received by the users when we create a fixed ranking of languages. I'd think going beyond Latin charset has most potential for people being upset.
Cc: Esc3300, Smalyshev, Aklapper, Yurik, EBjune, mschwarzer, Avner, debt, Gehel, D3r1ck01, Jonas, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331
_______________________________________________ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs