Denny,

Yes, yet the timing is good for these great developments you're making with
languages in Wikidata4Wiktionary.

Cheers,
Scott



On Fri, Apr 7, 2017 at 11:50 AM, Denny Vrandečić <vrande...@gmail.com>
wrote:

> Scott,
>
> I assume you realized that the article by Norvig you cited was rather
> intentionally published on April 1st.
>
> Cheers,
> Denny
>
> On Fri, Apr 7, 2017 at 11:04 AM Scott MacLeod <worlduniversityandschool@
> gmail.com> wrote:
>
>> I tried to see how the ISO codes and IANA language subtags compare with
>> Glottolog's 8,444 entries under languages (http://glottolog.org/
>> glottolog/language) and Ethnologue's 7,099 living languages (
>> https://www.ethnologue.com/), but couldn't find any comparisons or
>> comparative lists.
>>
>> Will it be possible with these new developments in Wikidata to query for
>> these possibilities, and leave the options open for a growing list of
>> languages, as well as an universal translator?
>>
>> And how will invented languages be added, such as Krell, Elvish and
>> Klingon (and even other species' languages in emergent interspecies'
>> communications), and possibly per OpenNMT (Neural Machine Translation) -
>> http://opennmt.net/ (and possibly GNMT); see also Peter Norvig's recent
>> article in the regards to OpenNMT and invented languages -
>> https://medium.com/@peternorvig/last-tweets-of-the-krell-82b8cb74c320 (and
>> per http://scott-macleod.blogspot.com/2017/04/falco-peregrinus-
>> smartphone-that-could.html).
>>
>> Scott
>>
>>
>>
>> On Fri, Apr 7, 2017 at 10:13 AM, Daniel Kinzler <
>> daniel.kinz...@wikimedia.de> wrote:
>>
>> Am 07.04.2017 um 01:34 schrieb Denny Vrandečić:
>> >     I foresee that might be a bit of a problem for external tools
>> consuming
>> >     this data - how they would figure out what language it is if it's
>> >     doesn't have a code? We could of course generate fake codes like
>> >     mis-x-q12345, maybe that would work.
>> >
>> > Q-items for languages already have a property to state their language
>> code. It's
>> > just an extra hop away.
>>
>> We want ISO codes (or rather, IANA language subtags [1]), so we can use
>> them in
>> HTML lang attributes, and in RDF literals. This allows interoperability
>> with
>> standard tools.
>>
>> For this reason, I also favor a mixed approach, that allows standard
>> language
>> tags to be used whenever possible. I have some ideas on how that could
>> work, but
>> no definite plan yet.
>>
>> Something like de+Q1980305 could work; when generating HTML or RDF, we'd
>> just
>> drop the suffix. For transligual entries (e.g. the for number symbol i),
>> we
>> could use e.g. mis+Q1140046.
>>
>>
>> [1]
>> https://www.iana.org/assignments/language-subtag-
>> registry/language-subtag-registry
>>
>> --
>> Daniel Kinzler
>> Principal Platform Engineer
>>
>> Wikimedia Deutschland
>> Gesellschaft zur Förderung Freien Wissens e.V.
>>
>> _______________________________________________
>> Wikidata mailing list
>> Wikidata@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/wikidata
>>
>>
>>
>>
>> --
>>
>> --
>> - Scott MacLeod - Founder & President
>> - World University and School
>> - http://worlduniversityandschool.org
>>
>> - 415 480 4577 <(415)%20480-4577>
>> - http://scottmacleod.com
>>
>>
>> - CC World University and School - like CC Wikipedia with best
>> STEM-centric CC OpenCourseWare - incorporated as a nonprofit university and
>> school in California, and is a U.S. 501 (c) (3) tax-exempt educational
>> organization.
>>
>>
>> IMPORTANT NOTICE: This transmission and any attachments are intended only
>> for the use of the individual or entity to which they are addressed and may
>> contain information that is privileged, confidential, or exempt from
>> disclosure under applicable federal or state laws.  If the reader of this
>> transmission is not the intended recipient, you are hereby notified that
>> any use, dissemination, distribution, or copying of this communication is
>> strictly prohibited.  If you have received this transmission in error,
>> please notify me immediately by email or telephone.
>>
>> World University and School is sending you this because of your interest
>> in free, online, higher education. If you don't want to receive these,
>> please reply with 'unsubscribe' in the body of the email, leaving the
>> subject line intact. Thank you.
>>
>
> _______________________________________________
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>
>


-- 

-- 
- Scott MacLeod - Founder & President
- 415 480 4577
- http://scottmacleod.com

- World University and School
- http://worlduniversityandschool.org

- CC World University and School - like CC Wikipedia with best STEM-centric
CC OpenCourseWare - incorporated as a nonprofit university and school in
California, and is a U.S. 501 (c) (3) tax-exempt educational organization.


IMPORTANT NOTICE: This transmission and any attachments are intended only
for the use of the individual or entity to which they are addressed and may
contain information that is privileged, confidential, or exempt from
disclosure under applicable federal or state laws.  If the reader of this
transmission is not the intended recipient, you are hereby notified that
any use, dissemination, distribution, or copying of this communication is
strictly prohibited.  If you have received this transmission in error,
please notify me immediately by email or telephone.

World University and School is sending you this because of your interest in
free, online, higher education. If you don't want to receive these, please
reply with 'unsubscribe' in the body of the email, leaving the subject line
intact. Thank you.
_______________________________________________
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata

Reply via email to