Re: [Wikitech-l] Now live: Shared structured data

2016-12-30 Thread mathieu stumpf guntz
As to my mind it's a very interesting topic, I searched a bit more. https://www.w3.org/International/articles/article-text-size.en which quotes http://www-01.ibm.com/software/globalization/guidelines/a3.html According to which, for strings in English source that are over 70 characters,

Re: [Wikitech-l] Now live: Shared structured data

2016-12-29 Thread mathieu stumpf guntz
Le 28/12/2016 à 23:08, Yuri Astrakhan a écrit : The 400 chat limit is to be in sync with Wikidata, which has the same limitation. The origins of this limit is to encourage storage of "values" rather than full strings (sentences). Well, that's probably not the best constraints for a glossary

Re: [Wikitech-l] Now live: Shared structured data

2016-12-28 Thread Yuri Astrakhan
The 400 chat limit is to be in sync with Wikidata, which has the same limitation. The origins of this limit is to encourage storage of "values" rather than full strings (sentences). Also, it discourages storage of wiki markup. On Wed, Dec 28, 2016, 16:45 mathieu stumpf guntz <

Re: [Wikitech-l] Now live: Shared structured data

2016-12-28 Thread mathieu stumpf guntz
Thank you Yuri. Is there some rational explanation behind this limits? I understand the limit over performance concern, and 2Mb seems already very large for intented glossaries. But 400 chars might be problematic for some definition I guess, especially since translations can lead to varying

Re: [Wikitech-l] Now live: Shared structured data

2016-12-25 Thread Yuri Astrakhan
Hi Mathieu, yes, I think you can totally build up this glossary in a dataset. Just remember that each string can be no longer then 400 chars, and total size under 2mb. On Sun, Dec 25, 2016, 10:45 mathieu stumpf guntz < psychosl...@culture-libre.org> wrote: > Hi Yuri, > > Seems very interesting.

Re: [Wikitech-l] Now live: Shared structured data

2016-12-25 Thread mathieu stumpf guntz
Hi Yuri, Seems very interesting. Am I wrong thinking this could helpto create multi-lingual glossary as drafted in https://phabricator.wikimedia.org/T150263#2860014 ? Le 22/12/2016 à 20:30, Yuri Astrakhan a écrit : Gift season! We have launched structured data on Commons, available from

Re: [Wikitech-l] Now live: Shared structured data

2016-12-22 Thread David Cuenca Tudela
Anyway, this is great news! I hope that it gets adopted by the community. Congratulations, Yuri! I was going to suggest a Wikidata property, but I see that the data type for datasets is not there yet: https://phabricator.wikimedia.org/T151334 On Thu, Dec 22, 2016 at 8:48 PM, Yuri Astrakhan

Re: [Wikitech-l] Now live: Shared structured data

2016-12-22 Thread Yuri Astrakhan
Micru, thanks, I think Datasets sounds like a good name too! On Thu, Dec 22, 2016 at 2:44 PM David Cuenca Tudela wrote: > On Thu, Dec 22, 2016 at 8:38 PM, Brad Jorsch (Anomie) < > bjor...@wikimedia.org > > wrote: > > > On Thu, Dec 22, 2016 at 2:30 PM, Yuri Astrakhan < >

Re: [Wikitech-l] Now live: Shared structured data

2016-12-22 Thread David Cuenca Tudela
On Thu, Dec 22, 2016 at 8:38 PM, Brad Jorsch (Anomie) wrote: > On Thu, Dec 22, 2016 at 2:30 PM, Yuri Astrakhan > wrote: > > > Gift season! We have launched structured data on Commons, available from > > all wikis. > > > > I was momentarily

Re: [Wikitech-l] Now live: Shared structured data

2016-12-22 Thread Yuri Astrakhan
Yes, there seem to have been a bit of a naming collision. Tabular data and map data have been jointly known as structured data, but there is also the Structured Data project, which IMO should be called Structured Metadata project :) Naming suggestions are welcome! P.S. Brad, I'm sorry tabular

Re: [Wikitech-l] Now live: Shared structured data

2016-12-22 Thread Brad Jorsch (Anomie)
On Thu, Dec 22, 2016 at 2:30 PM, Yuri Astrakhan wrote: > Gift season! We have launched structured data on Commons, available from > all wikis. > I was momentarily excited, then I read a little farther and discovered this isn't about

[Wikitech-l] Now live: Shared structured data

2016-12-22 Thread Yuri Astrakhan
Gift season! We have launched structured data on Commons, available from all wikis. TLDR; One data store. Use everywhere. Upload table data to Commons, with localization, and use it to create wiki tables, lists, or use directly in graphs. Works for GeoJSON maps too. Must be licensed as CC0. Try