from:"David Cuenca"

Re: [Wikidata-l] External identifiers vs. Wikidata-internal links data

2015-04-04 Thread David Cuenca

Hi all,

On a side note I would like to mention that labels and aliases are
external identifiers, perhaps not as accurate as database IDs, as natural
language users tend to stretch the conceptual boundaries, but they can also
be referenced and sourced.

If, as Lydia says, database IDs will have their own section (either only in
the frontend, or also in the backend), I would recommend to use a similar
method for sourcing+qualifying labels, as natural language identifiers
and database identifiers share the same problems.

On Freebase, database keys had a separate tab. Not sure if that is the
right approach...

Cheers,
Micru

On Sat, Apr 4, 2015 at 3:45 PM, Andy Mabbett a...@pigsonthewing.org.uk
wrote:

 On 4 April 2015 at 10:20, Thomas Douillard thomas.douill...@gmail.com
 wrote:

  I guess a class of properties ''external identifier definition property''
  with isbn  instance of external identifier prop could be useful as
 well.

 The property for an ORCID iD (P 496), for example, is an instance of
 Wikidata
 property for authority control for people (Q19595382). That, in turn
 is a sub-class of Wikidata property for authority control
 (Q18614948)

 --
 Andy Mabbett
 @pigsonthewing
 http://pigsonthewing.org.uk

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Sitelinks to Redirects

2014-10-22 Thread David Cuenca

On Wed, Oct 22, 2014 at 10:47 AM, svetlana svetl...@fastmail.com.au wrote:

 If we have a need in pointing (at Wikibase/Wikidata) to redirects on a
 regular basis, it might be time to rethink the relevant project design.


I think that rethinking the project design is the right approach here. To
link to redirects is as bad as leaving relevant article sections
unconnected. The challenge is to find another way to associate an article
section with an item without using redirects.

I have opened a bug report to gather ideas:
https://bugzilla.wikimedia.org/show_bug.cgi?id=72347

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Associating WD items with WP article sections (it was: redirects, workflow)

2014-10-20 Thread David Cuenca

Another thing that I am seeing now is that parsoid plans to add IDs to all
elements:
https://www.mediawiki.org/wiki/Parsoid/MediaWiki_DOM_spec/Element_IDs

I don't know enough about it to see if those element IDs could be used as
section identifiers, but it might be well worth to ask about it.

Cheers,
Micru



On Mon, Oct 20, 2014 at 10:24 PM, James Heald j.he...@ucl.ac.uk wrote:

 I think we have to look at what people actually use: overwhelmingly, that
 is redirects, not labelled section transclusion.

 Redirects are lightweight, in terms of editor time, and they do the job.

 Transclusion is of course valuable in particular cases; but trying to
 maintain multiple different contexts for the same material would be quite a
 headache - tricky to create and maintain, and utterly inflexible if
 somebody wants to re-shape the article.

 The bottom line here is that we should face reality: people are not going
 to create labelled section transclusions, still less have to put up with
 maintaining them, just to make wikidata have some more sitelinks.
 Realisticly, we would end up with a handful of such transclusions, at
 most.   Whereas people create redirects every day.


 Yes, a redirect is just a redirect.  It's not perfect.  But it's usually
 good enough.

 Take Daniel Havell for instance:
https://en.wikipedia.org/w/index.php?title=Daniel_Havell

 the section it points to is recognisably part of a larger article.  In
 fact it has been written to be part of that larger article, and depends on
 it for context.  The section is not standalone content.  This is the deal
 with redirects, one which I do believe readers understand and accept.

 Yes, if somebody changed the section name, the redirect would no longer
 point to the section (unless they had left an anchor). But the redirect
 would still point to the right article; and given that at best redirects
 are just redirects, and depend on the rest of the article for context
 anyway, the less precise link is not *such* a big loss.


 The key thing about allowing sitelinks to redirects is that they are the
 mechanism that is actually used.

 The sitelink should point to where on the wiki there is content that
 matches the actual meaning of the item.  If that happens to be a redirect,
 so be it.

 The best can be the enemy of the good.  I simply do not believe that
 labelled section transclusions will happen to any great degree; and I think
 editors would find even those that did to be an endless pain to maintain.
 They are not a promise for which it is worth sacrificing the benefits of
 all the redirects we should and could be sitelinking to.

   -- James.



 On 20/10/2014 20:44, Derric Atzrott wrote:

 There are major problems using redirects as sitelinks. The top one is
 that they do not always point to the concept they should, and even if
 they do, there is no guarantee that this redirect will keep pointing
 to the same place (normally to a section of another article), since
 the section title can change.

 Wikipedia supports section labelling:
 https://en.wikipedia.org/wiki/Help:Labeled_section_transclusion


 I would support this as a solution.  It seems to solve the issue that
 using
 redirects in site links wishes to solve.

 Thank you,
 Derric Atzrott


 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l



 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

[Wikidata-l] How to express property constraints correctly?

2014-10-20 Thread David Cuenca

I'm thinking of possible ways to represent constraints as items (see [1]),
like those in: https://www.wikidata.org/wiki/Property_talk:P19

However some of these are not easy to translate into Wikidata proper. For
example:
place of birth (P19) Conflicts with instance of (P31): criminal delict
(Q1456832)

With the current tools it could be expressed as convolutely as wished for,
but it won't be compatible with the semantic web.

Of course it would be much nicer and compatible to implement OWL2 property
restrictions (some_values_from, all_values_from, none_of, etc) as snak
types which could be re-used for any property. Then it would be as easy as:

place of birth (P19)
 property restriction [none of]
   instance of (P31): criminal delict (Q1456832)


Thoughts?

Cheers,
Micru



[1] https://www.wikidata.org/wiki/Wikidata:Constraint_violation_report_input
[2] http://www.w3.org/TR/owl2-syntax/#Object_Property_Restrictions
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Sitelinks to Redirects

2014-10-19 Thread David Cuenca

On Sat, Oct 18, 2014 at 1:12 PM, Andy Mabbett a...@pigsonthewing.org.uk
wrote:

 On 18 October 2014 08:15, Gerard Meijssen gerard.meijs...@gmail.com
 wrote:

  I think I requested P1472, I forgot all about it. It takes so long before

 The proposal was mine:

https://www.wikidata.org/wiki/Property_talk:P1472


Actually there where two proposals, the one by Gerard was submitted in
January
https://www.wikidata.org/wiki/Wikidata:Property_proposal/Sister_projects#Commons_Creator

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Sitelinks to Redirects

2014-10-19 Thread David Cuenca

On Sun, Oct 19, 2014 at 6:54 PM, rupert THURNER rupert.thur...@gmail.com
wrote:

 would it make sense to use wikidata for such tasks as well?


Wikidata already represents more granular information than an article, the
real problem is that the only way that we have to bind a piece of
information in Wikipedia to its Wikidata representation is through the
article name.
This is of course derived from the technological limitations of mediawiki
which treats each article as a blob of text.

On Wikisource we use Labeled Section Transclusion to define regions of a
mediawiki page that can be transcluded into other pages:
https://www.mediawiki.org/wiki/Extension:Labeled_Section_Transclusion

It is normally used with the following format:

## stable_section_identifier ##
some text here


In a way, it is like creating a local variable, since you assign an
identifier to a section that later on can be referred to regardless of the
changes in the text or in the title. I wonder if this is something that
could be adapted for Wikipedia in a way that users could mark article
sections with unique identifiers and then link those stable section
identifiers in Wikidata.

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Item both subclass and instance?

2014-10-05 Thread David Cuenca

Hi Eric,

The idea is to separate in those edge cases the perceptual model (what is
recognized) from the name given to it. As such you can consider ethanol
and chemical compound metaclasses (names) pointing to a label-less class
that represents the model.

In practical terms what it would entail is:
- remove the labels from Q153
- create one item for the label ethanol and one item for the label
chemical compound
- link Q153 with those names using has name
- ethanol is no longer an instance, but a class that can take different
names, ethanol being more specific

Do you realize that after this scrap everything and start again from the
beginning the resulting structure is more comprehensive and encompasses
previous efforts? Entity stays as it is, but now it can be examined by
its qualities and they can be taken apart if needed. The term cognizable
is 1:1 compatible with the subclass/instance model, but it emphasizes the
necessity of having an observer for it to be meaningful. Classes do not
exist in isolation, it is in fact very naive to keep the notion of
objectivity when dealing with observation. Even logic needs a system to be
executed, and based on which reality models? How were they produced? And
how are those models and the models based on them verified? The problem
with logicians is that they think themselves isolated from the world,
however even they were born from a womb.

And have you seen who introduced the term negentropy? If you check the
names behind it you will see that those ideas are in fact standing on the
shoulders of giants. It is hard to model life without understanding first
that life itself is survival-oriented.

Those papers are interesting but fail to address basic questions like: who
decides identity? How does the observer interact with it? Where does
information come from? And without tackling those questions, and by
extension, emergent processes, logic seems like a deux ex machina that
appears from nothingness and acts in nothingness.

Best,
Micru

On Sun, Oct 5, 2014 at 6:49 PM, Emw emw.w...@gmail.com wrote:

 Hi David,

 How does your treatise relate to the fact that
 https://www.wikidata.org/wiki/Q153 has statements that entail the
 following? [1]

 ethanol
 *instance of* chemical compound
 *subclass of* chemical compound

 How would it resolve that specific problem?

 Such statements make Wikidata incompatible with ChEBI and other major
 ontologies, like Gene Ontology and Disease Ontology, which use *instance
 of* (i.e. rdf:type, P31) and *subclass of* (i.e. rdfs:subClassOf, P279,
 is_a) as recommended in the Relation Ontology (RO) [2] and the Basic Formal
 Ontology (BFO).

 For Wikidata to be interoperable with other major ontologies in the
 Semantic Web, we cannot scrap everything and start again from the
 beginning.  We must stand on the shoulders of giants.  Doing away with
 entity [3] as a the top of the *subclass of* hierarchy and introducing
 a raft of idiosyncratic ontological constructs like identifiables,
 cognizables, negentropy and system survival-oriented direction to
 Wikidata is probably not the way to go.

 Regarding your concerns about the ability of existing approaches to
 represent emergent properties and natural language, I recommend perusing
 [4], an influential paper that informs DOLCE and BFO -- particularly the
 section on The Role of Identity Criteria.  I also highly recommend
 lectures 3 and 1 in
 http://ontology.buffalo.edu/smith/IntroOntology_Course.html for gaining a
 perspective on how BFO maintainers think about things like distinguishing
 objects and representations, and that upper ontology's philosophical
 roots.  Given your interest in phenomenology and Husserl, you may also be
 interested in what BFO maintainers have written on those subjects in e.g.
 [5] with regard to ontology.

 Best,
 Eric

 [1] https://www.wikidata.org/wiki/Q153 currently states ethanol *subclass
 of* alcohol, but given alcohol *subclass of* organic compound and
 organic compound *subclass of* chemical compound, it is entailed that
 ethanol *subclass of* chemical compound.
 [2] Barry Smith et al. (2005).  *Relations in Biomedical Ontologies*.
 http://genomebiology.com/2005/6/5/r46
 [3] Entity item on Wikidata.  https://www.wikidata.org/wiki/Q35120.
 Mapped to owl:Thing.
 [4] Nicola Guarino (1998). *Some Ontological Principles for Designing
 Upper Level Lexical Resources*.  http://arxiv.org/pdf/cmp-lg/9809002v1
 [5] Barry Smith (1989).  *Husserl: Logic and Formal Ontology*.
 http://ontology.buffalo.edu/smith/articles/lfo.html




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Item both subclass and instance?

2014-10-03 Thread David Cuenca

Hi Eric,

I have been following this issue for a long time, and I haven't found any
satisfactory answer from anyone or from any source. Totally disappointed I
decided to scrap everything and start again from the beginning, working on
a foundation that eventually would allow to represent emergent systems and
natural language. Pulling from that thread I got some interesting insights,
that I hope you find useful:
https://docs.google.com/document/d/1dc99zyxdPX8t6Ept_JjSxNalij2PNdYt-A3TG2e4cIw/edit?usp=sharing

If you don't have time or patience to read the whole 27 pages (which is
just an introduction that needs refinement and to be expanded):
- in real life there is no difference between calling something
metaclass, identifier or information, they all refer to an observer
system encoding an observed signal together with a pattern recognition model
- in real life instantiation is an observer system putting a frame on an
observed system. This frame might be more or less justified considering the
intrinsic qualities of what is placed inside, but in the end it is entirely
observer-defined, so quite useless other than to point to an agreed or
arbitrary bottom concept chosen by the observer

( I want to emphasize in real life because sometimes I notice a focus in
devising mathematical sound approaches and not so much approaches based
on reality itself.  )

In my opinion the solution to atoms is to separate identifiers from the
entity with a new property (has/identifier of or has/manifestation of)
and use instance_of only when really necessary.

I hope we don't have to resort to Captain Metaphysics :)
http://existentialcomics.com/comic/47

Cheers,
Micru

On Fri, Sep 26, 2014 at 2:59 PM, Emw emw.w...@gmail.com wrote:

 The statement ethanol *instance of* chemical compound is ontologically
 incorrect.  Importantly, it is also incompatible with ChEBI, the most
 widely-used chemistry ontology.

 The matter of how to apply *instance of* (P31, rdf:type) and *subclass of*
 (P279, rdfs:subClassOf) on Wikidata in relation to chemical entities has
 been, as Thomas puts it, a long discussion [1-5].  Hopefully with a wider
 audience and experts like Markus Krötzsch and Denny Vrandečić now
 interested, we can come to a resolution at least in the particular domain
 of chemical compounds.  Since it concerns interoperability with another
 large Semantic Web project, I have copied Janna Hastings and Alan
 Ruttenberg on this discussion.  Janna coordinates ChEBI.  Alan coordinates
 BFO, the upper ontology used by ChEBI and many other major ontologies in
 the natural sciences, like Gene Ontology and Disease Ontology.

 Denny indicates how the statement Porsche 356 *instance of *car would
 be incorrect in Wikidata even though Porsche 356 *is a* car is
 acceptable in everyday speech.  Similarly, ethanol *instance of*
 chemical compound is incorrect in Wikidata even though ethanol *is a*
 chemical compound is acceptable in less formal contexts.

 A key difference between talk about cars and talk about chemicals is that,
 with cars, we have familiar terms like car model that distinguish
 concrete instances (that *particular* car you see on the street) from
 abstract instances (i.e. metaclasses, classes that are also instances,
 the *kind* of car that you see on the street).  We do not have a
 well-known term like chemical model or chemical compound type to
 distinguish classes (types) of chemicals and instances (tokens) of
 chemicals.  When one speaks of the properties of ethanol or hydrogen, it is
 understood that the subject is *all concrete, particular, spatiotemporal
 tokens, i.e. instances *of ethanol and hydrogen -- not just a specific
 ethanol molecule floating in that container before you on a Saturday with
 friends, but all molecules that we label ethanol everywhere.

 Thus, in order to formally classify ethanol itself as opposed to some
 particular ethanol molecule, we must say for an item like
 http://www.wikidata.org/wiki/Q153: ethanol *subclass of* chemical
 compound and not ethanol *instance of* chemical compound.  (On
 Wikidata, the statement is more precisely ethanol *subclass of *alcohol,
 but it is entailed from the statements alcohol *subclass of* organic
 compound and organic compound *subclass of* chemical compound that
 ethanol *subclass of* chemical compound.)

 A common defense of statements like ethanol *instance of* chemical
 compound is that Wikidata will never have items about any concrete
 molecules of ethanol, so, since ethanol is a leaf node in our concept
 taxonomy, it makes sense to state that ethanol is an instance.  That
 interpretation of instance is short-sighted.  It precludes us from ever
 talking about particular tokens of ethanol, or particular aggregates of
 such objects, without overhauling our chemistry ontology.  Excluding
 consideration of metaclasses like chemical compound type, the fact that
 an entity is a leaf node in a concept hierarchy is a necessary but not
 sufficient condition for

Re: [Wikidata-l] Linking to Wikipedia page using Wikidata ID

2014-08-29 Thread David Cuenca

Hi Nick,

I have opened this bug report where you can subscribe or add your use case
(if you think my suggestion is not appropriate):
https://bugzilla.wikimedia.org/show_bug.cgi?id=70159

There is also this related enhancement proposal for error handling:
https://bugzilla.wikimedia.org/show_bug.cgi?id=70127

Regards,
Micru


On Thu, Aug 28, 2014 at 2:08 PM, Nicholas Humfrey 
nicholas.humf...@bbc.co.uk wrote:

  Fantastic, thanks!

  Could you put linking to the user's preferred language (Accept-Language
 header?) on the backlog?

  nick.


   From: David Cuenca dacu...@gmail.com
 Reply-To: Discussion list for the Wikidata project. 
 wikidata-l@lists.wikimedia.org
 Date: Thursday, 28 August 2014 12:59
 To: Discussion list for the Wikidata project. 
 wikidata-l@lists.wikimedia.org
 Subject: Re: [Wikidata-l] Linking to Wikipedia page using Wikidata ID

 Hi Nick,

 You are lucky, that feature has been released this week:
 https://www.wikidata.org/wiki/Special:GoToLinkedPage

  For instance
 https://www.wikidata.org/wiki/Special:GoToLinkedPage/enwiki/Q732383

  Cheers,
  Micru


 On Thu, Aug 28, 2014 at 12:52 PM, Nicholas Humfrey 
 nicholas.humf...@bbc.co.uk wrote:

  Hello,

  We have been working on associating every episode of BBC Desert Island
 Discs to a Wikipedia page about that person.

  Example:
 http://www.bbc.co.uk/programmes/p0093x3l
 http://en.wikipedia.org/wiki/David_Croft
 https://en.wikipedia.org/wiki/David_Croft
 http://www.wikidata.org/wiki/Q2072654
 https://www.wikidata.org/wiki/Q2072654

  Since we created some of the links to Wikipedia, those pages have
 changed into disambiguation pages. Is it possible to link to a Wikipedia
 page using the Wikidata ID, via some kind of redirect URL? Thus ensuring
 that we continue to link to the correct article?

  Ideally it would even link to the users preferred language…


  Thanks!

  nick.


 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




 --
 Etiamsi omnes, ego non

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] new features deployed \o/

2014-08-20 Thread David Cuenca

Lydia,
I have a question about pages in Wikidata connected to their items. In the
example you give:
https://www.wikidata.org/wiki/Q914807

How can be the page linked? I guess I have to enter it in the section
Pages on other sites linked to this item, but when I click add, there
is no suggester. I tried wikidata, wikidatawiki, data, but nothing
happens.

By the way the Wikinews section appears as wikibase-sitelinks-wikinews,
but probably you are already aware of it.

Thanks for the update!
Micru


On Wed, Aug 20, 2014 at 8:10 AM, Lydia Pintscher 
lydia.pintsc...@wikimedia.de wrote:

 Hey folks :)

 As announced last week we just deployed a number of new features. Those
 are:

 * Wikinews is now able to manage its sitelinks via Wikidata.
 https://www.wikidata.org/wiki/Wikidata:Wikinews for
 questions/coordination/...
 * Wikidata is now also its own client. This means you can for example
 add a sitelink to Wikidata:Help to the item for all main help pages.
 You are able to make use of the data in items on other pages in
 Wikidata with Lua. (Arbitrary access has been enabled for Wikidata
 for this but when data in an item changes we will not be able to purge
 the page using the data yet.)
 * Sitelinks for projects with just one sitelink in the group (like
 Commons, Wikidata and in the future Meta for example) are now grouped
 together in one sitelink group.
 * Badges for good and featured articles can be stored on Wikidata
 right next to the sitelink. We have badges for featured and good
 articles. More can be added on request later. Thanks to Bene* and
 lazowik for this feature.
 * Redirects between items can be created. When two items are merged
 one of them can be turned into a redirect. This way our identifiers
 can be considered much more stable by 3rd parties for example. It also
 makes it unnecessary to delete duplicate items. This will reduce the
 workload of our admins considerably.
 * We have the new datatype monolingual text. This allows you to make
 statements with a string and an associated language.

 Known issues/limitations:
 * Redirects can so far only be created via the API
 * Arbitrary access on Wikidata to the data on Wikidata itself is only
 possible via Lua. The parser function still needs to be adapted.
 * Badges can not yet be shown on the Wikipedias etc. This will follow next
 week.
 * Diffs for badges changes have a link to a wrong target
 (https://bugzilla.wikimedia.org/show_bug.cgi?id=69758)


 Cheers
 Lydia

 --
 Lydia Pintscher - http://about.me/lydia.pintscher
 Product Manager for Wikidata

 Wikimedia Deutschland e.V.
 Tempelhofer Ufer 23-24
 10963 Berlin
 www.wikimedia.de

 Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.

 Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
 unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das
 Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Announce: WikiProject Structured Data for Commons

2014-08-19 Thread David Cuenca

Thanks for the stats, Gerard. Two thoughts:
- With so many items without description I wonder why we don't have the
automatic descriptions gadget enabled by default.
- There are many items without statements, but not that many articles
without a category -- would it be possible to have a game that suggests
instance of/subclass of based on the statements of the items in the same or
in the upper category?

Thanks
Micru


On Tue, Aug 19, 2014 at 10:39 AM, Gerard Meijssen gerard.meijs...@gmail.com
 wrote:

 Hoi,
 I am not in research. I am into making Wikidata into a reasonable
 resource. To achieve this I add gazillions of statements and I am happy
 when people focus on templates. However without the data, templates that
 make use of Wikidata are niche applications. Without some mature
 understanding of what Wikidata is at this time concentrating on templates
 is an exercise in building enabling technology.

 My point is for the community to have reasonable expectations. So many
 people consider Wikidata to be useless. That is fine but imho it helps when
 the baseline of where Wikidata is at this time is understood.
 Thanks,
  GerardM

 The statistics that explain this best can be found here ..
 https://tools.wmflabs.org/wikidata-todo/stats.php?reverse


 On 19 August 2014 10:27, Luca Martinelli martinellil...@gmail.com wrote:

 Ok, I got the point. What you probably  need to consider is that focusing
 on one goal does not mean at all that we have to dismiss all the others. At
 least, *I* do not think so.

 You want to focus on research? Fine, do it. I'd like to focus on
 templates. That's fine too, I guess. We're both working to let Wikidata be
 appreciated - by separate audiences.

 L.
 Il 19/ago/2014 07:57 Gerard Meijssen gerard.meijs...@gmail.com ha
 scritto:

  Hoi,
 What is the point of Wiktionary, WIkipedia, Wikispecies et al as a WMF
 project? Like Wikidata they all help us share in the sum of all knowledge.
 Wikidata already provides an application in being the vehicle for
 interlanguage links.

 The low hanging fruit of Wikidata is not sharing info in templates, it
 is in providing search results where a Wikipedia does NOT have an article.
 It is used for this and it does have a measurable impact.  It is nice to
 have the ambition to share data in templates but be realistic. The quality
 of the data in Wikidata does not merit this at this time. The community
 insists on sources and frankly it is assassine to expect that in the first
 few years it will be available near the level that some demand. This is
 only based on the data that is there. That is the next problem we do not
 have enough data. We are still at the stage where we are harvesting data
 for the first time. Harvesting big amounts, not one item at a time.

 It is important to have goals, and it is nice that at the start
 providing data to templates was seen as an initial goal. However it will
 not be like with Pallas Athena when she came from the head of Zeus in full
 armour. This goal is achievable and we are making big strides in that
 direction BUT we need smaller goals, small applications that grow our
 content in both quality and quantity. As I wrote on my blog, we need to
 think in terms of confidence in our data and not so much in sources. Amir
 is finishing a tool that will allow us to compare data for humans in the
 English, German and Italian Wikipedia. That will be a massive step in the
 right direction.

 I care about Wikidata and I know that at this time those freakingly hard
 templates are the least of our worries. More problematic is that people
 think of Wikidata as a service product for Wikipedia and limit their
 thinking to templates. The existing search extension with WDQ is there. It
 works really well. It is dismissed probably because it demonstrates that
 ALL Wikipedias cover less than 50% of the subjects known to us. We know all
 of them because of Wikidata.

 So yeah by all means blow the horn about our aspiration of servicing
 templates in those projects that can handle this. It is fine. It is not
 realistic and even counter productive as an aspiration when we do not
 appreciate the reality as we have it at this time.
 Thanks,
   GerardM


 On 18 August 2014 14:41, Luca Martinelli martinellil...@gmail.com
 wrote:

 2014-08-17 17:00 GMT+02:00 Gerard Meijssen gerard.meijs...@gmail.com:
  Hoi,
  Importing data from Wikidata (where do you want it??) is just one
  application. There are so many potential applications for structured
 data
  and Wikidata implicitly covers the sum of all knowledge as we know it
 (in
  the Wikimedia projects) so there are opportunities galore.
 
  For people not to know how to is a given. I do not care to know
 about
  Wikipedia templates because they are freaking impossibly hard.

 Yet, if we don't use the Wikidata data in the freaking impossibly
 hard Wikipedia templates, what is the point of Wikidata as a
 Wikimedia project?

 I remember that this project had among its first

Re: [Wikidata-l] Commons Wikibase

2014-08-19 Thread David Cuenca

Markus, is this related with the idea of creating a database of
automatically inferred statements that you presented some time ago?

Cheers,
Micru


On Tue, Aug 19, 2014 at 12:30 PM, Markus Krötzsch 
mar...@semantic-mediawiki.org wrote:

 On 19.08.2014 12:20, Gerard Meijssen wrote:

 Hoi,
 I cannot parse this ..


 What Thomas is saying is that classification (putting things into
 categories) and querying (finding things based on certain properties) can
 be combined in a natural way. In ontology languages like OWL, you can make
 statements that say, roughly speaking, that all results of query A belong
 to class B. This allows you to build a classification partly
 automatically, and to ensure that your classification is always consistent
 with your data.

 From this perspective, you can view Wikidata's stored queries as similar
 to OWL's class expressions in that they allow you to define classes based
 on the data given for each item, without having to go through all the items
 to add classes manually. What Thomas is referring to is of course slightly
 more advanced still, but maybe this clarifies part of the idea.

 Cheers,

 Markus



 On 19 August 2014 11:43, Thomas Douillard thomas.douill...@gmail.com
 mailto:thomas.douill...@gmail.com wrote:

 Note that in Wikidata we are developping methods and tools to class
 items in « classes », which in short are sets of real world things
 or events. In languages like the w3c language and standards OWL2
 https://en.wikipedia.org/wiki/OWL2. In this language you can

 assign a class to an element (a media in the common case) to a class
 (that can be seen as a better defined category) either by creating a
 statement « this media belongs to that category » (in Wikidata this
 is done by using the « instance of » property
 https://www.wikidata.org/wiki/Property:P31) or by associating a so

 called «class expression» in OWL (an analog of a query but more
 powerful) Then in OWL any item who satisfy the criteria of the query
 or class expression associated to a class belongs to that class
 without stating it explicitely.  In short, the possibility to assign
 an arbitrary class to an item when a query is not enough will also
 be possible with just a metadata repository, we may in the future
 even be able to mix these two ways to class medias.


 2014-08-19 10:14 GMT+02:00 James Heald j.he...@ucl.ac.uk
 mailto:j.he...@ucl.ac.uk:


 Also there might be queries one might want to run on the
 categories, which would be another reason to include them in
 Commons Wikibase.

-- J.



 On 19/08/2014 07:00, Gerard Meijssen wrote:

 Hoi,
 I know the categories in Commons exist. I also know that you
 do not have to
 add categories when an image is uploaded. Many people do not
 consider the
 categories because they are just there and are not easy nor
 obvious without
 a long study.

 They are there and they evolve. When the community finds
 that they are no
 longer useful, there will be others who still want to work
 on it. They can,
 it is a harmless occupation. Why would we consider removing
 category
 structures as long as someone cares about them ??
 Thanks,
  GerardM



 _
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 mailto:Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/__mailman/listinfo/wikidata-l

 https://lists.wikimedia.org/mailman/listinfo/wikidata-l



 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org mailto:Wikidata-l@lists.wikimedia.org
 

 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l



 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Announce: WikiProject Structured Data for Commons

2014-08-19 Thread David Cuenca

It is possible to represent linguistic patterns as items but it needs some
work to conceptualize a structure that can be used in several languages.
Then you could implement a complex query that selects the one that displays
the most information.

The biggest hurdle I see is that we are not storing enough linguistic data.
We could already do it without any further development, but I think that we
need more time to understand thoroughly the structure that is emerging.

Maybe it could be thought as a research project to explore the
possibilities.


On Tue, Aug 19, 2014 at 4:34 PM, Thomas Douillard 
thomas.douill...@gmail.com wrote:

 One answer would be deeper answer of autodescription into Wikidata. I
 don't know how to make this flexible and generic enough to make this only
 depedant of Wikibase model concept though.

 One rough and not thought throw proposition would be: associate a
 (mediawiki?) template to a query or a class for each language, which
 generates a description if an item matches the query according to the
 values of the properties.

 hen we could add an API call that uses the template.


 2014-08-19 16:13 GMT+02:00 Lydia Pintscher lydia.pintsc...@wikimedia.de:

 On Tue, Aug 19, 2014 at 11:19 AM, David Cuenca dacu...@gmail.com wrote:
  Thanks for the stats, Gerard. Two thoughts:
  - With so many items without description I wonder why we don't have the
  automatic descriptions gadget enabled by default.

 I am a bit worried about enabling this by default for everyone as a
 gadget. We need the descriptions in a lot of places where people
 search for items. The next big one will be Commons. But _a lot_ more
 will come in the future. Think for example of tagging your blog post
 on Wordpress with Wikidata concepts. You'll need the descriptions. If
 we enable automatic descriptions on Wikidata now we will actively
 discourage people from entering more descriptions. That would be bad
 as 3rd parties then don't get the benefit of them.
 I am also hesitant to build this into Wikibase directly as it'd need
 quite some domain-knowledge for all I can tell at this point. That's
 something we need to avoid.
 Anyone got ideas how to get out of this?


 Cheers
 Lydia

 --
 Lydia Pintscher - http://about.me/lydia.pintscher
 Product Manager for Wikidata

 Wikimedia Deutschland e.V.
 Tempelhofer Ufer 23-24
 10963 Berlin
 www.wikimedia.de

 Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.

 Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
 unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das
 Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l



 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Commons Wikibase

2014-08-19 Thread David Cuenca

On Tue, Aug 19, 2014 at 5:59 PM, Markus Krötzsch 
mar...@semantic-mediawiki.org wrote:

 I guess it is related, yes. Although currently I am first focussing on
 efficient query answering -- inference will come after that :-) Whether one
 stores the results in a database or not is an implementation detail, btw.,
 that is maybe not essential for a user.


Ok, ok, no hurries :-)

Oh, and of course all of this seems to be rather off-topic for the current
 subject Commons Wikibase :-) All that is relevant there has already been
 said I guess (many categories could be expressed by queries to improve
 results; a gentle, community-led transition will be possible and preferred;
 categories won't be switched off just because Wikidata is switched on).


Actually I have one last question :)  At the moment Gerard is using is a
list of:value on category item pages which has the effect of being the
inverse of instance of. And then he adds further conditions as
qualifiers, see:
https://www.wikidata.org/wiki/Q6562

While this method of works for simple categories, more complex ones would
be hard to model using this method, like
https://www.wikidata.org/wiki/Q8380098

I was thinking of modelling it like:
Category:Discoverers of extrasolar planets is a list of human
Category:Discoverers of extrasolar planets has items used as value of
discoverer

Of course it would require to have a link between the item discoverer and
the property discoverer, but would that make sense?

Thanks,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Language code and site code in the links section

2014-08-18 Thread David Cuenca

Would it be possible to lift the limitation of 1-sitelink = 1-wikipage for
wikis that host several languages?
In a way that: 1-sitelink + 1-language = 1-wikipage

Or are there other technical limitations?

Thanks
Micru


On Mon, Aug 18, 2014 at 3:38 PM, Lydia Pintscher 
lydia.pintsc...@wikimedia.de wrote:

 On Sat, Aug 16, 2014 at 5:50 PM, David Cuenca dacu...@gmail.com wrote:
  There are 3 columns in the sitelink section: Language, Code and
 Linked
  page. Usually language-code and wiki-code have a permanent 1:1
  correspondence , but not in all projects (incubator, meta, wikidata).
 
  Would it be possible to add a configuration option so that for wikis that
  host several languages the language code is also requested?
 
  In this case it would be better to change the column order to wiki
 code,
  language and linked page, that way the users would enter first the
  wiki-code and then if that wiki has the multiple language configuration
  enabled, it would request the language code as well.
 
  For wikis that have pages translated into multiple languages (meta,
  wikidata) the code mul can be used.

 Can you name a specific case where you'd want this? Even for wikis
 with several languages in one we can only link one page and have no
 way to distinguish between them. Or am I missing something?


 Cheers
 Lydia

 --
 Lydia Pintscher - http://about.me/lydia.pintscher
 Product Manager for Wikidata

 Wikimedia Deutschland e.V.
 Tempelhofer Ufer 23-24
 10963 Berlin
 www.wikimedia.de

 Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.

 Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
 unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das
 Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Summary #122

2014-08-17 Thread David Cuenca

They are slightly different. One refers to the work the character appears
in, and the other refers to the fictional universe. For instance:
Frodo Baggins (Q177329) present in work The Lord of the Rings (Q15228)
Frodo Baggins (Q177329) from narrative or fictional universe Tolkien's
legendarium (Q81738)

Cheers,
Micru


On Sun, Aug 17, 2014 at 3:34 PM, Sjoerd de Bruin sjoerddebr...@me.com
wrote:

 Hello,

 So we have https://www.wikidata.org/wiki/Property:P1441 now. What's the
 difference between that property and
 https://www.wikidata.org/wiki/Property:P1080 ?

 Greetings,

 Sjoerd de Bruin
 sjoerddebr...@me.com

 Op 17 aug. 2014, om 15:27 heeft John Lewis johnflewi...@gmail.com het
 volgende geschreven:

 Hi everyone,

 You can view the latest summary at
 https://www.wikidata.org/wiki/Wikidata:Status_updates/2014_08_16.

 Discussions

- Open RfOS: John F. Lewis

 https://www.wikidata.org/wiki/Wikidata:Requests_for_permissions/Oversight/John_F._Lewis

 Other Noteworthy Stuff

- Help choose which banner will be featured on the new Main page! Click
here to view the two banner candidates
https://www.wikidata.org/wiki/Wikidata:Portal_Redesign/Banner#Candidates 
 and
leave your feedback before August 20th 16:00 UTC.
- Wikidata Translate https://tools.wmflabs.org/hay/wdtranslate, a
Wikidata-based Google translator open source clone.

 Did you know?

- Newest properties: journey destination
https://www.wikidata.org/wiki/Property:P1444, score method
https://www.wikidata.org/wiki/Property:P1443, grave picture
https://www.wikidata.org/wiki/Property:P1442, present in work
https://www.wikidata.org/wiki/Property:P1441, Fide ID
https://www.wikidata.org/wiki/Property:P1440, Norsk filmografi ID
https://www.wikidata.org/wiki/Property:P1439, Jewish Encyclopedia ID
https://www.wikidata.org/wiki/Property:P1438, plea
https://www.wikidata.org/wiki/Property:P1437, collection size
https://www.wikidata.org/wiki/Property:P1436
- Newest WikiProjects
https://www.wikidata.org/wiki/Wikidata:WikiProjects: WikiProject
Movies https://www.wikidata.org/wiki/Wikidata:WikiProject_Movies

 Development

- Finished a large number of new features and got them ready for
roll-out. More in this email
https://lists.wikimedia.org/pipermail/wikidata-l/2014-August/004328.html
.
- Wikibase made a big step forward to finally switch to DataModel 1.0.
- Improved support for entity IDs bigger than 2 billion (32 bit
integer).
- We had to adapt Wikibase to some major changes (more major than
usual, partly caused by discussions at Wikimania) in MediaWiki core: The
default Vector skin became it’s own component and the ResourceLoader got
some small but important updates.
- Continued work on refactoring code of the user interface to make it
ready for new design
- Wrote a script to get number of users having wikidata in their
recent changes/watchlist from the database

 See current sprint items http://sb.wmflabs.org/p/wikidata/ for what
 we’re working on next.
 You can see all open bugs related to Wikidata here
 https://bugzilla.wikimedia.org/buglist.cgi?emailcc1=1list_id=151540resolution=---emailtype1=exactemailassigned_to1=1query_format=advancedemail1=wikidata-bugs%40lists.wikimedia.org
  Monthly Tasks

- Hack on one of these

 https://bugzilla.wikimedia.org/buglist.cgi?keywords=need-volunteer%2C%20keywords_type=allwordsemailcc1=1resolution=---emailtype1=exactemailassigned_to1=1query_format=advancedemail1=wikidata-bugs%40lists.wikimedia.orglist_id=162515
.
- Help fix these items
https://www.wikidata.org/wiki/Wikidata:The_Game/Flagged_items which
have been flagged using Wikidata - The Game.
- Help develop the next summary here!
https://www.wikidata.org/wiki/Wikidata:Status_updates/Next
- Contribute to a Showcase item
https://www.wikidata.org/wiki/Wikidata:Showcase

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l



 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

[Wikidata-l] Language code and site code in the links section

2014-08-16 Thread David Cuenca

There are 3 columns in the sitelink section: Language, Code and Linked
page. Usually language-code and wiki-code have a permanent 1:1
correspondence , but not in all projects (incubator, meta, wikidata).

Would it be possible to add a configuration option so that for wikis that
host several languages the language code is also requested?

In this case it would be better to change the column order to wiki code,
language and linked page, that way the users would enter first the
wiki-code and then if that wiki has the multiple language configuration
enabled, it would request the language code as well.

For wikis that have pages translated into multiple languages (meta,
wikidata) the code mul can be used.

Thanks
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

[Wikidata-l] DBpedia - Wikidata mapping questions

2014-08-15 Thread David Cuenca

Hi Marco,

Thanks for getting in touch. Looking at the list there seems to be many
wrong mappings.

Is there any way that we can collaborate to increase the number of matches
or does it have to do with qualifiers? And do you plan to document 1:1
matches on your OntologyProperty namespace?

Thanks and regards,
Micru


On Wed, Aug 13, 2014 at 12:55 PM, Marco Fossati hell.j@gmail.com
wrote:

 Hi everyone,

 The following spreadsheet contains the latest DBpedia - Wikidata mapping
 effort both for properties and classes (cf. the 2 sheets) made by the GSoC
 student so far.

 https://docs.google.com/spreadsheets/d/18W9xb5KXorotJ7qM58AsqK-
 myAHEjiDgnZrqNXHEnwk/edit#gid=716082733

 Since I am his main mentor, feel free to ask me further details.
 Hope this helps!

 Cheers,

 Marco


 On 8/13/14, 9:14 AM, David Cuenca wrote:


 On Wed, Aug 13, 2014 at 12:55 AM, Daniel Kinzler
 daniel.kinz...@wikimedia.de mailto:daniel.kinz...@wikimedia.de wrote:

 I think it would be cool to use http://mappings.dbpedia.org/ - it
 already has
 mappings for hundreds of templates in different language versions to
 the dbpedia
 vocabulary. If we added a mapping for wikidata - dbpedia, it
 should be easy
 enough to derive a mapping between a wikipedia template and
 wikidata. Actually,
 I was under the impression the wikidata - dbpedia already existed,
 but I don't
 see it on the site right now. Adding Anja Jentzsch, she might know.

 Even if it proves too tricky to map individual template parameters
 this way, it
 should still be possible to at least detect the class (instanceof)
 of an item
 based on the templates the respective wikipedia page contains. We
 should
 definitely map dpedia classes to wikidata items.


 Sergey Skovorodkin is working on a dbpedia-wikidata mapping as a GsoC
 for DBpedia this year
 https://github.com/dbpedia/extraction-framework/wiki/
 GSoC-2014-Progress-Sergey-Skovorodkin

 I'm CCing him to see if he can report the status or if he might need
 help with our properties, maybe we are missing some or need to clarify
 some others. Sometimes we are using qualifiers to extend properties
 instead of creating new ones.

 Cheers,
 Micru


 --
 Marco Fossati
 http://about.me/marco.fossati
 Twitter: @hjfocs
 Skype: hell_j




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] integration of VisualEditor and Wikidata

2014-08-13 Thread David Cuenca

On Wed, Aug 13, 2014 at 12:55 AM, Daniel Kinzler 
daniel.kinz...@wikimedia.de wrote:

 I think it would be cool to use http://mappings.dbpedia.org/ - it already
 has
 mappings for hundreds of templates in different language versions to the
 dbpedia
 vocabulary. If we added a mapping for wikidata - dbpedia, it should be
 easy
 enough to derive a mapping between a wikipedia template and wikidata.
 Actually,
 I was under the impression the wikidata - dbpedia already existed, but I
 don't
 see it on the site right now. Adding Anja Jentzsch, she might know.

 Even if it proves too tricky to map individual template parameters this
 way, it
 should still be possible to at least detect the class (instanceof) of an
 item
 based on the templates the respective wikipedia page contains. We should
 definitely map dpedia classes to wikidata items.


Sergey Skovorodkin is working on a dbpedia-wikidata mapping as a GsoC for
DBpedia this year
https://github.com/dbpedia/extraction-framework/wiki/GSoC-2014-Progress-Sergey-Skovorodkin

I'm CCing him to see if he can report the status or if he might need help
with our properties, maybe we are missing some or need to clarify some
others. Sometimes we are using qualifiers to extend properties instead of
creating new ones.

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Wikidata: CSV, Shapefile, etc.

2014-08-13 Thread David Cuenca

Yuri and others, I asked not long ago in Commons and there is no opposition
as long as it is for supporting visualizations. It would require modifying
what is allowed in commons, though. There are interesting comments on the
talk page
https://meta.wikimedia.org/wiki/Requests_for_comment/How_to_deal_with_open_datasets

Data.wikisource.org could be another option, but it would require more
effort to set up, and some time for discussions.

Cheers,
Micru


On Wed, Aug 13, 2014 at 6:15 PM, Yuri Astrakhan yastrak...@wikimedia.org
wrote:

 This can already be done by changing JsonConfig configuration. I propose
 we add a Data namespace to the *commons
 https://commons.wikimedia.org/*. Moreover, with the recent work on Graph
 https://www.mediawiki.org/wiki/Extension:Graph extension, I was
 thinking of storing graphing related data there as well. JsonConfig
 currently supports php-code-based validation, but adding json-schema
 https://www.google.com/webhp?sourceid=chrome-instantion=1espv=2ie=UTF-8#q=json%20schemasafe=off
 validation should not be too difficult.

 Data:* -- accepts any valid JSON, without additional validation
 Data:SubNamespace:* -- Custom validated domain-specific data

 For graphs, we might have shape data as well as statistics data. So we
 could declare:
 Data:Graph:* -- accepts JSON that represents entire graph -- vega
 http://trifacta.github.io/vega/ grammar,
 and Data:SomethingElse:* for all snippets of graphs that will be pulled in
 by the vega dynamically.

 Also, I could fairly easily add support for CSV and TSV if we decide that
 it is needed, so that Data:Tsv:* pages would be forced to have the same
 number of columns on each line.




 On Wed, Aug 13, 2014 at 11:48 AM, Dario Taraborelli 
 dtarabore...@wikimedia.org wrote:

 There's a proposal I posted a while ago to store generic datasets that
 can be represented in a tabular or JSON format in a dedicated project
 namespace with dedicated handlers:

 http://meta.wikimedia.org/wiki/DataNamespace
 http://meta.m.wikimedia.org/wiki/DataNamespace

 There's some good discussion on the talk page on the differences between
 this type of data and structure data hosted on Wikidata and where this
 thing could live (it could live on any Wikimedia wiki, including Commons or
 Meta). It looks like this could be a good fit for shapefiles and I'd love
 to hear your thoughts if you have a moment to read this.

 Dario

 On Aug 5, 2014, at 8:43, Paul Houle ontolo...@gmail.com wrote:

 I'm intensely interested in links to shapefiles from databases such as
 Wikidata,  DBpedia and Freebase.  In particular I'd like to get Natural
 Earth hooked up

 http://www.naturalearthdata.com/

 It's definitely a weakness of current generic databases that they use the
 'point GIS' model that is so popular in the social media world.


  On Tue, Aug 5, 2014 at 10:14 AM, Magnus Manske 
 magnusman...@googlemail.com wrote:

 We don't have shapefiles yet, but a lot of property types such as
 geographic coordinates (as in, one per item, ideally...), external
 identifiers (e.g. VIAF), dates, etc.

 A (reasonably) simple way to mass-add statements to Wikidata is this
 tool:
 http://tools.wmflabs.org/wikidata-todo/quick_statements.php

 A combination of spreadsheet apps, shell commands, and/or a good text
 editor should allow you to convert many CSVs into the tool's input format.

 Cheers,
 Magnus


 On Tue, Aug 5, 2014 at 3:01 PM, Brylie Christopher Oxley 
 bry...@gnumedia.org wrote:

 I would like to contribute data to Wikidata that is in the form of CSV
 files,
 geospatial shapefiles, etc.

 Is there currently, or planned, functionality to store general
 structured data
 on Wikidata?
 --
 Brylie Christopher Oxley
 http://gnumedia.org

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l



 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




 --
 Paul Houle
 Expert on Freebase, DBpedia, Hadoop and RDF
 (607) 539 6254paul.houle on Skype   ontolo...@gmail.com

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l


 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l



 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] integration of VisualEditor and Wikidata

2014-08-13 Thread David Cuenca

On Wed, Aug 13, 2014 at 9:52 PM, James Forrester jforres...@wikimedia.org
wrote:

 If the infobox really has no options and just builds itself entirely
 automatically, why not just move it out of the wikitext/etc. content
 entirely and display it always? This makes pages much easier to edit in
 wikitext (no KiB of {{…}} at the top, no confusing stuff at all, nothing to
 break) and simplifies a lot of things…


I support moving the infobox out of wikitext and into page settings, and
allowing only one infobox per article. Sometimes there are more than one
which makes things unnecesarly confusing. For instance:
https://en.wikipedia.org/wiki/Bonnie_and_Clyde

it could use an infobox group of humans that displays data of each person
in the group instead of using twice infobox person.

However I'm saying this as a Wikidatan and a rational person, not as
Wikipedian :)

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] {{Universal infocard}} in ruwiki, and cross-wiki coordination

2014-08-12 Thread David Cuenca

Actually you don't even need to specify which infobox because that can be
automated using P1423 (infobox's main topic) and P1424 (topic's main
infobox). For instance if an item has instance of:person, then you can
check which infobox is connected to the item Q5 and use that infobox. These
properties are still not used widely, but there is no reason not to.

Since Wikidata will be soon its own client, then it is also possible to
store (and use) Lua modules on Wikidata and synchronize them across all
Wikipedias with a bot. However this is sometimes a bit tricky, since
Wikipedias usually like to have their own version. For instance in cawiki
the Authority template gives more relevance to Catalan registers and
contains translated text.

GerardM also suggested an article creation wizard that would first check
if the topic exists in Wikidata. If it does, the infobox could be
automatically inserted on creation. If it doesn't some basic facts could be
asked about the article to create a wikidata item. And based on that data
some automatic text could be generated. Magic! :)

Micru



On Tue, Aug 12, 2014 at 3:00 PM, Amir E. Aharoni 
amir.ahar...@mail.huji.ac.il wrote:

 Thanks to David's comment earlier today about editing , I found this page
 in the Russian Wikipedia:
 https://ru.wikipedia.org/wiki/Module:Universal_infocard

 It's a Lua module that shows a person infobox, pulling the data from
 Wikidata and without giving any parameters in the wiki source code.

 This is essentially the fulfillment of Wikidata's promise to make editing
 articles with infoboxes easier, and it's wonderful. There are more things
 to do, but I already want to thank everyone involved.

 But now the question of cross-wiki synchronization arises. Wikidata.org is
 cross-wiki by definition. Templates such as {{Universal infocard}} should
 be cross-wiki as well.

 Why? To make translation easier. The article about the Slovak poet
 Bohuslav Tablic is available in Russian, but not in English. I'd love to
 translate it to English, but I'll have to use {{Infobox writer}} and fill
 it manually with data. This is doable, but it would be far more efficient
 to pull the data from Wikidata. This will become even more acute when the
 ContentTranslation, and that should happen Some Time Soon. Millions of such
 articles could be translated, and using Wikidata well will save the
 translators millions of minutes.

 I am not saying that all projects should have the same templates. For
 example, I am not concerned with the visual design of the templates - this
 is up to the communities and the designers. But the way in which the data
 is used should be sync'ed.

 What does it entail?
 Synchronizing the code of the Lua modules? Can these, maybe, be made into
 a Lua library that is maintained in MediaWiki source, rather than as
 on-wiki modules?
 Major cross-wiki collaboration in functional specification of data to be
 used in infoboxes?

 What else?


 --
 Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי
 http://aharoni.wordpress.com
 ‪“We're living in pieces,
 I want to live in peace.” – T. Moore‬

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] integration of VisualEditor and Wikidata

2014-08-12 Thread David Cuenca

@Bene*: The WE-Framework already has an interface for dealing with
statement ranks, have you tried the latest version?
https://ru.wikipedia.org/wiki/%D0%92%D0%B8%D0%BA%D0%B8%D0%BF%D0%B5%D0%B4%D0%B8%D1%8F:WE-Framework

As for mapping Wikipedia templates to wikidata properties I think James F.
mentioned something about including that into templatedata
https://en.wikipedia.org/wiki/Wikipedia:VisualEditor/TemplateData

Cheers,
Micru

On Tue, Aug 12, 2014 at 11:19 PM, Bene* benestar.wikime...@gmail.com
wrote:

Hello,

My Dream scenario is that the VE understands that the data is pulled from
Wikidata and shows a dialog that is similar to the current template
parameters. I see the old mayor's name in that dialog, I write the new
mayor's name, and the new value is stored in Wikidata. Of course, it must
be taken into account that the name is likely not just a string, but a
label of the Wikidata item.

The problem I see here is that we mostly do not want to replace a value
but rather add another preferred one and state that the old one is now
deprecated. This means, we also want to have the old major on the Wikidata
item but the new major should be marked as preferred. I think we have to
think a lot how this can be implemented best in Visual Editor. Maybe we
should just add the value entered there just as another preferred value but
this is rather a guess because the data might also be simply wrong and
should be replaced. We thus have to investigate how often the value should
be replaced and how often the old value should be kept as well. Then we can
decide a way how the visual editor should use this data without being to
complicated but also offering all options to create the data as intended.

My acceptable-but-suboptimal scenario is taking the user to
https://www.wikidata.org/wiki/Q6850543 . This is probably a useful
workflow for the tech-savvy editors, but it's suboptimal for a casual
editor. I'll go as far as saying that for a casual editor it may be
(relatively) more comfortable to edit parameters in a MediaWiki template
(|mayor =[[Milan Ftáčnik]]) than to go to
https://www.wikidata.org/wiki/Q6850543 and find the value.

We might take the user to the item's page on Wikidata if he wants to make
more enhanced edits like replacing an old edit.

Does anybody have any more ideas about it?

For the technical implementation, we have to find a way which data comes
from which item and which property. Maybe we can add some data-item and
data-property attributes to the HTML tag in which the data is stored so
that the visual editor can easy find these transclusions. However, this
might also be a security issue if the attributes are added to elements not
containing Wikidata data.

All in all, you are surely not late and this discussion has just started
at Wikimania (although it is planned much longer to have something like
that). Therefore, I think this feature hasn't hightes priority at the
moment and needs lots of cooperation with the Visual Editor team. Perhaps
we will start developing not before 2016. (Only my personal guess. :-)

Best regards,
Bene

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

--
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

[Wikidata-l] Great editor for Wikipedia infoboxes/templates connected to Wikidata

2014-08-09 Thread David Cuenca

User:Vlsergey from ruwiki presented yesterday at the Wikidata Meetup a new
wonderful infobox editor for Wikipedia infoboxes:
https://www.wikidata.org/wiki/Wikidata:Project_chat#Edit_directly_from_infocard_.2F_infobox

And also an improved authority template which you can see in action at Obama
https://ru.wikipedia.org/wiki/%D0%9E%D0%B1%D0%B0%D0%BC%D0%B0,_%D0%91%D0%B0%D1%80%D0%B0%D0%BA#.D0.A1.D1.81.D1.8B.D0.BB.D0.BA.D0.B8

Plus some interfaces for editing person info, taxons, and work/edition
source info:
https://www.wikidata.org/wiki/Wikidata:Project_chat/Archive/2014/07#WEF_gadgets_update

I think these are great improvements for editing wikidata from wikipedia
and I hope you can spread the word in your local wikis about these
wonderful tools.

Thanks!
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] New initiative to promote friendliness

2014-07-29 Thread David Cuenca

:D

https://www.wikidata.org/wiki/Wikidata:Lounge/Jokes!

I tried my best! :D

*backs up slowly*



On Tue, Jul 29, 2014 at 11:13 AM, Maximilian Klein isa...@gmail.com wrote:

 Thanks Mircu, this is a great idea with the right intentions.

 Max Klein
 ‽ http://notconfusing.com/


 On Fri, Jul 25, 2014 at 9:46 PM, David Cuenca dacu...@gmail.com wrote:

 Exactly! That's the spirit, it is much better to cement openess and
 tolerance while they are there :)
 And just by setting up that page we'll have a place to chill out :)

 Thanks
 Micru


 On Fri, Jul 25, 2014 at 5:12 PM, Markus Krötzsch 
 mar...@semantic-mediawiki.org wrote:

 On 25/07/14 15:28, David Cuenca wrote:

 Worried about the harshness that lately has been developing, I have
 started a new initiative to counter that by promoting more dialogue,
 civility, and friendliness
 https://www.wikidata.org/wiki/Wikidata:Shelter

 When you are with friends you don't need to put any rule or condition,
 because they are people you like to be with. If each one does a little
 effort to become someone who everybody likes to be with, then all
 harshness will disappear and Wikidata will become even more wonderful
 \o/
 And even if we don't reach utopia tomorrow, at least we'll be on the
 right direction :)

 I would like to invite everyone to participate in this initiative and to
 come up with new ways to reduce conflict before it happens.


 Very nice. Thanks for taking the initiative. In general, I already find
 Wikidata a great, friendly community to work with, but it is better to
 cement this now to be sure it stays this way :-) After all, friendliness
 can always be improved in online communication (already by remembering that
 some people are more sensitive than others to the tone of an email ;-). I
 think we are already doing very well on encouragement and openness, but
 again it's important to keep this up.

 Best wishes,

 Markus


 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




 --
 Etiamsi omnes, ego non

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l



 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] New initiative to promote friendliness

2014-07-25 Thread David Cuenca

Exactly! That's the spirit, it is much better to cement openess and
tolerance while they are there :)
And just by setting up that page we'll have a place to chill out :)

Thanks
Micru


On Fri, Jul 25, 2014 at 5:12 PM, Markus Krötzsch 
mar...@semantic-mediawiki.org wrote:

 On 25/07/14 15:28, David Cuenca wrote:

 Worried about the harshness that lately has been developing, I have
 started a new initiative to counter that by promoting more dialogue,
 civility, and friendliness
 https://www.wikidata.org/wiki/Wikidata:Shelter

 When you are with friends you don't need to put any rule or condition,
 because they are people you like to be with. If each one does a little
 effort to become someone who everybody likes to be with, then all
 harshness will disappear and Wikidata will become even more wonderful \o/
 And even if we don't reach utopia tomorrow, at least we'll be on the
 right direction :)

 I would like to invite everyone to participate in this initiative and to
 come up with new ways to reduce conflict before it happens.


 Very nice. Thanks for taking the initiative. In general, I already find
 Wikidata a great, friendly community to work with, but it is better to
 cement this now to be sure it stays this way :-) After all, friendliness
 can always be improved in online communication (already by remembering that
 some people are more sensitive than others to the tone of an email ;-). I
 think we are already doing very well on encouragement and openness, but
 again it's important to keep this up.

 Best wishes,

 Markus


 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Wikidata just got 10 times easier to use

2014-07-01 Thread David Cuenca

I noticed that sometimes it is a bit sluggish when showing the suggestions,
maybe it is my network, no idea. And it would be nice if it could suggest
the three basic properties (instance of, sublcass of, part of) when the
item is empty and suggest further properties based on that initial value,

Other than that, good job!



On Tue, Jul 1, 2014 at 10:00 PM, Lydia Pintscher 
lydia.pintsc...@wikimedia.de wrote:

 On Tue, Jul 1, 2014 at 9:52 PM, Derric Atzrott
 datzr...@alizeepathology.com wrote:
  We still need to tweak it a bit here and there, yeah. We're working on
  that right now. Also it will get smarter as more statements are added
  to items.
 
  Even with some somewhat off suggestions this will be a wonderful tool.

 \o/
 We've just changed the threshold a bit so it should give less but more
 fitting suggestions. We'll play around with that setting a bit more
 over the next days to find the one that's right for us.

  Thank you to everyone who worked on making this happen!  This really
  is going to make Wikidata so much easier to use.
 
  Is there any documentation on how it chooses which entities to
  suggest?

 It basically creates a table of correlations for properties over all
 items in Wikidata. So if say date of birth and place of birth are used
 together a lot they get a high correlation. When you then have an item
 with no place of birth but a date of birth it will suggest that
 because of the high correlation.


 Cheers
 Lydia

 --
 Lydia Pintscher - http://about.me/lydia.pintscher
 Product Manager for Wikidata

 Wikimedia Deutschland e.V.
 Tempelhofer Ufer 23-24
 10963 Berlin
 www.wikimedia.de

 Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.

 Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
 unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das
 Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Wikidata just got 10 times easier to use

2014-07-01 Thread David Cuenca

Markus, could your algorithm work together with human direction? Like, if
we entered which properties are common for a class, and then a user creates
an instance of that class, would the algorithm be able to sort those
properties based on how often they appear on the database?

Thanks,
Micru


On Tue, Jul 1, 2014 at 10:23 PM, Markus Krötzsch 
mar...@semantic-mediawiki.org wrote:

 On 01/07/14 22:14, Markus Krötzsch wrote:
 ...


 (2) Grade I listed building
 http://tools.wmflabs.org/wikidata-exports/miga/?classes#_cat=Classes/Id=
 Q15700818


 Related properties: English Heritage list number, masts, Minor Planet
 Center observatory code, home port, coordinate location, OS grid
 reference, mother house, architect, manager/director, Emporis ID,
 MusicBrainz place ID, country, architectural style, visitors per year,
 Commons category, Structurae ID (structure), officially opened by,
 floors above ground, inspired by, religious order, number of platforms,
 street, owned by, diocese

 These are computed fully automatically from the data, with no manual
 filtering or user input. But don't get me wrong -- great work! Brilliant
 to have such a thing integrated into the UI. In any case, my algorithm
 for computing the related properties is certainly very different from
 theirs; I am sure it also has its glitches.


 P.S. One weakness of my algorithm you can already see: it has troubles
 estimating the relevance of very rare properties, such as Minor Planet
 Center observatory code above. A single wrong annotation may then lead to
 wrong suggestions. Also, it seems from my list under (2) that some Grade I
 listed buildings are ships. This seems to be an error that is amplified by
 the fact that property masts is used only 11 times in the dataset I
 evaluated (last week's data). I guess the new property suggester rather
 errs on the other side, being tricked into suggesting very frequent
 properties even in places that don't need them.

 -- Markus



 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

[Wikidata-l] Monolingual text and old languages

2014-06-29 Thread David Cuenca

Hi,

Will the monolngual text datatype support old languages? There are
settlements that had an official name in a now dead language. Besides of
no language, will there be an option to mark such names as custom and
use the qualifier language (p407) instead?

Thanks,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Monolingual text and old languages

2014-06-29 Thread David Cuenca

On Sun, Jun 29, 2014 at 3:01 PM, Daniel Kinzler daniel.kinz...@wikimedia.de
 wrote:

 This is correct. However, there will be no custom option: the set of
 languages
  supported by wikidata is a well-defined list, controlled by the
 configuration of
 the site. It's basically the list of languages MediaWiki supports for the
 user
 interface, plus a few special ones. Adding a language can be done based on
 community consensus.


What about constructed or invented languages? A list will work for most
cases but there always will be edge cases and adding them to the list will
be too much.

*Ph'nglui mglw'nafh Cthulhu R'lyeh wgah'nagl fhtagn.*

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Monolingual text and old languages

2014-06-29 Thread David Cuenca

For instance in Template:Infobox fictional country there is a field for
fictional mottos, like the one for Syldavia, the fictional country that
appears in The Adventures of Tintin:

Syldavia (Q1751922) motto Eih bennek, eih blavek.
   language Syldavian (Q2708019)



On Sun, Jun 29, 2014 at 6:33 PM, Lydia Pintscher 
lydia.pintsc...@wikimedia.de wrote:

 Hey :)

 On Sun, Jun 29, 2014 at 3:08 PM, David Cuenca dacu...@gmail.com wrote:
  What about constructed or invented languages? A list will work for most
  cases but there always will be edge cases and adding them to the list
 will
  be too much.
 
  Ph'nglui mglw'nafh Cthulhu R'lyeh wgah'nagl fhtagn.

 Which languages are you worried about specifically? What kind of data
 would you want to record for them? (I'm trying to get a feeling for
 how large the problem is and which parts of it are already solved.)


 Cheers
 Lydia

 --
 Lydia Pintscher - http://about.me/lydia.pintscher
 Product Manager for Wikidata

 Wikimedia Deutschland e.V.
 Tempelhofer Ufer 23-24
 10963 Berlin
 www.wikimedia.de

 Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.

 Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
 unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das
 Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] first draft for new user interface design

2014-06-17 Thread David Cuenca

On Tue, Jun 17, 2014 at 10:50 AM, Lydia Pintscher 
lydia.pintsc...@wikimedia.de wrote:

 I really don't want us to add this additional complexity at this
 point. We're trying really hard to make editing friendlier and easier
 to understand. I think this would go in the opposite direction.


Not necessarily. It is just the same concept as statements, just in a
different section. I have left a mockup on the ui page.



 Let's move the discussion to
 https://www.wikidata.org/wiki/Wikidata:UI_redesign_input please.


Ok! :-)

Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] first draft for new user interface design

2014-06-16 Thread David Cuenca

Considering how hard is to please everyone I think it is a good start. The
color scheme is much better than the omg-this-is-so-depressing grey that
we have right now. And the picture plus the boxes look like they have the
right size. I'm not so sure about the statements size, but if they can be
shown collapsed, ok.

As Thomas I also miss a collection of all references in a box on the right
that could be drag and dropped on any item.
And some sorting options, like alphabetical, by property number, or some
other basic property sorting (metadata properties on top).

The only thing I'm not convinced about is the edition of label/alias
statements. Labels/aliases should be edited as statements so we can have
more variety and add qualifiers and references to them.

Micru


On Mon, Jun 16, 2014 at 10:37 PM, Thomas Douillard 
thomas.douill...@gmail.com wrote:

 Interesting. My first feeling is for references cloning : I had in mind
 some kind of boxes like the one on the left collumn to collect statements
 references, that could be drag and dropped or copy paste to statements to
 add reference to them.




 2014-06-16 22:01 GMT+02:00 Lydia Pintscher lydia.pintsc...@wikimedia.de:

 Hey folks :)

 I just uploaded the first drafts for the new user interface design:
 https://www.wikidata.org/wiki/Wikidata:UI_redesign_input   It's not
 finished yet but I wanted to hear what you're thinking about the
 direction we're going. Please leave any feedback you have on that page
 so we can keep it all in one place.


 Cheers
 Lydia

 --
 Lydia Pintscher - http://about.me/lydia.pintscher
 Product Manager for Wikidata

 Wikimedia Deutschland e.V.
 Tempelhofer Ufer 23-24
 10963 Berlin
 www.wikimedia.de

 Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.

 Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
 unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das
 Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l



 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

[Wikidata-l] RFC about part of

2014-06-14 Thread David Cuenca

This RFC was pending after several discussions. How to clarify the property
part of?
https://www.wikidata.org/wiki/Wikidata:Requests_for_comment/Refining_%22part_of%22

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Wikidata query feature: status and plans

2014-06-11 Thread David Cuenca

Gerard,
Normally users don't want information about human settlements, but about
municipalities, which may contain several human settlements. Ideally we
should have both and let the user decide what to look for.

Thanks,
Micru


On Tue, Jun 10, 2014 at 11:08 PM, Gerard Meijssen gerard.meijs...@gmail.com
 wrote:

 Hoi,
 As far as I am concerned, it is relevant to compare settlements in
 whatever country they are. A British city is always located in the United
 Kingdom and even more precise it is in the administrative unit of a
 county or whatever. When it is a city for historical reasons, this can be
 indicated with a qualifier.

 In this way it is is a settlement and the rest can be deduced. Having
 specific types of settlements for countries is imho not necessary in this
 way.
 Thanks,
  GerardM


 On 10 June 2014 22:14, David Cuenca dacu...@gmail.com wrote:

 Hi Gerard,

 I think we should not aim for a perfect system, just for a better
 one. In our case we don't need to reproduce all cases, just identify the
 most relevant ones and to clarify when to use each and label/describe them
 clearly.

 Part of is understood, but in so many possible ways that its meaning
 gets diluted into uselessness.

 Thanks,
 Micru



 On Tue, Jun 10, 2014 at 9:50 PM, Gerard Meijssen 
 gerard.meijs...@gmail.com wrote:

 Hoi,

 I fear that when words like mereology are expected to be understood, we
 will fall into the trap where our communities fear what we have been
 sniffing. It will just alienate them.

 Part of is something that is understood. There may be academic reasons
 that make sense to the people who care about them. The question I think we
 should take serious is if that is really where we want to go.
 Thanks,
  GerardM


 On 10 June 2014 20:21, David Cuenca dacu...@gmail.com wrote:

 I think we should drop part of and start using a better mereological
 system
 https://en.wikipedia.org/wiki/Mereology#Various_systems
 http://plato.stanford.edu/entries/mereology/image1.png

 Cheers,
 Micru


 On Tue, Jun 10, 2014 at 8:05 PM, Joe Filceolaire filceola...@gmail.com
  wrote:

 Even where there is complete agreement that a human settlement is a
 'city' there is still usually a question over the population of that city.
 The question is down to what to include.

 A city in many cases is understood to include the contiguous built up
 area but this will often extend far beyond the original administrative
 region that bears the name. So we have the City of London (the central
 business district, corresponding to the medieval and Roman city), Greater
 London (The collection of contiguous urban boroughs that area part of the
 Greater London administrative entity - ironically this does not include 
 the
 City of London but does include the City of Westminster), all the 
 built
 up areas out to the Metropolitan green belt (includes bits of every
 county adjacent to Greater London), or all areas within commuting distance
 of Central London (with the train services this includes a lot of area and
 it is getting bigger as faster trains are deployed).

 When do two cities become one? London and Westminster? Buda and Pest?
 Minneapolis and St Paul? Dallas and Fort Worth? Kansas MI and Kansas KA?
 Dusseldorf, Essen and Dortmund? Detroit and Windsor?

 Joe


 On Tue, Jun 10, 2014 at 12:50 PM, Andrew Gray 
 andrew.g...@dunelm.org.uk wrote:

 On 10 June 2014 09:20, Markus Krötzsch mar...@semantic-mediawiki.org
 wrote:

  The class city is used for relatively large and permanent human
  settlement[s] [1], which does not say much (because the vagueness
 of
  relatively). Maybe we should even wonder if city is a good
 class to use
  in Wikidata. Saying that something has been awarded city status in
 the UK
  (Q1867820) has a clear meaning. Saying that something is a human
  settlement is also rather clear. But drawing the line between
 village,
  city and town is quite tricky, and will probably never be done
 uniformly
  across the data.
 
  Conclusion: if you are looking for, say, human settlements with
 more than
  100k inhabitants, then you should be searching for just that (which
 I think
  is basically what you also are saying below :-).

 OSM has had a lot of problems with this as well, I think - labelling
 something as a city is one of those very slippery terms that
 everyone thinks is obvious but never quite agrees on what the obvious
 bit is :-)

 I wonder if we should think about how best to make sure people know
 this. Perhaps there is a role for the human-readable pages to have
 disambiguation-type notes on them? If you are aiming to do a search
 based on instances of 'city', we recommend you try instances of
 'human settlement' instead...

 --
 - Andrew Gray
   andrew.g...@dunelm.org.uk

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l



 ___
 Wikidata-l mailing

Re: [Wikidata-l] Wikidata query feature: status and plans

2014-06-10 Thread David Cuenca

I think we should drop part of and start using a better mereological
system
https://en.wikipedia.org/wiki/Mereology#Various_systems
http://plato.stanford.edu/entries/mereology/image1.png

Cheers,
Micru


On Tue, Jun 10, 2014 at 8:05 PM, Joe Filceolaire filceola...@gmail.com
wrote:

 Even where there is complete agreement that a human settlement is a 'city'
 there is still usually a question over the population of that city. The
 question is down to what to include.

 A city in many cases is understood to include the contiguous built up area
 but this will often extend far beyond the original administrative region
 that bears the name. So we have the City of London (the central business
 district, corresponding to the medieval and Roman city), Greater London
 (The collection of contiguous urban boroughs that area part of the Greater
 London administrative entity - ironically this does not include the City
 of London but does include the City of Westminster), all the built up
 areas out to the Metropolitan green belt (includes bits of every county
 adjacent to Greater London), or all areas within commuting distance of
 Central London (with the train services this includes a lot of area and it
 is getting bigger as faster trains are deployed).

 When do two cities become one? London and Westminster? Buda and Pest?
 Minneapolis and St Paul? Dallas and Fort Worth? Kansas MI and Kansas KA?
 Dusseldorf, Essen and Dortmund? Detroit and Windsor?

 Joe


 On Tue, Jun 10, 2014 at 12:50 PM, Andrew Gray andrew.g...@dunelm.org.uk
 wrote:

 On 10 June 2014 09:20, Markus Krötzsch mar...@semantic-mediawiki.org
 wrote:

  The class city is used for relatively large and permanent human
  settlement[s] [1], which does not say much (because the vagueness of
  relatively). Maybe we should even wonder if city is a good class to
 use
  in Wikidata. Saying that something has been awarded city status in the
 UK
  (Q1867820) has a clear meaning. Saying that something is a human
  settlement is also rather clear. But drawing the line between
 village,
  city and town is quite tricky, and will probably never be done
 uniformly
  across the data.
 
  Conclusion: if you are looking for, say, human settlements with more
 than
  100k inhabitants, then you should be searching for just that (which I
 think
  is basically what you also are saying below :-).

 OSM has had a lot of problems with this as well, I think - labelling
 something as a city is one of those very slippery terms that
 everyone thinks is obvious but never quite agrees on what the obvious
 bit is :-)

 I wonder if we should think about how best to make sure people know
 this. Perhaps there is a role for the human-readable pages to have
 disambiguation-type notes on them? If you are aiming to do a search
 based on instances of 'city', we recommend you try instances of
 'human settlement' instead...

 --
 - Andrew Gray
   andrew.g...@dunelm.org.uk

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l



 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Wikidata query feature: status and plans

2014-06-10 Thread David Cuenca

Hi Gerard,

I think we should not aim for a perfect system, just for a better one.
In our case we don't need to reproduce all cases, just identify the most
relevant ones and to clarify when to use each and label/describe them
clearly.

Part of is understood, but in so many possible ways that its meaning gets
diluted into uselessness.

Thanks,
Micru



On Tue, Jun 10, 2014 at 9:50 PM, Gerard Meijssen gerard.meijs...@gmail.com
wrote:

 Hoi,

 I fear that when words like mereology are expected to be understood, we
 will fall into the trap where our communities fear what we have been
 sniffing. It will just alienate them.

 Part of is something that is understood. There may be academic reasons
 that make sense to the people who care about them. The question I think we
 should take serious is if that is really where we want to go.
 Thanks,
  GerardM


 On 10 June 2014 20:21, David Cuenca dacu...@gmail.com wrote:

 I think we should drop part of and start using a better mereological
 system
 https://en.wikipedia.org/wiki/Mereology#Various_systems
 http://plato.stanford.edu/entries/mereology/image1.png

 Cheers,
 Micru


 On Tue, Jun 10, 2014 at 8:05 PM, Joe Filceolaire filceola...@gmail.com
 wrote:

 Even where there is complete agreement that a human settlement is a
 'city' there is still usually a question over the population of that city.
 The question is down to what to include.

 A city in many cases is understood to include the contiguous built up
 area but this will often extend far beyond the original administrative
 region that bears the name. So we have the City of London (the central
 business district, corresponding to the medieval and Roman city), Greater
 London (The collection of contiguous urban boroughs that area part of the
 Greater London administrative entity - ironically this does not include the
 City of London but does include the City of Westminster), all the built
 up areas out to the Metropolitan green belt (includes bits of every
 county adjacent to Greater London), or all areas within commuting distance
 of Central London (with the train services this includes a lot of area and
 it is getting bigger as faster trains are deployed).

 When do two cities become one? London and Westminster? Buda and Pest?
 Minneapolis and St Paul? Dallas and Fort Worth? Kansas MI and Kansas KA?
 Dusseldorf, Essen and Dortmund? Detroit and Windsor?

 Joe


 On Tue, Jun 10, 2014 at 12:50 PM, Andrew Gray andrew.g...@dunelm.org.uk
  wrote:

 On 10 June 2014 09:20, Markus Krötzsch mar...@semantic-mediawiki.org
 wrote:

  The class city is used for relatively large and permanent human
  settlement[s] [1], which does not say much (because the vagueness of
  relatively). Maybe we should even wonder if city is a good class
 to use
  in Wikidata. Saying that something has been awarded city status in
 the UK
  (Q1867820) has a clear meaning. Saying that something is a human
  settlement is also rather clear. But drawing the line between
 village,
  city and town is quite tricky, and will probably never be done
 uniformly
  across the data.
 
  Conclusion: if you are looking for, say, human settlements with more
 than
  100k inhabitants, then you should be searching for just that (which I
 think
  is basically what you also are saying below :-).

 OSM has had a lot of problems with this as well, I think - labelling
 something as a city is one of those very slippery terms that
 everyone thinks is obvious but never quite agrees on what the obvious
 bit is :-)

 I wonder if we should think about how best to make sure people know
 this. Perhaps there is a role for the human-readable pages to have
 disambiguation-type notes on them? If you are aiming to do a search
 based on instances of 'city', we recommend you try instances of
 'human settlement' instead...

 --
 - Andrew Gray
   andrew.g...@dunelm.org.uk

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l



 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




 --
 Etiamsi omnes, ego non

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l



 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] What is the point of labels?

2014-06-06 Thread David Cuenca

Other possibilities (not necessarily all together):
- new datatype label, with the value to be indexed as an alias
- use qualifiers on label statements to add lexical data or to link with
declination tables
- consider labels internally as independent items, but don't display them
as such


On Thu, Jun 5, 2014 at 4:28 PM, David Cuenca dacu...@gmail.com wrote:

 When I drafted the functional structure that is appearing on items [1],
 Gerard pointed out that it is drifting into the lexical area. That made me
 think that while useful to have lexical data as an independent item as we
 discussed last year for Wiktionary, the current structure q item label
 string doesn't seem to be compatible with that wish, or at least it would
 be more difficult to maintain the same label twice. And it is not just one
 label per item, there are many, and each one might have different lexical
 properties.

 For more efficiency, it seems that we would need statements like q item
 label lexical item to reflect that separation, but that adds further
 complexity, because according to the latest Wikidata:Wiktionary proposal
 [2], the lexical item (W) also contains senses/meanings (S). This is
 recurrent, as we already have Q items as the basis for meaning... or at
 least a concept that is more or less shared among languages. The only
 difference between Q items and the proposed S items is that S items
 represent only one of the lexeme meanings for one particular language, but
 other than that they have the same nature as Q items (it should be possible
 to add subclass of and other statements to them).

 Labels, aliases, and name properties are just normal statements where one
 of them is preferred, I have been wondering why don't we treat them as
 such... That way we could have some coherence, and have both Q items and
 S items as the units of meaning/sense and later on move the labels
 (lexemes), which now are strings, to the lexical items (W items in the
 example on the page Wikidata:Wiktionary).

 Summing up, labels in their current form make complete sense now, but when
 considered together with lexical information, it seems that it would be
 convenient to treat all of them as statements that later on could link with
 W items. And as Joe pointed out, there are many more properties that are
 equivalent to a label, just more specific, and that now don't show up in
 the suggester, nor up above of the page where they should.

 I know that Wiktionary is still in the future and that there are many
 other priorities on the way, however since the representation of the items
 is being re-considered, I think it is a good moment to think about how to
 move little by little in the right direction. I also would like to point
 out that by keeping lexical information in wikidata, its complexity is
 going to increase inevitably. If new users already struggling to understand
 it now, I cannot imagine how will they cope with added elements...

 Micru

 [1] http://lists.wikimedia.org/pipermail/wikidata-l/2014-June/003941.html
 [2] https://www.wikidata.org/wiki/Wikidata:Wiktionary




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] What is the point of labels?

2014-06-06 Thread David Cuenca

On Fri, Jun 6, 2014 at 3:08 PM, Daniel Kinzler daniel.kinz...@wikimedia.de
wrote:

 SKOS is a good fit for Wikidata data items. For modeling Wiktionary, LEMON
 fits
  a lot better http://lemon-model.net/.


Could you please elaborate on how to share the label between q items and
the future lexical items? It is not very clear to me what you have in mind.

OTOH, is there any possibility to have some property values indexed as
aliases? (like the ones Joe mentioned [1]: birth name, pseudonym)

Thanks
Micru

[1] http://lists.wikimedia.org/pipermail/wikidata-l/2014-June/003944.html
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] What is the point of labels?

2014-06-06 Thread David Cuenca

On Fri, Jun 6, 2014 at 5:05 PM, Daniel Kinzler daniel.kinz...@wikimedia.de
wrote:

 The software shouldn't know about special properties. A bot could know
 about
 them, and automatically add aliases.


Labels and aliases are acting as special properties, just that we cannot
decide which property to use instead of the default.
Now we have to enter twice the property, for instance title and again as
label title, same for names, same for official names of towns, same for
everything... and even more so when the monolingual datatype is available.
Yes, a bot could do that, but perhaps we should ask ourselves if to have
all information twice makes sense?

Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] What is the point of labels?

2014-06-06 Thread David Cuenca

On Fri, Jun 6, 2014 at 7:08 PM, Lydia Pintscher 
lydia.pintsc...@wikimedia.de wrote:

 It's really too early to make all these decisions. We're still at
 least a year away from any significant progress towards Wiktionary
 unless anything changes. Until then we will have learned a lot more
 and a lot of things will have changed. Let's focus on the next big
 things and not lose focus: queries and Commons support :)


Ah, don't worry Daniel's answer regarding future W item connection was
enough concerning Wiktionary. I also want to see queries and Commons
support soon :-)

My main motivation to start this discussion was linked with the planned UI
revamp.
That is what the user sees and it conveys meaning, and not only that, users
learn it and get attached to it. If there is any change to be made to
labels/aliases it would be better to do it before, and to plan the mockups
with those changes in mind.

I remember that you mentioned that external identifier statements should
appear in another area of the screen? I don't know which mechanism would be
used to accomplish that, but maybe it would help if the same method is
applied to properties that represent a label?
Just giving ideas, I have no preference, actually everything would be
better than having to maintain (repeated) aliases by bot :)

Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Item metadata

2014-06-05 Thread David Cuenca

On Thu, Jun 5, 2014 at 3:15 PM, Joe Filceolaire filceola...@gmail.com
wrote:

 Note that 'historic name' is not among these. Historic name is just an
 official name (or one of the other name types) with an 'end date' qualifier.
 These names are, I believe, worth putting in statements so we can qualify
 the statements with dates, type of name, language, references etc. The
 label and aliases are merely search terms and should, I believe, include
 all of the names above plus common misspellings.


Thanks, Joe, for clearing this up. It didn't occur to me that there are so
many! And now that I think of it, there are more subclasses of label, like
title or original title.
I have been thinking about how this connects with lexical information,
perhaps it is better to start a separate thread, since the title of this
one is misleading.

Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Item metadata

2014-06-04 Thread David Cuenca

In order to clarify my previous email I have prepared this functional
taxonomy of the elements that in my opinion form an item:

Meta
+ Labels
-- Multilingual main label (no sources, no qualifiers)
-- Multilingual alternative labels (no sources, no qualifiers)
-- Multilingual historic/alternative labels (will be possible with
monolingual datatype+qualifiers)
-- External identifiers (currently treated as statements)
-- Sitelinks (how each project calls the concept)
+ Metainformation
-- Concept taxonomy (part of/ subclass of)
-- Concept life-cycle (sometimes irrelevant, other times expressed with
start/end date, no deprecation possible)
-- Concept precedence (possible to express as statements succeeded
by/preceded by, but perhaps needs disambiguation from content)
-- Concept dependence (endemic to, bound to [experience/cognition/etc],
recognition methods, etc)
-- Source for the concept (not defined, sometimes irrelevant)

Content
+ Current information
-- Statements
+ Historic information
-- Deprecated statements
+ Open data information
-- Related to outcome on RFC about open datasets
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Item metadata

2014-06-04 Thread David Cuenca

Interesting appreciation... I didn't draft it with lexical information in
mind. If we are struggling to make it different, perhaps there is no
difference?

Thanks,
Micru


On Wed, Jun 4, 2014 at 3:55 PM, Gerard Meijssen gerard.meijs...@gmail.com
wrote:

 Hoi,
 Nice. However, it is not something we can cope with at this time. It makes
 sense to first be able to include lexical items because what you are
 getting into is firmly in that area.
 Thanks,
  GerardM


 On 3 June 2014 20:30, David Cuenca dacu...@gmail.com wrote:

 Continuing with the discussion of last week about the nature of
 properties I follow with my personal crusade to foster a better
 understanding of Wikidata (which sometimes means asking difficult questions
 :)). This time I ask about items, or concepts for that matter.

 To start with I cherry-pick a very insightful question posed by Markus
 last week, that unfortunately I left unanswered:
 The main question is Did the reference say that pianos are
 instruments? but not Did the reference say pianos are instruments because
 of the definition of 'piano'? Therefore, we don't need to put this
 information in our labels.

 To my mind that is a problem that, as the chicken and the egg, can be
 settled with just a word: emergence. There is no such thing as a piano or a
 concept of a piano. But both of them, concept and object, co-evolved over
 time and now we recognize certain objects as pianos. Timeline:
 https://en.wikipedia.org/wiki/Piano#History
 There have been so many innovations upon innovations, versions, and even
 name changes, that what we call now piano is very different from what it
 was long ago. Same can be said about other concepts like country of
 citizenship, which is not a valid concept when talking about historic
 people.

 When we are creating an item we are capturing a moment of time of the
 past, according to a source in a different past. Eventually this item might
 change its label, change its meaning, or become obsolete. So when I look in
 Wikidata for:
 - a way to reflect label changes over time: yes, that will be possible
 with the mono-lingual datatype + qualifiers, creating a property label
 - a way to reflect that the concept is obsolete: perhaps it could be
 reflected with start/end date
 - a way to indicate a different item with a related meaning: it can be
 done with properties

 This information is not about the item itself, but we treat it as other
 statements.

 In my opinion these kind of statements are different (as labels, or
 descriptions), since they don't refer to the represented entity, but to the
 container that represents the entity. Like the walls of a bubble.
 I can imagine that there will be some confusion between labels that can
 accept qualifiers, other than don't, and aliases that can edited in one
 language but not in other, and all this not grouped with other statements
 that belong to the same metadata group.

 So I candidly ask: does it make sense to treat item metadata statements
 just as any other statement? Would it bring more confusion or less?

 Cheers,
 Micru

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l



 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

[Wikidata-l] Item metadata

2014-06-03 Thread David Cuenca

Continuing with the discussion of last week about the nature of properties
I follow with my personal crusade to foster a better understanding of
Wikidata (which sometimes means asking difficult questions :)). This time I
ask about items, or concepts for that matter.

To start with I cherry-pick a very insightful question posed by Markus last
week, that unfortunately I left unanswered:
The main question is Did the reference say that pianos are instruments?
but not Did the reference say pianos are instruments because of the
definition of 'piano'? Therefore, we don't need to put this information in
our labels.

To my mind that is a problem that, as the chicken and the egg, can be
settled with just a word: emergence. There is no such thing as a piano or a
concept of a piano. But both of them, concept and object, co-evolved over
time and now we recognize certain objects as pianos. Timeline:
https://en.wikipedia.org/wiki/Piano#History
There have been so many innovations upon innovations, versions, and even
name changes, that what we call now piano is very different from what it
was long ago. Same can be said about other concepts like country of
citizenship, which is not a valid concept when talking about historic
people.

When we are creating an item we are capturing a moment of time of the past,
according to a source in a different past. Eventually this item might
change its label, change its meaning, or become obsolete. So when I look in
Wikidata for:
- a way to reflect label changes over time: yes, that will be possible with
the mono-lingual datatype + qualifiers, creating a property label
- a way to reflect that the concept is obsolete: perhaps it could be
reflected with start/end date
- a way to indicate a different item with a related meaning: it can be done
with properties

This information is not about the item itself, but we treat it as other
statements.

In my opinion these kind of statements are different (as labels, or
descriptions), since they don't refer to the represented entity, but to the
container that represents the entity. Like the walls of a bubble.
I can imagine that there will be some confusion between labels that can
accept qualifiers, other than don't, and aliases that can edited in one
language but not in other, and all this not grouped with other statements
that belong to the same metadata group.

So I candidly ask: does it make sense to treat item metadata statements
just as any other statement? Would it bring more confusion or less?

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] What is the point of properties?

2014-05-30 Thread David Cuenca

On Thu, May 29, 2014 at 9:04 PM, Andrew Gray andrew.g...@dunelm.org.uk
wrote:

 One other issue to bear in mind: it's *simple* to have properties as a
 separate thing. I have been following this discussion with some
 interest but... well, I don't think I'm particularly stupid, but most
 of it is completely above my head.

 Saying here are items, here are a set of properties you can define
 relating to them, here's some notes on how to use properties is going
 to get a lot more people able to contribute than if they need to start
 understanding theoretical aspects of semantic relationships...


Definitely, I cannot agree more. TBH, the original question of this thread
was already settled some messages ago.
I understand that it might result confusing that we have wandered off into
other realms, so I consider that it is better to consider this thread
closed and I will consider opening a new one with the right topic (which is
quite different as it started :-P)

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] What is the point of properties?

2014-05-30 Thread David Cuenca

And to summarize the answer of the original question to future readers. The
point of properties is:
a) to help humans to better understand Wikidata
b) to help programmers (also humans :P) build the software running it
c) to make a distinction between concepts found in the world and the
concepts that have been interiorized by the community

There might be more, but those are the main points that suggest that it is
better to keep properties and items separate even if their essence is the
same.

Thank you all for this learning experience :-)

Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

[Wikidata-l] What is the point of properties?

2014-05-28 Thread David Cuenca

Since the very beginning I have kept myself busy with properties, thinking
about which ones fit, which ones are missing to better describe reality,
how integrate into the ones that we have. The thing is that the more I work
with them, the less difference I see with normal items and if soon
there will be statements allowed in property pages, the difference will
blur even more.
I can understand that from the software development point of view it might
make sense to have a clear difference. Or for the community to get a deeper
understanding of the underlying concepts represented by words.

But semantically I see no difference between:
cement (Q45190) emissivity (P1295) 0.54
and
cement (Q45190) emissivity (Q899670) 0.54

Am I missing something here? Are properties really needed or are we adding
unnecessary artificial constraints?

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] What is the point of properties?

2014-05-28 Thread David Cuenca

On Wed, May 28, 2014 at 10:37 AM, Daniel Kinzler 
daniel.kinz...@wikimedia.de wrote:

 More fundamentally, they are semantically different: an item describes a
 concept
 in the real world, while a property is a structural component used for
 such a
 description.


As I perceive it, a property is a normal item (concept) imbued with the
option to use it as predicate and allow it to use different datatypes.
There is no property that cannot be expressed as an item, even properties
that represent an identifier, they also could be said that they are a
concept in the real world.
I understand that from the software side you need to make a difference
between basic concepts (items) and concepts that can be used as
predicates (properties). From the community side we also need to
scrutinize and rinse the concepts that hide behind the words before using
them as predicates, but sometimes it is good to stop and consider what are
we really doing.


Yes, properies are simmilar to data items, and in some cases, there may be
 an
 item representing the same concept that is represented by a property
 entity.


I haven't found yet a property that couldn't be expressed as an item.


 I don't see why that is a problem, while I can see a lot of confusion
 arising from
 mixing them.


 It is not a problem now but I considered interesting to analyze what is
the substance of the distinction. If properties and concepts are separate
in the end we will be reproducing their ontological structure when
organizing them. So then it might not make sense to use subproperty of to
organize properties, but just corresponds to item.

Gerard, thanks for bringing the example of OmegaWiki, it is interesting
that two independent communities came to the same thoughts without any
contact between them :)

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] What is the point of properties?

2014-05-28 Thread David Cuenca

 still
 have items and properties share a page and somehow define which
 statements/claims refer to which concept, but this does not seem to make
 things easier for users.

 These are, for me, the two main reasons why it makes sense to keep
 properties apart from items on a technical level. Besides this, it is also
 convenient to separate the 1000-something properties from the 15-million
 something items for reasons of maintenance.

 Best regards,

 Markus



 On 28/05/14 09:25, David Cuenca wrote:

 Since the very beginning I have kept myself busy with properties,
 thinking about which ones fit, which ones are missing to better describe
 reality, how integrate into the ones that we have. The thing is that the
 more I work with them, the less difference I see with normal items
 and if soon there will be statements allowed in property pages, the
 difference will blur even more.
 I can understand that from the software development point of view it
 might make sense to have a clear difference. Or for the community to get
 a deeper understanding of the underlying concepts represented by words.

 But semantically I see no difference between:
 cement (Q45190) emissivity (P1295) 0.54
 and
 cement (Q45190) emissivity (Q899670) 0.54

 Am I missing something here? Are properties really needed or are we
 adding unnecessary artificial constraints?

 Cheers,
 Micru


 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l



 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] What is the point of properties?

2014-05-28 Thread David Cuenca

Markus,

Ok, now I understand that same as wouldn't be a good name for the
confusion it would cause. However the property subject of as it is now
wouldn't be a good candidate either. Its meaning is that a certain
statement is represented by another item (that is why it is only allowed to
be used as qualifier).

Perhaps a better name would be corresponds with item and the inverse
corresponds with property. Just by having these connections, a lot of
information can be inferred from the connected item.

Consider the following example with occupation (P106), and occupation
(Q13516667):
- I cannot find any clear subproperty of for p106, but there is a clear
subclass of:human behaviour for the item
- human behaviour is part of human
- human can have a statement intrinsic property (property proposal
still under discussion) with values birthday (Q47223) and an (eventual)
date of death. It can be expanded in the future to include newly created
properties like height, weight, eye color, etc
- birthday (Q47223) corresponds with property date of birth (P569)

Out of this I reach the following conclusions:
- the taxonomy of properties is going to be weak, since there is not always
a clear subpropertyOf unless created artificially (more work)
- the standard taxonomy of items (subclass of/part of) is sufficient
to automatically reach meaningful constraints and inference (less work)
- by adding manually the constraints to the property itself we are
duplicating information which will require volunteer effort to maintain
(more work)

My recommendation is to rely mainly on the main taxonomy instead of
creating a parallel property taxonomy, and then think of ways to extract
information from the main taxonomy to convert it automatically into
constraints.
All the maintenance takes effort, so the more it can be automated, the more
efficient volunteers will be. And if we can simplify the maintenance of
properties, we will be able to simplify the creation of properties too,
specially when we face the next surge which will come with the datatype
number with units.

Cheers,
Micru



On Wed, May 28, 2014 at 2:48 PM, Markus Krötzsch 
mar...@semantic-mediawiki.org wrote:

 David,

 Regarding the question of how to classify properties and how to relate
 them to items:

 * same as (in the sense of owl:sameAs) is not the right concept here. In
 fact, it has often been discouraged to use this on the Web, since it has
 very strong implications: it means that in all uses of the one identifier,
 one could just as well use the other identifier, and that it is
 indistinguishable if something has been said about the one or the other.
 That seems too strong here, at least for most cases.

 * In the world of OWL DL, sameAs specifically refers to individuals, not
 to classes or properties. Saying P sameAs Q does not imply that P and Q
 have the same extension as properties. For the latter, OWL has the
 relationship owl:equivalentProperties. This distinction of instance level
 and schema level is similar to the distinction we have between instance
 of and subclass of.

 * Therefore, I would suggest to use a property called subproperty of as
 one way of relating properties (analogously to subclass of). It has to be
 checked if this actually occurs in Wikidata (do we have any properties that
 would be in this relation, or do we make it a modelling principle to have
 only the most specific properties in Wikidata?).

 * The relationship from properties to items could be modelled with the
 existing property subject of (P805).

 * It might be useful to also have a taxonomic classification of
 properties. For example, we already group properties into properties for
 people, organisations, etc. Such information could also be added with a
 specific property (this would be a bit more like a category system on
 property pages). On the other hand, some of this might coincide with
 constraint information that could be expressed as claims. For instance,
 person properties might be those with Type (i.e., rdfs:domain)
 constraint human. By the way, our constraint system could use some
 systematisation -- there are many overlaps in what you can do with one
 constraint or another.

 Cheers,

 Markus


 On 28/05/14 12:14, David Cuenca wrote:

 Markus,
 The explanation about the implications of renaming/deleting makes most
 sense and just that justifies already the separation in two.
 It is equally true that when we create a property, we might have
 cleaned the original concept so much that it might differ (even
 slightly) with the understood concept that the item represents. However,
 even after that process, the new concept is still an item...

 The process of imbuing a concept with permanent characteristics (adding
 a datatype) and the practical approach, also seems to recommend keeping
 items and properties separate.
 Thanks for showing me that reasoning :)

 I am still wondering about how are we going to classify properties.
 Maybe it will require a broader

Re: [Wikidata-l] What is the point of properties?

2014-05-28 Thread David Cuenca

Markus,

I share your dissatisfaction with part of because that language construct
hides many different conceptual relationships that should be cleared out, I
think we'll have some community discussion work to do in that regard. One
of the uses is: what is the relationship between a human and his behavior?
I would say that the human has been defined as having human behavior
(or the reverse). But if you have a better suggestion to express this
concept I would be really glad to hear it.

Now that you mention it, yes, I agree that only a property called
corresponds with item makes sense in this context, but not the inverse.

I would like to make a further distinction regarding constraints. The
nature of constraints is not to set arbitrary limits but to reflect
patterns that naturally appear in concepts. On that regard, I hate the word
constraint, because it means that we are placing a straitjacket on
reality, when it is the other way round, recurring patterns in the real
world make us expect that a value will fall within the bonds of our
expectations.
I think that we should seriously consider using the term expectation from
now on because we don't constrain the values per se, we expect them to
have a value, and when the value departs from the expected value, then it
sets an alarm that might reflect an error or not.

Once made that distinction, yes, you are right, considering that we are
separating properties and items, our expectations do not belong to the data
itself, they belong to the property.

However, I would like to go to bring the conversation to a deeper level.
What is that what makes the concept of addition (Q32043) to be that? What
is in physical object (Q223557) that we, sentient beings, can perceive
and agree to treat as a concept? I mention those two because one is purely
abstract, and the other one is purely physical. And I would say that
addition (Q32043) has been defined as having associativity (Q177251)
and physical object (Q223557) has been repeatedly observed to have
density (Q29539). We can argue whether the second is an expectation or
not, but the first is definitely not, someone defined an addition like
that and this information can be sourced. Even more, we could also say that
also physical object (Q223557) has been defined as having density
(Q29539), and I guess we could find sources for that statement too.

With all this I want to make the point that there are two sources of
expectations:
- from our experience seeing repetitions and patterns in the values
(male/female/etc between 10 and 50), which belong to the property
- from the agreed definition of the concept itself, which belong to the data

Cheers,
Micru

PS: this is a re-post because my previous message was bounced back for
being too long :)
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Using external vocabularies (like RDA) in WikiData ?

2014-05-28 Thread David Cuenca

Hello Jean-Baptiste and welcome!
regarding the bibliographic aspects of your question, I have to mention
that we have a page where we collect and discuss these properties
https://www.wikidata.org/wiki/Wikidata:Books_task_force

As you can see, we already use the FRBR model and we were considering
mapping external ontologies like FaBiO or others
https://www.wikidata.org/wiki/Wikidata_talk:Books_task_force#Mapping_external_ontologies

But of course we are always short on volunteers and this is not done yet,
so if you would like to help out finding equivalences with RDA or/and FaBiO
we would be most grateful. What needs to be done is to find out where are
the exact relationships and where we use a different way to express the
same concept, and if we miss any property, then propose it for creation.
That could be a previous step before the connection can be established
directly from the property page, as Daniel mentioned.

Cheers,
Micru



On Wed, May 28, 2014 at 3:05 PM, Jean-Baptiste Pressac 
jean-baptiste.pres...@univ-brest.fr wrote:

  Hello,
 I am reading the documentation of WikiData where I learned that new
 properties could be suggested for discussion. But this means adding knew
 properties to  WikiData. However, is it possible to use existing RDF
 vocabularies like the RDF implementation http://www.rdaregistry.info/of
 RDA http://www.loc.gov/aba/rda/ a cataloging norm based on the 
 FRBRhttp://www.ifla.org/publications/functional-requirements-for-bibliographic-recordsconceptual
  model (Functional Requirements for Bibliographic Records) (see
 also What is FRBR? http://www.loc.gov/cds/downloads/FRBR.PDF) ?

 Unless this could be considered like librarian stuff, the FRBR conceptual
 model is an interresting way of expressing relations between any work
 (book, music, movie...) and their authors because it makes a distinction
 between the work (20.000 lieues sous les mers, the novel written by Jules
 Verne) and its manifestations (the publication of this novel by Hetzel in
 Paris in 1871). FRBR suggest two more levels expression (which I don't
 understood yet) and item (an explary of the book). This model was used by
 the Bibliothèque Nationale de France (BNF) for its web site data.bnf.fr,
 the open data portal of the BNF.

 What I mean, is that I could ask for a new property in WikiData like
 p:writerOf, but why not using rdaw:author (rdaw:
 http://rdaregistry.info/Elements/w/) or rather than having a WikiData
 property p:workTitle using rdaw:titleOfTheWork ?

 Thanks,

 --
 Jean-Baptiste Pressac
 Traitement et analyse de bases de données
 Centre de Recherche Bretonne et Celtique
 UMS 3554
 20 rue Duquesne
 CS 93837
 29238 Brest cedex 3

 tel : +33 (0)2 98 01 68 95
 fax : +33 (0)2 98 01 63 93


 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] What is the point of properties?

2014-05-28 Thread David Cuenca

Markus,


On Thu, May 29, 2014 at 12:53 AM, Markus Krötzsch 
mar...@semantic-mediawiki.org wrote:

 This is an easy question once you have been clear about what human
 behaviour is. According to enwiki, it is a range of behaviours *exhibited
 by* humans.


Settled :) Let's leave it at defined as a trait of


 What would anybody do with this data? In what application could it be of
 interest?


Well, our goal it to gather the whole human knowledge, not to use it. I can
think of several applications, but let's leave that open. Never
underestimate human creativity ;-)



 Moreover, as a great Icelandic ontologist once said: There is definitely,
 definitely, definitely no logic, to human behaviour ;-)


Definitely, that is why we spend so much time in front of flickering
squares making them flicker even more. It makes total sense :P



 I think constraints are already understood in this way. The name comes
 from databases, where a constraint violation is indeed a rather hard
 error. On the other hand, ironically, constraints (as a technical term) are
 often considered to be a softer form of modelling than (onto)logical
 axioms: a constraint can be violated while a logical axiom (as the name
 suggests) is always true -- if it is not backed by the given data, new data
 will be inferred. So as a technical term, constraint is quite appropriate
 for the mechanism we have, although it may not be the best term to clarify
 the intention.


Ok, I will not fight traditional labels nor conventions. I was interested
in pointing out to the inappropriateness of using a word inside our
community with a definition that doesn't matches its use, when there is
another word that matches perfectly and conveys its meaning better to users.

Some important ideas like classification (instance of/subclass of) belong
 completely to the analytical realm. We don't observe classes, we define
 them. A planet is what we call a planet, and this can change even if the
 actual lumps in space are pretty much the same.


Agreed. Better labels could be defined as instance of/defined as
subclass of


 Now inferences are slightly different. If we know that X implies Y, then
 if A says X we can infer that (implicitly) A says Y. That is a logical
 relationship (or rule) on the level of what is claimed, rather than on the
 level of statements. Note that we still need to have a way to find out that
 X implies Y, which is a content-level claim that should have its own
 reference somewhere. We mainly use inference in this sense with subclass
 of in reasonator or when checking constraints. In this case, the
 implications are encoded as subclass-of statements (If X is a piano, then
 X is an instrument). This allows us to have references on the implications.


Nope, nope, nope. I was not referring to hard implications, but to
heuristic ones.

Consider that these properties in the item namespace:
defined as a trait of
defined as having
defined as instance of

Would translate as these constraints in the property namespace:
likely to be a trait of
likely to have
likely to be an instance of



 In general, an interesting question here is what the status of subclass
 of really is. Do we gather this information from external sources (surely
 there must be a book that tells us that pianos are instruments) or do we as
 a community define this for Wikidata (surely, the overall hierarchy we get
 is hardly the universal class hierarchy of the world but a very specific
 classification that is different from other classifications that may exist
 elsewhere)? Best not to think about it too much and to gather sources
 whenever we have them ;-)


I think it is good to think about it and to consider options to deal with
it. Like for instance:
defined as instance of corresponds with item Wikimedia community
concept
We already have items that refer to concepts that only make sense for us,
so no change in that regard.

At the moment, hard constraints (from definitions) and soft constraints
 (expectations) are simply mixed, and maybe this is fine since we handle
 them in a similar fashion (humans need to look how to fix the situation).
 Most constraints, even those that refer to definitions, are rather soft
 anyway since we apply them to statements, not to hard facts. Hard
 constraints can only occur in cases where the *encoding* of a statement in
 Wikidata is wrong (not the intended statement as such, but how it was
 translated to data).


As explained above, expectations inferred from definitions should not be
treated as hard constraints, but as soft ones.

Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Tables

2014-05-27 Thread David Cuenca

Please, leave your comments here too:
https://meta.wikimedia.org/wiki/Requests_for_comment/How_to_deal_with_open_datasets

I've been gathering comments from several people, and in the next days I
will try to summarize these suggestions to be discussed on irc.

Thanks,
Micru


On Tue, May 27, 2014 at 2:12 PM, Ilario Valdelli valde...@gmail.com wrote:

 I am not an expert of Wikidata but I work a lot in integration of
 databases with several middlewares or tools.

 I think that the best way is to ask a solution to store and to download
 data in a format compatible with datasheets like CSV.

 Excel is considered also a tool to do some basic analysis, but it can be
 connected easily to a data source (if well structured).

 Excel itself is not a good approach to store data, so it's not a good
 solution to keep the data in excel format in a database.

 Doesn't make sense to store a 2D tables in a database in my opinion
 because the data have no sense and they are not helpful to anyone.

 They can be stored like a text file, but I would not imagine the series of
 errors that can be generated importing these data again.

 Regards


 On Tue, May 27, 2014 at 1:15 PM, Jan Kučera kozuc...@gmail.com wrote:

 Hi there,

 can tables be stored within wikidata database? I mean simple 2D tables
 like excel spreadsheets...

 Jan

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




 --
 Ilario Valdelli
 Wikimedia CH
 Verein zur Förderung Freien Wissens
 Association pour l’avancement des connaissances libre
 Associazione per il sostegno alla conoscenza libera
 Switzerland - 8008 Zürich
 Wikipedia: Ilario https://meta.wikimedia.org/wiki/User:Ilario
 Facebook: Ilario Valdelli https://www.facebook.com/ivaldelli
 Twitter: Ilario Valdelli https://twitter.com/ilariovaldelli
 Linkedin: Ilario Valdellihttp://www.linkedin.com/profile/view?id=6724469
  Tel: +41764821371
 http://www.wikimedia.ch

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Subclass of/instance of

2014-05-27 Thread David Cuenca

On Thu, May 15, 2014 at 5:03 PM, Markus Krötzsch 
mar...@semantic-mediawiki.org wrote:

 I applaud your comparison of inferencing with a form of decompression. I
 think this is a nice intuition (in fact, some people have researched
 semantic compression where one tries to reduce the size of a knowledge
 base by eliminating things that follow from the rest anyway).


Markus, sorry the delay answering to this, I had to let the ideas grow for
a while.

I also like the idea of decompression, that is what makes your database of
inferred data even more useful. There is a lot of data that can be
inferred, and not just from following the relationships, but by computing.
For instance population density which can be calculated from area and
population, or aggregates of the population of each town in a district.

Another source of inferred statements are wp categories. Most of them are
very easily translatable into statements, and the other way round too. A
place where to store and  process these inferences would be most useful if
WD is not the right place.

You also say: Constraints are a great start. We should now ask how we
could improve the management of constraints in the future, and which
constraints we will have then.
The first step will be having them as statements, then having them as
queries, and finally automating their correction, either by semi-automatic
tools, or with gamification. How to automatically transform a constraint
into a game to solve the outliers it might be also an interesting topic.
And of course, more far fetched, but nevertheless relevant is how to
connect the property to a perceptual mechanism.

About improving the reliability: yes, as wikidata grows bigger some
statements become more important. There is something to be learnt about how
neural nets work, specially strengthening most-used (or traveled, or
accepted, or viewed) connections. Another process little understood now is
the need to forget, or in wikidata terms auto-deprecate information that is
no longer current. Not very relevant now, but something to keep in mind for
the next years.

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Weekyl summary #110

2014-05-25 Thread David Cuenca

John,
I appreciate that you prepare these summaries so diligently. If you
wouldn't do it I would miss so many important things.

Thank you!

Cheers,
Micru


On Sun, May 25, 2014 at 6:31 PM, John Lewis johnflewi...@gmail.com wrote:

 Hey everyone,

 Here is the latest weekly summary

 Discussions[edithttps://www.wikidata.org/w/index.php?title=Wikidata:Status_updates/2014_05_24action=editsection=1
 ]

- Discussion about Draft 
 articleshttps://www.wikidata.org/wiki/Wikidata:Project_chat#What_do_do_with_Wikipedia:Draft_articles

 Events 
 https://www.wikidata.org/wiki/Wikidata:Events/Press/Blogshttps://www.wikidata.org/wiki/Wikidata:Press_coverage
 [edithttps://www.wikidata.org/w/index.php?title=Wikidata:Status_updates/2014_05_24action=editsection=2
 ]

- Wikidata office hour 
 loghttps://meta.wikimedia.org/wiki/IRC_office_hours/Office_hours_2014-05-19
- Wikiconference USA http://wikiconferenceusa.org/ in New York on
May 30th and June 1st

 Other Noteworthy 
 Stuff[edithttps://www.wikidata.org/w/index.php?title=Wikidata:Status_updates/2014_05_24action=editsection=3
 ]

- Editors may now include their ORCIDhttps://en.wikipedia.org/wiki/ORCID 
 identifiers
(and others, such as VIAF https://en.wikipedia.org/wiki/VIAF) on
their user pages, using the{{Authority 
 controlhttps://www.wikidata.org/wiki/Template:Authority_control
}} template. You can register for an ORCID at http://orcid.org
- Play the Wikidata Game https://tools.wmflabs.org/wikidata-game

  
 Development[edithttps://www.wikidata.org/w/index.php?title=Wikidata:Status_updates/2014_05_24action=editsection=5
 ]

- jQuery 1.9 compatibility fixes
- More fixes for gadgets (authority control, preview gadget)
- Improve formatting and handling of snak formatting errors
- Continued working on monolingual text data type
- Continued making messages easier to understand
- Removed a bunch of unused code, interfaces and methods

 See current sprint 
 itemshttps://bugzilla.wikimedia.org/buglist.cgi?list_id=218716resolution=---resolution=LATERresolution=DUPLICATEemailtype1=substringemailassigned_to1=1query_format=advancedbug_status=ASSIGNEDemail1=wikidata
  for
 what we’re working on next.

 You can view the commits currently in review 
 herehttps://gerrit.wikimedia.org/r/#/q/(+project:mediawiki/extensions/Wikibase+OR+project:mediawiki/extensions/Diff+OR+project:mediawiki/extensions/DataValues+OR+project:mediawiki/extensions/WikibaseSolr+OR+project:mediawiki/extensions/Ask+OR+project:mediawiki/extensions/WikibaseQuery+OR+project:mediawiki/extensions/WikibaseDatabase+OR+project:mediawiki/extensions/WikibaseQueryEngine+OR+project:mediawiki/extensions/WikibaseDataModel+)+status:open,n,z
  and
 the ones that have been merged 
 herehttps://gerrit.wikimedia.org/r/#/q/(+project:mediawiki/extensions/Wikibase+OR+project:mediawiki/extensions/Diff+OR+project:mediawiki/extensions/DataValues+OR+project:mediawiki/extensions/WikibaseSolr+OR+project:mediawiki/extensions/Ask+OR+project:mediawiki/extensions/WikibaseQuery+OR+project:mediawiki/extensions/WikibaseDatabase+OR+project:mediawiki/extensions/WikibaseQueryEngine+OR+project:mediawiki/extensions/WikibaseDataModel+)+status:merged,n,z
 .

 You can see all open bugs related to Wikidata 
 herehttps://bugzilla.wikimedia.org/buglist.cgi?emailcc1=1list_id=151540resolution=---emailtype1=exactemailassigned_to1=1query_format=advancedemail1=wikidata-bugs%40lists.wikimedia.org
  Monthly 
 Tasks[edithttps://www.wikidata.org/w/index.php?title=Wikidata:Status_updates/2014_05_24action=editsection=6
 ]

- Fix a format or content 
 violationhttps://www.wikidata.org/wiki/Wikidata:Database_reports/Constraint_violations/P512
  for
the academic degree (P512)https://www.wikidata.org/wiki/Property:P512
 property
- Hack on one of 
 thesehttps://bugzilla.wikimedia.org/buglist.cgi?keywords=need-volunteer%2C%20keywords_type=allwordsemailcc1=1resolution=---emailtype1=exactemailassigned_to1=1query_format=advancedemail1=wikidata-bugs%40lists.wikimedia.orglist_id=162515
.
- Help develop the next summary 
 here!https://www.wikidata.org/wiki/Wikidata:Status_updates/Next
- Contribute to a Showcase item
 https://www.wikidata.org/wiki/Wikidata:Showcase


 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

[Wikidata-l] Request for comments: How to deal with open datasets?

2014-05-15 Thread David Cuenca

Hi,

During the Zürich Hackathon I met several people that looked for solutions
about how to integrate external open datasets into our projects (mainly
Wikipedia, Wikidata). Since Wikidata is not the right tool to manage them
(reasons explained in the RFC as discussed during the Wikidata session), I
have felt convenient to centralize the discussion about potential
requirements, needs, and how to approach this new changing landscape that
didn't exist a few years ago.

You will find more details here
https://meta.wikimedia.org/wiki/Requests_for_comment/How_to_deal_with_open_datasets

Your comments, thoughts and ideas are appreciated!

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] [Wikitech-l] [Wikimedia-l] Request for comments: How to deal with open datasets?

2014-05-15 Thread David Cuenca

On Thu, May 15, 2014 at 1:42 PM, Cristian Consonni kikkocrist...@gmail.com
 wrote:

 Thanks for the pointer, How can I put this open data on Wikidata is a
 question that I have been asked many times, this page was needed.


Thanks for your comment!

On Thu, May 15, 2014 at 3:59 PM, Samuel Klein meta...@gmail.com wrote:

 Thanks Micru!  I think we should start by including datasets on
 wikisource, with descriptions about them (storing the files on commons
 where possible).   And adding more data formats to the formats
 accepted on commons.


I don't follow you... why would you put datasets on Wikisource when they
are only used in Wikipedia and have to be stored somewhere else? As it is
now, it doesn't seem a good dataset management solution.
Besides that it would conflict with its identity as repository for textual
sources..
About Commons I don't know if it is relevant to their mission as a sharing
media platform either... I hope someone from their community can share
their views.

Thanks for the input,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

[Wikidata-l] A project to look forward to

2014-05-14 Thread David Cuenca

http://www.slideshare.net/petermurrayrust/the-content-mine-presented-at-uksg
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] When the source says the information provided is dubious

2014-05-07 Thread David Cuenca

Property proposal started as:
https://www.wikidata.org/wiki/Wikidata:Property_proposal/Generic#statement_disputed_by

I guess all additional parameters (page, chapter, etc) can go in the
references section.

We will be able to say things like:
birthfollowed bybaptism
---time span until next event 1-7 days
---disputed by GerardM

What about the uncertainty qualifier? What would be a good name? statement
considered uncertain by?

Thanks,
Micru


On Tue, May 6, 2014 at 5:26 PM, Gerard Meijssen
gerard.meijs...@gmail.comwrote:

 Hoi,
 In the Netherlands it used to be that people were baptised as soon as
 possible after birth. The notion that he must have been born a few days
 earlier is not necessarily correct.
  Thanks,
  GerardM


 On 6 May 2014 17:18, Joe Filceolaire filceola...@gmail.com wrote:

 Having a property with multiple values can mean a number of things:
 * All the values are equally valid e.g. because a work has multiple
 authors
 * All values are valid but one is preferred - usually the current value
 e.g. when we have population figures back over time or all the kings of
 Denmark.
 * One of the values is shown because it is widely used but is deprecated
 because it is wrong e.g. Beethoven born on 17 December 1770 (that his date
 of baptism so he must have been born a few days earlier).

 The case described by Freidrich where we have two (or more values) which
 are both disputed (because they can't both be right) although one value is
 more widely supported then this is harder to represent semantically. I
 would go with adding a 'disputed by' qualifier to BOTH claims and marking
 the more widely accepted value as 'rank:preferred'

 But that is just me

 Joe

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l



 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] When the source says the information provided is dubious

2014-05-06 Thread David Cuenca

On Tue, May 6, 2014 at 1:37 PM, Thomas Douillard thomas.douill...@gmail.com
 wrote:

 We could create a new qualifier like ''contradicted by'' or ''disputed
 by''. The sourcs are a problem though as we can source only the totality of
 a claim, not only a qualifier of this claim, so we would have to source all
 the sources for the claim and it's disputation sources in the source
 without order..


I have mixed feelings about that... it is good because it doesn't require
any development, it isn't that good because it mixes claim and source...
And having a reference rank to indicate if the source is supporting,
against or unsure about the claim seems too much work for the number of
times that such feature would be needed

Thanks,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

[Wikidata-l] When the source says the information provided is dubious

2014-05-05 Thread David Cuenca

Hi,

I'm having some cases where a work has been attributed to an author by a
source, but the source itself says this attribution is dubious, or it is
contesting a previous attributions as spurious.

As I see it, the rank of the statement is not deprecated (in fact it is
normal or even preferred), but I have no way of representing this
claim uncertainty or claim rebuttal.

Is there any hidden parameter for this or should it be addressed with a
qualifier?

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] When the source says the information provided is dubious

2014-05-05 Thread David Cuenca

Hi Jane,

No, I was not referring to books in particular, but of course it could be
applied to books as well, and to works of art, and to many things in
general.
I agree that the statement is valuable and that it should be included, but
I don't know how to represent it.

Following your examples, what I am trying to represent is not what you say,
but instead:
a) uncertainty: it is hinted that Pete was the son of Klaus, but I have no
conclusive proof
b) rebuttal: Source A says that Pete was the younger brother of Klaus, I
can disprove that (but I cannot provide an alternative)

Cheers,
Micru


On Mon, May 5, 2014 at 2:10 PM, Jane Darnell jane...@gmail.com wrote:

 David,
 I assume you are referring to books. The same is true for works of
 art. The reason why these statements are still valuable is because it
 is an attribution based on grounds determined by someone somewhere and
 based on that loose statement alone are therefore considered of
 interest. You basically make a decision to include the statement or
 not, as you see fit.

 When it comes to people, one source may say Pete was the son of
 Klaus, while another source says Pete was the younger brother of
 Klaus. I think it's just a question of picking one on Wikidata to
 keep the family aspect of the relationship (whichever it is) intact,
 and sooner or later one or the other will be chosen. It's a wiki after
 all.
 Jane



 2014-05-05 11:24 GMT+02:00, David Cuenca dacu...@gmail.com:
  Hi,
 
  I'm having some cases where a work has been attributed to an author by a
  source, but the source itself says this attribution is dubious, or it
 is
  contesting a previous attributions as spurious.
 
  As I see it, the rank of the statement is not deprecated (in fact it is
  normal or even preferred), but I have no way of representing this
  claim uncertainty or claim rebuttal.
 
  Is there any hidden parameter for this or should it be addressed with a
  qualifier?
 
  Cheers,
  Micru
 

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Bot request: 250+ thousands person data

2014-04-29 Thread David Cuenca

On Tue, Apr 29, 2014 at 12:48 PM, Daniel Kinzler 
daniel.kinz...@wikimedia.de wrote:

 If you have something like between 1846 and 1855, you can use the
 before and
 after fields of the time value:

   time: +0001850-00-00T00:00:00Z,
   precision: 9,
   before: 4,
   after: 5

 This means the main value is 1850, given as a year, with a lower bound
 four
 years before and an upper bound 5 years after the main value (before and
 after
 are given in the unit specified by the precision value). The main value
 is
 what is going to be displayed per default; it will also be used for sorting
 query results (once we have queries).


Is it possible to have just an lower bond, leaving the upper one open? I am
thinking of uses like
https://www.wikidata.org/wiki/Wikidata:Property_proposal/Generic#earliest_date

For things like circa I don't see any clear solution other than
inventing some ranges...

Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Bot request: 250+ thousands person data

2014-04-28 Thread David Cuenca

On Mon, Apr 28, 2014 at 3:10 PM, Luca Martinelli
martinellil...@gmail.comwrote:

 I recalled the fact quite correctly:
 https://it.wikipedia.org/wiki/Modulo:Bio takes dates of birth and
 death from Wikidata. I think we can talk to extend the possibility to
 gender, and later to other fields.


That's perfect, because that means that the bot can just delete the text on
import.
What is missing are the fields brth/death date after and before for
uncertain dates (DataNascitaDopo/DataNascitaPrima?). I'm curious to see
that working :)

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Bot request: 250+ thousands person data

2014-04-27 Thread David Cuenca

Maybe it is possible to identify those cases and not use wd data for them?
They must represent a very tiny percentage of the total...

Cheers,
Micru


On Sun, Apr 27, 2014 at 12:28 PM, Amir Ladsgroup ladsgr...@gmail.comwrote:

 there are some problems in using bio template for example they used it for
 a group of people

 https://it.wikipedia.org/wiki/Fratelli_Wright


 On Sun, Apr 27, 2014 at 2:51 PM, David Cuenca dacu...@gmail.com wrote:

 @Nemo, Apper: Do you think you could import that data into the wd-repo
 AND make use of it via an inclusion template? In the past there was some
 animosity against importing data without their sources, but if it is data
 that is being used and displayed on Wikipedia, then I guess it would be
 regarded differently. Anyhow, these kinds of discussions are better on-wiki.

 On Sun, Apr 27, 2014 at 11:16 AM, Christian Thiele ap...@apper.dewrote:

 The german dates have to be parsed, and I think the uncertainty for many
 persons is the main problem (born 5th of July 1716 or 15th of July 1718,
 between 1918 and 1921). But of course for a lot of persons there are
 accurate information, which could be imported to Wikidata.


 For this kinds of situation you have the before and after of the time
 datatype:
 https://www.wikidata.org/wiki/Special:ListDatatypes

 The problem is that it is not visible yet...
 https://bugzilla.wikimedia.org/show_bug.cgi?id=61909

 Cheers,
  Micru

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




 --
 Amir


 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Wikibase/ Wikidata for structuring biodiversity knowledge

2014-04-24 Thread David Cuenca

Hi Daniel,

Interesting report, about WD tagging I would also mention Denny's qLabel:
http://google-opensource.blogspot.de/2014/04/qlabel-multilingual-content-without.html

Regarding cost-efficient tagging options, there are automated
tagging/entity extraction methods like DBpedia Spotlight, which can be
further improved by using them together with human supervision/custody
around an established online community.

I'm also unsure about the mark-up of semantically unaware publications,
since marking-up requires access to an editable version of the text and
that requires an open license. Semantic annotations don't seem to suffer of
that problem and there are several open source initiatives in development.
https://thepund.it/
http://www.openannotation.org/Partners.html

Cheers,
Micru



On Wed, Apr 23, 2014 at 6:23 PM, Daniel Mietchen 
daniel.mietc...@googlemail.com wrote:

 Dear all,

 lately, I have been working on a report on mark-up and related
 approaches to structuring biodiversity information:

 https://docs.google.com/document/d/133szlTaabYakEeR6JF6FFYsDJH-bLdSgDby86XPxPDk/edit#

 Wikibase and Wikidata are featuring prominently in there, and I would
 appreciate your comments.

 Thanks and cheers,

 Daniel


 --

 http://www.naturkundemuseum-berlin.de/en/institution/mitarbeiter/mietchen-daniel/
 https://en.wikipedia.org/wiki/User:Daniel_Mietchen/Publications
 http://okfn.org
 http://wikimedia.org

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] [OHM] Should we map former endonyms?

2014-03-20 Thread David Cuenca

On Thu, Mar 20, 2014 at 9:45 AM, Gerard Meijssen
gerard.meijs...@gmail.comwrote:

 A similar thing can be considered for streets and stuff as well. Obviously
 it needs a lot of thought but from an abstract point of view, a street or
 an image, it is just another category of data. When it works for one type
 of data it could / should work for another type of data as well.


In that regard perhaps it would make more sense for OSM or another entity
to run a Wikibase Repository dedicated exclusively to geographic
entities/names.

Thanks,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] [OHM] Should we map former endonyms?

2014-03-20 Thread David Cuenca

On Thu, Mar 20, 2014 at 1:31 PM, Jo winfi...@gmail.com wrote:

 I think wikidata has the potential to tie it all together. There is no
 need to split the information over 2 databases.


It depends on how much granularity you want. If you want to use just
well-known entities, then for sure, wikidata can tie it all together. If
you want street level (or even building level) information, that would be
too much for Wikidata.

There was already a similar discussion regarding bibliographic data.
Should Wikidata collect *all* bibliographic data? And the consensus was
not all, just what is relevant as hinted by Wikipedia/Wikisource (and
common sense).

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] [OHM] Should we map former endonyms?

2014-03-20 Thread David Cuenca

On Thu, Mar 20, 2014 at 3:36 PM, Gerard Meijssen
gerard.meijs...@gmail.comwrote:

 Given that we want to collaborate with openstreetmap we could host it for
 them

I like the idea of a Wikibase-powered OSM data repository, it is a pity
that the WM Incubator is only for language versions of existing projects
and not for new projects... OTOH, since Wikidata is (or is supposed to be)
language-agnostic, couldn't we argue that domain-specific data projects are
to wikidata what language editions are to wikipedia?

I wonder how hard would be to set-up a labs Wikibase instance for OSM
developers to experiment with it? Or even if it would be enough interest?

Thanks,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] [OHM] Should we map former endonyms?

2014-03-20 Thread David Cuenca

On Thu, Mar 20, 2014 at 5:17 PM, Susanna Ånäs susanna.a...@gmail.comwrote:

 An independent project will require a lot of MediaWiki related knowledge
 that is not necessarily found in an initial group of interested
 individuals. Or combined OSM, MediaWiki  Wikidata knowledge, which may be
 even more sparse. It would be more relaxed in regard to rules and
 guidelines. Could it be re-integrated to Wikidata later, or would it run to
 in-evident oblivion?


It could be re-integrated, but I wouldn't start a wikibase repo only for
the specific case of historical data. If there is a sizeable community that
could mantain a full-fledged repository of geographic entities (as
understood in Wikidata terms), then the historic information could be a
subset of that. OSM can do it (and actually it is being done more or less),
but that is something that should be decided by their community.


 An integrated path would require complying to all guidelines eg. re:
 notability. It would cause a lot of waiting time for reaching consensus
 while defining properties - which is also needed in an independent project.


I think the main intersection points are entities and properties. With
entities it is already happening (using property
p402https://www.wikidata.org/wiki/Property:P402),
but with properties we still have no technical means of saying this
property in WD is the same as this other property in project X.


 Are you going to be in the Zürich hackathon to discuss this?

 Not sure yet, but I have seen that Katie and Daniel will be there and they
have a deeper technical knowledge than me :)

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] [OHM] Should we map former endonyms?

2014-03-20 Thread David Cuenca

On Thu, Mar 20, 2014 at 6:28 PM, Jo winfi...@gmail.com wrote:

 Using property P402 is not a very good idea since object ids in OSM aren't
 guaranteed to be stable. nodes, ways and relations each have their own
 'namespace' and sometimes information is refined by moving it from a node
 to a way or from a node or a way to a relation (multipolygon), usually this
 means he original object vanishes and property P402 isn't pointing anywhere
 anymore.

 The only way that makes sense is to add wikidata tags to OSM objects.


Or that OSM would have their own repository of entities and then we would
interlink both ;)

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] [OSM-talk] [OHM] Should we map former endonyms?

2014-03-20 Thread David Cuenca

Well, that is one part of the problem, which could be addressed in Wikidata
with a property official name with the datatype mono- or multi-lingual
string 
(plannedhttps://www.wikidata.org/wiki/Wikidata:Development_plan#Multi-lingual_text_datatype_.28optional.29,
but not available yet) plus the qualifiers start/end date.
The other part of the problem is that for different periods of time you
have different entities attached to geographic locations.

For instance after the Kingdom of Great Britain
https://www.wikidata.org/wiki/Q161885

Came the United Kingdom of Great Britain and Ireland
https://www.wikidata.org/wiki/Q174193

Yes, the name changed, but it it not just a name change, it is a different
entity.

Cheers,
Micru


On Thu, Mar 20, 2014 at 6:43 PM, Andrew Gray andrew.g...@dunelm.org.ukwrote:

 I think the problem is that we sometimes need to reflect more than just
 the single official name - at the moment we include multilingual names,
 which is great, and it's a bit of a backwards step to lose that ability for
 the past. Imagine if you're looking at an English or German map of Russia -
 all the names rendered with nice Latin-script equivalents - and you say
 okay, show me a 1970s map, only for everything to become Cyrillic
 instead. :-)

 It becomes more complicated if you have cases where the name changes in
 some languages and not others, or countries with multiple official
 languages where it changes in both.

 For example, we'd want to be able to record that in English the city of
 Tsaritsyn became Stalingrad on a certain date, and then later became
 Volgograd, just as we record that in Russian it went from Царицын to
 Сталинград to Волгоград.

 However, as you can see at the moment, the other names are simple
 strings with no dates or modifiers, so we can't convey this information.
 (Switching the interface to a different language will display the
 alternative names in those languages)

 https://www.wikidata.org/wiki/Q914

 Susanna: I'm not aware of what the plans are for this, but I'm reasonably
 sure we can't do it right now. However, I'm not completely up to speed on
 how dates/modifiers etc work, so someone on wikidata-l can no doubt correct
 me :-)

 Andrew.

 On 20 March 2014 13:24, Martin Koppenhoefer dieterdre...@gmail.com
 wrote:
 
 
  Am 20/mar/2014 um 13:51 schrieb Andrew Gray andrew.g...@dunelm.org.uk
 :
 
  Properties can have modifiers such as date, labels can't. So there's a
  bit of a challenge here - we would be able to construct a field that
  says historic name : Warschau (date:1939-45), but this would be
  shown as a historic name in Polish, German, English, Chinese...
 
 
  maybe that's not a problem as this was indeed the official name in that
 time?
 
  cheers,
  Martin

 --
 - Andrew Gray
 andrew.g...@dunelm.org.uk

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] [cultural-partners] Wikidata and GLAM

2014-03-19 Thread David Cuenca

On Thu, Mar 20, 2014 at 12:53 AM, Andy Mabbett a...@pigsonthewing.org.ukwrote:

 P.S. I have just proposed an accession number property for Wikidata,
 which may also be of interest:


 https://www.wikidata.org/wiki/Wikidata:Property_proposal/Unsorted#Accession_number


I don't think it is necessary, we alredy have catalog code:
https://www.wikidata.org/wiki/Property:P528

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

[Wikidata-l] Interaction of VisualEditor with Wikidata

2014-03-17 Thread David Cuenca

Hi,

I have started writing some thoughts about a possible interaction between
VE and WD.
https://www.wikidata.org/wiki/User:Micru/Interaction_of_VisualEditor_with_Wikidata

I assumed the following requirements:
- link/unlink any template field to WD from VE
- edit WD data from VE

There are so many questions open...

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

[Wikidata-l] New RFC: Organizing statements, sitelinks, and external identifiers

2014-03-13 Thread David Cuenca

Hi,

I have started a Request for Comments on how to organize statements,
sitelinks, and external identifiers. Hopefully the feedback gathered can
help with the UI redesign.
https://www.wikidata.org/wiki/Wikidata:Requests_for_comment/Organizing_statements,_sitelinks,_and_external_identifiers

Thanks,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Making Wikidata entries at the time of 'Article for Creation' publication

2014-03-12 Thread David Cuenca

On Wed, Mar 12, 2014 at 2:52 PM, Andy Mabbett a...@pigsonthewing.org.ukwrote:

 I'm not sugegsting that we make people enter even more information in
 Wikipedia; I'm suggesting that wikidata would benefit from capturing
 the data that is /already/ being entered into Wikipedia, not least via
 AfC, by the people I describe above; and that I and others who review
 and publish those articles would benefit from tool to save us the
 manual task of having to retype (into Wikidata) what we're already
 asked to type once (into the AfC tool) as part of that process.


We cannot get there yet, since we depend on many features still in
development:
1.- Simple data editing from VisualEditor
2.- Easy way to map wikipedia template fields to wikidata properties
3.- Migration of main infobox templates to make use of Wikidata

There are many needed features still not done, plus some more, which take a
long time to discuss, implement, and test. Of course when all that is
available then you should be able to have an infobox selection wizard
(possibly based on this structure [1]), and then by editing the fields on
VisualEditor the data would be automatically filled on Wikidata.

As said, it sounds easy, but this has many prerequisites that are still not
met.

My only advice: patience :-)

And if you want meanwhile you can help with the infobox mappings:
https://www.wikidata.org/wiki/Wikidata:Infobox_mappings

Cheers,
Micru

[1] https://en.wikipedia.org/wiki/Wikipedia:List_of_infoboxes
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Making Wikidata entries at the time of 'Article for Creation' publication

2014-03-12 Thread David Cuenca

On Wed, Mar 12, 2014 at 6:11 PM, Andy Mabbett a...@pigsonthewing.org.ukwrote:

 What does this have to do with AfC?

 Indeed, nowhere in your post do you mention AfC, once.


Because there is no difference between creating an article through
enwiki-AfC or creating an article on any of the other 270+ language
editions of WP. The general requirements are a simple interface and a
streamlined workflow. If later you want to re-package it as AfC or
anything else it is up to you, it doesn't affect the real problematic.

On Wed, Mar 12, 2014 at 7:09 PM, Dimitris Kontokostas jimk...@gmail.comwrote:

 Actually there is a lot of work done in this regard.
 We map Wikipedia templates to the DBpedia ontology [1] and already started
 marking equivalent properties to Wikidata (see [2] [3])


Dimitris, I meant integrating the mapping info into the WP template data
itself. Anyhow, thanks for the lead, maybe you could post it on the
Wikidata:Infobox_mappings page too? It might simplify their work.

Thanks,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Making Wikidata entries at the time of 'Article for Creation' publication

2014-03-11 Thread David Cuenca

On Tue, Mar 11, 2014 at 12:36 PM, Joe Filceolaire filceola...@gmail.comwrote:

 I agree that it makes sense that one of the first things to do when
 creating a new wikipedia article should be to create an infobox which is
 automagically linked to the creation of a wikidata item. There are a number
 of considerations related to this.

 1. Notability. The current rules for Wikidata means that it is not
 acceptable to create a wikidata item until after the Wikipedia article has
 been created.


Most of the time superseded by clearly identifiable conceptual or material
entity or structural need, but nevertheless important.



 2. Drafts. English Wikipedia has recently enabled a Draft name space for
 people to use to develop new articles. Articles in the Draft namespace are
 not indexed by Google and are not required to meet notability standards
 until they are transferred to the Main namespace. Should we change the
 rules on wikidata so Draft articles can be sitelinked and have wikidata
 items?


No idea how useful drafts are... or how often they move forward or get
deleted.




 3. Visual Editor. The visual editor already has some template editing
 functionality but it does not link to a wikidata item. To get Wikipedia
 editors editing wikidata we need an infobox creation wizard which will mean
 wikipedia editors can edit wikidata from inside wikipedia and it feels like
 they are editing an info box. Personally I think the first step should be
 to enable the wikibase client on wikidata so we can start developing these
 internationalised and localised infoboxes on wikidata which can then be
 redeployed to other WMF wikis.


Totally agree. Even better would be to have a Infobox Reasonator, meaning
a standard infobox that can be used for everything (perfect for small
wikis). But for all that first we need Lua for testing the templates on WD:
https://bugzilla.wikimedia.org/show_bug.cgi?id=47071
Which needs arbitrary access:
https://bugzilla.wikimedia.org/show_bug.cgi?id=47930

And perhaps linking Wikidata pages as sitelinks, so Wikipedias can find our
custom-made infoboxes easily
https://bugzilla.wikimedia.org/show_bug.cgi?id=55570

As for WP-templates there is the need to be able to specify to which WD
property should a template field default. No idea if that has been
tracked/discussed somewhere.

Coming back to the topic of new articles, I think it is important to note
that most articles are categorized, and categories can give a hint about
the properties common to all articles belonging to the category. It should
be possible to semi-automate the process of creating wikidata items with an
initial set of statements depending on the item category. Infoboxes also
give hints about base statements.

Thanks
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] rank related changes

2014-03-10 Thread David Cuenca

Hi,

I don't know which kind of wrong choices you are talking about, maybe you
can expand on that? If you would have preferred labels to be treated as
statements with ranks, qualifiers, etc, there are already properties like
birth name (P513), short name (P743), etc. which are properties that
you can use *now*, and nothing prevents you of proposing new properties in
that direction for historical names. True that they would become more
useful with the mono-/multi-lingual datatype, but it will be there some day.

I cannot forget about what a structured Wiktionary will do in abstraction
because that is a totally flawed argument. Same could have been said about
Wikidata some time ago about the potential benefits to Wikipedia... The
benefits are only known in abstraction, and only by doing it we'll know the
answer.

Even being technically more advanced, Omegwiki didn't gather a community
for a number of reasons. Wiktionary managed to gather a community, but has
strong technical flaws. Don't you think it is worth it to learn the lessons
from each site and forget about one-sided preferences?

Thanks,
Micru


On Mon, Mar 10, 2014 at 2:27 PM, Gerard Meijssen
gerard.meijs...@gmail.comwrote:

 Hoi,
 When you look at the current use of labels, they are used to identify
 items. When a statement is used on an item, it uses a combination of a
 property and another item. What we notice is that several not so smart
 choices have been made in the past that make the current use of labels
 problematic.  I remind you of the discussion of calling an item a list when
 it is used an a single instance.

 The notion that labels on statements are not used flies in the face of it
 being applied in Reasonator for instance.

 Forget about what a structured Wiktionary will do in abstraction, it is
 not where we are. We are at a point where we have labels and no clue (in a
 Wikidata context) if and what practical benefits embedding Wiktionary will
 bring. It is really nice to point to Lemon and say that it is a standard.
 But as far as I am concerned it is a lemon [1] when there is no translation
 in how the information can be leveraged.

 I can appreciate that we have simple labels at this time. It will not stay
 that way. The alternative is that including Wiktionary in Wikidata will
 truly become a lemon [1]. I will do what I can to help prevent that, my
 ambition is that Wikidata will truly replace Omegawiki.
 Thanks,
   GerardM

 [1] A defective https://en.wiktionary.org/wiki/defective or 
 inadequatehttps://en.wiktionary.org/wiki/inadequateitem.


 On 10 March 2014 14:03, David Cuenca dacu...@gmail.com wrote:

 On Mon, Mar 10, 2014 at 1:50 PM, Gerard Meijssen 
 gerard.meijs...@gmail.com wrote:

 Hoi,
 There is little point to integrating Wiktionary and the current proposal
 if it is unclear how that information is going to be used. The proposal for
 all its fancy words misses the point completely and you point it out really
 well: it is unclear how lexemes will interact in the UI and in search.


 Lexemes UI/search is not clear, but it can be clarified when needed. I
 don't think now it is the time, maybe after getting some experience with
 queries.



 The current labels will one on one coincide with lexemes. I think we can
 agree on this.


 Not always, not for all the languages, and not necessarily. A structured
 Wiktionary should be the interface to written (and hopefully spoken) human
 language. Q-item labels are for humans to have some clue what the concept
 is about, but the textual expression is a different topic.

 Thanks,
 Micru


 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l



 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Queries - can they be stored as statements in Category/List items?

2014-03-08 Thread David Cuenca

On Fri, Mar 7, 2014 at 6:37 PM, Denny Vrandečić vrande...@gmail.com wrote:

 I still disagree - let me explain why. I think that trying to express a
 query definition into a single statement is very hard. Having a specific
 Query namespace allows us to create a completely new UI for them, allows us
 to use a different data model for Queries than for items, and allows us to
 treat Query pages very different (e.g. for caching) than, e.g. item pages.


Maybe I'm wrong, but to create a completely new UI for [queries] in my
head translates as more barriers for users to understand and navigate
Wikidata.
The query doesn't need to be a single statement, but a single property for
queries (I called it same as query but it could be named differently).
You should be able to combine two statements of such property, or more, to
create a complex query. What I think it also matters is to reuse the
concepts the users are familiar with as much as possible.




 For example, the different data model would allow us to restrict the
 number of queries on a page. If they were just a statement, what would stop
 a contributor from creating several such statements on one page?


On the mockup I intended to represent that the property same as query can
have several statements, but they combine into a single query.


 What happens when someone removes the same as query statement?


Same as now happens with other statements transcluded into wikipedia pages,
when somebody notices the deletion, it gets restored.


 What happens if someone adds it to the page for USA (e.g. same as query
 instance of-country, continent-North America,
 population-300M)?


It will get corrected, it is a wiki, right? :)


 Would this page suddenly be treated differently?


Not necessarily.


 Also, you already show in your mock up that the same as query statement
 requires plenty of special code (e.g. for the different visualizations,
 etc.)


Same requirements as having queries as independent pages.



 One option would be to have them as item pages, but then treat them
 continuously different. This would mean more and more exceptions and
 special casing in the code. I think that Queries and Items are sufficiently
 different to deserve their own treatment. In my personal opinion, this is
 provides sufficient reasons for Query pages and Item pages being distinct.


 No need for different treatment, or exceptions. Just a property that acts
as a query descriptor with as many modifiers as needed.



 What would be the advantage of having Queries being expressed in the
 Items? Less entities?


Yes!


 Less confusion about what these list of- and category-items mean?


Yes!


 Both reasons I don't find sufficiently enticing to change my opinion on
 this.


And better navigation, and more simplicity to create a query, and no need
of creating artificial divisions between pages that represent the same
concept...

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Queries - can they be stored as statements in Category/List items?

2014-03-07 Thread David Cuenca

Denny, sorry for the confusion, it is a complex topic, or it could also be
that I am terribly bad at explaining :)
Based on that item page I have made a mock-up which perhaps makes things
easier:
http://i.imgur.com/1dSfrqx.png

The reasoning for this being:
1) there is a well-defined set of queries that are equivalent to
categories/lists, so there is no need to have independent query pages
2) if a wikipedia wants to include query results on a page, it is quite
probable that the query already exists as a list/category
3) and if it doesn't then it will be *very* specific to that language
wikipedia. In that case there is no need to define a query page on
wikidata, but on the wikipedia page itself as an inclusion syntax command
or another similar module

You are right that it might be a bit preliminary, as there are not even
simple queries yet, but since this kind of decisions might have an impact
on later design, I think it is worth start presenting the concepts/options
now. Besides, ideas and a common understanding take time to develop, and
the RFC was started, so I thought it was worth giving it some attention.

Cheers,
Micru

On Fri, Mar 7, 2014 at 12:18 AM, Denny Vrandečić vrande...@gmail.comwrote:

 Since I am obviously bad at guessing what you mean, can you please
 explicate what you mean with replicate that functionality on Wikidata?

 Sorry, I am too dense to understand it.

 What do you want to happen, explicitly?

 I go to http://www.wikidata.org/wiki/Q6573995 - how should it be
 different from what it displays today?

 Do you want the item pages to have the feature to directly embed query
 results, instead of having a one-click distance to the actual query page
 and its results?

 Or is there more to it?




 On Thu Mar 06 2014 at 3:10:44 PM, David Cuenca dacu...@gmail.com wrote:

 I'm not saying that the results yielded by Category:Books by Jean-Paul
 Sartre or Category:Books by J.R.R. Tolkien are or should be the same as
 the result yielded by a corresponding Wikidata query, but the concepts they
 represent, they are the same. Ditto for lists.
 (As a further clarification, I didn't mention anything about changing
 Wikipedia categories or Wikipedia lists either.)

 My question was regarding the functionality of WD items associated with
 Wikipedia categories and Wikipedia lists.
 Conceptually those items represent (or can represent) queries.
 WDQ, the tool by Magnus, already can interpret certain statements as
 queries [1].
 Would it make sense to replicate that functionality on Wikidata?

 Cheers,
 Micru

 [1] http://tools.wmflabs.org/reasonator/?q=6573995


 On Thu, Mar 6, 2014 at 11:16 PM, Denny Vrandečić vrande...@gmail.comwrote:

 But that's simply not the case. The Category:Books by Jean-Paul Sartre
 [1] or Category:Books by J.R.R. Tolkien [2} neither are a complete list of
 books by those authors (e.g. Sartre's fictional books are missing,
 Tolkien's non-fictional *and* Middle earth books are missing), nor are they
 only including books by Tolkien (e.g. they also include templates and other
 categories, which are likely not written by Sartre or Tolkien).

 If the plan is to change the way categories are used in Wikipedia and
 the other Wikimedia wikis, I'd say this is a very different goal, but
 should be discussed on those wikis.

  List-articles often contain much more love and care than a Wikidata
 query result will for a while. I don't think that replacing an article like
 the list of books by David Foster Wallace [3] the List of US Presidents [4]
 with a single simple query is a short-term goal.

 [1] https://en.wikipedia.org/wiki/Category:Books_by_Jean-Paul_Sartre
 [2] https://en.wikipedia.org/wiki/Category:Books_by_J._R._R._Tolkien
 [3] https://en.wikipedia.org/wiki/Books_by_David_Foster_Wallace
 [4] https://en.wikipedia.org/wiki/List_of_US_Presidents


 On Thu Mar 06 2014 at 1:49:54 PM, David Cuenca dacu...@gmail.com
 wrote:

 The point I wanted to make (following your example), is that the
 Wikidata Query All novels by Douglas Adams is equivalent to the item
 Category:Novels by Douglas Adams [1]. In other cases there will be even 3
 items representing the same information: the wd query, the category item,
 and a list of... item. So I'm just wondering if this complexity is really
 needed for structural/technical reasons.

 Maybe there is an easier way instead of linking to the wd query with a
 category's main query property?

 Cheers,
 Micru

 [1] https://www.wikidata.org/wiki/Q8687492


 On Thu, Mar 6, 2014 at 7:24 PM, Lydia Pintscher 
 lydia.pintsc...@wikimedia.de wrote:

 On Tue, Mar 4, 2014 at 9:01 PM, David Cuenca dacu...@gmail.com
 wrote:
  I would like to make you aware of this RFC started by Gerard:
 
 https://www.wikidata.org/wiki/Wikidata:Requests_for_comment/Define_lists_on_both_%22Wikimedia_lists%22_and_%22Wikimedia_categories%22
 
  It is interesting because in the end, what is the difference between
 a list,
  a category, and a query? Not much, really.
 
  I'm curious

Re: [Wikidata-l] Queries - can they be stored as statements in Category/List items?

2014-03-06 Thread David Cuenca

I'm not saying that the results yielded by Category:Books by Jean-Paul
Sartre or Category:Books by J.R.R. Tolkien are or should be the same as
the result yielded by a corresponding Wikidata query, but the concepts they
represent, they are the same. Ditto for lists.
(As a further clarification, I didn't mention anything about changing
Wikipedia categories or Wikipedia lists either.)

My question was regarding the functionality of WD items associated with
Wikipedia categories and Wikipedia lists.
Conceptually those items represent (or can represent) queries.
WDQ, the tool by Magnus, already can interpret certain statements as
queries [1].
Would it make sense to replicate that functionality on Wikidata?

Cheers,
Micru

[1] http://tools.wmflabs.org/reasonator/?q=6573995

On Thu, Mar 6, 2014 at 11:16 PM, Denny Vrandečić vrande...@gmail.comwrote:

But that's simply not the case. The Category:Books by Jean-Paul Sartre [1]
or Category:Books by J.R.R. Tolkien [2} neither are a complete list of
books by those authors (e.g. Sartre's fictional books are missing,
Tolkien's non-fictional *and* Middle earth books are missing), nor are they
only including books by Tolkien (e.g. they also include templates and other
categories, which are likely not written by Sartre or Tolkien).

If the plan is to change the way categories are used in Wikipedia and the
other Wikimedia wikis, I'd say this is a very different goal, but should be
discussed on those wikis.

List-articles often contain much more love and care than a Wikidata query
result will for a while. I don't think that replacing an article like the
list of books by David Foster Wallace [3] the List of US Presidents [4]
with a single simple query is a short-term goal.

[1] https://en.wikipedia.org/wiki/Category:Books_by_Jean-Paul_Sartre
[2] https://en.wikipedia.org/wiki/Category:Books_by_J._R._R._Tolkien
[3] https://en.wikipedia.org/wiki/Books_by_David_Foster_Wallace
[4] https://en.wikipedia.org/wiki/List_of_US_Presidents

On Thu Mar 06 2014 at 1:49:54 PM, David Cuenca dacu...@gmail.com wrote:

The point I wanted to make (following your example), is that the Wikidata
Query All novels by Douglas Adams is equivalent to the item
Category:Novels by Douglas Adams [1]. In other cases there will be even 3
items representing the same information: the wd query, the category item,
and a list of... item. So I'm just wondering if this complexity is really
needed for structural/technical reasons.

Maybe there is an easier way instead of linking to the wd query with a
category's main query property?

Cheers,
Micru

[1] https://www.wikidata.org/wiki/Q8687492

On Thu, Mar 6, 2014 at 7:24 PM, Lydia Pintscher
lydia.pintsc...@wikimedia.de wrote:

On Tue, Mar 4, 2014 at 9:01 PM, David Cuenca dacu...@gmail.com wrote:
I would like to make you aware of this RFC started by Gerard:

https://www.wikidata.org/wiki/Wikidata:Requests_for_comment/Define_lists_on_both_%22Wikimedia_lists%22_and_%22Wikimedia_categories%22

It is interesting because in the end, what is the difference between a
list,
a category, and a query? Not much, really.

I'm curious to know if the approach taken with queries will be the
same as
the WDQ
http://tools.wmflabs.org/reasonator/?q=6573995

Items like List of... or Category: would have some use, but the
development notes don't state if this is the intended path
https://meta.wikimedia.org/wiki/Wikidata/Development/Queries

Any thoughts about it?

I've been trying to understand the RfC 3 times now and still fail. So
I can't answer your questions unfortunately.

The short and simplified version of how complex queries will work:
* someone defines a query on a page in a special query namespace (eg
everything that has author = Douglas Adams)
* the result of the query is a list of items matching the query
* the Wikipedias can include the result of the query and visualize it
in certain ways on a page. (eg the wikitext of the article List of
works by Douglas Adams would have a call to include the query result
from Wikidata)

Cheers
Lydia

--
Lydia Pintscher - http://about.me/lydia.pintscher
Product Manager for Wikidata

Wikimedia Deutschland e.V.
Tempelhofer Ufer 23-24
10963 Berlin
www.wikimedia.de

Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.

Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das
Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

--
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

___
Wikidata-l mailing list

[Wikidata-l] Question about progressive loading

2014-02-25 Thread David Cuenca

While reading this [1] and observing how other website load large pages
without freezing my computer, I was just wondering why progressive loading
of items was not an option for Wikidata.

Is it too difficult to implement? Or impossible with mediawiki? Or just
undesirable for some other reasons?

Thanks!
Micru


[1] docforge.com/wiki/Web_application/Progressive_loading
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Question about progressive loading

2014-02-25 Thread David Cuenca

Just noticed about it! Yesterday I took me almost 5min and many
unresposive script errors to load Russia, now it is under a minute of
computer freeze :)
https://www.wikidata.org/wiki/Q159

Not optimal but definitely a big leap forward. Great work!

Still curious about progressive loading (at least for big items).

Cheers,
Micru


On Tue, Feb 25, 2014 at 8:58 PM, Lydia Pintscher 
lydia.pintsc...@wikimedia.de wrote:

 On Tue, Feb 25, 2014 at 8:55 PM, David Cuenca dacu...@gmail.com wrote:
  While reading this [1] and observing how other website load large pages
  without freezing my computer, I was just wondering why progressive
 loading
  of items was not an option for Wikidata.
 
  Is it too difficult to implement? Or impossible with mediawiki? Or just
  undesirable for some other reasons?

 We've just made a huge improvement like 2 minutes ago ;-)
 More will follow.


 Cheers
 Lydia

 --
 Lydia Pintscher - http://about.me/lydia.pintscher
 Product Manager for Wikidata

 Wikimedia Deutschland e.V.
 Tempelhofer Ufer 23-24
 10963 Berlin
 www.wikimedia.de

 Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.

 Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
 unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das
 Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Wikidata-Freebase mappings published under CC0

2013-11-12 Thread David Cuenca

Hi Denny,

I asked again about this property because there were some concerns when it
was created. From the comments I infer that the general ethos is not
against, but not thrilled either:
https://www.wikidata.org/wiki/Wikidata:Project_chat#Freebase-Wikidata_mappings

I share the feeling, I don't mind linking, but I see little utility for
Wikidata given the different licenses.

Cheers,
Micru


On Mon, Nov 11, 2013 at 11:34 PM, Denny Vrandečić vrande...@google.comwrote:

 Hi Max,

 indeed, I am not suggesting creating a new property. The data release that
 was made today basically provides data for P646.

 Cheers,
 Denny


 Regarding the off-topic part of your mail, I want to point out to my
 personal opinion with regards to data licensing, which was formulated
 before I started my current job: http://simia.net/wiki/Free_data . This
 is solely my opinion though, but it did influence my voice in the
 decision-making process around the license for Wikidata in my previous job.





 On Mon, Nov 11, 2013 at 12:22 PM, Klein,Max kle...@oclc.org wrote:

  Hello All,

  We already do have a freebase ID, similar to the VIAF one- it's 
 P646http://www.wikidata.org/wiki/Property:P646
 .

  I just went back into the email Archives and a discussion started on
 6/14/2013 by Google Developer Relations person Shawn Simister[1] first
 proposed this property. There is large thread there about licensing, ending
 with Micru having an unanswered question to Shawn about how Freebase can
 manage its license since it mixes CC-0 and CC-BY?

  Off-topic: this thread is from before Denny announced that he was going
 to Google, I believe, is that correct? Whatever happens with licensing, and
 regarding Wikidata's CC0, I hope that it wasn't engineered so that Google
 could profit from Wikidata. I concede that this thought is mostly from
 paranoia, and disagreeing with a lot of Google's politics and tactics in
 the past.

  [1] https://plus.google.com/+ShawnSimister


  Maximilian Klein
 Wikipedian in Residence, OCLC
 +17074787023

  --
 *From:* wikidata-l-boun...@lists.wikimedia.org 
 wikidata-l-boun...@lists.wikimedia.org on behalf of Gerard Meijssen 
 gerard.meijs...@gmail.com
 *Sent:* Monday, November 11, 2013 11:27 AM
 *To:* WikiData-l
 *Subject:* Re: [Wikidata-l] Wikidata-Freebase mappings published under
 CC0


 Hoi,
 If I understand you well,  what you propose is adding links that are
 similar to what we already do to VIAF and so many others. Using data from
 Freebase is a tantalising option. Is there agreement and a license that
 allows for this?
 Thanks,
 GerardM
 Op 11 nov. 2013 19:35 schreef Denny Vrandečić vrande...@google.com:

 Hello all,

  as you know, I have recently started a new job. My mission to get more
 free data out to the world has not changed due to that, though.

  Thus, I am happy to point you today to a step in this direction. We
 have created a mapping of Wikidata Qids to Freebase Mids, and are
 publishing it under CC0, so that anyone can use it in any way they want.
 The file contains about 2 Million mappings.

  The download and all further data is available here:
   
 https://developers.google.com/freebase/data#freebase-wikidata-mappings

  In particular, I hope that the data will be uploaded to Wikidata. We
 also plan to upload the data to Freebase. The two-way links will ensure
 that systems using either knowledge base can easily merge data, go from one
 to the other, enrich data with each other, etc. We opted to first publish
 the mappings standalone, to make clear that they are under CC0.

  The mapping is created based on the Wikidata dumps from end of
 October. Wikidata Items are matched with Freebase Topics based on their
 respective links to Wikipedia. Each mapping says that there have been no
 disagreeing Wikipedia-Links and at least two agreeing Wikipedia-Links, so
 the data should be very clean. Furthermore, the mappings are sorted by the
 number of agreeing Wikipedia-Links, so it is kinda sorted by relevance.

  The data is in RDF format. To get the Mids as they are used in
 Wikidata, the replace the namespace
   http://rdf.freebase.com/ns/m.
 with
   /m/
 The Wikidata Qids are using the Wikidata namespace, i.e. the Qids
 themselves are concatenated to
   http://www.wikidata.org/entity/

  If anyone wants to take up the task to upload the data to Wikidata, I
 can offer to help and answer questions and to help.

  Cheers,
 Denny

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l


 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l



 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non

Re: [Wikidata-l] next sister project: Wikisource

2013-11-08 Thread David Cuenca

The sitelinks are not used directly, they are aggregated by using the work
item as a central hub. Basically the algorithm is:
1.- query all edition-items that are connected to the same work-item using
edition of
2.- make a list of all the sitelinks used on each one of these
edition-items (1 sitelink per edition-item)
3.- for each wikisource page connected to an edition-item, display the list
generated on point 2

Micru


On Fri, Nov 8, 2013 at 6:49 PM, Joe Filceolaire filceola...@gmail.comwrote:

 Actually the problem isn't that you can only have one link from a wikisource 
 work from a wikidata item. We have separate wikidata items for each edition 
 of a work (because these have different metadata) so multiple editions of the 
 same work on a wikisource link to different wikidata items.

 This creates a different problem. Each language edition of a work is a 
 different edition so it links to a different wikidata item which has 
 sitelinks only to that translation of the work. This means you can't use 
 sitelinks to link to translations of a work on other wikisources.


 Does this mean wikidata sitelinks are useless for wikisource?


 filceolaire


 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] next sister project: Wikisource

2013-11-06 Thread David Cuenca

On Tue, Nov 5, 2013 at 5:35 PM, Daniel Kinzler
daniel.kinz...@wikimedia.dewrote:

 However, MediaWiki only supports one link per target site in the sidebar.

 Maybe an on-page navigation box could be used instead of proper language
 links?

 With the help of JavaScript, the contents of that nav box could then be
 moved
 into the sidebar. That's a bit hackish, but would work ok, I think.


On English Wikisource they were using this template to allow more than one
link per language:
https://en.wikisource.org/wiki/Template:Interwiki-info

However it seems that is not working now on this page (the interwiki list
should be much larger according to the wikitext):
https://en.wikisource.org/wiki/The_Raven_%28Poe%29



 I think it would be a good idea to always have that, for consistency and
 structural integrity.


It makes sense to create work pages for works with several editions in a
given language (5-10% depending on language) since it is the equivalent of
having a desambiguation page, but having a big number of work pages on
Wikisource for works that only have one edition might be detrimental for
the user experience, unless they are redirects. How far is the development
of using redirect pages as sitelinks?

--Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] next sister project: Wikisource

2013-11-06 Thread David Cuenca

On Wed, Nov 6, 2013 at 4:22 PM, Daniel Kinzler
daniel.kinz...@wikimedia.dewrote:

 MediaWiki used to handle this inconsistently: multiple links for the same
 language where shown in the sidebar, but not recorded in the database. This
 inconsistency was fixed a few months ago - now, only one link per page can
 exist. I suppose that's why the template no longer works.


Someone's bug is someone else's feature... Anyhow, on fr-ws they already
have a custom menu on the left to link with other wikimedia projects
(Compléments) which is loaded by:
https://fr.wikisource.org/wiki/MediaWiki:Common.js
Example: https://fr.wikisource.org/wiki/Victor_Hugo
If you want to add a similar menu for editions then it should be compatible
with:
https://www.mediawiki.org/wiki/Extension:DoubleWiki


I'm not suggesting to always great a work *page* on wikisource, even if
 there is
 only one edition in that language. I'm saying that there should always be
 an
 *item* for the work on wikidata, even if there is only an item for one
 edition.


Sorry, I misunderstood you. Yes, I agree it makes sense to create an item
for the work on wikidata.

Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] next sister project: Wikisource

2013-11-04 Thread David Cuenca

On Mon, Nov 4, 2013 at 4:43 PM, Daniel Kinzler
daniel.kinz...@wikimedia.dewrote:

 Am 04.11.2013 16:10, schrieb Amir Ladsgroup:
  If you're using statement instead of site link who we can trace item from
  article of wikisource to wikidata

 This will very soon be possible with queries.


Actually a query or Lua would be much better solution for Wikisource
instead of sitelinks  (well, author pages can have sitelinks that is no
problem).

According to the data model that we have been defining for Wikisource [1]
there should be a top-level item (work item) representing all the editions
that a text has, then there should be sub-items for each edition (example
of a book with several translations [2]). Each one of those sub-items is
the one that should be connected with a sitelink, although there will be
only of them per item.

Ideally, the script or the query should examine which items are connected
with the property pair edition/edition of, collect the sitelink of each
language and list them all for each one of them.

Is that factible?

Cheers,
Micru


[1] https://www.wikidata.org/wiki/Wikidata:Books_task_force
[2] https://www.wikidata.org/wiki/Q6911
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] next sister project: Wikisource

2013-11-02 Thread David Cuenca

Excellent news! :)

For the author pages it is quite straight-forward. For the bibliographic
metadata the easiest would be to connect wikidata items with the Book:
page generated by the (planned) Book Manager extension. The Book: page is
supposed to provide the book structure and act as a metadata hub for both
books with scans (those with Index: page) and books without scans (there
was no solution for those yet)
Project page: https://meta.wikimedia.org/wiki/Book_management
Bugzilla page: https://bugzilla.wikimedia.org/show_bug.cgi?id=15071
Example:
http://tools.wmflabs.org/bookmanagerv2/mediawiki/index.php?title=Book:The_Interpretation_of_Dreamsaction=edit

Problem, the extension is not finished yet and neither Molly nor Raylton
have time to keep working on it. Some bugs are still open and the fields in
the template would need to be maped to Wikidata properties. All this is not
relevant for phase 1 (if it is done only for books), but it will become
relevant for phase 2.

Is there anyone that could volunteer as an OPW mentor to help a potential
student to finish this project?

Cheers,
Micru

On Sat, Nov 2, 2013 at 4:30 PM, Lydia Pintscher
lydia.pintsc...@wikimedia.de wrote:

Hey everyone,

The next sister project to get language links via Wikidata is
Wikisource. We're currently planning this for January 13.
The coordination is happening at
https://www.wikidata.org/wiki/Wikidata:Wikisource On this page we're
also looking for ambassadors to help spread the messages to the
different language editions of Wikisource. Please help if you can.

Cheers
Lydia

--
Lydia Pintscher - http://about.me/lydia.pintscher
Product Manager for Wikidata

Wikimedia Deutschland e.V.
Obentrautstr. 72
10963 Berlin
www.wikimedia.de

Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

--
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

[Wikidata-l] Proposed change to the inclusion synthax

2013-10-20 Thread David Cuenca

Hi,

I would like to know your opinion about having the value in the #property
parser function. Right now we have two options:

 {{#property:P36}}
 {{#property:capital}}

The problem with this model is that editors in wikipedia cannot see the
value unless they render the page (or possibly use the VisualEditor).
Having the value in the property parser itself would allow contributors to
see and edit the value in Wikipedia, which would in turn update the
Wikidata value. Of course updating the value in Wikidata will also update
the text field. It could look like

 {{#property:P36|value=Berlin}}
 {{#property:capital|value=Berlin}}

Or maybe:

 {{#property:P36=Berlin}}
 {{#property:capital=Berlin}}

What is your opinion about it?

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Proposed change to the inclusion synthax

2013-10-20 Thread David Cuenca

@Sven: AFAIK, whenever there is a change in the data, the change is shown
as an edit, that way editors can keep an eye on the changes, even if there
is a continuous stream of edits. In that regard there would stay as it is
now.

@Vito: what do you mean by data's scope?

Micru


On Sun, Oct 20, 2013 at 12:17 PM, Sven Manguard svenmangu...@gmail.comwrote:

 Excuse me in advance, as this isn't an area I am well versed in, but:

 {{#property:P36}} doesn't have to change if the value changes, but an edit
 would have to be made to modify  {{#property:P36|value=Berlin}} if the
 value changed. Now capitals don't change, but if we have a value for Alexa
 rank or annual GDP or population, well those change often. Part of the
 utility of Wikidata is that on smaller projects once you set everything up
 you don't need a continuous stream of edits.

 Sven
 On Oct 20, 2013 3:06 PM, Vito vituzzu.w...@gmail.com wrote:

 Il 20/10/2013 20:30, David Cuenca ha scritto:

 Hi,

 I would like to know your opinion about having the value in the
 #property parser function. Right now we have two options:
   {{#property:P36}}
   {{#property:capital}}

 The problem with this model is that editors in wikipedia cannot see the
 value unless they render the page (or possibly use the VisualEditor).
 Having the value in the property parser itself would allow contributors to
 see and edit the value in Wikipedia, which would in turn update the
 Wikidata value. Of course updating the value in Wikidata will also update
 the text field. It could look like
   {{#property:P36|value=Berlin}}
   {{#property:capital|value=**Berlin}}
 Or maybe:
   {{#property:P36=Berlin}}
   {{#property:capital=Berlin}}

 What is your opinion about it?

 Cheers,
 Micru


 Then what will be data's scope? :/

 Vito

 __**_
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/**mailman/listinfo/wikidata-lhttps://lists.wikimedia.org/mailman/listinfo/wikidata-l


 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] Open tabular data

2013-08-27 Thread David Cuenca

Hi Dario,

Thanks for sharing this proposal. As posted in the talk page, I believe
that it would be a great opportunity to partner with dedicated
organizations like Datahub [1] (OKFN/CKAN).

Wikidata could also play an important role in semantically describing how
to interpret the information, for instance  mapping the fields of raw data
to wikidata properties or describing semantically the contents of the file.

The DataNamespace could be used to render selected portions of a large
file, which would be too much to handle in real time.

Cheers,
Micru


[1] http://datahub.io/


On Tue, Aug 27, 2013 at 8:06 PM, Dario Taraborelli 
dtarabore...@wikimedia.org wrote:

 I'd like to hear from the people on this list on a proposal to create a
 dedicated namespace to host open (tabular) data and make these datasets
 persistently identifiable, version controlled and easily embeddable into
 other wikis.

 While this use case is currently not within the scope of Wikidata (and
 could potentially live on other Wikimedia wikis, like Meta or Commons), I'd
 appreciate input from the wikidata community on this draft:

 https://meta.wikimedia.org/wiki/DataNamespace

 Some interesting discussion on the talk page:

 http://meta.wikimedia.org/wiki/Talk:DataNamespace

 Dario

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

[Wikidata-l] DNB 11M bibliographic records as CC0

2013-08-26 Thread David Cuenca

Hi,

Maybe this is interesting as an import source for bibliographic info
http://blogs.ifla.org/bibliography/2013/08/06/german-national-library-offers-over-11-million-marc21-records-under-cc0-open-license/

Cheers,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] [Wikisource-l] DNB 11M bibliographic records as CC0

2013-08-26 Thread David Cuenca

If the problem is to automate bibliographic data importing, a solution is
what you propose, to import everything. Another one is to have an import
tool to automatically import the data for the item that needs it. In WP
they do that, there is a tool to import book/journal info by ISBN/doi. The
same can be done in WD.

Micru


On Mon, Aug 26, 2013 at 9:23 AM, Thomas Douillard 
thomas.douill...@gmail.com wrote:

 If Wikidata has an ambition to be a really reliable database, we should do
 eveything we can to make it easy for users to use any source they want. In
 this perspective, if we got datas with guaranted high quality, it make it
 easy for Wikidatian to find and use these references for users. Entering a
 reference in the database seems to me a highly fastidious, boring, and
 easily automated task.

 With that in mind, any reference that the user will not have to enter by
 hand is something good, and import high quality sources datas should pass
 every Wikidata community barriers easily. If there is no problem for the
 software to handle that many information, I say we really have no reason
 not to do the imports.

 Tom


 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] URL datatype needs testing

2013-08-23 Thread David Cuenca

I cannot see it on the new property data type list:
http://test.wikidata.org/wiki/Special:NewProperty

Only item, geocoord, time, string, file...


On Fri, Aug 23, 2013 at 1:21 PM, Lydia Pintscher 
lydia.pintsc...@wikimedia.de wrote:

 On Fri, Aug 23, 2013 at 7:19 PM, Lydia Pintscher
 lydia.pintsc...@wikimedia.de wrote:
  When testing please be aware that URLs are right now not checked
  against the spam blacklist and that there is an issue with too long
  URLs.

 Pressed sent too early... Those obviously are on our list of things to
 fix before deployment.


 Cheers
 Lydia

 --
 Lydia Pintscher - http://about.me/lydia.pintscher
 Community Communications for Technical Projects

 Wikimedia Deutschland e.V.
 Obentrautstr. 72
 10963 Berlin
 www.wikimedia.de

 Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.

 Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
 unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das
 Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l




-- 
Etiamsi omnes, ego non
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Re: [Wikidata-l] [Wikimedia-l] Meeting about the support of Wiktionary in Wikidata

2013-08-10 Thread David Cuenca

To add up a couple of comments to what Denny said, from my experience with
Wikisource, reaching out to international, loosely connected communities is
already a big challenge on its own. I would like to invite Wiktionary
contributors to take a look to this Individual Engagement Grant project
that Aubrey and me are doing for Wikisource, because maybe it would make
sense that a group of involved Wiktionarians started a similar initiative
for Wiktionary. The original application can be found here:
http://meta.wikimedia.org/wiki/Grants:IEG/Elaborate_Wikisource_strategic_vision

And the midterm report:
http://meta.wikimedia.org/wiki/Grants:IEG/Elaborate_Wikisource_strategic_vision

If anyone from the Wiktionary community wants to step forward, I would be
more than happy to share experiences and provide advice.

Cheers,
Micru

On Sat, Aug 10, 2013 at 3:30 AM, Denny Vrandečić 
denny.vrande...@wikimedia.de wrote:

 [Sorry for cross-posting]

 Yes, I agree that the OmegaWiki community should be involved in the
 discussions, and I pointed GerardM to our proposals whenever and
 discussions, using him as a liaison. We also looked and keep looking at the
 OmegaWiki data model to see what we are missing.

 Our latest proposal is different from OmegaWiki in two major points:

 * our primary goal is to provide support for structured data in the
 Wiktionaries. We do not plan to be the main resource ourselves, where
 readers come to in order to look up something, we merely provide structured
 data that a Wiktionary may or may not use. This parallels the role of
 Wikidata has with regards to Wikipedia. This also highlights the difference
 between Wikidata and OmegaWiki, since OmegaWiki's goal is to create a
 dictionary of all words of all languages, including lexical, terminological
 and ontological information.

 * a smaller difference is the data model. Wikidata's latest proposal to
 support Wiktionary is centered around lexemes, and we do not assume that
 there is such a things as a language-independent defined meaning. But no
 matter what model we end up with, it is important to ensure that the bulk
 of the data could freely flow between the projects, and even though we
 might disagree on this issue in the modeling, it is ensured that the
 exchange of data is widely possible.

 We tried to keep notes on the discussion we had today: 
 http://epl.wikimedia.org/p/WiktionaryAndWikidata

 My major take home message for me is that:
 * the proposal needs more visual elements, especially a mock-up or sketch
 of how it would look like and how it could be used on the Wiktionaries
 * there is no generally accepted place for a discussion that involves all
 Wiktionary projects. Still, my initial decision to have the discussion on
 the Wikidata wiki was not a good one, and it should and will be moved to
 Meta.

 Having said that, the current proposal for the data model of how to support
 Wiktionary with Wikidata seems to have garnered a lot of support so far. So
 this is what I will continue building upon. Further comments are extremely
 welcomed. You can find it here:

 http://www.wikidata.org/wiki/Wikidata:Wiktionary

 As said, it will be moved to Meta, as soon as the requested mockups and
 extensions are done.

 Cheers,
 Denny





 2013/8/10 Samuel Klein meta...@gmail.com

  Hello,
 
   On Fri, Aug 9, 2013 at 6:13 PM, JP Béland lebo.bel...@gmail.com
 wrote:
   I agree. We also need to include the Omegawiki community.
 
  Agreed.
 
  On Fri, Aug 9, 2013 at 12:22 PM, Laura Hale la...@fanhistory.com
 wrote:
   Why? The question of moving them into the WMF fold was pretty much no,
   because the project has an overlapping purpose with Wiktionary,
 
  This is not actually the case.
  There was overwhelming community support for adopting Omegawiki - at
  least simply providing hosting.  It stalled because the code needed a
  security and style review, and Kip (the lead developer) was going to
  put some time into that.  The OW editors and dev were very interested
  in finding a way forward that involved Wikidata and led to a combined
  project with a single repository of terms, meanings, definitions and
  translations.
 
  Recap: The page describing the OmegaWiki project satisfies all of the
  criteria for requesting WMF adoption.
  * It is well-defined on Meta http://meta.wikimedia.org/wiki/Omegawiki
  * It describes an interesting idea clearly aligned with expanding the
  scope of free knowledge
  * It is not a 'competing' project to Wiktionaries; it is an idea that
  grew out of the Wiktionary community, has been developed for years
  alongside it, and shares many active contributors and linguiaphiles.
  * It started an RfC which garnered 85% support for adoption.
  http://meta.wikimedia.org/wiki/Requests_for_comment/Adopt_OmegaWiki
 
  Even if the current OW code is not used at all for a future Wiktionary
  update -- and this idea was proposed and taken seriously by the OW
  devs -- their community of contributors should be part of

[Wikidata-l] Meeting about the support of Wiktionary in Wikidata

2013-08-08 Thread David Cuenca

 wiktionar...@lists.wikimedia.orgHi,

If there is someone in Wikimania interested in participating in the talks
about the future support of Wiktionary in Wikidata, we will having a
discussion about the several proposals.
http://wikimania2013.wikimedia.org/wiki/Support_of_Wiktionary_in_Wikidata

Date : Saturday, 10 Aug, 11:30 am - 1:00 pm
Place: Y520 (block Y, 5th floor)

See you there,
Micru
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

1 2 >

1 - 100 of 119 matches

Mail list logo