Hoi, Exactly. Thanks, GerardM
On 6 June 2014 20:57, Paul Houle <ontolo...@gmail.com> wrote: > There's another road to ontology of labels which is connected with the > kind of roles that labels play in systems. > > One need is that a system wants to mention something or draw something > and otherwise refer to something and it needs to know what to call it. > Another need is that you have a phrase and you want to find things > with a matching label. Then there's the more general problem that the > user has something in his head and you want to specify it. > > In terms of acceptance of labels you want the system to accept a wide > range of possible names people would use for something (I think in > Wikidata scope) but to make the most of that you need a good estimator > of the probability that a particular surface form used in a particular > context refers to this or that and that is probably out of scope. > > You want to accept labels you wouldn't want to generate. A tendency > to generate ethnic, racial and other kinds of slurs is a showstopper > for any public commercial application. A.I.'s are like people; some > of them are more prone to potty mouth than others, you can't count on > good behavior unless you train your animals. Thus, offensive labels > should be tagged. > > Similar choices appear in different contexts. I live in New York and > if you look at legal documents they always say "New York State" or > "New York City" but if you drive onto the Thruway from Pennsylvania > you will see "Welcome to New York" and then a distance sign that says > New York is 490 miles away. Sometimes you want the latin name of an > organism and sometimes you want the common name. You might want to > speak of pharmaceuticals always using the generic name (Omeprazole) > rather than a brand (Prilosec). Sometimes you want to use > abbreviations (RDF) and other times you want to spell things out > (Resource Description Framework). If you want to make something > visually tight you need to control label length > > http://carpictures.cc/cars/photo/ > > A superhuman system would certainly contain statistical models, but a > lot of the knowledge needed to do the above could be encoded as > properties of the labels. > > ᐧ > > On Fri, Jun 6, 2014 at 1:57 PM, Gerard Meijssen > <gerard.meijs...@gmail.com> wrote: > > Hoi, > > In a different conversation it was put like this: "Wikipedia is what it > is > > and Wikidata is what it is". This was in the context of assumptions. > > Thanks, > > GerardM > > > > > > On 6 June 2014 16:59, Daniel Kinzler <daniel.kinz...@wikimedia.de> > wrote: > >> > >> Am 06.06.2014 15:44, schrieb Gerard Meijssen: > >> > Hoi, > >> > That is exactly the point. Once you assume that they are the same you > >> > ignore the > >> > extend to which they are not. Many, many items have articles pointing > to > >> > items > >> > resulting in labels that are not exactly the same subject. > >> > >> And these are mistakes that should be fixed. So? > >> > >> > >> -- > >> Daniel Kinzler > >> Senior Software Developer > >> > >> Wikimedia Deutschland > >> > >> Gesellschaft zur Förderung Freien Wissens e.V. > >> > >> _______________________________________________ > >> Wikidata-l mailing list > >> Wikidata-l@lists.wikimedia.org > >> https://lists.wikimedia.org/mailman/listinfo/wikidata-l > > > > > > > > _______________________________________________ > > Wikidata-l mailing list > > Wikidata-l@lists.wikimedia.org > > https://lists.wikimedia.org/mailman/listinfo/wikidata-l > > > > > > -- > Paul Houle > Expert on Freebase, DBpedia, Hadoop and RDF > (607) 539 6254 paul.houle on Skype ontolo...@gmail.com > > _______________________________________________ > Wikidata-l mailing list > Wikidata-l@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wikidata-l >
_______________________________________________ Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l