Re: [Wikidata] Wikidata Analyst, a tool to comprehensively analyze quality of Wikidata

2015-12-12 Thread Amir Ladsgroup
Hey,
I made some significant changes based on feedbacks

* Per suggestion of Nemo_bis I added reference-based analysis: Here's

an example
* I added limit parameter which you can get more results if you want (both
for reference-based and property-based analysis) for example:
http://tools.wmflabs.org/wd-analyst/index.php?p=P31&q=&limit=50 (Maximum
acceptable value is 50)
* Per suggestion of André I added a column to the database and results
which gives you number of percentage of unsourced statements. Obviously it
doesn't apply to reference-based analysis. for example
https://tools.wmflabs.org/wd-analyst/index.php?p=P1082&q= shows only 2% of
statements of population are unsourced

For Gerard suggestion. It's definitely a good idea but problem is it's
technically hard because every week it makes the databse twice as big. We
can store only a limited number (e.g. last three weeks) or apply this to a
limited number of value-pair properties. I'm looking to find out which one
is better.

Best


On Thu, Dec 10, 2015 at 12:13 AM André Costa 
wrote:

> Nice tool!
>
> To understand the statistics better.
> If a claim has two sources, one wikipedia and one other, how does that
> show up in the statistics?
>
> The reason I'm wondering is because I would normally care if a claim is
> sourced or not (but not by how many sources) and whether it is sourced by
> only Wikipedias or anything else.
>
> E.g.
> 1) a statment with 10 claims each sourced is "better" than one with 10
> claims where one claim has 10 sources.
> 2) a statement with a wiki source + another source is "better" than on
> with just a wiki source and just as "good" as one without the wiki source.
>
> Also is wiki ref/source Wikipedia only or any Wikimedia project? Whilst
> (last I checked) the others were only 70,000 refs compared to the 21
> million from Wikipedia they might be significant for certain domains and
> are just as "bad".
>
> Cheers,
> André
> On 9 Dec 2015 10:37, "Gerard Meijssen"  wrote:
>
>> Hoi,
>> What would be nice is to have an option to understand progress from one
>> dump to the next like you can with the Statistics by Magnus. Magnus also
>> has data on sources but this is more global.
>> Thanks,
>>  GerardM
>>
>> On 8 December 2015 at 21:41, Markus Krötzsch <
>> mar...@semantic-mediawiki.org> wrote:
>>
>>> Hi Amir,
>>>
>>> Very nice, thanks! I like the general approach of having a stand-alone
>>> tool for analysing the data, and maybe pointing you to issues. Like a
>>> dashboard for Wikidata editors.
>>>
>>> What backend technology are you using to produce these results? Is this
>>> live data or dumped data? One could also get those numbers from the SPARQL
>>> endpoint, but performance might be problematic (since you compute averages
>>> over all items; a custom approach would of course be much faster but then
>>> you have the data update problem).
>>>
>>> An obvious feature request would be to display entity ids as links to
>>> the appropriate page, and maybe with their labels (in a language of your
>>> choice).
>>>
>>> But overall very nice.
>>>
>>> Regards,
>>>
>>> Markus
>>>
>>>
>>> On 08.12.2015 18:48, Amir Ladsgroup wrote:
>>>
 Hey,
 There has been several discussion regarding quality of information in
 Wikidata. I wanted to work on quality of wikidata but we don't have any
 source of good information to see where we are ahead and where we are
 behind. So I thought the best thing I can do is to make something to
 show people how exactly sourced our data is with details. So here we
 have *http://tools.wmflabs.org/wd-analyst/index.php*

 You can give only a property (let's say P31) and it gives you the four
 most used values + analyze of sources and quality in overall (check this
 out )
   and then you can see about ~33% of them are sources which 29.1% of
 them are based on Wikipedia.
 You can give a property and multiple values you want. Let's say you want
 to compare P27:Q183 (Country of citizenship: Germany) and P27:Q30 (US)
 Check this out
 . And
 you can see US biographies are more abundant (300K over 200K) but German
 biographies are more descriptive (3.8 description per item over 3.2
 description over item)

 One important note: Compare P31:Q5 (a trivial statement) 46% of them are
 not sourced at all and 49% of them are based on Wikipedia **but* *get
 this statistics for population properties (P1082
 ) It's not a
 trivial statement and we need to be careful about them. It turns out
 there are slightly more than one reference per statement and only 4% of
 them are based on Wikipedia. So we can relax and enjoy these
 highly-sourced data.

 Requests:

 

Re: [Wikidata] Announcing WikiBrowser - Semantically navigating ALL the knowledge

2015-12-12 Thread Info WorldUniversity
James, Gerard, Romaine and Wikidatans,

Excellent. How are you planning for using voice technologies eventually
(and with SPARQL, I wonder)? How does one plan for engaging all ~300
Wikipedia languages in voice (say in Android, for example) through such a
WikiBrowser? And how would one anticipate adding more languages not yet in
Wikipedia to their countries (and all 7,938 languages?) in voice per
Gerard's question ([Wikidata] missing labels) on December 1?

Best regards,
Scott




On Dec 12, 2015 6:00 AM,  wrote:

> Thanks for your great input, Michael!  My plan is to provide navigation
> using a directed graph as in the following rough sketch:
>
>
> https://bytebucket.org/johanvos/wikibrowser/raw/7ce572aed9995cca4788dd7bedf8c9be278f3a9e/design-docs/research-view.pdf
>
> I also plan to provide the ability to "pin" topics, rendering directed
> graphs that include only those topics.  I'll need more SPARQL-fu than I
> currently possess, so am currently climbing that learning curve.
>
> Regards,
> James Weaver
>
>
> On Sat, Dec 12, 2015, at 04:24 AM, Michael Karpeles wrote:
>
> @GarardM: the top left "hamburger" menu item pulls open a left sidebar
> with language options
>
> best wishes,
>
> - mek
>
> On Sat, Dec 12, 2015 at 12:25 AM, Gerard Meijssen <
> gerard.meijs...@gmail.com> wrote:
>
> Hoi,
> How do I change languages?
> Thanks,
>  GerardM
>
> On 12 December 2015 at 08:10,  wrote:
>
> I'd like to share an application that I'm developing for technology
> demonstrations, entitled WikiBrowser. It is a web application that
> leverages the structure of Wikidata to semantically navigate Wikipedia
> articles. It is being developed in Java using technologies such as
> Spring Boot, Spring Cloud, and Cloud Foundry. This web application is
> live at http://WikiBrowser.io and the code is open source and located in
> my GitHub repository.  There is a brief video that shows features of
> WikiBrowser on my most recent blog post at http://JavaFXpert.com and I
> hope that you'll take WikiBrowser for a spin!
>
> Regards,
> James Weaver
> Developer Advocate
> Pivotal Software
> http://twitter.com/JavaFXpert
>
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>
>
>
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>
>
> *___*
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>
>
>
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>
>
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Announcing WikiBrowser - Semantically navigating ALL the knowledge

2015-12-12 Thread james
Thanks for your great input, Michael!  My plan is to provide navigation using a 
directed graph as in the following rough sketch:

https://bytebucket.org/johanvos/wikibrowser/raw/7ce572aed9995cca4788dd7bedf8c9be278f3a9e/design-docs/research-view.pdf

I also plan to provide the ability to "pin" topics, rendering directed
graphs that include only those topics.  I'll need more SPARQL-fu than I
currently possess, so am currently climbing that learning curve.

Regards, James Weaver


On Sat, Dec 12, 2015, at 04:24 AM, Michael Karpeles wrote:
> @GarardM: the top left "hamburger" menu item pulls open a left sidebar
> with language options
>
> best wishes,
>
> - mek
>
> On Sat, Dec 12, 2015 at 12:25 AM, Gerard Meijssen
>  wrote:
>> Hoi, How do I change languages? Thanks, GerardM
>>
>> On 12 December 2015 at 08:10,   wrote:
>>> I'd like to share an application that I'm developing for technology
>>>
demonstrations, entitled WikiBrowser. It is a web application that
>>>
leverages the structure of Wikidata to semantically navigate Wikipedia
>>>
articles. It is being developed in Java using technologies such as
>>>
Spring Boot, Spring Cloud, and Cloud Foundry. This web application is
>>>
live at http://WikiBrowser.io and the code is open source and located in
>>>
my GitHub repository.  There is a brief video that shows features of
>>>
WikiBrowser on my most recent blog post at http://JavaFXpert.com and I
>>>
hope that you'll take WikiBrowser for a spin!
>>>
>>>
Regards,
>>>
James Weaver
>>>
Developer Advocate
>>>
Pivotal Software
>>> http://twitter.com/JavaFXpert
>>>
>>>
___
>>>
Wikidata mailing list
>>> Wikidata@lists.wikimedia.org
>>> https://lists.wikimedia.org/mailman/listinfo/wikidata
>>
>>
>> ___
>>
Wikidata mailing list
>> Wikidata@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/wikidata
>>
> _
> Wikidata mailing list Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Announcing WikiBrowser - Semantically navigating ALL the knowledge

2015-12-12 Thread Michael Karpeles
@GarardM: the top left "hamburger" menu item pulls open a left sidebar with
language options

best wishes,

- mek

On Sat, Dec 12, 2015 at 12:25 AM, Gerard Meijssen  wrote:

> Hoi,
> How do I change languages?
> Thanks,
>  GerardM
>
> On 12 December 2015 at 08:10,  wrote:
>
>> I'd like to share an application that I'm developing for technology
>> demonstrations, entitled WikiBrowser. It is a web application that
>> leverages the structure of Wikidata to semantically navigate Wikipedia
>> articles. It is being developed in Java using technologies such as
>> Spring Boot, Spring Cloud, and Cloud Foundry. This web application is
>> live at http://WikiBrowser.io and the code is open source and located in
>> my GitHub repository.  There is a brief video that shows features of
>> WikiBrowser on my most recent blog post at http://JavaFXpert.com and I
>> hope that you'll take WikiBrowser for a spin!
>>
>> Regards,
>> James Weaver
>> Developer Advocate
>> Pivotal Software
>> http://twitter.com/JavaFXpert
>>
>> ___
>> Wikidata mailing list
>> Wikidata@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/wikidata
>>
>
>
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>
>
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Announcing WikiBrowser - Semantically navigating ALL the knowledge

2015-12-12 Thread Gerard Meijssen
Hoi,
How do I change languages?
Thanks,
 GerardM

On 12 December 2015 at 08:10,  wrote:

> I'd like to share an application that I'm developing for technology
> demonstrations, entitled WikiBrowser. It is a web application that
> leverages the structure of Wikidata to semantically navigate Wikipedia
> articles. It is being developed in Java using technologies such as
> Spring Boot, Spring Cloud, and Cloud Foundry. This web application is
> live at http://WikiBrowser.io and the code is open source and located in
> my GitHub repository.  There is a brief video that shows features of
> WikiBrowser on my most recent blog post at http://JavaFXpert.com and I
> hope that you'll take WikiBrowser for a spin!
>
> Regards,
> James Weaver
> Developer Advocate
> Pivotal Software
> http://twitter.com/JavaFXpert
>
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Announcing WikiBrowser - Semantically navigating ALL the knowledge

2015-12-12 Thread Michael Karpeles
Nice work, this is a great way of seeing wikipedia + wikidata side by side
(and having more intimate access to wikidata IDs).

It would be great if the hierarchy (breadcrumbs) + context of your session
were preserved *as* you are navigating. Right now, it appears the value of
this browser is that it augments a wikipedia-type page with wikidata
semantic values within a side bar.

In contrast, http://math.mx (read: mathematics) is an example DAG (directed
acyclic graph of hierarchical math topics, presented as a d3 zoomable
treegraph) which preserves the context of your session (with breadcrumbs)
as you traverse the directed graph of knowledge. A core value proposition
of a browser (e.g. web browser) is that the provenance and history of a
session is preserved; you can go back and revisit links (history), as well
as gain perspective about how your searches were related (provenance). You
session, itself, is essentially a graph (and this graph is critical for
understanding your coverage/level of comprehension of a domain).

On WikiBrowser, one was viewing the WikiBrowser "Earth" page and you click
the inner term "Planet", I would expect the interface to semantically zoom
out of "Earth", into planet (because of the directionality of the
relationship of the two terms: earth, planet). From this screen, I would
expect to be able to go back (close "Planet" and return to "Earth")
allowing my context to be returned/zoomed back to "Earth" (which had
remained on the stack), or visit a new link/direction from within "Planet"
and increase the depth of my stack.

Even with this (linear 1-step browsing w/ a provenance trail) solved, it's
often extremely difficult to get a holistic view (topic depth > 1) of a
topic (i.e. understand all the topics and sub-topics entailed) just by
looking a single wikipedia page. Ideally, one could press a special hotkey
(see "bring and go ") and see an
overlay (similar to your sidebar) with a graph (link depth of ~2 pages)
which shows an entire graph of a page / topic's wikidata or wikipedia
relations, which you can then use to navigate and more efficiently explore
the knowledge space.

On the other hand, if (instead of a "browser") your main goal is augmenting
(and creating a bi-directional mapping between) wikipedia articles and
wikidata entries, have you considered something like a native Wikipedia
Tool, e.g. Navigation Popups? (see:
https://en.wikipedia.org/wiki/Wikipedia:Tools/Navigation_popups -- Under
the "Browsing" section, there is a checkable option for, "Navigation
popups, article previews and editing functions popup when hovering over
links)

I learned the feature existed when I asked on facebook (
https://www.facebook.com/michael.karpeles/posts/10101912216804060)

It would be great if hovering over hyperlinks in Wikipedia resulted in a
> community approved 1-line distilled answer / summary tooltip
>

Just food for thought, to fuel your mission. Good luck!

best wishes,

- mek

On Fri, Dec 11, 2015 at 11:10 PM,  wrote:

> I'd like to share an application that I'm developing for technology
> demonstrations, entitled WikiBrowser. It is a web application that
> leverages the structure of Wikidata to semantically navigate Wikipedia
> articles. It is being developed in Java using technologies such as
> Spring Boot, Spring Cloud, and Cloud Foundry. This web application is
> live at http://WikiBrowser.io and the code is open source and located in
> my GitHub repository.  There is a brief video that shows features of
> WikiBrowser on my most recent blog post at http://JavaFXpert.com and I
> hope that you'll take WikiBrowser for a spin!
>
> Regards,
> James Weaver
> Developer Advocate
> Pivotal Software
> http://twitter.com/JavaFXpert
>
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata