Hi Lydia, hello Denny, Thank you so much. I have to apologise for my delayed response, got caught up with university lab exams and assignments; the semester is drawing to an end.
Firstly, Lydia - thanks for the link. The Data model primer really helps. I've been browsing the pages on wikidata.org, and surely now I realize why this is so important (please read my question in the last paragraph). It'll reduce a lot of repeated labour. I'm currently setting up a mediawiki instance and my development environment. I'll ask quick questions if I need any help along the way. Denny - That's just what I wanted to know - clean and crisp. Thanks! I'm browsing the xml for some familiar entries and comparing them to the pages on wikidata.org (eg. India <http://www.wikidata.org/wiki/Q668>). I'm getting the whole picture now. I have a question - when someone creates a new statement, for suggesting "properties", I can use collaborative filtering to make suggestions. Example, explained in the simplest terms - suppose there are X cities in the dataset. The user is adding another city (writes 'city in Australia' for short description). The system checks all other cities, figures out the common properties and suggests them. Cool. But I can't get any "exact" ideas off the top of my head that can used to suggest "values" for the properties. Suppose one of the recommended properties is "population". How can I make the system guess its value? (Am I getting this right?) Have you guys got anything on your minds regarding this? Please point me to the right direction. :) Cheers, Nilesh On Mon, Apr 22, 2013 at 7:54 PM, Denny Vrandečić < denny.vrande...@wikimedia.de> wrote: > You can get the data from here: > http://dumps.wikimedia.org/wikidatawiki/20130417/ > > All items with all properties and their values are inside the dump. The > questions would be, based on this data, could we make suggestions for: > > * when I create a new statement, suggest a property. then suggest a value > * suggest qualifier properties, then suggest qualifier values (there is no > data yet on qualifiers, but this would change soon) > * suggest properties for references, and values > > Does this help? > > Cheers, > Denny > > > > > > 2013/4/19 Nilesh Chakraborty <nil...@nileshc.com> > > > Hi, > > > > I am a 3rd year undergraduate student of computer science, pursuing my > > B.Tech degree at RCC Institute of Information Technology. I am proficient > > in Java, PHP and C#. > > > > Among the project ideas on the GSoC 2013 ideas page, the one particular > > idea that seemed really interesting to me is developing an Entity > > Suggester for Wikidata. I want to work on it. > > > > I am passionate about data mining, big data and recommendation engines, > > therefore this idea naturally appeals to me a lot. I have experience with > > building music and people recommendation systems, and have worked with > > Myrrix and Apache Mahout. I recently designed and implemented such a > > recommendation system and deployed it on a live production site, where > I'm > > interning at, to recommend Facebook users to each other depending upon > > their interests. > > > > The problem is, the documentation for Wikidata and the Wikibase extension > > seems pretty daunting to me since I have not ever configured a mediawiki > > instance or actually used it. (I am on my way to try it out following the > > instructions at > > http://www.mediawiki.org/wiki/Summer_of_Code_2013#Where_to_start.) I can > > easily build a recommendation system and create a web-service or REST > based > > API through which the engine can be trained with existing data, and > queried > > and all. This seems to be a collaborative filtering problem (people who > > bought x also bought y). It'll be easier if I could get some help about > the > > part where/how I need to integrate it with Wikidata. Also, some sample > > datasets (csv files?) or schemas (just the column names and data types?) > > would help a lot, for me to figure this out. > > > > I have added this email as a comment on the bug report at > > https://bugzilla.wikimedia.org/show_bug.cgi?id=46555#c1. > > > > Please ask me if you have any questions. :-) > > > > Thanks, > > Nilesh > > > > -- > > A quest eternal, a life so small! So don't just play the guitar, build > one. > > You can also email me at cont...@nileshc.com or visit my > > website<http://www.nileshc.com/> > > _______________________________________________ > > Wikitech-l mailing list > > Wikitech-l@lists.wikimedia.org > > https://lists.wikimedia.org/mailman/listinfo/wikitech-l > > > > > -- > Project director Wikidata > Wikimedia Deutschland e.V. | Obentrautstr. 72 | 10963 Berlin > Tel. +49-30-219 158 26-0 | http://wikimedia.de > > Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e.V. > Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter > der Nummer 23855 B. Als gemeinnützig anerkannt durch das Finanzamt für > Körperschaften I Berlin, Steuernummer 27/681/51985. > _______________________________________________ > Wikitech-l mailing list > Wikitech-l@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wikitech-l > -- A quest eternal, a life so small! So don't just play the guitar, build one. You can also email me at cont...@nileshc.com or visit my website<http://www.nileshc.com/> _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l