Re: BibClassify with RDF and MySQL store

2010-12-20 Thread Ferran Jorba
Hello, > On Sat, Dec 18, 2010 at 12:33 PM, Samuele Kaplun > wrote: >> Hi Roman, >> >> Il giorno sab, 18/12/2010 alle 12.17 +0100, Roman Chyla ha scritto: >>> I agree this is cool, but something doesn't fit, at least I don't >>> understand how this could be used for the task of bibclassify, the >

Re: BibClassify with RDF and MySQL store

2010-12-18 Thread Roman Chyla
On Sat, Dec 18, 2010 at 12:33 PM, Samuele Kaplun wrote: > Hi Roman, > > Il giorno sab, 18/12/2010 alle 12.17 +0100, Roman Chyla ha scritto: >> I agree this is cool, but something doesn't fit, at least I don't >> understand how this could be used for the task of bibclassify, the >> dict is good if

Re: BibClassify with RDF and MySQL store

2010-12-18 Thread Samuele Kaplun
Hi Roman, Il giorno sab, 18/12/2010 alle 12.17 +0100, Roman Chyla ha scritto: > I agree this is cool, but something doesn't fit, at least I don't > understand how this could be used for the task of bibclassify, the > dict is good if you know (more or less) what you are looking for, but > the task

Re: BibClassify with RDF and MySQL store

2010-12-18 Thread Roman Chyla
Hi, >> >> As the protocol is inherently client-server, the same ontology >> (dictionary) can be (re-)used among different Invenio instances.  It is >> not a toy.  I haven't been able to make any noticeable use in my >> instance even massively querying it.  You can follow part of my >> experiments

Re: BibClassify with RDF and MySQL store

2010-12-18 Thread Samuele Kaplun
Hi Ferran! Il giorno ven, 17/12/2010 alle 13.59 +0100, Ferran Jorba ha scritto: > Blame XML bloat (again). > > For dictionaries and such, that is, a large corpus of data that doesn't > change so much, in other words, that it is not transactional, why don't > you use specialised software? well i

Re: BibClassify with RDF and MySQL store

2010-12-17 Thread Ferran Jorba
Hello Samuele, [warning: I may be way off-road] > I am starting to play a bit with the EuroVoc > > > > ontology in order to integrate it into OpenAIRE Orphan Record > Repository, for automatic keyword extraction for EU documents. > > This ontology is *big*! and multil

Re: BibClassify with RDF and MySQL store

2010-12-17 Thread Roman Chyla
Hi Sam, Even if you manage to store/load it into bibclassify, you will probably wait forever -- my recent tests of seman vs bibclassify on a corpus of 1830 docs shows that bibclassify professes them in 186 mins (~6s per doc) and seman in 15 min. And this is HEP taxonomy, with only 40K entries! How

BibClassify with RDF and MySQL store

2010-12-17 Thread Samuele Kaplun
Hi, I am starting to play a bit with the EuroVoc ontology in order to integrate it into OpenAIRE Orphan Record Repository, for automatic keyword extraction for EU documents. This ontology is *big*! and multilingual. I can't even load it with RDFLIB on my laptop (4GB