Hi.

I was thinking that infoboxes probably follow Zipf's law, and made a
set of pages for mappings by frequency on my user page:
http://mappings.dbpedia.org/index.php/User:Jimregan

I think the way I did it should only take into account the templates
that were missing mappings at the time of the last extraction - I put
the quick and dirty script I used on the page too. I used 50
occurrences as the cutoff point, to not have too much noise, and
there's no filtering to ensure that the templates are infoboxes, but
it should give a rough guide to which templates are the most important
to get the maximum amount of data.

-- 
<Sefam> Are any of the mentors around?
<jimregan> yes, they're the ones trolling you

------------------------------------------------------------------------------
Fulfilling the Lean Software Promise
Lean software platforms are now widely adopted and the benefits have been 
demonstrated beyond question. Learn why your peers are replacing JEE 
containers with lightweight application servers - and what you can gain 
from the move. http://p.sf.net/sfu/vmware-sfemails
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to