Hi. I was thinking that infoboxes probably follow Zipf's law, and made a set of pages for mappings by frequency on my user page: http://mappings.dbpedia.org/index.php/User:Jimregan
I think the way I did it should only take into account the templates that were missing mappings at the time of the last extraction - I put the quick and dirty script I used on the page too. I used 50 occurrences as the cutoff point, to not have too much noise, and there's no filtering to ensure that the templates are infoboxes, but it should give a rough guide to which templates are the most important to get the maximum amount of data. -- <Sefam> Are any of the mentors around? <jimregan> yes, they're the ones trolling you ------------------------------------------------------------------------------ Fulfilling the Lean Software Promise Lean software platforms are now widely adopted and the benefits have been demonstrated beyond question. Learn why your peers are replacing JEE containers with lightweight application servers - and what you can gain from the move. http://p.sf.net/sfu/vmware-sfemails _______________________________________________ Dbpedia-discussion mailing list Dbpedia-discussion@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion