Apparently the people at Spock are not familiar with our work on name discrimination using SenseClusters, or they would know the problem has been solved. :)
I'm kidding. So, here's yet another challenge open to anyone that resolves around name discrimination, that is solving the problem of telling George Miller the movie director from George Miller the father of WordNet. http://challenge.spock.com/ Looks like fun. :) Here's what they are up to: ----------------- The SPOCK Entity Resolution Problem: A common problem that we face is that there are many people with the same name. Given that, how do we distinguish a document about Michael Jackson the singer from Michael Jackson the football player? With billions of documents and people on the web, we need to identify and cluster web documents accurately to the people they are related to. Mapping these named entities from documents to the correct person is the essence of the Spock Challenge. In order to constrain the problem so that it can be successfully solved by an individual or a small team, we provide you with real world data with ground truth. This data contains 100,000 documents about people, and the challenge is to determine all the distinct people described in the data set. This data can be your training set. Once you've got your basic algorithm working against the training set, we let you further tune your code by running it against a second test data set. We give you instant accuracy feedback in the form of a percentage rank score. The score depends on how many correct unique people you can identify in the data. This way you can continue to refine your work and see how well you are holding up against your competitors. What good is a problem without a little competition? -- Ted Pedersen http://www.d.umn.edu/~tpederse ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ senseclusters-users mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/senseclusters-users
