Apparently the people at Spock are not familiar with our work on name
discrimination using SenseClusters, or they would know the problem has
been solved. :)

I'm kidding.

So, here's yet another challenge open to anyone that resolves around
name discrimination, that is solving the problem of telling George
Miller the movie director from George Miller the father of WordNet.

http://challenge.spock.com/

Looks like fun. :)

Here's what they are up to:

-----------------

The SPOCK Entity Resolution Problem:

A common problem that we face is that there are many people with the
same name. Given that, how do we distinguish a document about Michael
Jackson the singer from Michael Jackson the football player?

With billions of documents and people on the web, we need to identify
and cluster web documents accurately to the people they are related
to. Mapping these named entities from documents to the correct person
is the essence of the Spock Challenge.

In order to constrain the problem so that it can be successfully
solved by an individual or a small team, we provide you with real
world data with ground truth. This data contains 100,000 documents
about people, and the challenge is to determine all the distinct
people described in the data set. This data can be your training set.
Once you've got your basic algorithm working against the training set,
we let you further tune your code by running it against a second test
data set.

We give you instant accuracy feedback in the form of a percentage rank
score. The score depends on how many correct unique people you can
identify in the data. This way you can continue to refine your work
and see how well you are holding up against your competitors.

What good is a problem without a little competition?

-- 
Ted Pedersen
http://www.d.umn.edu/~tpederse

-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
senseclusters-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/senseclusters-users

Reply via email to