I'm working on a project that's trying to use data from dbpedia to
disambiguate references to people in text.  One thing we are trying to
figure out is the use of several properties to encode the same
attribute.  For example, we found that the following dbpedia
properties are all used for a person's date of birth:

  1 dateofbirth
  2 dateOfBirth
  3 dateBirth
  4 datebirth
  5 birthdate
  6 birthDate

In some cases, it looks like the information is encoded differently on
the Wikipedia page -- e.g.

 1 in the infobox after 'Date of birth' (e.g., Alan_Kirby)
 2 in the text, e.g., 'Born in 1910' (e.g., Elton_Bennett)
 3 In the infobox after 'Born' (with place following date) (e.g., James_Irwin)
 4 In the infobox after 'Date of birth' (e.g., Arthur_Ashe)
 5 in infobox after 'Birth' (e.g., Ted_Leehane)
 6 in infobox after 'Born' (e.g., Annabel_Giles)

But {1,4} and {3,6} seem to be encoded sing the same string in their Wikipedia 
infoboxes.

It might also have something to do with the form of the values for the
properties, but I did not see a definite pattern that explained the use
of these six different properties.

There is also a similar duplication with the properties birthyear, birthYear,
yearOfBirth and yearofbirth.

Of course, there are additional properties (e.g. datefbirth) that
link a person to an instance that bundles up the three numbers that
describe a date. 

Can anyone shed light on this?

Tim

-- 
Tim Finin, Computer Science & Electrical Engineering, Univ of Maryland
Baltimore County, 1000 Hilltop Cir, Baltimore MD 21250. [EMAIL PROTECTED]
http://umbc.edu/~finin 410-455-3522 fax:-3969 http://ebiquity.umbc.edu 

-------------------------------------------------------------------------
This SF.net email is sponsored by the 2008 JavaOne(SM) Conference 
Register now and save $200. Hurry, offer ends at 11:59 p.m., 
Monday, April 7! Use priority code J8TLD2. 
http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to