Hi Jason & apologies for the delayed reply I think you run into a dbo vs dbp issue http://wiki.dbpedia.org/faq/whats-dbpedia-owl-and-dbprop-namespaces-inconsistent-use-predicates
in general data under the dbp (http://dbpedia.org/property) namespace are not considered of good quality they are usually extracted using greedy heuristics. Prefer data from the dbo namespace when possible because these are of much higher quality. in case you do not know, you can also improve the data by updating existing mappings at http://mappings.dbpedia.org/index.php/Main_Page e.g. the pages you mention use the http://mappings.dbpedia.org/index.php/Mapping_en:Infobox_NRHP mapping after you improve the mapping you can test it with random pages, e.g. http://mappings.dbpedia.org/server/mappings/en/extractionSamples/Mapping_en:Infobox_NRHP or use your own e.g. http://mappings.dbpedia.org/server/extraction/en/extract?title=David_Ashbridge_Log_House&revid=&format=trix&extractors=mappings since you are working with Live, all the mapping changes will be updated in the data within hours or the latest, in case of errors, 1 month in particular to your example, you are looking at the "built" property which, in the above mapping, is mapped to dbo:yearOfConstruction and exists in all your example resources so try with this query SELECT DISTINCT ?placeName ?sameAs ?built WHERE { ?team a dbo:Place; rdfs:label ?placeName; owl:sameAs ?sameAs; dbo:yearOfConstruction ?built. FILTER regex(?sameAs, ".*freebase.*") FILTER (Lang(?placeName)='en') } limit 50 hope that helps On Sat, Aug 6, 2016 at 2:05 AM, Jason Hart <jason.h...@teamaol.com> wrote: > Hello. I've been writing queries to pull in date information for cities > and other locations. For most of the data, everything is formatted fine but > I have run into some quirks. > > For example, http://live.dbpedia.org/page/Garton_Toy_Company has a > dbp:built > value of 193019421947 (xsd:double) > > Looking at https://en.wikipedia.org/wiki/Garton_Toy_Company, I see that > the value really is 1930, 1942, 1947 > > I've noticed issues mostly when there are multiple dates for an entity. > > Here are some other example with built values: > https://en.wikipedia.org/wiki/Mountain_View_(Chatham,_Virginia) = c. > 1840-1842 > http://live.dbpedia.org/page/Mountain_View_(Chatham,_Virginia) = c. -1842 > (en) > > https://en.wikipedia.org/wiki/Clinton_County_Courthouse_Complex > = 1884-1885, 1889 > http://live.dbpedia.org/page/Clinton_County_Courthouse_Complex = > -18851889 (xsd:integer) > > https://en.wikipedia.org/wiki/David_Ashbridge_Log_House = 1782, 1970 > http://live.dbpedia.org/page/David_Ashbridge_Log_House = 17821970 > (xsd:integer) > > https://en.wikipedia.org/wiki/Brandon_Plantation_(Halifax_ > County,_Virginia) = c. 1800, 1842 > http://live.dbpedia.org/page/Brandon_Plantation_(Halifax_County,_Virginia) > = c. , 1842 (en) > > There will always be some poorly formatted data but it in these cases it > looks like the ingestion process is altering the data. > > Are there changes to the ingestion process that can be made to fix quirks > like this? > > If I come across other data that is not correct, is it helpful for me to > send examples to this listserv? Or is there another way I should be > submitting this? > > Thanks! > Jason > > Here's a query that will return back additional examples. Most are fine > but you will see several that are incorrect. > > SELECT DISTINCT ?placeName ?sameAs ?built WHERE > { > ?team a dbo:Place; > rdfs:label ?placeName; > owl:sameAs ?sameAs; > dbp:built ?built. > FILTER regex(?sameAs, ".*freebase.*") > FILTER (Lang(?placeName)='en') > FILTER (strlen(str(?built))>4) > } limit 50 > > ------------------------------------------------------------ > ------------------ > > _______________________________________________ > DBpedia-discussion mailing list > DBpedia-discussion@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion > > -- Kontokostas Dimitris
------------------------------------------------------------------------------
_______________________________________________ DBpedia-discussion mailing list DBpedia-discussion@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion