Hi Jason & apologies for the delayed reply

I think you run into a dbo vs dbp issue
http://wiki.dbpedia.org/faq/whats-dbpedia-owl-and-dbprop-namespaces-inconsistent-use-predicates

in general data under the dbp (http://dbpedia.org/property) namespace are
not considered of good quality they are usually extracted using greedy
heuristics.
Prefer data from the dbo namespace when possible because these are of much
higher quality.

in case you do not know, you can also improve the data by updating existing
mappings at http://mappings.dbpedia.org/index.php/Main_Page
e.g. the pages you mention use the
http://mappings.dbpedia.org/index.php/Mapping_en:Infobox_NRHP mapping

after you improve the mapping you can test it with random pages, e.g.
http://mappings.dbpedia.org/server/mappings/en/extractionSamples/Mapping_en:Infobox_NRHP
or use your own e.g.
http://mappings.dbpedia.org/server/extraction/en/extract?title=David_Ashbridge_Log_House&revid=&format=trix&extractors=mappings

since you are working with Live, all the mapping changes will be updated in
the data within hours or the latest, in case of errors, 1 month

in particular to your example, you are looking at the "built" property
which, in the above mapping, is mapped to dbo:yearOfConstruction and exists
in all your example resources so try with this query

SELECT DISTINCT ?placeName ?sameAs ?built WHERE
{
  ?team a dbo:Place;
 rdfs:label ?placeName;
     owl:sameAs ?sameAs;
     dbo:yearOfConstruction ?built.
  FILTER regex(?sameAs, ".*freebase.*")
  FILTER (Lang(?placeName)='en')
 } limit 50


hope that helps


On Sat, Aug 6, 2016 at 2:05 AM, Jason Hart <jason.h...@teamaol.com> wrote:

> Hello. I've been writing queries to pull in date information for cities
> and other locations. For most of the data, everything is formatted fine but
> I have run into some quirks.
>
> For example, http://live.dbpedia.org/page/Garton_Toy_Company has a
> dbp:built
> value of 193019421947 (xsd:double)
>
> Looking at https://en.wikipedia.org/wiki/Garton_Toy_Company, I see that
> the value really is  1930, 1942, 1947
>
> I've noticed issues mostly when there are multiple dates for an entity.
>
> Here are some other example with built values:
> https://en.wikipedia.org/wiki/Mountain_View_(Chatham,_Virginia) = c.
> 1840-1842
> http://live.dbpedia.org/page/Mountain_View_(Chatham,_Virginia) = c. -1842
> (en)
>
> https://en.wikipedia.org/wiki/Clinton_County_Courthouse_Complex
> = 1884-1885, 1889
> http://live.dbpedia.org/page/Clinton_County_Courthouse_Complex =
> -18851889 (xsd:integer)
>
> https://en.wikipedia.org/wiki/David_Ashbridge_Log_House = 1782, 1970
> http://live.dbpedia.org/page/David_Ashbridge_Log_House = 17821970
> (xsd:integer)
>
> https://en.wikipedia.org/wiki/Brandon_Plantation_(Halifax_
> County,_Virginia) = c. 1800, 1842
> http://live.dbpedia.org/page/Brandon_Plantation_(Halifax_County,_Virginia)
> = c. , 1842 (en)
>
> There will always be some poorly formatted data but it in these cases it
> looks like the ingestion process is altering the data.
>
> Are there changes to the ingestion process that can be made to fix quirks
> like this?
>
> If I come across other data that is not correct, is it helpful for me to
> send examples to this listserv? Or is there another way I should be
> submitting this?
>
> Thanks!
> Jason
>
> Here's a query that will return back additional examples. Most are fine
> but you will see several that are incorrect.
>
> SELECT DISTINCT ?placeName ?sameAs ?built WHERE
> {
>   ?team a dbo:Place;
>  rdfs:label ?placeName;
>      owl:sameAs ?sameAs;
>      dbp:built ?built.
>   FILTER regex(?sameAs, ".*freebase.*")
>   FILTER (Lang(?placeName)='en')
>   FILTER (strlen(str(?built))>4)
>  } limit 50
>
> ------------------------------------------------------------
> ------------------
>
> _______________________________________________
> DBpedia-discussion mailing list
> DBpedia-discussion@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>
>


-- 
Kontokostas Dimitris
------------------------------------------------------------------------------
_______________________________________________
DBpedia-discussion mailing list
DBpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to