Hi all,
I tried to extract wikipedia dump for Indonesian.
The example of the result is like this
<http://dbpedia.org/resource/Jawa>
<http://www.w3.org/1999/02/22-rdf-syntax-ns#type>
<http://www.opengis.net/gml/_Feature> .
My question is about the namespace. Where do I must configure so i can get the
namespace id.dbpedia.org?
I hope I can get the result like this
<http://id.dbpedia.org/resource/Jawa>
<http://www.w3.org/1999/02/22-rdf-syntax-ns#type>
<http://www.opengis.net/gml/_Feature> .
This is my configuration file.
# download and extraction target dir
base-dir=/home/riko/TA/dbpedia
# Source file. If source file name ends with .gz or .bz2, it is unzipped on the
fly.
# Must exist in the directory xxwiki/yyyymmdd and have the prefix
xxwiki-yyyymmdd-
# where xx is the wiki code and yyyymmdd is the dump date.
# default:
source=pages-articles.xml
# alternatives:
# source=pages-articles.xml.bz2
# source=pages-articles.xml.gz
# use only directories that contain a 'download-complete' file? Default is
false.
require-download-complete=true
# List of languages or article count ranges, e.g. 'en,de,fr' or '10000-20000'
or '10000-', or '@mappings'
languages=id
# unqualified extractor class names are prefixed by
org.dbpedia.extraction.mappings.
extractors=GeoExtractor
# extractors.id=MappingExtractor
# if ontology and mapping files are not given or do not exist, download info
from mappings.dbpedia.org
ontology=../ontology.xml
mappings=../mappings
# URI policies. Allowed flags: uri, generic, xml-safe. Each flag may have one
of the suffixes
# -subjects, -predicates, -objects, -datatype, -context to match only URIs in a
certain position.
# Without a suffix, a flag matches all URI positions.
uri-policy.uri=uri:id; generic:id; xml-safe-predicates:*
uri-policy.iri=generic:id; xml-safe-predicates:*
# File formats. Allowed flags: n-triples, n-quads, turtle-triples,
turtle-quads, trix-triples, trix-quads
# May be followed by a semicolon and a URI policy name. If format name ends
with .gz or .bz2, files
# are zipped on the fly.
# NT is unreadable anyway - might as well use URIs for en
#format.nt.gz=n-triples;uri-policy.uri
#format.nq.gz=n-quads;uri-policy.uri
# Turtle is much more readable - use nice IRIs for all languages
#format.ttl.gz=turtle-triples;uri-policy.iri
format.tql.gz=turtle-quads;uri-policy.iri
Thank you.
Regards,
Riko
------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_d2d_mar
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion