[ 
https://issues.apache.org/jira/browse/JENA-1930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17147268#comment-17147268
 ] 

Wolfgang Fahl commented on JENA-1930:
-------------------------------------

You'll find more info now at http://wiki.bitplan.com/index.php/Apache_Jena

> How to load data and then start a fuseki server with that dataset and use it?
> -----------------------------------------------------------------------------
>
>                 Key: JENA-1930
>                 URL: https://issues.apache.org/jira/browse/JENA-1930
>             Project: Apache Jena
>          Issue Type: Question
>    Affects Versions: Jena 3.14.0, Jena 3.15.0
>            Reporter: Wolfgang Fahl
>            Priority: Minor
>         Attachments: fusekiscreenshot2020-06-26.png
>
>
> With my (unfortunately not successful) tdb2.tdbloader experience for wikidata 
> I modified my script to load a smaller dataset like shown below (gnd2jena). 
> The output log seems to be ok.
> {code:java}
> 15:01:25 INFO loader :: Add: 9,500,000 authorities-kongress_lds.ttl (Batch: 
> 28,457 / Avg: 33,975)
> 15:01:29 INFO loader :: Finished: authorities-kongress_lds.ttl: 9,556,155 
> tuples in 282.97s (Avg: 33,771)
> 15:02:06 INFO loader :: Finish - index SPO
> 15:02:12 INFO loader :: Finish - index POS
> 15:02:12 INFO loader :: Finish - index OSP
> 15:02:12 INFO loader :: Time = 326.504 seconds : Triples = 9,556,155 : Rate = 
> 29,268 /s{code}
> Now I am trying to run a fuseki server using the data like this:
> {code:java}
> #!/bin/bash
> # WF 2020-06-25
> # Jena Fuseki server installation
> # see https://jena.apache.org/documentation/fuseki2/fuseki-run.html
> version=3.14.0
> fuseki=apache-jena-fuseki-$version
> if [ ! -d $fuseki ]
> then
>  if [ ! -f $fuseki.tar.gz ]
>  then
>  wget http://archive.apache.org/dist/jena/binaries/$fuseki.tar.gz
>  else
>  echo $fuseki.tar.gz already downloaded
>  fi
>  echo "unpacking $fuseki.tar.gz"
>  tar xvfz $fuseki.tar.gz
> else
>  echo $fuseki already downloaded and unpacked
> fi
> cd $fuseki
> java -jar fuseki-server.jar --tdb2 --loc=../data /gnd{code}
> Please note that I used version 3.14.0 here given that there where reports 
> about 3.15.0 needing some kind of patching that i hoped to avoid by using the 
> previous version assuming that the tdbstore would still be compatible or a 
> message would show if not.
> At the servers port 3030 a user interface shows up showing that the server 
> status is ok.
> !fusekiscreenshot2020-06-26.png!
> When clicking "Manage datasets" i get two options:
>  * existing datasets
>  * add new dataset
> None of the two buttons shows and effect. Clicking does not show any visible 
> reaction. I would have expected I would be able to work with the imported 
> dataset immediately (not even knowing whether you'd call the import a dataset 
> ...)
> I already found some hint that I'd have to change the config.ttl manually to 
> get the desired effect. I find it quite confusing that the web UI does not 
> give any hints on this. I placed this as a question - not knowing whether 
> this would end up as a feature request or bug ...
> *gnd2jena*
>  
> {code:java}
> #!/bin/bash
> # WF 2020-05-10
> # global settings
> jena=apache-jena-3.15.0
> tgz=$jena.tar.gz
> jenaurl=http://mirror.easyname.ch/apache/jena/binaries/$tgz
> base=/hd/luxio/gnd
> data=$base/data
> tdbloader=$jena/bin/tdb2.tdbloader
> getjena() {
> # download
> if [ ! -f $tgz ]
> then
>  echo "downloading $tgz from $jenaurl"
>  wget $jenaurl
> else
>  echo "$tgz already downloaded"
> fi
> # unpack
> if [ ! -d $jena ]
> then
>  echo "unpacking $jena from $tgz"
>  tar xvzf $tgz
> else
>  echo "$jena already unpacked"
> fi
> # create data directory
> if [ ! -d $data ]
> then
>  echo "creating $data directory"
>  mkdir -p $data
> else
>  echo "$data directory already created"
> fi
> }
> #
> # show the given timestamp
> #
> timestamp() {
>  local msg="$1"
>  local ts=$(date -u +"%Y-%m-%dT%H:%M:%SZ")
>  echo "$msg at $ts"
> }
> #
> # load data for the given data dir and input
> #
> loaddata() {
>  local data="$1"
>  local input="$2"
>  timestamp "start loading $input to $data"
>  $tdbloader --loader=parallel --loc "$data" "$input" > tdb2-$phase-out.log 2> 
> tdb2-$phase-err.log
>  timestamp "finished loading $input to $data"
> }
> getjena
> export TMPDIR=$base/tmp
> if [ ! -d $TMPDIR ]
> then
>  echo "creating temporary directory $TMPDIR"
>  mkdir $TMPDIR
> else
>  echo "using temporary directory $TMPDIR"
> fi
> if [ ! -f authorities-kongress_lds.ttl ]
> then
>  wget https://data.dnb.de/opendata/authorities-kongress_lds.ttl.gz
>  gunzip authorities-kongress_lds.ttl.gz
> fi
> loaddata $data authorities-kongress_lds.ttl
> {code}
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to