[ 
https://issues.apache.org/jira/browse/JENA-1909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonas Sourlier updated JENA-1909:
---------------------------------
    Description: 
This might be related to JENA-1908, but since the stack trace is different, I 
opened a second ticket.

Tried to import the latest Wikidata dump into Apache Jena, using the following 
setup:
 * Ubuntu 20.04 on Windows 10 Subsystem for Linux
 * Apache Jena 3.15.0
 * Intel i7 4770K, 32GB RAM
 * 
{code:java}
openjdk 11.0.7 2020-04-14
OpenJDK Runtime Environment (build 11.0.7+10-post-Ubuntu-3ubuntu1)
OpenJDK 64-Bit Server VM (build 11.0.7+10-post-Ubuntu-3ubuntu1, mixed mode, 
sharing){code}

These are the commands I have run:
{code:java}
wget -c http://mirror.easyname.ch/apache/jena/binaries/apache-jena-3.15.0.tar.gz
tar -xvzf apache-jena-3.15.0.tar.gz
mkdir data
apache-jena-3.15.0/bin/tdbloader2 --phase data --loc data/ ../latest-all.ttl > 
tdb1.log 2> tdb2.log &
apache-jena-3.15.0/bin/tdbloader2 --phase index --loc data/  > tdb1.log 2> 
tdb2.log &

{code}
The data phase ran fine, but the index phase crashed after about 10 hours. This 
is the stack trace which appears in the error output (when pasting this, Jira 
messed it up, thus I pasted it into a gist):

[https://gist.github.com/yolpsoftware/31e4892f457df4bd5fd70a7b1e3dae4c]

Here's the standard output:
{code:java}
 08:47:57 INFO -- TDB Bulk Loader Start
 08:47:57 INFO Index Building Phase
 08:47:57 INFO Creating Index SPO
 08:47:58 INFO Sort SPO
 18:26:19 INFO Sort SPO Completed
 18:26:19 INFO Build SPO
{code}
 

  was:
This might be related to JENA-1908, but since the stack trace is different, I 
opened a second ticket.

Tried to import the latest Wikidata dump into Apache Jena, using the following 
setup:
 * Ubuntu 20.04 on Windows 10 Subsystem for Linux
 * Apache Jena 3.15.0
 * Intel i7 4770K, 32GB RAM
 * 
{code:java}
openjdk 11.0.7 2020-04-14
OpenJDK Runtime Environment (build 11.0.7+10-post-Ubuntu-3ubuntu1)
OpenJDK 64-Bit Server VM (build 11.0.7+10-post-Ubuntu-3ubuntu1, mixed mode, 
sharing){code}

These are the commands I have run:
{code:java}
wget -c http://mirror.easyname.ch/apache/jena/binaries/apache-jena-3.15.0.tar.gz
tar -xvzf apache-jena-3.15.0.tar.gz
mkdir data
apache-jena-3.15.0/bin/tdbloader2 --phase data --loc data/ ../latest-all.ttl > 
tdb1.log 2> tdb2.log &
apache-jena-3.15.0/bin/tdbloader2 --phase index --loc data/  > tdb1.log 2> 
tdb2.log &

{code}
The data phase ran fine, but the index phase crashed after about 10 hours. This 
is the stack trace which appears in the error output:
{code:java}
{code}
Here's the standard output:
{code:java}
 08:47:57 INFO -- TDB Bulk Loader Start
 08:47:57 INFO Index Building Phase
 08:47:57 INFO Creating Index SPO
 08:47:58 INFO Sort SPO
 18:26:19 INFO Sort SPO Completed
 18:26:19 INFO Build SPO
{code}
 


> tdb2.tdbloader crashes
> ----------------------
>
>                 Key: JENA-1909
>                 URL: https://issues.apache.org/jira/browse/JENA-1909
>             Project: Apache Jena
>          Issue Type: Bug
>    Affects Versions: Jena 3.15.0
>            Reporter: Jonas Sourlier
>            Priority: Major
>
> This might be related to JENA-1908, but since the stack trace is different, I 
> opened a second ticket.
> Tried to import the latest Wikidata dump into Apache Jena, using the 
> following setup:
>  * Ubuntu 20.04 on Windows 10 Subsystem for Linux
>  * Apache Jena 3.15.0
>  * Intel i7 4770K, 32GB RAM
>  * 
> {code:java}
> openjdk 11.0.7 2020-04-14
> OpenJDK Runtime Environment (build 11.0.7+10-post-Ubuntu-3ubuntu1)
> OpenJDK 64-Bit Server VM (build 11.0.7+10-post-Ubuntu-3ubuntu1, mixed mode, 
> sharing){code}
> These are the commands I have run:
> {code:java}
> wget -c 
> http://mirror.easyname.ch/apache/jena/binaries/apache-jena-3.15.0.tar.gz
> tar -xvzf apache-jena-3.15.0.tar.gz
> mkdir data
> apache-jena-3.15.0/bin/tdbloader2 --phase data --loc data/ ../latest-all.ttl 
> > tdb1.log 2> tdb2.log &
> apache-jena-3.15.0/bin/tdbloader2 --phase index --loc data/  > tdb1.log 2> 
> tdb2.log &
> {code}
> The data phase ran fine, but the index phase crashed after about 10 hours. 
> This is the stack trace which appears in the error output (when pasting this, 
> Jira messed it up, thus I pasted it into a gist):
> [https://gist.github.com/yolpsoftware/31e4892f457df4bd5fd70a7b1e3dae4c]
> Here's the standard output:
> {code:java}
>  08:47:57 INFO -- TDB Bulk Loader Start
>  08:47:57 INFO Index Building Phase
>  08:47:57 INFO Creating Index SPO
>  08:47:58 INFO Sort SPO
>  18:26:19 INFO Sort SPO Completed
>  18:26:19 INFO Build SPO
> {code}
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to