Lorenz Bühmann created JENA-2225:
------------------------------------
Summary: TDB/TDB2 dataset size stat serialized inccorrectly for
large datasets
Key: JENA-2225
URL: https://issues.apache.org/jira/browse/JENA-2225
Project: Apache Jena
Issue Type: Bug
Components: TDB, TDB2
Affects Versions: Jena 4.3.1
Reporter: Lorenz Bühmann
When computing the TDB/TDB2 stats via CLI the size will be serialized
incorrectly for large datasets.
For example for latest Wikidata Truthy we get
{noformat}
(count -1983667112)){noformat}
This happens because for both the corresponding `Stats.java` class does enforce
an Integer type Node though the value is a long type:
{code:java}
if ( count >= 0 )
addPair(meta.getList(), StatsMatcher.COUNT,
NodeFactoryExtra.intToNode((int)count)) ; {code}
--
This message was sent by Atlassian Jira
(v8.20.1#820001)