[
https://issues.apache.org/jira/browse/JENA-2225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17466529#comment-17466529
]
ASF subversion and git services commented on JENA-2225:
-------------------------------------------------------
Commit 6696e4efdd0e441a9334d2338ea9fd9ce51a7c04 in jena's branch
refs/heads/main from Andy Seaborne
[ https://gitbox.apache.org/repos/asf?p=jena.git;h=6696e4e ]
JENA-2225: Handle large stats counts
> TDB/TDB2 dataset size stat serialized incorrectly for large datasets
> --------------------------------------------------------------------
>
> Key: JENA-2225
> URL: https://issues.apache.org/jira/browse/JENA-2225
> Project: Apache Jena
> Issue Type: Bug
> Components: TDB, TDB2
> Affects Versions: Jena 4.3.1
> Reporter: Lorenz Bühmann
> Priority: Minor
>
> When computing the TDB/TDB2 stats via CLI the size will be serialized
> incorrectly for large datasets.
> For example for latest Wikidata Truthy we get
> {noformat}
> (count -1983667112)){noformat}
> This happens because for both the corresponding `Stats.java` class does
> enforce an Integer type Node though the value is a long type:
> {code:java}
> if ( count >= 0 )
> addPair(meta.getList(), StatsMatcher.COUNT,
> NodeFactoryExtra.intToNode((int)count)) ; {code}
--
This message was sent by Atlassian Jira
(v8.20.1#820001)