[
https://issues.apache.org/jira/browse/JENA-2225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476636#comment-17476636
]
ASF subversion and git services commented on JENA-2225:
-------------------------------------------------------
Commit f13c9f8e3268716b6c6efc84952a5c0f65d97163 in jena's branch
refs/heads/main from Andy Seaborne
[ https://gitbox.apache.org/repos/asf?p=jena.git;h=f13c9f8 ]
JENA-2225: All counts to be Long, not Integer
> TDB/TDB2 dataset size stat serialized incorrectly for large datasets
> --------------------------------------------------------------------
>
> Key: JENA-2225
> URL: https://issues.apache.org/jira/browse/JENA-2225
> Project: Apache Jena
> Issue Type: Bug
> Components: TDB, TDB2
> Affects Versions: Jena 4.3.1
> Reporter: Lorenz Bühmann
> Assignee: Andy Seaborne
> Priority: Minor
> Fix For: Jena 4.4.0
>
>
> When computing the TDB/TDB2 stats via CLI the size will be serialized
> incorrectly for large datasets.
> For example for latest Wikidata Truthy we get
> {noformat}
> (count -1983667112)){noformat}
> This happens because for both the corresponding `Stats.java` class does
> enforce an Integer type Node though the value is a long type:
> {code:java}
> if ( count >= 0 )
> addPair(meta.getList(), StatsMatcher.COUNT,
> NodeFactoryExtra.intToNode((int)count)) ; {code}
--
This message was sent by Atlassian Jira
(v8.20.1#820001)