This is an automated email from the ASF dual-hosted git repository.
mboehm7 pushed a change to branch main
in repository https://gitbox.apache.org/repos/asf/systemds.git
from 469d158568 [SYSTEMDS-3414] Fix allocation of large, multi-array dense
matrices
new 47ea25f624 [MINOR] Fix warnings, imports, and formatting issues
new 4a62c5214c [SYSTEMDS-3417] Fix integer overflow in
fast-buffered-input-stream
The 2 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
.../java/org/apache/sysds/api/PythonDMLScript.java | 1 -
.../matrix/data/LibMatrixCountDistinct.java | 7 +-
.../transform/tokenize/DocumentRepresentation.java | 41 ++-
.../sysds/runtime/transform/tokenize/Token.java | 184 +++++-----
.../runtime/transform/tokenize/Tokenizer.java | 385 ++++++++++-----------
.../transform/tokenize/TokenizerFactory.java | 169 +++++----
.../tokenize/applier/TokenizerApplier.java | 343 +++++++++---------
.../tokenize/applier/TokenizerApplierCount.java | 168 +++++----
.../tokenize/applier/TokenizerApplierHash.java | 6 -
.../tokenize/applier/TokenizerApplierPosition.java | 2 -
.../tokenize/builder/TokenizerBuilder.java | 100 +++---
.../tokenize/builder/TokenizerBuilderNgram.java | 120 +++----
.../builder/TokenizerBuilderWhitespaceSplit.java | 90 +++--
.../runtime/util/FastBufferedDataInputStream.java | 13 +-
.../builtin/part2/BuiltinTomeklinkTest.java | 1 -
.../transform/TokenizeMultithreadedTest.java | 298 ++++++++--------
16 files changed, 942 insertions(+), 986 deletions(-)