[Impala-ASF-CR] IMPALA-12076: Use ZSTD interfaces with reusable context

2023-10-06 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19773 )

Change subject: IMPALA-12076: Use ZSTD interfaces with reusable context
..


Patch Set 5: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/19773
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5010a56bf8202ccb3f1710425002f81587fd412b
Gerrit-Change-Number: 19773
Gerrit-PatchSet: 5
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Laszlo Gaal 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Sat, 07 Oct 2023 00:28:24 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-12076: Use ZSTD interfaces with reusable context

2023-10-06 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/19773 )

Change subject: IMPALA-12076: Use ZSTD interfaces with reusable context
..

IMPALA-12076: Use ZSTD interfaces with reusable context

For repeated compression/decompression, ZSTD recommends
constructing a context once via ZSTD_createCCtx()/ZSTD_createDCtx()
and using the set of interfaces that passes in the context explicitly
to avoid constructing the context on each call.

This follows the recommendation and allocates the ZSTD context once for
each compressor / decompressor and reuses it for the lifetime of the
compressor / decompressor.

This gets a minor speedup for small-scale ZSTD TPC-H:
+--++-++++
| Workload | File Format| Avg (s) | Delta(Avg) | GeoMean(s) | 
Delta(GeoMean) |
+--++-++++
| TPCH(42) | parquet / zstd / block | 3.55| -1.40% | 2.52   | 
-1.63% |
+--++-++++

Testing:
 - Ran core job
 - Ran a perf-AB-test job

Change-Id: I5010a56bf8202ccb3f1710425002f81587fd412b
Reviewed-on: http://gerrit.cloudera.org:8080/19773
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M be/src/util/compress.cc
M be/src/util/compress.h
M be/src/util/decompress.cc
3 files changed, 26 insertions(+), 5 deletions(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/19773
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I5010a56bf8202ccb3f1710425002f81587fd412b
Gerrit-Change-Number: 19773
Gerrit-PatchSet: 6
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Laszlo Gaal 
Gerrit-Reviewer: Michael Smith 


[Impala-ASF-CR] IMPALA-12076: Use ZSTD interfaces with reusable context

2023-10-06 Thread Laszlo Gaal (Code Review)
Laszlo Gaal has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19773 )

Change subject: IMPALA-12076: Use ZSTD interfaces with reusable context
..


Patch Set 4: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/19773
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5010a56bf8202ccb3f1710425002f81587fd412b
Gerrit-Change-Number: 19773
Gerrit-PatchSet: 4
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Laszlo Gaal 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Fri, 06 Oct 2023 20:06:00 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-12076: Use ZSTD interfaces with reusable context

2023-10-06 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19773 )

Change subject: IMPALA-12076: Use ZSTD interfaces with reusable context
..


Patch Set 5:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/9792/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/19773
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5010a56bf8202ccb3f1710425002f81587fd412b
Gerrit-Change-Number: 19773
Gerrit-PatchSet: 5
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Laszlo Gaal 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Fri, 06 Oct 2023 20:06:48 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-12076: Use ZSTD interfaces with reusable context

2023-10-06 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19773 )

Change subject: IMPALA-12076: Use ZSTD interfaces with reusable context
..


Patch Set 5: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/19773
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5010a56bf8202ccb3f1710425002f81587fd412b
Gerrit-Change-Number: 19773
Gerrit-PatchSet: 5
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Laszlo Gaal 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Fri, 06 Oct 2023 20:06:48 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-12076: Use ZSTD interfaces with reusable context

2023-08-25 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19773 )

Change subject: IMPALA-12076: Use ZSTD interfaces with reusable context
..


Patch Set 4: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/19773
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5010a56bf8202ccb3f1710425002f81587fd412b
Gerrit-Change-Number: 19773
Gerrit-PatchSet: 4
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Fri, 25 Aug 2023 22:08:27 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-12076: Use ZSTD interfaces with reusable context

2023-08-25 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19773 )

Change subject: IMPALA-12076: Use ZSTD interfaces with reusable context
..


Patch Set 4:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/9636/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/19773
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5010a56bf8202ccb3f1710425002f81587fd412b
Gerrit-Change-Number: 19773
Gerrit-PatchSet: 4
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Fri, 25 Aug 2023 17:52:55 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-12076: Use ZSTD interfaces with reusable context

2023-08-25 Thread Michael Smith (Code Review)
Michael Smith has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19773 )

Change subject: IMPALA-12076: Use ZSTD interfaces with reusable context
..


Patch Set 4: Code-Review+1


--
To view, visit http://gerrit.cloudera.org:8080/19773
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5010a56bf8202ccb3f1710425002f81587fd412b
Gerrit-Change-Number: 19773
Gerrit-PatchSet: 4
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Fri, 25 Aug 2023 17:52:41 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-12076: Use ZSTD interfaces with reusable context

2023-08-22 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19773 )

Change subject: IMPALA-12076: Use ZSTD interfaces with reusable context
..


Patch Set 3: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/9622/


--
To view, visit http://gerrit.cloudera.org:8080/19773
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5010a56bf8202ccb3f1710425002f81587fd412b
Gerrit-Change-Number: 19773
Gerrit-PatchSet: 3
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Wed, 23 Aug 2023 05:15:53 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-12076: Use ZSTD interfaces with reusable context

2023-08-22 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19773 )

Change subject: IMPALA-12076: Use ZSTD interfaces with reusable context
..


Patch Set 3:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/9622/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/19773
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5010a56bf8202ccb3f1710425002f81587fd412b
Gerrit-Change-Number: 19773
Gerrit-PatchSet: 3
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Wed, 23 Aug 2023 00:59:21 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-12076: Use ZSTD interfaces with reusable context

2023-08-22 Thread Joe McDonnell (Code Review)
Joe McDonnell has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19773 )

Change subject: IMPALA-12076: Use ZSTD interfaces with reusable context
..


Patch Set 2:

> Ah, we already did this for streaming decompression.

Right, the compressors/decompressors are single threaded and this was something 
that already required for streaming ZSTD.


--
To view, visit http://gerrit.cloudera.org:8080/19773
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5010a56bf8202ccb3f1710425002f81587fd412b
Gerrit-Change-Number: 19773
Gerrit-PatchSet: 2
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Wed, 23 Aug 2023 00:57:11 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-12076: Use ZSTD interfaces with reusable context

2023-08-10 Thread Michael Smith (Code Review)
Michael Smith has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19773 )

Change subject: IMPALA-12076: Use ZSTD interfaces with reusable context
..


Patch Set 2:

Ah, we already did this for streaming decompression.


--
To view, visit http://gerrit.cloudera.org:8080/19773
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5010a56bf8202ccb3f1710425002f81587fd412b
Gerrit-Change-Number: 19773
Gerrit-PatchSet: 2
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Thu, 10 Aug 2023 16:20:27 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-12076: Use ZSTD interfaces with reusable context

2023-08-10 Thread Michael Smith (Code Review)
Michael Smith has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19773 )

Change subject: IMPALA-12076: Use ZSTD interfaces with reusable context
..


Patch Set 2: Code-Review+1

(1 comment)

http://gerrit.cloudera.org:8080/#/c/19773/2/be/src/util/compress.h
File be/src/util/compress.h:

http://gerrit.cloudera.org:8080/#/c/19773/2/be/src/util/compress.h@153
PS2, Line 153:   ZSTD_CCtx* stream_ = nullptr;
Pretty sure instances of these classes aren't shared across threads. Gzip has a 
similar construct.

Main uses are tmp-file-mgr and hdfs-parquet-table-writer, which create an 
instance for their particular use (writing a temporary file block or 
sequentially to a file).



--
To view, visit http://gerrit.cloudera.org:8080/19773
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5010a56bf8202ccb3f1710425002f81587fd412b
Gerrit-Change-Number: 19773
Gerrit-PatchSet: 2
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Thu, 10 Aug 2023 16:18:45 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-12076: Use ZSTD interfaces with reusable context

2023-08-10 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19773 )

Change subject: IMPALA-12076: Use ZSTD interfaces with reusable context
..


Patch Set 2: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/19773
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5010a56bf8202ccb3f1710425002f81587fd412b
Gerrit-Change-Number: 19773
Gerrit-PatchSet: 2
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Thu, 10 Aug 2023 07:06:25 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-12076: Use ZSTD interfaces with reusable context

2023-08-09 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19773 )

Change subject: IMPALA-12076: Use ZSTD interfaces with reusable context
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/13734/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/19773
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5010a56bf8202ccb3f1710425002f81587fd412b
Gerrit-Change-Number: 19773
Gerrit-PatchSet: 2
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Thu, 10 Aug 2023 03:17:25 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-12076: Use ZSTD interfaces with reusable context

2023-08-09 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19773 )

Change subject: IMPALA-12076: Use ZSTD interfaces with reusable context
..


Patch Set 2:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/9583/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/19773
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5010a56bf8202ccb3f1710425002f81587fd412b
Gerrit-Change-Number: 19773
Gerrit-PatchSet: 2
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Thu, 10 Aug 2023 02:50:51 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-12076: Use ZSTD interfaces with reusable context

2023-08-09 Thread Joe McDonnell (Code Review)
Hello Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/19773

to look at the new patch set (#2).

Change subject: IMPALA-12076: Use ZSTD interfaces with reusable context
..

IMPALA-12076: Use ZSTD interfaces with reusable context

For repeated compression/decompression, ZSTD recommends
constructing a context once via ZSTD_createCCtx()/ZSTD_createDCtx()
and using the set of interfaces that passes in the context explicitly
to avoid constructing the context on each call.

This follows the recommendation and allocates the ZSTD context once for
each compressor / decompressor and reuses it for the lifetime of the
compressor / decompressor.

This gets a minor speedup for small-scale ZSTD TPC-H:
+--++-++++
| Workload | File Format| Avg (s) | Delta(Avg) | GeoMean(s) | 
Delta(GeoMean) |
+--++-++++
| TPCH(42) | parquet / zstd / block | 3.55| -1.40% | 2.52   | 
-1.63% |
+--++-++++

Testing:
 - Ran core job
 - Ran a perf-AB-test job

Change-Id: I5010a56bf8202ccb3f1710425002f81587fd412b
---
M be/src/util/compress.cc
M be/src/util/compress.h
M be/src/util/decompress.cc
3 files changed, 26 insertions(+), 5 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/73/19773/2
--
To view, visit http://gerrit.cloudera.org:8080/19773
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I5010a56bf8202ccb3f1710425002f81587fd412b
Gerrit-Change-Number: 19773
Gerrit-PatchSet: 2
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins