Re: Fwd: Carbon format performance intro

2018-11-05 Thread manhua
You can see this, but it is an old version(carbon 1.2) https://cwiki.apache.org/confluence/display/CARBONDATA/Performance+-+TPCH+Report+of+CarbonData+%281.2+version%29+and+Parquet+on+Spark+Execution+Engine - Regards Manhua -- Sent from: http://apache-carbondata-dev-mailing-list-archive.11

Re: [ISSUE] carbondata1.5.0 and spark 2.3.2 query plan issue

2018-11-05 Thread xuchuanyin
Hi, aaron. For the wrong pruning information statistics in the query plan, do you execute the queries concurrently? I noticed that the pruning collector is single thread, if you ran queries concurrently, the statistics for pruning will be incorrect. -- Sent from: http://apache-carbondata-dev-ma

RE: Proposal to integrate QATCodec into Carbondata

2018-11-05 Thread Xu, Cheng A
Yes, we have the performance number against Snappy. It's included in our proposal. The performance various depending on workloads. > For the sort workload (input, intermediate data, output are all > compression-enabled, 3TB data scale, 5 workers, 2 replica for data) with Map > Reduce, using QAT

Re: [Discussion] Encryption support for carbondata files

2018-11-05 Thread brijoobopanna
Encryption Algo are CPU intensive, Any analysis to guarantee performance will have no impact , Any other file format already support this and what is the motivational real time use case behind this support -- Sent from: http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/

Re: Proposal to integrate QATCodec into Carbondata

2018-11-05 Thread brijoobopanna
Thanks por proposing this QATCodec If any performance benchmarks are already available wrt Snappy or ZSTD -- Sent from: http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/