[jira] [Updated] (SPARK-27603) Make ShuffleClient pluggable

2019-04-30 Thread Chenzhao Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chenzhao Guo updated SPARK-27603: - Description: ShuffleClient resides in BlockManager, it's the client to read other executors'

[jira] [Updated] (SPARK-27603) Make ShuffleClient pluggable

2019-04-30 Thread Chenzhao Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chenzhao Guo updated SPARK-27603: - Description: ShuffleClient resides in BlockManager, it's the client to read other executors'

[jira] [Created] (SPARK-27603) Make ShuffleClient pluggable

2019-04-30 Thread Chenzhao Guo (JIRA)
Chenzhao Guo created SPARK-27603: Summary: Make ShuffleClient pluggable Key: SPARK-27603 URL: https://issues.apache.org/jira/browse/SPARK-27603 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-26268) Decouple shuffle data from Spark deployment

2019-04-25 Thread Chenzhao Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26268?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16825814#comment-16825814 ] Chenzhao Guo commented on SPARK-26268: -- Actually this can be resolved in SPARK-25299, there is no

[jira] [Created] (SPARK-23667) Scala version check will fail due to launcher directory doesn't exist

2018-03-13 Thread Chenzhao Guo (JIRA)
Chenzhao Guo created SPARK-23667: Summary: Scala version check will fail due to launcher directory doesn't exist Key: SPARK-23667 URL: https://issues.apache.org/jira/browse/SPARK-23667 Project: Spark

[jira] [Created] (SPARK-22671) SortMergeJoin read more data when wholeStageCodegen is off compared with when it is on

2017-12-01 Thread Chenzhao Guo (JIRA)
Chenzhao Guo created SPARK-22671: Summary: SortMergeJoin read more data when wholeStageCodegen is off compared with when it is on Key: SPARK-22671 URL: https://issues.apache.org/jira/browse/SPARK-22671

[jira] [Created] (SPARK-22537) Aggregation of map output statistics on driver faces single point bottleneck

2017-11-15 Thread Chenzhao Guo (JIRA)
Chenzhao Guo created SPARK-22537: Summary: Aggregation of map output statistics on driver faces single point bottleneck Key: SPARK-22537 URL: https://issues.apache.org/jira/browse/SPARK-22537

[jira] [Created] (SPARK-22524) Subquery on UI appear as the same node even if it's not reused

2017-11-15 Thread Chenzhao Guo (JIRA)
Chenzhao Guo created SPARK-22524: Summary: Subquery on UI appear as the same node even if it's not reused Key: SPARK-22524 URL: https://issues.apache.org/jira/browse/SPARK-22524 Project: Spark

[jira] [Created] (SPARK-21412) Reset BufferHolder while initialize an UnsafeRowWriter

2017-07-13 Thread Chenzhao Guo (JIRA)
Chenzhao Guo created SPARK-21412: Summary: Reset BufferHolder while initialize an UnsafeRowWriter Key: SPARK-21412 URL: https://issues.apache.org/jira/browse/SPARK-21412 Project: Spark Issue

[jira] [Commented] (SPARK-20927) Add cache operator to Unsupported Operations in Structured Streaming

2017-06-12 Thread Chenzhao Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16046476#comment-16046476 ] Chenzhao Guo commented on SPARK-20927: -- What exactly is 'no-op' ? Does that mean scala

[jira] [Commented] (SPARK-20028) Implement NGrams aggregate function

2017-04-27 Thread Chenzhao Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15986646#comment-15986646 ] Chenzhao Guo commented on SPARK-20028: -- N-gram is a popular concept in NLP field, while Spark

[jira] [Updated] (SPARK-20028) Implement NGrams aggregate function

2017-04-27 Thread Chenzhao Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chenzhao Guo updated SPARK-20028: - Description: This is the implementation of `ngrams` aggregate expression which is also

[jira] [Updated] (SPARK-20028) Implement NGrams aggregate function

2017-03-20 Thread Chenzhao Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chenzhao Guo updated SPARK-20028: - Description: N-grams are subsequences of length N drawn from a longer sequence. The purpose of

[jira] [Created] (SPARK-20028) Implement NGrams aggregate function

2017-03-20 Thread Chenzhao Guo (JIRA)
Chenzhao Guo created SPARK-20028: Summary: Implement NGrams aggregate function Key: SPARK-20028 URL: https://issues.apache.org/jira/browse/SPARK-20028 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-19084) conditional function: field

2017-01-05 Thread Chenzhao Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chenzhao Guo updated SPARK-19084: - Target Version/s: 2.2.0 Fix Version/s: (was: 2.2.0) > conditional function: field >

[jira] [Updated] (SPARK-19084) conditional function: field

2017-01-05 Thread Chenzhao Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chenzhao Guo updated SPARK-19084: - Summary: conditional function: field (was: misc function: field) > conditional function: field

[jira] [Updated] (SPARK-19084) misc function: field

2017-01-05 Thread Chenzhao Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chenzhao Guo updated SPARK-19084: - Description: field(str, str1, str2, ... ) is a variable-length(>=2) function which returns the

[jira] [Created] (SPARK-19084) misc function: field

2017-01-05 Thread Chenzhao Guo (JIRA)
Chenzhao Guo created SPARK-19084: Summary: misc function: field Key: SPARK-19084 URL: https://issues.apache.org/jira/browse/SPARK-19084 Project: Spark Issue Type: Sub-task