[jira] [Created] (SPARK-11150) Dynamic partition pruning

2015-10-16 Thread Younes (JIRA)
Younes created SPARK-11150: -- Summary: Dynamic partition pruning Key: SPARK-11150 URL: https://issues.apache.org/jira/browse/SPARK-11150 Project: Spark Issue Type: Bug Components: SQL

[jira] [Updated] (SPARK-11152) Streaming UI: Input sizes are 0 for makeup batches started from a checkpoint

2015-10-16 Thread Yongjia Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yongjia Wang updated SPARK-11152: - Description: When a streaming job is resumed from a checkpoint at batch time x, and say the

[jira] [Updated] (SPARK-11153) Turns off Parquet filter push-down for string and binary columns

2015-10-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-11153: --- Description: Due to PARQUET-251, {{BINARY}} columns in existing Parquet files may be written with

[jira] [Resolved] (SPARK-10953) Benchmark codegen vs. hand-written code for univariate statistics

2015-10-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10953. --- Resolution: Done Fix Version/s: 1.6.0 > Benchmark codegen vs. hand-written code for

[jira] [Commented] (SPARK-10994) Local clustering coefficient computation in GraphX

2015-10-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961000#comment-14961000 ] Apache Spark commented on SPARK-10994: -- User 'SherlockYang' has created a pull request for this

[jira] [Assigned] (SPARK-10994) Local clustering coefficient computation in GraphX

2015-10-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10994: Assignee: (was: Apache Spark) > Local clustering coefficient computation in GraphX >

[jira] [Assigned] (SPARK-10994) Local clustering coefficient computation in GraphX

2015-10-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10994: Assignee: Apache Spark > Local clustering coefficient computation in GraphX >

[jira] [Created] (SPARK-11149) Improve performance of primitive types in columnar cache

2015-10-16 Thread Davies Liu (JIRA)
Davies Liu created SPARK-11149: -- Summary: Improve performance of primitive types in columnar cache Key: SPARK-11149 URL: https://issues.apache.org/jira/browse/SPARK-11149 Project: Spark Issue

[jira] [Created] (SPARK-11153) Turns off Parquet filter push-down for string and binary columns

2015-10-16 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-11153: -- Summary: Turns off Parquet filter push-down for string and binary columns Key: SPARK-11153 URL: https://issues.apache.org/jira/browse/SPARK-11153 Project: Spark

[jira] [Updated] (SPARK-11153) Turns off Parquet filter push-down for string and binary columns

2015-10-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-11153: --- Priority: Blocker (was: Critical) > Turns off Parquet filter push-down for string and binary

[jira] [Created] (SPARK-11155) Stage summary json should include stage duration

2015-10-16 Thread Imran Rashid (JIRA)
Imran Rashid created SPARK-11155: Summary: Stage summary json should include stage duration Key: SPARK-11155 URL: https://issues.apache.org/jira/browse/SPARK-11155 Project: Spark Issue

[jira] [Updated] (SPARK-10895) Add pushdown string filters for Parquet

2015-10-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-10895: --- Assignee: Liang-Chi Hsieh > Add pushdown string filters for Parquet >

[jira] [Commented] (SPARK-10165) Nested Hive UDF resolution fails in Analyzer

2015-10-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961159#comment-14961159 ] Michael Armbrust commented on SPARK-10165: -- That sounds like a different issue. Please open up

[jira] [Created] (SPARK-11154) make specificaition spark.yarn.executor.memoryOverhead consistent with typical JVM options

2015-10-16 Thread Dustin Cote (JIRA)
Dustin Cote created SPARK-11154: --- Summary: make specificaition spark.yarn.executor.memoryOverhead consistent with typical JVM options Key: SPARK-11154 URL: https://issues.apache.org/jira/browse/SPARK-11154

[jira] [Issue Comment Deleted] (SPARK-10994) Local clustering coefficient computation in GraphX

2015-10-16 Thread Yang Yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yang Yang updated SPARK-10994: -- Comment: was deleted (was: Proposed implementation: https://github.com/amplab/graphx/pull/148/) >

[jira] [Created] (SPARK-11152) Streaming UI: Input sizes are 0 for makeup batches started from a checkpoint

2015-10-16 Thread Yongjia Wang (JIRA)
Yongjia Wang created SPARK-11152: Summary: Streaming UI: Input sizes are 0 for makeup batches started from a checkpoint Key: SPARK-11152 URL: https://issues.apache.org/jira/browse/SPARK-11152

[jira] [Updated] (SPARK-11152) Streaming UI: Input sizes are 0 for makeup batches started from a checkpoint

2015-10-16 Thread Yongjia Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yongjia Wang updated SPARK-11152: - Description: When a streaming job starts from a checkpoint at batch time x, and say the current

[jira] [Commented] (SPARK-11149) Improve performance of primitive types in columnar cache

2015-10-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961003#comment-14961003 ] Apache Spark commented on SPARK-11149: -- User 'davies' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11149) Improve performance of primitive types in columnar cache

2015-10-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11149: Assignee: Apache Spark (was: Davies Liu) > Improve performance of primitive types in

[jira] [Assigned] (SPARK-11149) Improve performance of primitive types in columnar cache

2015-10-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11149: Assignee: Davies Liu (was: Apache Spark) > Improve performance of primitive types in

[jira] [Commented] (SPARK-11153) Turns off Parquet filter push-down for string and binary columns

2015-10-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961147#comment-14961147 ] Michael Armbrust commented on SPARK-11153: -- Its actually corrupted statistics in data that is

[jira] [Commented] (SPARK-10953) Benchmark codegen vs. hand-written code for univariate statistics

2015-10-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10953?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961209#comment-14961209 ] Xiangrui Meng commented on SPARK-10953: --- That sounds good. I'm closing this for now since the

[jira] [Created] (SPARK-11151) Use Long internally for DecimalType with precision <= 18

2015-10-16 Thread Davies Liu (JIRA)
Davies Liu created SPARK-11151: -- Summary: Use Long internally for DecimalType with precision <= 18 Key: SPARK-11151 URL: https://issues.apache.org/jira/browse/SPARK-11151 Project: Spark Issue

[jira] [Commented] (SPARK-11147) HTTP 500 if try to access Spark UI in yarn-cluster

2015-10-16 Thread Sebastian YEPES FERNANDEZ (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961088#comment-14961088 ] Sebastian YEPES FERNANDEZ commented on SPARK-11147: --- I don't think its a networking

[jira] [Resolved] (SPARK-11124) JsonParser/Generator should be closed for resource recycle

2015-10-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-11124. - Resolution: Fixed Assignee: Navis Fix Version/s: 1.6.0 > JsonParser/Generator

[jira] [Commented] (SPARK-11154) make specificaition spark.yarn.executor.memoryOverhead consistent with typical JVM options

2015-10-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961184#comment-14961184 ] Sean Owen commented on SPARK-11154: --- Should be for all similar properties, not just this one. The twist

[jira] [Updated] (SPARK-10641) skewness and kurtosis support

2015-10-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10641: -- Attachment: simpler-moments.pdf I did some calculation offline and got a simpler formula for

[jira] [Created] (SPARK-11157) Allow Spark to be built without assemblies

2015-10-16 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-11157: -- Summary: Allow Spark to be built without assemblies Key: SPARK-11157 URL: https://issues.apache.org/jira/browse/SPARK-11157 Project: Spark Issue Type:

[jira] [Commented] (SPARK-11155) Stage summary json should include stage duration

2015-10-16 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961366#comment-14961366 ] Kay Ousterhout commented on SPARK-11155: [~imranr] where exactly do you mean this is missing? I

[jira] [Resolved] (SPARK-11050) PySpark SparseVector can return wrong index in error message

2015-10-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-11050. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9069

[jira] [Assigned] (SPARK-11153) Turns off Parquet filter push-down for string and binary columns

2015-10-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11153: Assignee: Cheng Lian (was: Apache Spark) > Turns off Parquet filter push-down for string

[jira] [Commented] (SPARK-11153) Turns off Parquet filter push-down for string and binary columns

2015-10-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961300#comment-14961300 ] Apache Spark commented on SPARK-11153: -- User 'liancheng' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11158) Add more information in Error statment for sql/types _verify_type()

2015-10-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11158: Assignee: Apache Spark > Add more information in Error statment for sql/types

[jira] [Assigned] (SPARK-11158) Add more information in Error statment for sql/types _verify_type()

2015-10-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11158: Assignee: (was: Apache Spark) > Add more information in Error statment for sql/types

[jira] [Commented] (SPARK-11158) Add more information in Error statment for sql/types _verify_type()

2015-10-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961315#comment-14961315 ] Apache Spark commented on SPARK-11158: -- User 'lababidi' has created a pull request for this issue:

[jira] [Resolved] (SPARK-10581) Groups are not resolved in scaladoc for org.apache.spark.sql.Column

2015-10-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-10581. - Resolution: Fixed Fix Version/s: 1.6.0 1.5.2 > Groups are not resolved

[jira] [Comment Edited] (SPARK-10641) skewness and kurtosis support

2015-10-16 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961359#comment-14961359 ] Seth Hendrickson edited comment on SPARK-10641 at 10/16/15 9:07 PM:

[jira] [Updated] (SPARK-10581) Groups are not resolved in scaladoc for org.apache.spark.sql.Column

2015-10-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10581: Assignee: Pravin Vishnu Gadakh > Groups are not resolved in scaladoc for

[jira] [Created] (SPARK-11156) Web UI doesn't count or show info about replicated blocks

2015-10-16 Thread Ryan Williams (JIRA)
Ryan Williams created SPARK-11156: - Summary: Web UI doesn't count or show info about replicated blocks Key: SPARK-11156 URL: https://issues.apache.org/jira/browse/SPARK-11156 Project: Spark

[jira] [Resolved] (SPARK-9409) make-distribution.sh should copy all files in conf, so that it's easy to create a distro with custom configuration and property settings

2015-10-16 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-9409. --- Resolution: Won't Fix > make-distribution.sh should copy all files in conf, so that it's easy to >

[jira] [Commented] (SPARK-11155) Stage summary json should include stage duration

2015-10-16 Thread Xin Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961282#comment-14961282 ] Xin Ren commented on SPARK-11155: - Hi, I'd like to have a try on this one. Thanks > Stage summary json

[jira] [Assigned] (SPARK-11127) Upgrade Kinesis Client Library to the latest stable version

2015-10-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11127: Assignee: Tathagata Das (was: Apache Spark) > Upgrade Kinesis Client Library to the

[jira] [Commented] (SPARK-11127) Upgrade Kinesis Client Library to the latest stable version

2015-10-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961308#comment-14961308 ] Apache Spark commented on SPARK-11127: -- User 'mengxr' has created a pull request for this issue:

[jira] [Created] (SPARK-11160) CloudPickeSerializer conflicts with xmlrunner

2015-10-16 Thread Gabor Liptak (JIRA)
Gabor Liptak created SPARK-11160: Summary: CloudPickeSerializer conflicts with xmlrunner Key: SPARK-11160 URL: https://issues.apache.org/jira/browse/SPARK-11160 Project: Spark Issue Type:

[jira] [Commented] (SPARK-11153) Turns off Parquet filter push-down for string and binary columns

2015-10-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961281#comment-14961281 ] Cheng Lian commented on SPARK-11153: Yes, it's the statistics information that is corrupted. And yes,

[jira] [Assigned] (SPARK-11153) Turns off Parquet filter push-down for string and binary columns

2015-10-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11153: Assignee: Apache Spark (was: Cheng Lian) > Turns off Parquet filter push-down for string

[jira] [Commented] (SPARK-9162) Implement code generation for ScalaUDF

2015-10-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961335#comment-14961335 ] Reynold Xin commented on SPARK-9162: [~viirya] can you work on this? We can then close this umbrella

[jira] [Resolved] (SPARK-8100) Make able to refer lost executor log

2015-10-16 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-8100. --- Resolution: Duplicate This looks like a duplicate of SPARK-7729 > Make able to refer lost executor

[jira] [Updated] (SPARK-11157) Allow Spark to be built without assemblies

2015-10-16 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-11157: --- Attachment: no-assemblies.pdf > Allow Spark to be built without assemblies >

[jira] [Assigned] (SPARK-11127) Upgrade Kinesis Client Library to the latest stable version

2015-10-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11127: Assignee: Apache Spark (was: Tathagata Das) > Upgrade Kinesis Client Library to the

[jira] [Assigned] (SPARK-11127) Upgrade Kinesis Client Library to the latest stable version

2015-10-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-11127: - Assignee: Xiangrui Meng (was: Tathagata Das) > Upgrade Kinesis Client Library to the

[jira] [Commented] (SPARK-6859) Parquet File Binary column statistics error when reuse byte[] among rows

2015-10-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961318#comment-14961318 ] Cheng Lian commented on SPARK-6859: --- This issue was left unresolved because Parquet filter push-down

[jira] [Updated] (SPARK-8360) Streaming DataFrames

2015-10-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-8360: --- Target Version/s: (was: 1.6.0) > Streaming DataFrames > > >

[jira] [Commented] (SPARK-11153) Turns off Parquet filter push-down for string and binary columns

2015-10-16 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961371#comment-14961371 ] Felix Cheung commented on SPARK-11153: -- so the corrupted stats data would still be a problem when

[jira] [Commented] (SPARK-10641) skewness and kurtosis support

2015-10-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961396#comment-14961396 ] Xiangrui Meng commented on SPARK-10641: --- See attached PDF file. > skewness and kurtosis support >

[jira] [Created] (SPARK-11158) Add more information in Error statment for sql/types _verify_type()

2015-10-16 Thread Mahmoud Lababidi (JIRA)
Mahmoud Lababidi created SPARK-11158: Summary: Add more information in Error statment for sql/types _verify_type() Key: SPARK-11158 URL: https://issues.apache.org/jira/browse/SPARK-11158 Project:

[jira] [Resolved] (SPARK-10974) Add progress bar for output operation column and use red dots for failed batches

2015-10-16 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-10974. --- Resolution: Fixed > Add progress bar for output operation column and use red dots for failed

[jira] [Resolved] (SPARK-11104) A potential deadlock in StreamingContext.stop and stopOnShutdown

2015-10-16 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-11104. --- Resolution: Fixed Assignee: Shixiong Zhu Fix Version/s: 1.6.0

[jira] [Resolved] (SPARK-11109) move FsHistoryProvider off import org.apache.hadoop.fs.permission.AccessControlException

2015-10-16 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-11109. Resolution: Fixed Assignee: Glenn Weidner Fix Version/s: 1.6.0 > move

[jira] [Commented] (SPARK-10641) skewness and kurtosis support

2015-10-16 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961359#comment-14961359 ] Seth Hendrickson commented on SPARK-10641: -- [~mengxr] I am interested, do you mine providing it

[jira] [Created] (SPARK-11159) Nested SQL UDF raises java.lang.UnsupportedOperationException: Cannot evaluate expression

2015-10-16 Thread Jacob Wellington (JIRA)
Jacob Wellington created SPARK-11159: Summary: Nested SQL UDF raises java.lang.UnsupportedOperationException: Cannot evaluate expression Key: SPARK-11159 URL: https://issues.apache.org/jira/browse/SPARK-11159

[jira] [Commented] (SPARK-11153) Turns off Parquet filter push-down for string and binary columns

2015-10-16 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961374#comment-14961374 ] Felix Cheung commented on SPARK-11153: -- re-read what you said, I think it makes sense. I assume it

[jira] [Updated] (SPARK-11050) PySpark SparseVector can return wrong index in error message

2015-10-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-11050: -- Assignee: Bhargav Mangipudi > PySpark SparseVector can return wrong index in error

[jira] [Commented] (SPARK-11087) spark.sql.orc.filterPushdown does not work, No ORC pushdown predicate

2015-10-16 Thread patcharee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960296#comment-14960296 ] patcharee commented on SPARK-11087: --- [~zhazhan] Below is my test. Please check. I tried to change

[jira] [Resolved] (SPARK-10974) Add progress bar for output operation column and use red dots for failed batches

2015-10-16 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-10974. --- Resolution: Fixed Fix Version/s: 1.6.0 > Add progress bar for output operation column

[jira] [Commented] (SPARK-7271) Redesign shuffle interface for binary processing

2015-10-16 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7271?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960363#comment-14960363 ] Hong Shen commented on SPARK-7271: -- Hi, I have a question, are you plan to rededign the shuffle reader to

[jira] [Assigned] (SPARK-10974) Add progress bar for output operation column and use red dots for failed batches

2015-10-16 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das reassigned SPARK-10974: - Assignee: Tathagata Das > Add progress bar for output operation column and use red dots

[jira] [Updated] (SPARK-10974) Add progress bar for output operation column and use red dots for failed batches

2015-10-16 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-10974: -- Assignee: Shixiong Zhu (was: Tathagata Das) > Add progress bar for output operation column

[jira] [Resolved] (SPARK-3950) Completed time is blank for some successful tasks

2015-10-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3950. -- Resolution: Cannot Reproduce > Completed time is blank for some successful tasks >

[jira] [Reopened] (SPARK-10974) Add progress bar for output operation column and use red dots for failed batches

2015-10-16 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das reopened SPARK-10974: --- > Add progress bar for output operation column and use red dots for failed > batches >

[jira] [Created] (SPARK-11145) Cannot filter using a partition key and another column

2015-10-16 Thread Julien Buret (JIRA)
Julien Buret created SPARK-11145: Summary: Cannot filter using a partition key and another column Key: SPARK-11145 URL: https://issues.apache.org/jira/browse/SPARK-11145 Project: Spark Issue

[jira] [Commented] (SPARK-11139) Make SparkContext.stop() exception-safe

2015-10-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960378#comment-14960378 ] Sean Owen commented on SPARK-11139: --- Yes please. StreamingContext probably needs a similar treatment:

[jira] [Resolved] (SPARK-11137) Make StreamingContext.stop() exception-safe

2015-10-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-11137. --- Resolution: Duplicate If you don't mind, this is too logically related to SPARK-11139 to make

[jira] [Commented] (SPARK-11143) SparkMesosDispatcher can not launch driver in docker

2015-10-16 Thread Klaus Ma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960420#comment-14960420 ] Klaus Ma commented on SPARK-11143: -- I addressed the issue by a new docker image which is more

[jira] [Resolved] (SPARK-11060) Fix some potential NPEs in DStream transformation

2015-10-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-11060. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9070

[jira] [Updated] (SPARK-11060) Fix some potential NPEs in DStream transformation

2015-10-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11060: -- Assignee: Saisai Shao > Fix some potential NPEs in DStream transformation >

[jira] [Created] (SPARK-11144) Add SparkLauncher for Spark Streaming, Spark SQL, etc

2015-10-16 Thread Yuhang Chen (JIRA)
Yuhang Chen created SPARK-11144: --- Summary: Add SparkLauncher for Spark Streaming, Spark SQL, etc Key: SPARK-11144 URL: https://issues.apache.org/jira/browse/SPARK-11144 Project: Spark Issue

[jira] [Comment Edited] (SPARK-11143) SparkMesosDispatcher can not launch driver in docker

2015-10-16 Thread Klaus Ma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960420#comment-14960420 ] Klaus Ma edited comment on SPARK-11143 at 10/16/15 9:24 AM: I addressed the

[jira] [Assigned] (SPARK-10581) Groups are not resolved in scaladoc for org.apache.spark.sql.Column

2015-10-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10581: Assignee: (was: Apache Spark) > Groups are not resolved in scaladoc for

[jira] [Commented] (SPARK-10965) Optimize filesEqualRecursive

2015-10-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960515#comment-14960515 ] Sean Owen commented on SPARK-10965: --- I'd like to resolve this, at least for now. I am not sure I see a

[jira] [Assigned] (SPARK-10581) Groups are not resolved in scaladoc for org.apache.spark.sql.Column

2015-10-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10581: Assignee: Apache Spark > Groups are not resolved in scaladoc for

[jira] [Resolved] (SPARK-11092) Add source URLs to API documentation.

2015-10-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-11092. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9110

[jira] [Commented] (SPARK-10581) Groups are not resolved in scaladoc for org.apache.spark.sql.Column

2015-10-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960513#comment-14960513 ] Apache Spark commented on SPARK-10581: -- User 'pravingadakh' has created a pull request for this

[jira] [Resolved] (SPARK-11146) missing or invalid dependency detected while loading class file 'RDDOperationScope.class

2015-10-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-11146. --- Resolution: Cannot Reproduce Since all tests are passing, it sounds strongly like a problem local to

[jira] [Updated] (SPARK-11094) Test runner script fails to parse Java version.

2015-10-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11094: -- Assignee: Jakob Odersky > Test runner script fails to parse Java version. >

[jira] [Created] (SPARK-11147) HTTP 500 if try to access Spark UI in yarn-cluster

2015-10-16 Thread Sebastian YEPES FERNANDEZ (JIRA)
Sebastian YEPES FERNANDEZ created SPARK-11147: - Summary: HTTP 500 if try to access Spark UI in yarn-cluster Key: SPARK-11147 URL: https://issues.apache.org/jira/browse/SPARK-11147 Project:

[jira] [Resolved] (SPARK-10965) Optimize filesEqualRecursive

2015-10-16 Thread Mark Grover (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Grover resolved SPARK-10965. - Resolution: Won't Fix Thanks Sean. Marking this as Won't Fix since I don't think this is super

[jira] [Commented] (SPARK-10754) table and column name are case sensitive when json Dataframe was registered as tempTable using JavaSparkContext.

2015-10-16 Thread Rick Hillegas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14960802#comment-14960802 ] Rick Hillegas commented on SPARK-10754: --- Note that unquoted identifiers are case-insensitive in the

[jira] [Created] (SPARK-11148) Unable to create views

2015-10-16 Thread Lunen (JIRA)
Lunen created SPARK-11148: - Summary: Unable to create views Key: SPARK-11148 URL: https://issues.apache.org/jira/browse/SPARK-11148 Project: Spark Issue Type: Bug Components: SQL

[jira] [Commented] (SPARK-9999) RDD-like API on top of Catalyst/DataFrame

2015-10-16 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961513#comment-14961513 ] Sandy Ryza commented on SPARK-: --- So ClassTags would work for case classes and Avro specific records,

[jira] [Assigned] (SPARK-11070) Remove older releases on dist.apache.org

2015-10-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reassigned SPARK-11070: --- Assignee: Patrick Wendell > Remove older releases on dist.apache.org >

[jira] [Assigned] (SPARK-11163) Remove unnecessary addPendingTask calls in TaskSetManager.executorLost

2015-10-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11163: Assignee: Apache Spark (was: Kay Ousterhout) > Remove unnecessary addPendingTask calls

[jira] [Resolved] (SPARK-10599) Decrease communication in BlockMatrix multiply and increase performance

2015-10-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-10599. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8757

[jira] [Created] (SPARK-11162) Allow enabling debug logging from the command line

2015-10-16 Thread Ryan Williams (JIRA)
Ryan Williams created SPARK-11162: - Summary: Allow enabling debug logging from the command line Key: SPARK-11162 URL: https://issues.apache.org/jira/browse/SPARK-11162 Project: Spark Issue

[jira] [Commented] (SPARK-10877) Assertions fail straightforward DataFrame job due to word alignment

2015-10-16 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961505#comment-14961505 ] Davies Liu commented on SPARK-10877: This is already fixed in master and 1.5 branch. > Assertions

[jira] [Commented] (SPARK-11070) Remove older releases on dist.apache.org

2015-10-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961515#comment-14961515 ] Patrick Wendell commented on SPARK-11070: - I removed them - I did leave 1.5.0 for now, but we can

[jira] [Resolved] (SPARK-11070) Remove older releases on dist.apache.org

2015-10-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-11070. - Resolution: Fixed > Remove older releases on dist.apache.org >

[jira] [Commented] (SPARK-9999) RDD-like API on top of Catalyst/DataFrame

2015-10-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14961518#comment-14961518 ] Michael Armbrust commented on SPARK-: - Yeah, I think tuples are a pretty important use case.

[jira] [Updated] (SPARK-11163) Remove unnecessary addPendingTask calls in TaskSetManager.executorLost

2015-10-16 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout updated SPARK-11163: --- Summary: Remove unnecessary addPendingTask calls in TaskSetManager.executorLost (was:

[jira] [Created] (SPARK-11163) Remove unnecessary addPendingTask calls in TaskSetManager

2015-10-16 Thread Kay Ousterhout (JIRA)
Kay Ousterhout created SPARK-11163: -- Summary: Remove unnecessary addPendingTask calls in TaskSetManager Key: SPARK-11163 URL: https://issues.apache.org/jira/browse/SPARK-11163 Project: Spark

  1   2   >