[jira] [Commented] (SPARK-13337) DataFrame join-on-columns function should support null-safe equal

2016-03-01 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175204#comment-15175204 ] Xiao Li commented on SPARK-13337: - To get your results, try using left outer join + right out join +

[jira] [Updated] (SPARK-13614) show() trigger memory leak,why?

2016-03-01 Thread chillon_m (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] chillon_m updated SPARK-13614: -- Description: hot.count()=599147 ghot.size=21844 [bigdata@namenode spark-1.5.2-bin-hadoop2.4]$

[jira] [Assigned] (SPARK-13543) Support for specifying compression codec for Parquet/ORC via option()

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13543: Assignee: (was: Apache Spark) > Support for specifying compression codec for

[jira] [Assigned] (SPARK-13543) Support for specifying compression codec for Parquet/ORC via option()

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13543: Assignee: Apache Spark > Support for specifying compression codec for Parquet/ORC via

[jira] [Commented] (SPARK-13543) Support for specifying compression codec for Parquet/ORC via option()

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175190#comment-15175190 ] Apache Spark commented on SPARK-13543: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Updated] (SPARK-13614) show() trigger memory leak,why?

2016-03-01 Thread chillon_m (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] chillon_m updated SPARK-13614: -- Attachment: memory leak.png > show() trigger memory leak,why? > --- > >

[jira] [Updated] (SPARK-13614) show() trigger memory leak,why?

2016-03-01 Thread chillon_m (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] chillon_m updated SPARK-13614: -- Attachment: (was: memory leak.png) > show() trigger memory leak,why? >

[jira] [Updated] (SPARK-13614) show() trigger memory leak,why?

2016-03-01 Thread chillon_m (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] chillon_m updated SPARK-13614: -- Summary: show() trigger memory leak,why? (was: show() trigger memory leak) > show() trigger memory

[jira] [Updated] (SPARK-13614) show() trigger memory leak

2016-03-01 Thread chillon_m (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] chillon_m updated SPARK-13614: -- Description: [bigdata@namenode spark-1.5.2-bin-hadoop2.4]$ bin/spark-shell --driver-class-path

[jira] [Assigned] (SPARK-13613) Provide ignored tests to export test dataset into CSV format

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13613: Assignee: Apache Spark > Provide ignored tests to export test dataset into CSV format >

[jira] [Comment Edited] (SPARK-13337) DataFrame join-on-columns function should support null-safe equal

2016-03-01 Thread Zhong Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175154#comment-15175154 ] Zhong Wang edited comment on SPARK-13337 at 3/2/16 6:50 AM: suppose we are

[jira] [Assigned] (SPARK-13613) Provide ignored tests to export test dataset into CSV format

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13613: Assignee: (was: Apache Spark) > Provide ignored tests to export test dataset into CSV

[jira] [Commented] (SPARK-13613) Provide ignored tests to export test dataset into CSV format

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13613?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175156#comment-15175156 ] Apache Spark commented on SPARK-13613: -- User 'yanboliang' has created a pull request for this issue:

[jira] [Updated] (SPARK-13614) show() trigger memory leak

2016-03-01 Thread chillon_m (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] chillon_m updated SPARK-13614: -- Attachment: memory leak.png memory.png > show() trigger memory leak >

[jira] [Commented] (SPARK-13337) DataFrame join-on-columns function should support null-safe equal

2016-03-01 Thread Zhong Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175154#comment-15175154 ] Zhong Wang commented on SPARK-13337: suppose we have two tables: -- TableA ||key1||key2||value1||

[jira] [Comment Edited] (SPARK-13337) DataFrame join-on-columns function should support null-safe equal

2016-03-01 Thread Zhong Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175154#comment-15175154 ] Zhong Wang edited comment on SPARK-13337 at 3/2/16 6:50 AM: suppose we have

[jira] [Created] (SPARK-13614) show() trigger memory leak

2016-03-01 Thread chillon_m (JIRA)
chillon_m created SPARK-13614: - Summary: show() trigger memory leak Key: SPARK-13614 URL: https://issues.apache.org/jira/browse/SPARK-13614 Project: Spark Issue Type: Question

[jira] [Closed] (SPARK-13608) py4j.Py4JException: Method createDirectStream([class org.apache.spark.streaming.api.java.JavaStreamingContext, class java.util.HashMap, class java.util.HashSet, class jav

2016-03-01 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao closed SPARK-13608. --- Resolution: Not A Problem > py4j.Py4JException: Method createDirectStream([class >

[jira] [Commented] (SPARK-13608) py4j.Py4JException: Method createDirectStream([class org.apache.spark.streaming.api.java.JavaStreamingContext, class java.util.HashMap, class java.util.HashSet, class

2016-03-01 Thread Avatar Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175148#comment-15175148 ] Avatar Zhang commented on SPARK-13608: -- i used a bad version spark-streaming-kafka-assembly. thanks.

[jira] [Commented] (SPARK-13608) py4j.Py4JException: Method createDirectStream([class org.apache.spark.streaming.api.java.JavaStreamingContext, class java.util.HashMap, class java.util.HashSet, class

2016-03-01 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175144#comment-15175144 ] Saisai Shao commented on SPARK-13608: - Hi [~avatarzhang] , would you please elaborate your problem,

[jira] [Created] (SPARK-13613) Provide ignored tests to export test dataset into CSV format

2016-03-01 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-13613: --- Summary: Provide ignored tests to export test dataset into CSV format Key: SPARK-13613 URL: https://issues.apache.org/jira/browse/SPARK-13613 Project: Spark

[jira] [Commented] (SPARK-13219) Pushdown predicate propagation in SparkSQL with join

2016-03-01 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175107#comment-15175107 ] Xiao Li commented on SPARK-13219: - Hi, [~velvia] after a discussion with Michael, he prefers to enhancing

[jira] [Commented] (SPARK-13337) DataFrame join-on-columns function should support null-safe equal

2016-03-01 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175097#comment-15175097 ] Xiao Li commented on SPARK-13337: - What is the null columns? If you are using full outer joins, all the

[jira] [Commented] (SPARK-12941) Spark-SQL JDBC Oracle dialect fails to map string datatypes to Oracle VARCHAR datatype

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175094#comment-15175094 ] Apache Spark commented on SPARK-12941: -- User 'thomastechs' has created a pull request for this

[jira] [Commented] (SPARK-13393) Column mismatch issue in left_outer join using Spark DataFrame

2016-03-01 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175095#comment-15175095 ] Xiao Li commented on SPARK-13393: - Thank you! [~adrian-wang] Sorry, [~srinathsmn] I missed your reply.

[jira] [Commented] (SPARK-13573) Open SparkR APIs (R package) to allow better 3rd party usage

2016-03-01 Thread Chip Senkbeil (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175093#comment-15175093 ] Chip Senkbeil commented on SPARK-13573: --- I'd gladly create a PR with the changes if needed. We

[jira] [Commented] (SPARK-13573) Open SparkR APIs (R package) to allow better 3rd party usage

2016-03-01 Thread Chip Senkbeil (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175089#comment-15175089 ] Chip Senkbeil commented on SPARK-13573: --- In terms of the JVM class whose methods we are invoking,

[jira] [Assigned] (SPARK-13607) Improves compression performance for integer-typed values on cache to reduce GC pressure

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13607: Assignee: (was: Apache Spark) > Improves compression performance for integer-typed

[jira] [Assigned] (SPARK-13607) Improves compression performance for integer-typed values on cache to reduce GC pressure

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13607: Assignee: Apache Spark > Improves compression performance for integer-typed values on

[jira] [Commented] (SPARK-13607) Improves compression performance for integer-typed values on cache to reduce GC pressure

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175085#comment-15175085 ] Apache Spark commented on SPARK-13607: -- User 'maropu' has created a pull request for this issue:

[jira] [Commented] (SPARK-13573) Open SparkR APIs (R package) to allow better 3rd party usage

2016-03-01 Thread Chip Senkbeil (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175079#comment-15175079 ] Chip Senkbeil commented on SPARK-13573: --- [~sunrui], IIRC, Toree supported SparkR from 1.4.x and

[jira] [Updated] (SPARK-13435) Add Weighted Cohen's kappa to MulticlassMetrics

2016-03-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13435: -- Shepherd: (was: Xiangrui Meng) > Add Weighted Cohen's kappa to MulticlassMetrics >

[jira] [Comment Edited] (SPARK-12177) Update KafkaDStreams to new Kafka 0.9 Consumer API

2016-03-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175066#comment-15175066 ] Reynold Xin edited comment on SPARK-12177 at 3/2/16 5:32 AM: - This thread is

[jira] [Commented] (SPARK-12177) Update KafkaDStreams to new Kafka 0.9 Consumer API

2016-03-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175066#comment-15175066 ] Reynold Xin commented on SPARK-12177: - This thread is getting to long for me to follow, but my

[jira] [Updated] (SPARK-13322) AFTSurvivalRegression should support feature standardization

2016-03-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13322: -- Shepherd: DB Tsai (was: Xiangrui Meng) > AFTSurvivalRegression should support feature

[jira] [Updated] (SPARK-13010) Survival analysis in SparkR

2016-03-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13010: -- Shepherd: yuhao yang (was: Xiangrui Meng) > Survival analysis in SparkR >

[jira] [Resolved] (SPARK-13008) Make ML Python package all list have one algorithm per line

2016-03-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-13008. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 10927

[jira] [Commented] (SPARK-13393) Column mismatch issue in left_outer join using Spark DataFrame

2016-03-01 Thread Varadharajan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175052#comment-15175052 ] Varadharajan commented on SPARK-13393: -- [~adrian-wang] Thanks a lot :) > Column mismatch issue in

[jira] [Commented] (SPARK-13587) Support virtualenv in PySpark

2016-03-01 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175045#comment-15175045 ] Jeff Zhang commented on SPARK-13587: spark.pyspark.virtualenv.requirements is a local file (which

[jira] [Created] (SPARK-13612) Multiplication of BigDecimal columns not working as expected

2016-03-01 Thread Varadharajan (JIRA)
Varadharajan created SPARK-13612: Summary: Multiplication of BigDecimal columns not working as expected Key: SPARK-13612 URL: https://issues.apache.org/jira/browse/SPARK-13612 Project: Spark

[jira] [Commented] (SPARK-13393) Column mismatch issue in left_outer join using Spark DataFrame

2016-03-01 Thread Adrian Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175034#comment-15175034 ] Adrian Wang commented on SPARK-13393: - [~srinathsmn] I have identified the issue, and working on

[jira] [Commented] (SPARK-13393) Column mismatch issue in left_outer join using Spark DataFrame

2016-03-01 Thread Varadharajan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175033#comment-15175033 ] Varadharajan commented on SPARK-13393: -- [~rxin] [~marmbrus] Can you share some inputs on this? >

[jira] [Commented] (SPARK-13587) Support virtualenv in PySpark

2016-03-01 Thread Mike Sukmanowsky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175018#comment-15175018 ] Mike Sukmanowsky commented on SPARK-13587: -- One thought that just occurred to me, does

[jira] [Assigned] (SPARK-13609) Support Column Pruning for MapPartitions

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13609: Assignee: Apache Spark > Support Column Pruning for MapPartitions >

[jira] [Assigned] (SPARK-13609) Support Column Pruning for MapPartitions

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13609: Assignee: (was: Apache Spark) > Support Column Pruning for MapPartitions >

[jira] [Commented] (SPARK-13609) Support Column Pruning for MapPartitions

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175011#comment-15175011 ] Apache Spark commented on SPARK-13609: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Created] (SPARK-13611) import Aggregator doesn't work in Spark Shell

2016-03-01 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-13611: --- Summary: import Aggregator doesn't work in Spark Shell Key: SPARK-13611 URL: https://issues.apache.org/jira/browse/SPARK-13611 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-13587) Support virtualenv in PySpark

2016-03-01 Thread Mike Sukmanowsky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175010#comment-15175010 ] Mike Sukmanowsky commented on SPARK-13587: -- Gotcha. I might suggest

[jira] [Commented] (SPARK-13025) Allow user to specify the initial model when training LogisticRegression

2016-03-01 Thread Gayathri Murali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175006#comment-15175006 ] Gayathri Murali commented on SPARK-13025: - https://github.com/apache/spark/pull/11459 > Allow

[jira] [Commented] (SPARK-13587) Support virtualenv in PySpark

2016-03-01 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175003#comment-15175003 ] Jeff Zhang commented on SPARK-13587: Thanks for your feedback [~msukmanowsky].

[jira] [Created] (SPARK-13610) Create a Transformer to disassemble vectors in DataFrames

2016-03-01 Thread Andrew MacKinlay (JIRA)
Andrew MacKinlay created SPARK-13610: Summary: Create a Transformer to disassemble vectors in DataFrames Key: SPARK-13610 URL: https://issues.apache.org/jira/browse/SPARK-13610 Project: Spark

[jira] [Created] (SPARK-13609) Support Column Pruning for MapPartitions

2016-03-01 Thread Xiao Li (JIRA)
Xiao Li created SPARK-13609: --- Summary: Support Column Pruning for MapPartitions Key: SPARK-13609 URL: https://issues.apache.org/jira/browse/SPARK-13609 Project: Spark Issue Type: Sub-task

[jira] [Comment Edited] (SPARK-13587) Support virtualenv in PySpark

2016-03-01 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15173228#comment-15173228 ] Jeff Zhang edited comment on SPARK-13587 at 3/2/16 4:17 AM: This method is

[jira] [Created] (SPARK-13608) py4j.Py4JException: Method createDirectStream([class org.apache.spark.streaming.api.java.JavaStreamingContext, class java.util.HashMap, class java.util.HashSet, class ja

2016-03-01 Thread Avatar Zhang (JIRA)
Avatar Zhang created SPARK-13608: Summary: py4j.Py4JException: Method createDirectStream([class org.apache.spark.streaming.api.java.JavaStreamingContext, class java.util.HashMap, class java.util.HashSet, class java.util.HashMap]) does not exist

[jira] [Comment Edited] (SPARK-13587) Support virtualenv in PySpark

2016-03-01 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15173228#comment-15173228 ] Jeff Zhang edited comment on SPARK-13587 at 3/2/16 4:12 AM: This method is

[jira] [Commented] (SPARK-13587) Support virtualenv in PySpark

2016-03-01 Thread Mike Sukmanowsky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174996#comment-15174996 ] Mike Sukmanowsky commented on SPARK-13587: -- Thanks for letting me know about this [~jeffzhang].

[jira] [Issue Comment Deleted] (SPARK-13025) Allow user to specify the initial model when training LogisticRegression

2016-03-01 Thread Gayathri Murali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gayathri Murali updated SPARK-13025: Comment: was deleted (was: PR : https://github.com/apache/spark/pull/11458) > Allow user

[jira] [Commented] (SPARK-13025) Allow user to specify the initial model when training LogisticRegression

2016-03-01 Thread Gayathri Murali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174983#comment-15174983 ] Gayathri Murali commented on SPARK-13025: - PR : https://github.com/apache/spark/pull/11458 >

[jira] [Commented] (SPARK-13606) Error from python worker: /usr/local/bin/python2.7: undefined symbol: _PyCodec_LookupTextEncoding

2016-03-01 Thread Avatar Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174953#comment-15174953 ] Avatar Zhang commented on SPARK-13606: -- /usr/local/bin/python2.7 can launch normally.

[jira] [Commented] (SPARK-6764) Add wheel package support for PySpark

2016-03-01 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174945#comment-15174945 ] Jeff Zhang commented on SPARK-6764: --- [~msukmanowsky] Can SPARK-13587 solve your issue ? I am working on

[jira] [Commented] (SPARK-13606) Error from python worker: /usr/local/bin/python2.7: undefined symbol: _PyCodec_LookupTextEncoding

2016-03-01 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174941#comment-15174941 ] Jeff Zhang commented on SPARK-13606: This might be python environment issue. Can you launch python on

[jira] [Commented] (SPARK-13073) creating R like summary for logistic Regression in Spark - Scala

2016-03-01 Thread Gayathri Murali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174943#comment-15174943 ] Gayathri Murali commented on SPARK-13073: - I can work on this, can you please assign it to me? >

[jira] [Created] (SPARK-13607) Improves compression performance for integer-typed values on cache to reduce GC pressure

2016-03-01 Thread Takeshi Yamamuro (JIRA)
Takeshi Yamamuro created SPARK-13607: Summary: Improves compression performance for integer-typed values on cache to reduce GC pressure Key: SPARK-13607 URL: https://issues.apache.org/jira/browse/SPARK-13607

[jira] [Updated] (SPARK-13581) LibSVM throws MatchError

2016-03-01 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated SPARK-13581: --- Priority: Critical (was: Minor) > LibSVM throws MatchError > > >

[jira] [Resolved] (SPARK-13141) Dataframe created from Hive partitioned tables using HiveContext returns wrong results

2016-03-01 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-13141. Resolution: Not A Problem Hi, this was a bug in CDH 5.5.0/5.5.1, it was fixed in CDH

[jira] [Commented] (SPARK-13511) Add wholestage codegen for limit

2016-03-01 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174919#comment-15174919 ] Liang-Chi Hsieh commented on SPARK-13511: - [~davies] Can you help update the Assignee field?

[jira] [Commented] (SPARK-13174) Add API and options for csv data sources

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174912#comment-15174912 ] Apache Spark commented on SPARK-13174: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-13174) Add API and options for csv data sources

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13174: Assignee: (was: Apache Spark) > Add API and options for csv data sources >

[jira] [Assigned] (SPARK-13174) Add API and options for csv data sources

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13174: Assignee: Apache Spark > Add API and options for csv data sources >

[jira] [Created] (SPARK-13606) Error from python worker: /usr/local/bin/python2.7: undefined symbol: _PyCodec_LookupTextEncoding

2016-03-01 Thread Avatar Zhang (JIRA)
Avatar Zhang created SPARK-13606: Summary: Error from python worker: /usr/local/bin/python2.7: undefined symbol: _PyCodec_LookupTextEncoding Key: SPARK-13606 URL:

[jira] [Comment Edited] (SPARK-13141) Dataframe created from Hive partitioned tables using HiveContext returns wrong results

2016-03-01 Thread zhichao-li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174895#comment-15174895 ] zhichao-li edited comment on SPARK-13141 at 3/2/16 2:29 AM: Just try, but

[jira] [Commented] (SPARK-6764) Add wheel package support for PySpark

2016-03-01 Thread Mike Sukmanowsky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174899#comment-15174899 ] Mike Sukmanowsky commented on SPARK-6764: - Just bumping this issue up. We use Spark (PySpark)

[jira] [Commented] (SPARK-13141) Dataframe created from Hive partitioned tables using HiveContext returns wrong results

2016-03-01 Thread zhichao-li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174895#comment-15174895 ] zhichao-li commented on SPARK-13141: Just try, but this cannot be reproduced from the master version

[jira] [Commented] (SPARK-12177) Update KafkaDStreams to new Kafka 0.9 Consumer API

2016-03-01 Thread Mark Grover (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174856#comment-15174856 ] Mark Grover commented on SPARK-12177: - Hi [~tdas] and [~rxin], can you help us with your opinion on

[jira] [Comment Edited] (SPARK-7768) Make user-defined type (UDT) API public

2016-03-01 Thread Randall Whitman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15169380#comment-15169380 ] Randall Whitman edited comment on SPARK-7768 at 3/2/16 1:47 AM: Am I

[jira] [Resolved] (SPARK-13167) JDBC data source does not include null value partition columns rows in the result.

2016-03-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-13167. - Resolution: Fixed Assignee: Suresh Thalamati Fix Version/s: 2.0.0 > JDBC data

[jira] [Commented] (SPARK-13230) HashMap.merged not working properly with Spark

2016-03-01 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-13230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174809#comment-15174809 ] Łukasz Gieroń commented on SPARK-13230: --- [~srowen] Can you please assign me to this ticket? I have

[jira] [Commented] (SPARK-7768) Make user-defined type (UDT) API public

2016-03-01 Thread Jaka Jancar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174807#comment-15174807 ] Jaka Jancar commented on SPARK-7768: [~randallwhitman] UDT, not UDF:

[jira] [Resolved] (SPARK-13598) Remove LeftSemiJoinBNL

2016-03-01 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-13598. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11448

[jira] [Created] (SPARK-13605) Bean encoder cannot handle nonbean properties - no way to Encode nonbean Java objects with columns

2016-03-01 Thread Steven Lewis (JIRA)
Steven Lewis created SPARK-13605: Summary: Bean encoder cannot handle nonbean properties - no way to Encode nonbean Java objects with columns Key: SPARK-13605 URL:

[jira] [Commented] (SPARK-13573) Open SparkR APIs (R package) to allow better 3rd party usage

2016-03-01 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174726#comment-15174726 ] Sun Rui commented on SPARK-13573: - [~chipsenkbeil] glad to know Toree is to support SparkR. I tried it

[jira] [Updated] (SPARK-13604) Sync worker's state after registering with master

2016-03-01 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-13604: - Description: If Master cannot talk with Worker for a while and then network is back, Worker may

[jira] [Assigned] (SPARK-13604) Sync worker's state after registering with master

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13604: Assignee: Apache Spark (was: Shixiong Zhu) > Sync worker's state after registering with

[jira] [Updated] (SPARK-13604) Sync worker's state after registering with master

2016-03-01 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-13604: - Description: If Master cannot talk with Worker for a while and then network is back, Worker may

[jira] [Commented] (SPARK-13604) Sync worker's state after registering with master

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174724#comment-15174724 ] Apache Spark commented on SPARK-13604: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Closed] (SPARK-13586) add config to skip generate down time batch when restart StreamingContext

2016-03-01 Thread jeanlyn (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jeanlyn closed SPARK-13586. --- Resolution: Invalid > add config to skip generate down time batch when restart StreamingContext >

[jira] [Updated] (SPARK-13604) Sync worker's state after registering with master

2016-03-01 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-13604: - Description: If Master cannot talk with Worker for a while and then network is back, Worker may

[jira] [Assigned] (SPARK-13604) Sync worker's state after registering with master

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13604: Assignee: Shixiong Zhu (was: Apache Spark) > Sync worker's state after registering with

[jira] [Created] (SPARK-13604) Sync worker's state after registering with master

2016-03-01 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-13604: Summary: Sync worker's state after registering with master Key: SPARK-13604 URL: https://issues.apache.org/jira/browse/SPARK-13604 Project: Spark Issue

[jira] [Commented] (SPARK-13525) SparkR: java.net.SocketTimeoutException: Accept timed out when running any dataframe function

2016-03-01 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174720#comment-15174720 ] Sun Rui commented on SPARK-13525: - the interactive R session is for your driver, Rscript is needed for

[jira] [Commented] (SPARK-13073) creating R like summary for logistic Regression in Spark - Scala

2016-03-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174691#comment-15174691 ] Joseph K. Bradley commented on SPARK-13073: --- It sounds reasonable to provide the same printed

[jira] [Commented] (SPARK-13030) Change OneHotEncoder to Estimator

2016-03-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174685#comment-15174685 ] Joseph K. Bradley commented on SPARK-13030: --- I agree this is an issue, but I think we need to

[jira] [Commented] (SPARK-13574) Improve parquet dictionary decoding for strings

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174656#comment-15174656 ] Apache Spark commented on SPARK-13574: -- User 'nongli' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13603) SQL generation for subquery

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13603: Assignee: Davies Liu (was: Apache Spark) > SQL generation for subquery >

[jira] [Commented] (SPARK-13603) SQL generation for subquery

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13603?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174649#comment-15174649 ] Apache Spark commented on SPARK-13603: -- User 'davies' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13603) SQL generation for subquery

2016-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13603: Assignee: Apache Spark (was: Davies Liu) > SQL generation for subquery >

[jira] [Created] (SPARK-13603) SQL generation for subquery

2016-03-01 Thread Davies Liu (JIRA)
Davies Liu created SPARK-13603: -- Summary: SQL generation for subquery Key: SPARK-13603 URL: https://issues.apache.org/jira/browse/SPARK-13603 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-13596) Move misc top-level build files into appropriate subdirs

2016-03-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174642#comment-15174642 ] Reynold Xin commented on SPARK-13596: - Are those dot files even possible to move? > Move misc

[jira] [Resolved] (SPARK-13548) Move tags and unsafe modules into common

2016-03-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-13548. - Resolution: Fixed Fix Version/s: 2/ > Move tags and unsafe modules into common >

[jira] [Updated] (SPARK-13548) Move tags and unsafe modules into common

2016-03-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-13548: Fix Version/s: (was: 2/) 2.0.0 > Move tags and unsafe modules into common >

  1   2   3   >