[jira] [Updated] (SPARK-9862) Join: Handling data skew

2016-10-13 Thread wangyuhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangyuhu updated SPARK-9862: Attachment: Handling skew data in join.pdf > Join: Handling data skew > > >

[jira] [Comment Edited] (SPARK-17890) scala.ScalaReflectionException

2016-10-13 Thread Khalid Reid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572217#comment-15572217 ] Khalid Reid edited comment on SPARK-17890 at 10/13/16 3:17 PM: --- Hi Sean,

[jira] [Assigned] (SPARK-13983) HiveThriftServer2 can not get "--hiveconf" or ''--hivevar" variables since 1.6 version (both multi-session and single session)

2016-10-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13983: Assignee: Apache Spark (was: Cheng Lian) > HiveThriftServer2 can not get "--hiveconf" or

[jira] [Commented] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572108#comment-15572108 ] Yanbo Liang commented on SPARK-17904: - Thanks your comments. Yeah, I agree it's tricky in my example

[jira] [Commented] (SPARK-17906) MulticlassClassificationEvaluator support target label

2016-10-13 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572154#comment-15572154 ] Seth Hendrickson commented on SPARK-17906: -- We are adding model summaries that would expose some

[jira] [Commented] (SPARK-17908) Column names Corrupted in pysaprk dataframe groupBy

2016-10-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572189#comment-15572189 ] Sean Owen commented on SPARK-17908: --- Can you provide the error and a self-contained reproduction? I

[jira] [Commented] (SPARK-17908) Column names Corrupted in pysaprk dataframe groupBy

2016-10-13 Thread Harish (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572278#comment-15572278 ] Harish commented on SPARK-17908: Traceback (most recent call last): File

[jira] [Comment Edited] (SPARK-17908) Column names Corrupted in pysaprk dataframe groupBy

2016-10-13 Thread Harish (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572278#comment-15572278 ] Harish edited comment on SPARK-17908 at 10/13/16 4:09 PM: -- Traceback (most

[jira] [Comment Edited] (SPARK-17890) scala.ScalaReflectionException

2016-10-13 Thread Khalid Reid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572217#comment-15572217 ] Khalid Reid edited comment on SPARK-17890 at 10/13/16 3:19 PM: --- Hi Sean,

[jira] [Comment Edited] (SPARK-17890) scala.ScalaReflectionException

2016-10-13 Thread Khalid Reid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572217#comment-15572217 ] Khalid Reid edited comment on SPARK-17890 at 10/13/16 3:18 PM: --- Hi Sean,

[jira] [Comment Edited] (SPARK-17890) scala.ScalaReflectionException

2016-10-13 Thread Khalid Reid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572217#comment-15572217 ] Khalid Reid edited comment on SPARK-17890 at 10/13/16 3:30 PM: --- Hi Sean,

[jira] [Created] (SPARK-17911) Scheduler does not messageScheduler for ResubmitFailedStages

2016-10-13 Thread Imran Rashid (JIRA)
Imran Rashid created SPARK-17911: Summary: Scheduler does not messageScheduler for ResubmitFailedStages Key: SPARK-17911 URL: https://issues.apache.org/jira/browse/SPARK-17911 Project: Spark

[jira] [Comment Edited] (SPARK-17911) Scheduler does not need messageScheduler for ResubmitFailedStages

2016-10-13 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572289#comment-15572289 ] Imran Rashid edited comment on SPARK-17911 at 10/13/16 3:44 PM: Copying

[jira] [Commented] (SPARK-17890) scala.ScalaReflectionException

2016-10-13 Thread Khalid Reid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572217#comment-15572217 ] Khalid Reid commented on SPARK-17890: - Hi Sean, I've created a small project to reproduce the error

[jira] [Updated] (SPARK-17911) Scheduler does not need messageScheduler for ResubmitFailedStages

2016-10-13 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid updated SPARK-17911: - Summary: Scheduler does not need messageScheduler for ResubmitFailedStages (was: Scheduler does

[jira] [Commented] (SPARK-17900) Mark the following Spark SQL APIs as stable

2016-10-13 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572119#comment-15572119 ] Cody Koeninger commented on SPARK-17900: Thanks for doing this, should make things clearer.

[jira] [Updated] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17904: Description: SparkR provides {{spark.lappy}} to run local R functions in distributed environment,

[jira] [Created] (SPARK-17908) Column names Corrupted in pysaprk dataframe groupBy

2016-10-13 Thread Harish (JIRA)
Harish created SPARK-17908: -- Summary: Column names Corrupted in pysaprk dataframe groupBy Key: SPARK-17908 URL: https://issues.apache.org/jira/browse/SPARK-17908 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-17909) we should create table before writing out the data in CTAS

2016-10-13 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-17909: --- Summary: we should create table before writing out the data in CTAS Key: SPARK-17909 URL: https://issues.apache.org/jira/browse/SPARK-17909 Project: Spark

[jira] [Commented] (SPARK-17911) Scheduler does not need messageScheduler for ResubmitFailedStages

2016-10-13 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572289#comment-15572289 ] Imran Rashid commented on SPARK-17911: -- Copying the earlier discussion on the PR here from squito:

[jira] [Comment Edited] (SPARK-17890) scala.ScalaReflectionException

2016-10-13 Thread Khalid Reid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572217#comment-15572217 ] Khalid Reid edited comment on SPARK-17890 at 10/13/16 3:29 PM: --- Hi Sean,

[jira] [Created] (SPARK-17912) Refactor code generation to get data for ColumnVector/ColumnarBatch

2016-10-13 Thread Kazuaki Ishizaki (JIRA)
Kazuaki Ishizaki created SPARK-17912: Summary: Refactor code generation to get data for ColumnVector/ColumnarBatch Key: SPARK-17912 URL: https://issues.apache.org/jira/browse/SPARK-17912 Project:

[jira] [Commented] (SPARK-17908) Column names Corrupted in pysaprk dataframe groupBy

2016-10-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572330#comment-15572330 ] Sean Owen commented on SPARK-17908: --- Yes what's your code? This says one DF has just 'columns' as

[jira] [Updated] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17904: Description: SparkR provides {{spark.lappy}} to run local R functions in distributed environment,

[jira] [Assigned] (SPARK-17909) we should create table before writing out the data in CTAS

2016-10-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17909: Assignee: Wenchen Fan (was: Apache Spark) > we should create table before writing out

[jira] [Commented] (SPARK-17909) we should create table before writing out the data in CTAS

2016-10-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572179#comment-15572179 ] Apache Spark commented on SPARK-17909: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17909) we should create table before writing out the data in CTAS

2016-10-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17909: Assignee: Apache Spark (was: Wenchen Fan) > we should create table before writing out

[jira] [Created] (SPARK-17910) Allow users to update the comment of a column

2016-10-13 Thread Yin Huai (JIRA)
Yin Huai created SPARK-17910: Summary: Allow users to update the comment of a column Key: SPARK-17910 URL: https://issues.apache.org/jira/browse/SPARK-17910 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-13983) HiveThriftServer2 can not get "--hiveconf" or ''--hivevar" variables since 1.6 version (both multi-session and single session)

2016-10-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13983: Assignee: Cheng Lian (was: Apache Spark) > HiveThriftServer2 can not get "--hiveconf" or

[jira] [Commented] (SPARK-13983) HiveThriftServer2 can not get "--hiveconf" or ''--hivevar" variables since 1.6 version (both multi-session and single session)

2016-10-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572249#comment-15572249 ] Apache Spark commented on SPARK-13983: -- User 'wangyum' has created a pull request for this issue:

[jira] [Updated] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17904: Description: SparkR provides {{spark.lappy}} to run local R functions in distributed environment,

[jira] [Updated] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-17904: Description: SparkR provides {{spark.lappy}} to run local R functions in distributed environment,

[jira] [Commented] (SPARK-17911) Scheduler does not need messageScheduler for ResubmitFailedStages

2016-10-13 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572344#comment-15572344 ] Imran Rashid commented on SPARK-17911: -- bq. In other words, handling a ResubmitFailedStages event

[jira] [Commented] (SPARK-17908) Column names Corrupted in pysaprk dataframe groupBy

2016-10-13 Thread Harish (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572373#comment-15572373 ] Harish commented on SPARK-17908: Sorry.. I didnt put the actual column names of my code in stack trace, i

[jira] [Commented] (SPARK-17192) Issuing an exception when users specify the partitioning columns without a given schema

2016-10-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572394#comment-15572394 ] Apache Spark commented on SPARK-17192: -- User 'kiszk' has created a pull request for this issue:

[jira] [Commented] (SPARK-12571) AWS credentials not available for read.parquet in SQLContext

2016-10-13 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12571?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572442#comment-15572442 ] Steve Loughran commented on SPARK-12571: Means the credentials aren't at the far end, either in

[jira] [Commented] (SPARK-8437) Using directory path without wildcard for filename slow for large number of files with wholeTextFiles and binaryFiles

2016-10-13 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572468#comment-15572468 ] Steve Loughran commented on SPARK-8437: --- Just came across by way of comments in the source. This

[jira] [Comment Edited] (SPARK-17908) Column names Corrupted in pysaprk dataframe groupBy

2016-10-13 Thread Harish (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572478#comment-15572478 ] Harish edited comment on SPARK-17908 at 10/13/16 4:58 PM: -- Yes. Your code

[jira] [Commented] (SPARK-17914) Spark SQL casting to TimestampType with nanosecond results in incorrect timestamp

2016-10-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572730#comment-15572730 ] Sean Owen commented on SPARK-17914: --- I think this is a duplicate of one of a couple possible issues,

[jira] [Commented] (SPARK-17895) Improve documentation of "rowsBetween" and "rangeBetween"

2016-10-13 Thread Weiluo Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572726#comment-15572726 ] Weiluo Ren commented on SPARK-17895: [~junyangq] Could you please help fix the SparkR doc

[jira] [Commented] (SPARK-17902) collect() ignores stringsAsFactors

2016-10-13 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572472#comment-15572472 ] Shivaram Venkataraman commented on SPARK-17902: --- Good catch - Looks like this was changed

[jira] [Commented] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572541#comment-15572541 ] Felix Cheung commented on SPARK-17904: -- I somewhat disagree, actually. In R, it is very common to

[jira] [Comment Edited] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572541#comment-15572541 ] Felix Cheung edited comment on SPARK-17904 at 10/13/16 5:09 PM: I

[jira] [Comment Edited] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572541#comment-15572541 ] Felix Cheung edited comment on SPARK-17904 at 10/13/16 5:15 PM: I

[jira] [Commented] (SPARK-17912) Refactor code generation to get data for ColumnVector/ColumnarBatch

2016-10-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572691#comment-15572691 ] Apache Spark commented on SPARK-17912: -- User 'kiszk' has created a pull request for this issue:

[jira] [Commented] (SPARK-17902) collect() ignores stringsAsFactors

2016-10-13 Thread Hossein Falaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572689#comment-15572689 ] Hossein Falaki commented on SPARK-17902: Thanks for the pointer [~shivaram]. I will submit it

[jira] [Assigned] (SPARK-17912) Refactor code generation to get data for ColumnVector/ColumnarBatch

2016-10-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17912: Assignee: (was: Apache Spark) > Refactor code generation to get data for

[jira] [Assigned] (SPARK-17912) Refactor code generation to get data for ColumnVector/ColumnarBatch

2016-10-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17912: Assignee: Apache Spark > Refactor code generation to get data for

[jira] [Created] (SPARK-17916) CSV data source treats empty string as null no matter what nullValue option is

2016-10-13 Thread Hossein Falaki (JIRA)
Hossein Falaki created SPARK-17916: -- Summary: CSV data source treats empty string as null no matter what nullValue option is Key: SPARK-17916 URL: https://issues.apache.org/jira/browse/SPARK-17916

[jira] [Commented] (SPARK-15565) The default value of spark.sql.warehouse.dir needs to explicitly point to local filesystem

2016-10-13 Thread Alessio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572759#comment-15572759 ] Alessio commented on SPARK-15565: - Same problem happened again in Spark 2.0.1. > The default value of

[jira] [Comment Edited] (SPARK-17908) Column names Corrupted in pysaprk dataframe groupBy

2016-10-13 Thread Harish (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572278#comment-15572278 ] Harish edited comment on SPARK-17908 at 10/13/16 4:13 PM: -- Traceback (most

[jira] [Comment Edited] (SPARK-17908) Column names Corrupted in pysaprk dataframe groupBy

2016-10-13 Thread Harish (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572278#comment-15572278 ] Harish edited comment on SPARK-17908 at 10/13/16 4:12 PM: -- Traceback (most

[jira] [Commented] (SPARK-17908) Column names Corrupted in pysaprk dataframe groupBy

2016-10-13 Thread Harish (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572478#comment-15572478 ] Harish commented on SPARK-17908: Yes. You are code structure is same as mine.. But i have 70M records

[jira] [Commented] (SPARK-17895) Improve documentation of "rowsBetween" and "rangeBetween"

2016-10-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572602#comment-15572602 ] Felix Cheung commented on SPARK-17895: -- would you like to fix this? > Improve documentation of

[jira] [Created] (SPARK-17914) Spark SQL casting to TimestampType with nanosecond results in incorrect timestamp

2016-10-13 Thread Oksana Romankova (JIRA)
Oksana Romankova created SPARK-17914: Summary: Spark SQL casting to TimestampType with nanosecond results in incorrect timestamp Key: SPARK-17914 URL: https://issues.apache.org/jira/browse/SPARK-17914

[jira] [Resolved] (SPARK-17882) RBackendHandler swallowing errors

2016-10-13 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-17882. --- Resolution: Fixed Fix Version/s: 2.1.0 2.0.2

[jira] [Updated] (SPARK-17882) RBackendHandler swallowing errors

2016-10-13 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-17882: -- Assignee: James Shuster > RBackendHandler swallowing errors >

[jira] [Created] (SPARK-17915) Prepare ColumnVector implementation for UnsafeData

2016-10-13 Thread Kazuaki Ishizaki (JIRA)
Kazuaki Ishizaki created SPARK-17915: Summary: Prepare ColumnVector implementation for UnsafeData Key: SPARK-17915 URL: https://issues.apache.org/jira/browse/SPARK-17915 Project: Spark

[jira] [Commented] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572377#comment-15572377 ] Shivaram Venkataraman commented on SPARK-17904: --- +1 I think this sounds good [~yanboliang].

[jira] [Commented] (SPARK-17908) Column names Corrupted in pysaprk dataframe groupBy

2016-10-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572421#comment-15572421 ] Sean Owen commented on SPARK-17908: --- You must be doing something different than what you show, because

[jira] [Commented] (SPARK-17895) Improve documentation of "rowsBetween" and "rangeBetween"

2016-10-13 Thread Weiluo Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572721#comment-15572721 ] Weiluo Ren commented on SPARK-17895: Sure. Just want to collect some comments on the example to be

[jira] [Commented] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572741#comment-15572741 ] Shivaram Venkataraman commented on SPARK-17904: --- Thanks all - This is a good discussion

[jira] [Assigned] (SPARK-17915) Prepare ColumnVector implementation for UnsafeData

2016-10-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17915: Assignee: (was: Apache Spark) > Prepare ColumnVector implementation for UnsafeData >

[jira] [Assigned] (SPARK-17915) Prepare ColumnVector implementation for UnsafeData

2016-10-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17915: Assignee: Apache Spark > Prepare ColumnVector implementation for UnsafeData >

[jira] [Commented] (SPARK-17915) Prepare ColumnVector implementation for UnsafeData

2016-10-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572747#comment-15572747 ] Apache Spark commented on SPARK-17915: -- User 'kiszk' has created a pull request for this issue:

[jira] [Commented] (SPARK-17714) ClassCircularityError is thrown when using org.apache.spark.util.Utils.classForName 

2016-10-13 Thread Weiqing Yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572580#comment-15572580 ] Weiqing Yang commented on SPARK-17714: -- Not yet, need to investigate more. Could we pull in people

[jira] [Commented] (SPARK-17914) Spark SQL casting to TimestampType with nanosecond results in incorrect timestamp

2016-10-13 Thread Oksana Romankova (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572758#comment-15572758 ] Oksana Romankova commented on SPARK-17914: -- You are correct. It is related to what has been

[jira] [Commented] (SPARK-14561) History Server does not see new logs in S3

2016-10-13 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572401#comment-15572401 ] Steve Loughran commented on SPARK-14561: To clarify: it's not changes in existing files that

[jira] [Comment Edited] (SPARK-17908) Column names Corrupted in pysaprk dataframe groupBy

2016-10-13 Thread Harish (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572373#comment-15572373 ] Harish edited comment on SPARK-17908 at 10/13/16 4:17 PM: -- Sorry.. I didnt put

[jira] [Commented] (SPARK-9004) Add s3 bytes read/written metrics

2016-10-13 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572414#comment-15572414 ] Steve Loughran commented on SPARK-9004: --- HADOOP-13605 added a whole new set of counters for HDFS, S3

[jira] [Commented] (SPARK-17714) ClassCircularityError is thrown when using org.apache.spark.util.Utils.classForName 

2016-10-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572435#comment-15572435 ] Sean Owen commented on SPARK-17714: --- This is resolved now, right? > ClassCircularityError is thrown

[jira] [Commented] (SPARK-12664) Expose raw prediction scores in MultilayerPerceptronClassificationModel

2016-10-13 Thread Gayathri Murali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572545#comment-15572545 ] Gayathri Murali commented on SPARK-12664: - [~yanboliang] I am not working on this. Please feel

[jira] [Created] (SPARK-17913) Filter/join expressions can return incorrect results when comparing strings to longs

2016-10-13 Thread Ming Beckwith (JIRA)
Ming Beckwith created SPARK-17913: - Summary: Filter/join expressions can return incorrect results when comparing strings to longs Key: SPARK-17913 URL: https://issues.apache.org/jira/browse/SPARK-17913

[jira] [Commented] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572606#comment-15572606 ] Felix Cheung commented on SPARK-17904: -- For reference these are the related PRs for Python for

[jira] [Resolved] (SPARK-17827) StatisticsColumnSuite failures on big endian platforms

2016-10-13 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-17827. --- Resolution: Fixed Fix Version/s: 2.1.0 > StatisticsColumnSuite failures on

[jira] [Updated] (SPARK-17827) StatisticsColumnSuite failures on big endian platforms

2016-10-13 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-17827: -- Assignee: Pete Robbins > StatisticsColumnSuite failures on big endian platforms >

[jira] [Created] (SPARK-17918) Default Warehause location apparently in HDFS

2016-10-13 Thread Alessio (JIRA)
Alessio created SPARK-17918: --- Summary: Default Warehause location apparently in HDFS Key: SPARK-17918 URL: https://issues.apache.org/jira/browse/SPARK-17918 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-17918) Default Warehouse location apparently in HDFS

2016-10-13 Thread Alessio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessio updated SPARK-17918: Summary: Default Warehouse location apparently in HDFS (was: Default Warehause location apparently in

[jira] [Updated] (SPARK-17918) Default Warehouse location apparently in HDFS

2016-10-13 Thread Alessio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessio updated SPARK-17918: Description: It seems that the default warehouse location in Spark 2.0.1 not only points at an inexistent

[jira] [Commented] (SPARK-17914) Spark SQL casting to TimestampType with nanosecond results in incorrect timestamp

2016-10-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572885#comment-15572885 ] Sean Owen commented on SPARK-17914: --- Does the ISO8601 format support nanoseconds even? I thought we had

[jira] [Commented] (SPARK-16575) partition calculation mismatch with sc.binaryFiles

2016-10-13 Thread Tarun Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572950#comment-15572950 ] Tarun Kumar commented on SPARK-16575: - [~rxin] I have now added the support of openCostInBytes,

[jira] [Comment Edited] (SPARK-15565) The default value of spark.sql.warehouse.dir needs to explicitly point to local filesystem

2016-10-13 Thread Alessio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572976#comment-15572976 ] Alessio edited comment on SPARK-15565 at 10/13/16 7:49 PM: --- Yes Sean, indeed in

[jira] [Updated] (SPARK-17918) Default Warehouse location apparently in HDFS

2016-10-13 Thread Alessio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessio updated SPARK-17918: Description: It seems that the default warehouse location in Spark 2.0.1 not only points at an inexistent

[jira] [Updated] (SPARK-17918) Default Warehouse location apparently in HDFS

2016-10-13 Thread Alessio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessio updated SPARK-17918: Description: It seems that the default warehouse location in Spark 2.0.1 not only points at an inexistent

[jira] [Updated] (SPARK-17918) Default Warehouse location apparently in HDFS

2016-10-13 Thread Alessio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessio updated SPARK-17918: Description: It seems that the default warehouse location in Spark 2.0.1 not only points at an inexistent

[jira] [Resolved] (SPARK-17918) Default Warehouse location apparently in HDFS

2016-10-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-17918. --- Resolution: Duplicate > Default Warehouse location apparently in HDFS >

[jira] [Closed] (SPARK-15369) Investigate selectively using Jython for parts of PySpark

2016-10-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-15369. --- Resolution: Won't Fix In the spirit of having more explicitly accept/rejects, and given the

[jira] [Created] (SPARK-17919) Make timeout to RBackend configurable in SparkR

2016-10-13 Thread Hossein Falaki (JIRA)
Hossein Falaki created SPARK-17919: -- Summary: Make timeout to RBackend configurable in SparkR Key: SPARK-17919 URL: https://issues.apache.org/jira/browse/SPARK-17919 Project: Spark Issue

[jira] [Updated] (SPARK-17918) Default Warehouse location apparently in HDFS

2016-10-13 Thread Alessio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessio updated SPARK-17918: Description: It seems that the default warehouse location in Spark 2.0.1 not only points at an inexistent

[jira] [Commented] (SPARK-15565) The default value of spark.sql.warehouse.dir needs to explicitly point to local filesystem

2016-10-13 Thread Alessio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572976#comment-15572976 ] Alessio commented on SPARK-15565: - Yes Sean, indeed in my latest issue SPARK-17918 I was referring to

[jira] [Updated] (SPARK-17918) Default Warehause location apparently in HDFS

2016-10-13 Thread Alessio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessio updated SPARK-17918: Description: It seems that the default warehouse location in Spark 2.0.1 not only points at an inexistent

[jira] [Updated] (SPARK-17918) Default Warehause location apparently in HDFS

2016-10-13 Thread Alessio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessio updated SPARK-17918: Description: It seems that the default warehouse location in Spark 2.0.1 not only points at an inexistent

[jira] [Updated] (SPARK-17918) Default Warehouse location apparently in HDFS

2016-10-13 Thread Alessio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessio updated SPARK-17918: Environment: Mac OS X 10.11.6 (was: Macintosh) > Default Warehouse location apparently in HDFS >

[jira] [Commented] (SPARK-15565) The default value of spark.sql.warehouse.dir needs to explicitly point to local filesystem

2016-10-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572961#comment-15572961 ] Sean Owen commented on SPARK-15565: --- Not quite... this was sort of undone by

[jira] [Updated] (SPARK-17920) HiveWriterContainer passes null configuration to serde.initialize, causing NullPointerException in AvroSerde when using avro.schema.url

2016-10-13 Thread James Norvell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] James Norvell updated SPARK-17920: -- Attachment: avro_data avro.avsc > HiveWriterContainer passes null

[jira] [Updated] (SPARK-17920) HiveWriterContainer passes null configuration to serde.initialize, causing NullPointerException in AvroSerde when using avro.schema.url

2016-10-13 Thread James Norvell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] James Norvell updated SPARK-17920: -- Attachment: avro_data avro.avsc > HiveWriterContainer passes null

[jira] [Updated] (SPARK-17918) Default Warehouse location apparently in HDFS

2016-10-13 Thread Alessio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessio updated SPARK-17918: Description: It seems that the default warehouse location in Spark 2.0.1 not only points at an inexistent

[jira] [Updated] (SPARK-17917) Convert 'Initial job has not accepted any resources..' logWarning to a SparkListener event

2016-10-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-17917: -- Priority: Minor (was: Major) Maybe, I suppose it will be a little tricky to define what the event is

[jira] [Commented] (SPARK-17812) More granular control of starting offsets (assign)

2016-10-13 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572922#comment-15572922 ] Cody Koeninger commented on SPARK-17812: Sorry, I didn't see this comment until just now. X

[jira] [Comment Edited] (SPARK-17914) Spark SQL casting to TimestampType with nanosecond results in incorrect timestamp

2016-10-13 Thread Oksana Romankova (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572957#comment-15572957 ] Oksana Romankova edited comment on SPARK-17914 at 10/13/16 7:42 PM:

  1   2   3   >