[jira] [Commented] (SPARK-10928) Spark Mesos finegrain mode with single core CPUs, rendered useless

2015-10-05 Thread Ravi Sanwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943789#comment-14943789 ] Ravi Sanwal commented on SPARK-10928: - I guess its not a major issue because these king of

[jira] [Commented] (SPARK-8654) Analysis exception when using "NULL IN (...)": invalid cast

2015-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943814#comment-14943814 ] Apache Spark commented on SPARK-8654: - User 'dilipbiswal' has created a pull request for this issue:

[jira] [Commented] (SPARK-10474) TungstenAggregation cannot acquire memory for pointer array after switching to sort-based

2015-10-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943865#comment-14943865 ] Reynold Xin commented on SPARK-10474: - I'm fairly sure you are hitting a different issue. [~nadenf]

[jira] [Commented] (SPARK-10792) Spark + YARN – executor is not re-created

2015-10-05 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943709#comment-14943709 ] Steve Loughran commented on SPARK-10792: you can't stop executors from "never dying" —on AWS

[jira] [Commented] (SPARK-9318) Add `merge` as synonym for join

2015-10-05 Thread Narine Kokhlikyan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943784#comment-14943784 ] Narine Kokhlikyan commented on SPARK-9318: -- Hi all, [~shivaram], [~falaki], I am working on the

[jira] [Assigned] (SPARK-8654) Analysis exception when using "NULL IN (...)": invalid cast

2015-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8654: --- Assignee: Apache Spark > Analysis exception when using "NULL IN (...)": invalid cast >

[jira] [Comment Edited] (SPARK-10474) TungstenAggregation cannot acquire memory for pointer array after switching to sort-based

2015-10-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943868#comment-14943868 ] Reynold Xin edited comment on SPARK-10474 at 10/5/15 7:21 PM: -- Also please

[jira] [Commented] (SPARK-10927) Spark history uses the application name instead of the ID

2015-10-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943776#comment-14943776 ] Jean-Baptiste Onofré commented on SPARK-10927: -- Sorry about that. Yes, now, I will look for

[jira] [Updated] (SPARK-10927) Spark history uses the application name instead of the ID

2015-10-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jean-Baptiste Onofré updated SPARK-10927: - Description: Setting spark.eventLog.enabled to true, and a folder location for

[jira] [Commented] (SPARK-9318) Add `merge` as synonym for join

2015-10-05 Thread Deborah Siegel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943810#comment-14943810 ] Deborah Siegel commented on SPARK-9318: --- Narine, just want to offer that I haven't replicated that

[jira] [Commented] (SPARK-10914) Incorrect empty join sets

2015-10-05 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943855#comment-14943855 ] Wenchen Fan commented on SPARK-10914: - hi [~benm], I can't reproduce this bug on spark

[jira] [Comment Edited] (SPARK-10474) TungstenAggregation cannot acquire memory for pointer array after switching to sort-based

2015-10-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14939808#comment-14939808 ] Reynold Xin edited comment on SPARK-10474 at 10/5/15 7:05 PM: -- We are

[jira] [Commented] (SPARK-10474) TungstenAggregation cannot acquire memory for pointer array after switching to sort-based

2015-10-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943868#comment-14943868 ] Reynold Xin commented on SPARK-10474: - Also please attach the "explain" of the query plan. Thanks.

[jira] [Created] (SPARK-10928) Spark Mesos finegrain mode with single core CPUs, rendered useless

2015-10-05 Thread Ravi Sanwal (JIRA)
Ravi Sanwal created SPARK-10928: --- Summary: Spark Mesos finegrain mode with single core CPUs, rendered useless Key: SPARK-10928 URL: https://issues.apache.org/jira/browse/SPARK-10928 Project: Spark

[jira] [Assigned] (SPARK-8654) Analysis exception when using "NULL IN (...)": invalid cast

2015-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8654: --- Assignee: (was: Apache Spark) > Analysis exception when using "NULL IN (...)": invalid

[jira] [Created] (SPARK-10943) NullType Column cannot be written to Parquet

2015-10-05 Thread Jason Pohl (JIRA)
Jason Pohl created SPARK-10943: -- Summary: NullType Column cannot be written to Parquet Key: SPARK-10943 URL: https://issues.apache.org/jira/browse/SPARK-10943 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-10382) Make example code in user guide testable

2015-10-05 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10382?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944480#comment-14944480 ] Xusen Yin commented on SPARK-10382: --- [~mengxr] I'd love to work on this if no one else keep on doing

[jira] [Commented] (SPARK-10942) Not all cached RDDs are unpersisted

2015-10-05 Thread Nick Pritchard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10942?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944478#comment-14944478 ] Nick Pritchard commented on SPARK-10942: Regardless, the documentation for

[jira] [Created] (SPARK-10944) org/slf4j/Logger is not provided in spark-1.5.1-bin-without-hadoop/lib/spark-assembly-1.5.1-hadoop2.2.0.jar

2015-10-05 Thread Pranas Baliuka (JIRA)
Pranas Baliuka created SPARK-10944: -- Summary: org/slf4j/Logger is not provided in spark-1.5.1-bin-without-hadoop/lib/spark-assembly-1.5.1-hadoop2.2.0.jar Key: SPARK-10944 URL:

[jira] [Comment Edited] (SPARK-10944) org/slf4j/Logger is not provided in spark-1.5.1-bin-without-hadoop/lib/spark-assembly-1.5.1-hadoop2.2.0.jar

2015-10-05 Thread Pranas Baliuka (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944539#comment-14944539 ] Pranas Baliuka edited comment on SPARK-10944 at 10/6/15 5:42 AM: - If one

[jira] [Commented] (SPARK-10944) org/slf4j/Logger is not provided in spark-1.5.1-bin-without-hadoop/lib/spark-assembly-1.5.1-hadoop2.2.0.jar

2015-10-05 Thread Pranas Baliuka (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944539#comment-14944539 ] Pranas Baliuka commented on SPARK-10944: If one wants to deploy Spark without Hadoop it should be

[jira] [Comment Edited] (SPARK-10944) org/slf4j/Logger is not provided in spark-1.5.1-bin-without-hadoop/lib/spark-assembly-1.5.1-hadoop2.2.0.jar

2015-10-05 Thread Pranas Baliuka (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944539#comment-14944539 ] Pranas Baliuka edited comment on SPARK-10944 at 10/6/15 5:42 AM: - If one

[jira] [Comment Edited] (SPARK-10942) Not all cached RDDs are unpersisted

2015-10-05 Thread Rekha Joshi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10942?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944500#comment-14944500 ] Rekha Joshi edited comment on SPARK-10942 at 10/6/15 4:50 AM: -- SPARK-10942:

[jira] [Commented] (SPARK-10942) Not all cached RDDs are unpersisted

2015-10-05 Thread Rekha Joshi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10942?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944497#comment-14944497 ] Rekha Joshi commented on SPARK-10942: - Thanks [~pnpritchard] I tried to replicate the issue few times

[jira] [Updated] (SPARK-10942) Not all cached RDDs are unpersisted

2015-10-05 Thread Rekha Joshi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rekha Joshi updated SPARK-10942: Attachment: SPARK-10942_3.png SPARK-10942_2.png SPARK-10942_1.png

[jira] [Commented] (SPARK-10942) Not all cached RDDs are unpersisted

2015-10-05 Thread Nick Pritchard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10942?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944508#comment-14944508 ] Nick Pritchard commented on SPARK-10942: [~rekhajoshm] Thanks for trying to reproduce it. Since

[jira] [Updated] (SPARK-10942) Not all cached RDDs are unpersisted

2015-10-05 Thread Nick Pritchard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pritchard updated SPARK-10942: --- Priority: Minor (was: Major) > Not all cached RDDs are unpersisted >

[jira] [Commented] (SPARK-10641) skewness and kurtosis support

2015-10-05 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944485#comment-14944485 ] Seth Hendrickson commented on SPARK-10641: -- My apologies, I haven't been able to devote much

[jira] [Comment Edited] (SPARK-10942) Not all cached RDDs are unpersisted

2015-10-05 Thread Rekha Joshi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10942?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944497#comment-14944497 ] Rekha Joshi edited comment on SPARK-10942 at 10/6/15 4:48 AM: -- Thanks

[jira] [Resolved] (SPARK-10585) only copy data once when generate unsafe projection

2015-10-05 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-10585. Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8747

[jira] [Assigned] (SPARK-10913) Add attach() function for DataFrame

2015-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10913: Assignee: (was: Apache Spark) > Add attach() function for DataFrame >

[jira] [Commented] (SPARK-10913) Add attach() function for DataFrame

2015-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943974#comment-14943974 ] Apache Spark commented on SPARK-10913: -- User 'adrian555' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10913) Add attach() function for DataFrame

2015-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10913: Assignee: Apache Spark > Add attach() function for DataFrame >

[jira] [Created] (SPARK-10931) PySpark ML Models should contain Param values

2015-10-05 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-10931: - Summary: PySpark ML Models should contain Param values Key: SPARK-10931 URL: https://issues.apache.org/jira/browse/SPARK-10931 Project: Spark

[jira] [Commented] (SPARK-10863) Method coltypes() to return the R column types of a DataFrame

2015-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943921#comment-14943921 ] Apache Spark commented on SPARK-10863: -- User 'olarayej' has created a pull request for this issue:

[jira] [Created] (SPARK-10929) Tungsten fails to acquire memory writing to HDFS

2015-10-05 Thread Naden Franciscus (JIRA)
Naden Franciscus created SPARK-10929: Summary: Tungsten fails to acquire memory writing to HDFS Key: SPARK-10929 URL: https://issues.apache.org/jira/browse/SPARK-10929 Project: Spark

[jira] [Updated] (SPARK-10929) Tungsten fails to acquire memory writing to HDFS

2015-10-05 Thread Naden Franciscus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Naden Franciscus updated SPARK-10929: - Priority: Blocker (was: Major) > Tungsten fails to acquire memory writing to HDFS >

[jira] [Updated] (SPARK-10929) Tungsten fails to acquire memory writing to HDFS

2015-10-05 Thread Naden Franciscus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Naden Franciscus updated SPARK-10929: - Description: We are executing 20 Spark SQL jobs in parallel using Spark Job Server and

[jira] [Created] (SPARK-10930) History "Stages" page "duration" can be confusing

2015-10-05 Thread Thomas Graves (JIRA)
Thomas Graves created SPARK-10930: - Summary: History "Stages" page "duration" can be confusing Key: SPARK-10930 URL: https://issues.apache.org/jira/browse/SPARK-10930 Project: Spark Issue

[jira] [Commented] (SPARK-9318) Add `merge` as synonym for join

2015-10-05 Thread Narine Kokhlikyan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943988#comment-14943988 ] Narine Kokhlikyan commented on SPARK-9318: -- printSchema is showing up correctly for me too. Only

[jira] [Comment Edited] (SPARK-9318) Add `merge` as synonym for join

2015-10-05 Thread Deborah Siegel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944007#comment-14944007 ] Deborah Siegel edited comment on SPARK-9318 at 10/5/15 9:00 PM: not sure

[jira] [Commented] (SPARK-10877) Assertions fail straightforward DataFrame job due to word alignment

2015-10-05 Thread Jason C Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943956#comment-14943956 ] Jason C Lee commented on SPARK-10877: - I ran your SparkFilterByKeyTest.scala from spark-shell but did

[jira] [Commented] (SPARK-10877) Assertions fail straightforward DataFrame job due to word alignment

2015-10-05 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944002#comment-14944002 ] Matt Cheah commented on SPARK-10877: Can you turn off assertions when you spawn the shell? Assertions

[jira] [Comment Edited] (SPARK-10877) Assertions fail straightforward DataFrame job due to word alignment

2015-10-05 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944002#comment-14944002 ] Matt Cheah edited comment on SPARK-10877 at 10/5/15 8:48 PM: - Can you turn on

[jira] [Commented] (SPARK-9963) ML RandomForest cleanup: replace predictNodeIndex with predictImpl

2015-10-05 Thread Amey Chaugule (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943904#comment-14943904 ] Amey Chaugule commented on SPARK-9963: -- Is this still WIP or has the PR been abandoned? I could take

[jira] [Updated] (SPARK-10929) Tungsten fails to acquire memory writing to HDFS

2015-10-05 Thread Naden Franciscus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Naden Franciscus updated SPARK-10929: - Description: We are executing 20 Spark SQL jobs in parallel using Spark Job Server and

[jira] [Commented] (SPARK-9318) Add `merge` as synonym for join

2015-10-05 Thread Narine Kokhlikyan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943986#comment-14943986 ] Narine Kokhlikyan commented on SPARK-9318: -- Hi [~dsiegel], thanks for checking it. Was there a

[jira] [Commented] (SPARK-10877) Assertions fail straightforward DataFrame job due to word alignment

2015-10-05 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944010#comment-14944010 ] Matt Cheah commented on SPARK-10877: Is it possible that this error is JVM or platform dependent? >

[jira] [Commented] (SPARK-9318) Add `merge` as synonym for join

2015-10-05 Thread Deborah Siegel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944007#comment-14944007 ] Deborah Siegel commented on SPARK-9318: --- not sure about the fix. I tried this on 1.5.0 and 1.5.1,

[jira] [Commented] (SPARK-7129) Add generic boosting algorithm to spark.ml

2015-10-05 Thread Meihua Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14942971#comment-14942971 ] Meihua Wu commented on SPARK-7129: -- Currently I am not aware of a straightforward way to impose the weak

[jira] [Commented] (SPARK-10669) Link to each language's API in codetabs in ML docs: spark.mllib

2015-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14942990#comment-14942990 ] Apache Spark commented on SPARK-10669: -- User 'keypointt' has created a pull request for this issue:

[jira] [Commented] (SPARK-10909) Spark sql jdbc fails for Oracle NUMBER type columns

2015-10-05 Thread Kostas papageorgopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14942994#comment-14942994 ] Kostas papageorgopoulos commented on SPARK-10909: - I will test it in the first chance and

[jira] [Assigned] (SPARK-10856) SQL Server dialect needs to map java.sql.Timestamp to DATETIME instead of TIMESTAMP

2015-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10856: Assignee: Apache Spark > SQL Server dialect needs to map java.sql.Timestamp to DATETIME

[jira] [Commented] (SPARK-10856) SQL Server dialect needs to map java.sql.Timestamp to DATETIME instead of TIMESTAMP

2015-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943017#comment-14943017 ] Apache Spark commented on SPARK-10856: -- User 'viirya' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10856) SQL Server dialect needs to map java.sql.Timestamp to DATETIME instead of TIMESTAMP

2015-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10856: Assignee: (was: Apache Spark) > SQL Server dialect needs to map java.sql.Timestamp to

[jira] [Commented] (SPARK-2344) Add Fuzzy C-Means algorithm to MLlib

2015-10-05 Thread Alex (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943040#comment-14943040 ] Alex commented on SPARK-2344: - Hi, Sorry it took me a lot of time to get back to you. I'm not yet finished

[jira] [Created] (SPARK-10923) Spark handling parallel requests

2015-10-05 Thread Tarek Abouzeid (JIRA)
Tarek Abouzeid created SPARK-10923: -- Summary: Spark handling parallel requests Key: SPARK-10923 URL: https://issues.apache.org/jira/browse/SPARK-10923 Project: Spark Issue Type: Question

[jira] [Resolved] (SPARK-10923) Spark handling parallel requests

2015-10-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10923. --- Resolution: Invalid Hi [~tarek_abouzeid] please have a look at

[jira] [Created] (SPARK-10924) Failed to update accumulators for ShuffleMapTask: Broken pipe

2015-10-05 Thread JIRA
Pau Tallada Crespí created SPARK-10924: -- Summary: Failed to update accumulators for ShuffleMapTask: Broken pipe Key: SPARK-10924 URL: https://issues.apache.org/jira/browse/SPARK-10924 Project:

[jira] [Commented] (SPARK-10924) Failed to update accumulators for ShuffleMapTask: Broken pipe

2015-10-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943107#comment-14943107 ] Sean Owen commented on SPARK-10924: --- I think this means there's a problem with your python worker -- is

[jira] [Commented] (SPARK-10924) Failed to update accumulators for ShuffleMapTask: Broken pipe

2015-10-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943130#comment-14943130 ] Pau Tallada Crespí commented on SPARK-10924: Oh, this one is the first error: 15/10/05

[jira] [Commented] (SPARK-10924) Failed to update accumulators for ShuffleMapTask: Broken pipe

2015-10-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943131#comment-14943131 ] Pau Tallada Crespí commented on SPARK-10924: >From then on, all are "Broken pipe" kind >

[jira] [Updated] (SPARK-10868) monotonicallyIncreasingId() supports offset for indexing

2015-10-05 Thread Martin Senne (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Martin Senne updated SPARK-10868: - Description: With SPARK-7135 and https://github.com/apache/spark/pull/5709

[jira] [Updated] (SPARK-10868) monotonicallyIncreasingId() supports offset for indexing

2015-10-05 Thread Martin Senne (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Martin Senne updated SPARK-10868: - Description: With SPARK-7135 and https://github.com/apache/spark/pull/5709

[jira] [Updated] (SPARK-10868) monotonicallyIncreasingId() supports offset for indexing

2015-10-05 Thread Martin Senne (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Martin Senne updated SPARK-10868: - Description: With SPARK-7135 and https://github.com/apache/spark/pull/5709

[jira] [Commented] (SPARK-6404) Call broadcast() in each interval for spark streaming programs.

2015-10-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-6404?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14943143#comment-14943143 ] Kåre Blakstad commented on SPARK-6404: -- I do believe there's an issue with this approach. The first

[jira] [Commented] (SPARK-10585) only copy data once when generate unsafe projection

2015-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944337#comment-14944337 ] Apache Spark commented on SPARK-10585: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Commented] (SPARK-9776) Another instance of Derby may have already booted the database

2015-10-05 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944340#comment-14944340 ] Michael Armbrust commented on SPARK-9776: - You should not create a HiveContext in the spark-shell.

[jira] [Comment Edited] (SPARK-10560) Make StreamingLogisticRegressionWithSGD Python API equals with Scala one

2015-10-05 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944305#comment-14944305 ] Bryan Cutler edited comment on SPARK-10560 at 10/6/15 1:17 AM: --- Hi

[jira] [Reopened] (SPARK-10066) Can't create HiveContext with spark-shell or spark-sql on snapshot

2015-10-05 Thread Robert Beauchemin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Beauchemin reopened SPARK-10066: --- This problem reappears with Spark 1.5.1. Same HDP/Hive setup and config. > Can't create

[jira] [Commented] (SPARK-9318) Add `merge` as synonym for join

2015-10-05 Thread Narine Kokhlikyan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944056#comment-14944056 ] Narine Kokhlikyan commented on SPARK-9318: -- I asked other ppl to try this and they all see k1

[jira] [Commented] (SPARK-10932) Port two minor changes to release packaging scripts back into Spark repo

2015-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944069#comment-14944069 ] Apache Spark commented on SPARK-10932: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10932) Port two minor changes to release packaging scripts back into Spark repo

2015-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10932: Assignee: Josh Rosen (was: Apache Spark) > Port two minor changes to release packaging

[jira] [Assigned] (SPARK-10932) Port two minor changes to release packaging scripts back into Spark repo

2015-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10932: Assignee: Apache Spark (was: Josh Rosen) > Port two minor changes to release packaging

[jira] [Commented] (SPARK-10868) monotonicallyIncreasingId() supports offset for indexing

2015-10-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944108#comment-14944108 ] Reynold Xin commented on SPARK-10868: - [~MartinSenne] this makes sense, and shouldn't be too hard to

[jira] [Commented] (SPARK-10934) hashCode of unsafe array may crush

2015-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944122#comment-14944122 ] Apache Spark commented on SPARK-10934: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10934) hashCode of unsafe array may crush

2015-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10934: Assignee: Apache Spark > hashCode of unsafe array may crush >

[jira] [Created] (SPARK-10935) Avito Context Ad Clicks

2015-10-05 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-10935: - Summary: Avito Context Ad Clicks Key: SPARK-10935 URL: https://issues.apache.org/jira/browse/SPARK-10935 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-10935) Avito Context Ad Clicks

2015-10-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10935: -- Description: >From [~kpl...@gmail.com]: > I would love to do Avito Context Ad Clicks - >

[jira] [Updated] (SPARK-10935) Avito Context Ad Clicks

2015-10-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10935: -- Description: >From [~kpl...@gmail.com]: > I would love to do Avito Context Ad Clicks - >

[jira] [Updated] (SPARK-10384) Univariate statistics as UDAFs

2015-10-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10384: -- Description: It would be nice to define univariate statistics as UDAFs. This JIRA discusses

[jira] [Resolved] (SPARK-10862) Univariate Statistics: Adding median & quantile support as UDAF

2015-10-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10862. --- Resolution: Duplicate > Univariate Statistics: Adding median & quantile support as UDAF >

[jira] [Commented] (SPARK-10066) Can't create HiveContext with spark-shell or spark-sql on snapshot

2015-10-05 Thread Robert Beauchemin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944049#comment-14944049 ] Robert Beauchemin commented on SPARK-10066: --- This problem reappeared in Spark 1.5.1, same exact

[jira] [Created] (SPARK-10932) Port two minor changes to release packaging scripts back into Spark repo

2015-10-05 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-10932: -- Summary: Port two minor changes to release packaging scripts back into Spark repo Key: SPARK-10932 URL: https://issues.apache.org/jira/browse/SPARK-10932 Project: Spark

[jira] [Created] (SPARK-10933) Spark SQL Joins should have option to fail query when row multiplication is encountered

2015-10-05 Thread Stephen Link (JIRA)
Stephen Link created SPARK-10933: Summary: Spark SQL Joins should have option to fail query when row multiplication is encountered Key: SPARK-10933 URL: https://issues.apache.org/jira/browse/SPARK-10933

[jira] [Commented] (SPARK-9941) Try ML pipeline API on Kaggle competitions

2015-10-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944128#comment-14944128 ] Xiangrui Meng commented on SPARK-9941: -- Created https://issues.apache.org/jira/browse/SPARK-10935. I

[jira] [Updated] (SPARK-10935) Avito Context Ad Clicks

2015-10-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10935: -- Description: >From [~kpl...@gmail.com]: I would love to do Avito Context Ad Clicks -

[jira] [Updated] (SPARK-10384) Univariate statistics as UDAFs

2015-10-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10384: -- Description: It would be nice to define univariate statistics as UDAFs. This JIRA discusses

[jira] [Updated] (SPARK-10384) Univariate statistics as UDAFs

2015-10-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10384: -- Description: It would be nice to define univariate statistics as UDAFs. This JIRA discusses

[jira] [Commented] (SPARK-10877) Assertions fail straightforward DataFrame job due to word alignment

2015-10-05 Thread Jason C Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944141#comment-14944141 ] Jason C Lee commented on SPARK-10877: - I enabled assertions by specifying either the following in my

[jira] [Updated] (SPARK-10384) Univariate statistics as UDAFs

2015-10-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10384: -- Description: It would be nice to define univariate statistics as UDAFs. This JIRA discusses

[jira] [Commented] (SPARK-10685) Misaligned data with RDD.zip and DataFrame.withColumn after repartition

2015-10-05 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944093#comment-14944093 ] Josh Rosen commented on SPARK-10685: [~jdanbrown], if the zip after repartition problem is still an

[jira] [Commented] (SPARK-10685) Misaligned data with RDD.zip and DataFrame.withColumn after repartition

2015-10-05 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944112#comment-14944112 ] Davies Liu commented on SPARK-10685: [~jdanbrown] The zip after repartition (or shuffling) is another

[jira] [Created] (SPARK-10934) hashCode of unsafe array may crush

2015-10-05 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-10934: --- Summary: hashCode of unsafe array may crush Key: SPARK-10934 URL: https://issues.apache.org/jira/browse/SPARK-10934 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-10513) Springleaf Marketing Response

2015-10-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14944121#comment-14944121 ] Xiangrui Meng commented on SPARK-10513: --- Any updates? > Springleaf Marketing Response >

[jira] [Assigned] (SPARK-10934) hashCode of unsafe array may crush

2015-10-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10934: Assignee: (was: Apache Spark) > hashCode of unsafe array may crush >

[jira] [Updated] (SPARK-10382) Make example code in user guide testable

2015-10-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10382: -- Assignee: (was: Xiangrui Meng) > Make example code in user guide testable >

[jira] [Updated] (SPARK-10384) Univariate statistics as UDAFs

2015-10-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10384: -- Description: It would be nice to define univariate statistics as UDAFs. This JIRA discusses

[jira] [Updated] (SPARK-10384) Univariate statistics as UDAFs

2015-10-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10384: -- Description: It would be nice to define univariate statistics as UDAFs. This JIRA discusses

[jira] [Updated] (SPARK-10384) Univariate statistics as UDAFs

2015-10-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10384: -- Description: It would be nice to define univariate statistics as UDAFs. This JIRA discusses

  1   2   3   >