[jira] [Updated] (SPARK-11723) Use LibSVM data source rather than MLUtils.loadLibSVMFile to load DataFrame

2015-11-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-11723: Summary: Use LibSVM data source rather than MLUtils.loadLibSVMFile to load DataFrame (was: Use

[jira] [Assigned] (SPARK-11723) Use LibSVM data source rather than MLUtils.loadLibSVMFile to load DataFrame

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11723: Assignee: (was: Apache Spark) > Use LibSVM data source rather than

[jira] [Commented] (SPARK-11638) Apache Spark in Docker with Bridge networking / run Spark on Mesos, in Docker with Bridge networking

2015-11-13 Thread Stavros Kontopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003875#comment-15003875 ] Stavros Kontopoulos commented on SPARK-11638: - Ok it makes senseso executors could run in

[jira] [Commented] (SPARK-11617) MEMORY LEAK: ByteBuf.release() was not called before it's garbage-collected

2015-11-13 Thread Jacek Lewandowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003879#comment-15003879 ] Jacek Lewandowski commented on SPARK-11617: --- This also happens in standalone mode, Netty based

[jira] [Commented] (SPARK-11638) Apache Spark in Docker with Bridge networking / run Spark on Mesos, in Docker with Bridge networking

2015-11-13 Thread Stavros Kontopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003889#comment-15003889 ] Stavros Kontopoulos commented on SPARK-11638: - Ok so the question from my side is that if ips

[jira] [Commented] (SPARK-11638) Apache Spark in Docker with Bridge networking / run Spark on Mesos, in Docker with Bridge networking

2015-11-13 Thread Radoslaw Gruchalski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003905#comment-15003905 ] Radoslaw Gruchalski commented on SPARK-11638: - Exactly, the only "problematic" thing is how

[jira] [Commented] (SPARK-11638) Apache Spark in Docker with Bridge networking / run Spark on Mesos, in Docker with Bridge networking

2015-11-13 Thread Radoslaw Gruchalski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003878#comment-15003878 ] Radoslaw Gruchalski commented on SPARK-11638: - Indeed, executors can run in docker containers

[jira] [Comment Edited] (SPARK-11638) Apache Spark in Docker with Bridge networking / run Spark on Mesos, in Docker with Bridge networking

2015-11-13 Thread Radoslaw Gruchalski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003905#comment-15003905 ] Radoslaw Gruchalski edited comment on SPARK-11638 at 11/13/15 12:08 PM:

[jira] [Commented] (SPARK-11721) The programming guide for Spark SQL in Spark 1.3.0 needs additional imports to work

2015-11-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003701#comment-15003701 ] Sean Owen commented on SPARK-11721: --- I don't think there will be any further 1.3.x releases so I don't

[jira] [Resolved] (SPARK-11706) Streaming Python tests cannot report failures

2015-11-13 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-11706. --- Resolution: Fixed Assignee: Shixiong Zhu Fix Version/s: 1.6.0 > Streaming

[jira] [Assigned] (SPARK-11549) Replace example code in mllib-evaluation-metrics.md using include_example

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11549: Assignee: Apache Spark > Replace example code in mllib-evaluation-metrics.md using

[jira] [Commented] (SPARK-11549) Replace example code in mllib-evaluation-metrics.md using include_example

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11549?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003789#comment-15003789 ] Apache Spark commented on SPARK-11549: -- User 'vikasnp' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11549) Replace example code in mllib-evaluation-metrics.md using include_example

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11549: Assignee: (was: Apache Spark) > Replace example code in mllib-evaluation-metrics.md

[jira] [Commented] (SPARK-2960) Spark executables fail to start via symlinks

2015-11-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003697#comment-15003697 ] Sean Owen commented on SPARK-2960: -- I don't think that's a problem. If {{SPARK_HOME}} is explicitly set,

[jira] [Closed] (SPARK-11721) The programming guide for Spark SQL in Spark 1.3.0 needs additional imports to work

2015-11-13 Thread Neelesh Srinivas Salian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neelesh Srinivas Salian closed SPARK-11721. --- Resolution: Implemented Fix Version/s: 1.3.0 > The programming guide

[jira] [Commented] (SPARK-11601) ML 1.6 QA: API: Binary incompatible changes

2015-11-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004216#comment-15004216 ] Xiangrui Meng commented on SPARK-11601: --- * LogisticAggregator is a package private class, or inside

[jira] [Updated] (SPARK-11723) Use LibSVM data source rather than MLUtils.loadLibSVMFile to load DataFrame

2015-11-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11723: -- Target Version/s: 1.6.0 > Use LibSVM data source rather than MLUtils.loadLibSVMFile to load

[jira] [Updated] (SPARK-11723) Use LibSVM data source rather than MLUtils.loadLibSVMFile to load DataFrame

2015-11-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11723: -- Assignee: Yanbo Liang > Use LibSVM data source rather than MLUtils.loadLibSVMFile to load

[jira] [Resolved] (SPARK-11723) Use LibSVM data source rather than MLUtils.loadLibSVMFile to load DataFrame

2015-11-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-11723. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9690

[jira] [Commented] (SPARK-11721) The programming guide for Spark SQL in Spark 1.3.0 needs additional imports to work

2015-11-13 Thread Neelesh Srinivas Salian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004265#comment-15004265 ] Neelesh Srinivas Salian commented on SPARK-11721: - Didn't work when I tried. Explicitly

[jira] [Reopened] (SPARK-11672) Flaky test: ml.JavaDefaultReadWriteSuite

2015-11-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reopened SPARK-11672: --- Saw another one:

[jira] [Commented] (SPARK-11672) Flaky test: ml.JavaDefaultReadWriteSuite

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004290#comment-15004290 ] Apache Spark commented on SPARK-11672: -- User 'mengxr' has created a pull request for this issue:

[jira] [Commented] (SPARK-11721) The programming guide for Spark SQL in Spark 1.3.0 needs additional imports to work

2015-11-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004305#comment-15004305 ] Sean Owen commented on SPARK-11721: --- Hm, what do you mean it hasn't been published? I'm referring to

[jira] [Updated] (SPARK-11678) Partition discovery fail if there is a _SUCCESS file in the table's root dir

2015-11-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-11678: - Fix Version/s: (was: 1.7.0) > Partition discovery fail if there is a _SUCCESS file in the table's

[jira] [Created] (SPARK-11728) Replace example code in ml-ensembles.md using include_example

2015-11-13 Thread Xusen Yin (JIRA)
Xusen Yin created SPARK-11728: - Summary: Replace example code in ml-ensembles.md using include_example Key: SPARK-11728 URL: https://issues.apache.org/jira/browse/SPARK-11728 Project: Spark

[jira] [Commented] (SPARK-11720) Return Double.NaN instead of null for Mean and Average when count = 0

2015-11-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004206#comment-15004206 ] Xiangrui Meng commented on SPARK-11720: --- Computing average in the normal way should be sufficient.

[jira] [Comment Edited] (SPARK-11720) Return Double.NaN instead of null for Mean and Average when count = 0

2015-11-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004206#comment-15004206 ] Xiangrui Meng edited comment on SPARK-11720 at 11/13/15 4:25 PM: -

[jira] [Commented] (SPARK-11668) R style summary stats in GLM package SparkR

2015-11-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004228#comment-15004228 ] Xiangrui Meng commented on SPARK-11668: --- [~shubhanshumis...@gmail.com] [~yanboliang] implemented

[jira] [Commented] (SPARK-11727) split ExpressionEncoder into FlatEncoder and ProductEncoder

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004092#comment-15004092 ] Apache Spark commented on SPARK-11727: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11727) split ExpressionEncoder into FlatEncoder and ProductEncoder

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11727: Assignee: Apache Spark > split ExpressionEncoder into FlatEncoder and ProductEncoder >

[jira] [Assigned] (SPARK-11727) split ExpressionEncoder into FlatEncoder and ProductEncoder

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11727: Assignee: (was: Apache Spark) > split ExpressionEncoder into FlatEncoder and

[jira] [Updated] (SPARK-9647) MLlib + SparkR integration for 1.6

2015-11-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9647: - Description: This is an umbrella JIRA for MLlib + SparkR integration for Spark 1.6, continuing

[jira] [Commented] (SPARK-10673) spark.sql.hive.verifyPartitionPath Attempts to Verify Unregistered Partitions

2015-11-13 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004240#comment-15004240 ] Xin Wu commented on SPARK-10673: I will look into this one. > spark.sql.hive.verifyPartitionPath

[jira] [Comment Edited] (SPARK-11721) The programming guide for Spark SQL in Spark 1.3.0 needs additional imports to work

2015-11-13 Thread Neelesh Srinivas Salian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004265#comment-15004265 ] Neelesh Srinivas Salian edited comment on SPARK-11721 at 11/13/15 4:48 PM:

[jira] [Created] (SPARK-11727) split ExpressionEncoder into FlatEncoder and ProductEncoder

2015-11-13 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-11727: --- Summary: split ExpressionEncoder into FlatEncoder and ProductEncoder Key: SPARK-11727 URL: https://issues.apache.org/jira/browse/SPARK-11727 Project: Spark

[jira] [Updated] (SPARK-11720) Return Double.NaN instead of null for Mean and Average when count = 0

2015-11-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11720: -- Target Version/s: 1.6.0 > Return Double.NaN instead of null for Mean and Average when count =

[jira] [Updated] (SPARK-11720) Return Double.NaN instead of null for Mean and Average when count = 0

2015-11-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11720: -- Priority: Minor (was: Major) > Return Double.NaN instead of null for Mean and Average when

[jira] [Resolved] (SPARK-11445) Replace example code in mllib-ensembles.md using include_example

2015-11-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-11445. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9407

[jira] [Updated] (SPARK-11678) Partition discovery fail if there is a _SUCCESS file in the table's root dir

2015-11-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-11678: - Labels: releasenotes (was: ) > Partition discovery fail if there is a _SUCCESS file in the table's root

[jira] [Commented] (SPARK-11678) Partition discovery fail if there is a _SUCCESS file in the table's root dir

2015-11-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004316#comment-15004316 ] Yin Huai commented on SPARK-11678: -- We need to document the newly added {{basePath}} option. >

[jira] [Created] (SPARK-11729) Replace example code in ml-linear-methods.md and ml-ann.md using include_example

2015-11-13 Thread Xusen Yin (JIRA)
Xusen Yin created SPARK-11729: - Summary: Replace example code in ml-linear-methods.md and ml-ann.md using include_example Key: SPARK-11729 URL: https://issues.apache.org/jira/browse/SPARK-11729 Project:

[jira] [Commented] (SPARK-11720) Return Double.NaN instead of null for Mean and Average when count = 0

2015-11-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004210#comment-15004210 ] Xiangrui Meng commented on SPARK-11720: --- If we don't have Decimal.NaN implemented, it is okay to

[jira] [Updated] (SPARK-11445) Replace example code in mllib-ensembles.md using include_example

2015-11-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11445: -- Assignee: Rishabh Bhardwaj > Replace example code in mllib-ensembles.md using include_example

[jira] [Commented] (SPARK-11693) spark kafka direct streaming exception

2015-11-13 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004072#comment-15004072 ] Cody Koeninger commented on SPARK-11693: You've under-provisioned Kafka storage and / or Spark

[jira] [Resolved] (SPARK-9647) MLlib + SparkR integration for 1.6

2015-11-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-9647. -- Resolution: Fixed Fix Version/s: 1.6.0 Marked this umbrella as done. We will track

[jira] [Resolved] (SPARK-11668) R style summary stats in GLM package SparkR

2015-11-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-11668. --- Resolution: Duplicate > R style summary stats in GLM package SparkR >

[jira] [Commented] (SPARK-8029) ShuffleMapTasks must be robust to concurrent attempts on the same executor

2015-11-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004527#comment-15004527 ] Reynold Xin commented on SPARK-8029: [~davies] can you update the jira ticket description with the

[jira] [Updated] (SPARK-11724) Casting integer types to timestamp has unexpected semantics

2015-11-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-11724: Labels: releasenotes (was: ) > Casting integer types to timestamp has unexpected semantics >

[jira] [Resolved] (SPARK-11727) split ExpressionEncoder into FlatEncoder and ProductEncoder

2015-11-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-11727. -- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9693

[jira] [Assigned] (SPARK-2344) Add Fuzzy C-Means algorithm to MLlib

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-2344: --- Assignee: Apache Spark > Add Fuzzy C-Means algorithm to MLlib >

[jira] [Assigned] (SPARK-11734) Move reference sort into test and standardize on TungstenSort

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11734: Assignee: Apache Spark (was: Reynold Xin) > Move reference sort into test and

[jira] [Commented] (SPARK-11734) Move reference sort into test and standardize on TungstenSort

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004597#comment-15004597 ] Apache Spark commented on SPARK-11734: -- User 'rxin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11734) Move reference sort into test and standardize on TungstenSort

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11734: Assignee: Reynold Xin (was: Apache Spark) > Move reference sort into test and

[jira] [Commented] (SPARK-6990) Add Java linting script

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004374#comment-15004374 ] Apache Spark commented on SPARK-6990: - User 'dskrvk' has created a pull request for this issue:

[jira] [Commented] (SPARK-11601) ML 1.6 QA: API: Binary incompatible changes

2015-11-13 Thread Tim Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004419#comment-15004419 ] Tim Hunter commented on SPARK-11601: Oh I see, two of them are false positives (SPARK-11732) and I

[jira] [Commented] (SPARK-10863) Method coltypes() to return the R column types of a DataFrame

2015-11-13 Thread Oscar D. Lara Yejas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004363#comment-15004363 ] Oscar D. Lara Yejas commented on SPARK-10863: - [~felixcheung] Let me try to clarify a bit.

[jira] [Created] (SPARK-11732) MiMa excludes miss private classes

2015-11-13 Thread Tim Hunter (JIRA)
Tim Hunter created SPARK-11732: -- Summary: MiMa excludes miss private classes Key: SPARK-11732 URL: https://issues.apache.org/jira/browse/SPARK-11732 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-11732) MiMa excludes miss private classes

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11732: Assignee: Apache Spark > MiMa excludes miss private classes >

[jira] [Commented] (SPARK-11454) DB2 dialect - map DB2 ROWID and TIMESTAMP with TIMEZONE types into valid Spark types

2015-11-13 Thread Suresh Thalamati (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004488#comment-15004488 ] Suresh Thalamati commented on SPARK-11454: -- I am looking into fixing this Jira along with

[jira] [Commented] (SPARK-11731) Enable batching on Driver WriteAheadLog by default

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004356#comment-15004356 ] Apache Spark commented on SPARK-11731: -- User 'brkyvz' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11731) Enable batching on Driver WriteAheadLog by default

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11731: Assignee: (was: Apache Spark) > Enable batching on Driver WriteAheadLog by default >

[jira] [Assigned] (SPARK-11731) Enable batching on Driver WriteAheadLog by default

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11731: Assignee: Apache Spark > Enable batching on Driver WriteAheadLog by default >

[jira] [Comment Edited] (SPARK-10863) Method coltypes() to return the R column types of a DataFrame

2015-11-13 Thread Oscar D. Lara Yejas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004363#comment-15004363 ] Oscar D. Lara Yejas edited comment on SPARK-10863 at 11/13/15 5:58 PM:

[jira] [Created] (SPARK-11731) Enable batching on Driver WriteAheadLog by default

2015-11-13 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-11731: --- Summary: Enable batching on Driver WriteAheadLog by default Key: SPARK-11731 URL: https://issues.apache.org/jira/browse/SPARK-11731 Project: Spark Issue Type:

[jira] [Created] (SPARK-11730) Feature Importance for GBT

2015-11-13 Thread Brian Webb (JIRA)
Brian Webb created SPARK-11730: -- Summary: Feature Importance for GBT Key: SPARK-11730 URL: https://issues.apache.org/jira/browse/SPARK-11730 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-11733) Allow shuffle readers to request data from just one mapper

2015-11-13 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-11733: - Summary: Allow shuffle readers to request data from just one mapper Key: SPARK-11733 URL: https://issues.apache.org/jira/browse/SPARK-11733 Project: Spark

[jira] [Updated] (SPARK-10408) Autoencoder

2015-11-13 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander Ulanov updated SPARK-10408: - Description: Goal: Implement various types of autoencoders Requirements: 1)Basic (deep)

[jira] [Updated] (SPARK-11724) Casting integer types to timestamp has unexpected semantics

2015-11-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-11724: Assignee: Nong Li > Casting integer types to timestamp has unexpected semantics >

[jira] [Assigned] (SPARK-2344) Add Fuzzy C-Means algorithm to MLlib

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-2344: --- Assignee: (was: Apache Spark) > Add Fuzzy C-Means algorithm to MLlib >

[jira] [Commented] (SPARK-2344) Add Fuzzy C-Means algorithm to MLlib

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004589#comment-15004589 ] Apache Spark commented on SPARK-2344: - User 'acflorea' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-10863) Method coltypes() to return the R column types of a DataFrame

2015-11-13 Thread Oscar D. Lara Yejas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004363#comment-15004363 ] Oscar D. Lara Yejas edited comment on SPARK-10863 at 11/13/15 5:54 PM:

[jira] [Commented] (SPARK-11732) MiMa excludes miss private classes

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004431#comment-15004431 ] Apache Spark commented on SPARK-11732: -- User 'thunterdb' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11732) MiMa excludes miss private classes

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11732: Assignee: (was: Apache Spark) > MiMa excludes miss private classes >

[jira] [Updated] (SPARK-11690) Add pivot to python api

2015-11-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-11690: - Assignee: Andrew Ray > Add pivot to python api > --- > > Key:

[jira] [Resolved] (SPARK-11690) Add pivot to python api

2015-11-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-11690. -- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9653

[jira] [Created] (SPARK-11734) Move reference sort into test and standardize on TungstenSort

2015-11-13 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-11734: --- Summary: Move reference sort into test and standardize on TungstenSort Key: SPARK-11734 URL: https://issues.apache.org/jira/browse/SPARK-11734 Project: Spark

[jira] [Commented] (SPARK-9762) ALTER TABLE cannot find column

2015-11-13 Thread Shaun A Elliott (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004359#comment-15004359 ] Shaun A Elliott commented on SPARK-9762: Is there a workaround for this at all? > ALTER TABLE

[jira] [Commented] (SPARK-10863) Method coltypes() to return the R column types of a DataFrame

2015-11-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004442#comment-15004442 ] Felix Cheung commented on SPARK-10863: -- I understand but I think this is problematic in several

[jira] [Assigned] (SPARK-11643) inserting date with leading zero inserts null example '0001-12-10'

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11643: Assignee: Apache Spark (was: Davies Liu) > inserting date with leading zero inserts null

[jira] [Assigned] (SPARK-11643) inserting date with leading zero inserts null example '0001-12-10'

2015-11-13 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-11643: -- Assignee: Davies Liu > inserting date with leading zero inserts null example '0001-12-10' >

[jira] [Commented] (SPARK-11643) inserting date with leading zero inserts null example '0001-12-10'

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004622#comment-15004622 ] Apache Spark commented on SPARK-11643: -- User 'davies' has created a pull request for this issue:

[jira] [Updated] (SPARK-8029) ShuffleMapTasks must be robust to concurrent attempts on the same executor

2015-11-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8029: - Target Version/s: 1.5.3, 1.6.0 (was: 1.5.2, 1.6.0) > ShuffleMapTasks must be robust to concurrent

[jira] [Assigned] (SPARK-11720) Return Double.NaN instead of null for Mean and Average when count = 0

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11720: Assignee: (was: Apache Spark) > Return Double.NaN instead of null for Mean and

[jira] [Commented] (SPARK-11720) Return Double.NaN instead of null for Mean and Average when count = 0

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004791#comment-15004791 ] Apache Spark commented on SPARK-11720: -- User 'JihongMA' has created a pull request for this issue:

[jira] [Updated] (SPARK-7308) Should there be multiple concurrent attempts for one stage?

2015-11-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7308: - Assignee: Davies Liu > Should there be multiple concurrent attempts for one stage? >

[jira] [Created] (SPARK-11740) Fix DStream checkpointing logic to prevent failures during checkpoint recovery

2015-11-13 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-11740: Summary: Fix DStream checkpointing logic to prevent failures during checkpoint recovery Key: SPARK-11740 URL: https://issues.apache.org/jira/browse/SPARK-11740

[jira] [Updated] (SPARK-8029) ShuffleMapTasks must be robust to concurrent attempts on the same executor

2015-11-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8029: - Description: When stages get retried, a task may have more than one attempt running at the same time, on

[jira] [Resolved] (SPARK-7829) SortShuffleWriter writes inconsistent data & index files on stage retry

2015-11-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-7829. -- Resolution: Fixed Assignee: Davies Liu (was: Imran Rashid) Fix Version/s:

[jira] [Updated] (SPARK-8029) ShuffleMapTasks must be robust to concurrent attempts on the same executor

2015-11-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8029: - Description: When stages get retried, a task may have more than one attempt running at the same time, on

[jira] [Commented] (SPARK-7829) SortShuffleWriter writes inconsistent data & index files on stage retry

2015-11-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004780#comment-15004780 ] Andrew Or commented on SPARK-7829: -- I believe this is now fixed due to

[jira] [Commented] (SPARK-10712) JVM crashes with spark.sql.tungsten.enabled = true

2015-11-13 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004636#comment-15004636 ] Davies Liu commented on SPARK-10712: How is you small table looks like? Does 1.5.2-RC2 still have

[jira] [Commented] (SPARK-11737) String may not be serialized correctly with Kyro

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004734#comment-15004734 ] Apache Spark commented on SPARK-11737: -- User 'davies' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11737) String may not be serialized correctly with Kyro

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11737: Assignee: Apache Spark (was: Davies Liu) > String may not be serialized correctly with

[jira] [Updated] (SPARK-11336) Include path to the source file in generated example code

2015-11-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11336: -- Summary: Include path to the source file in generated example code (was: Include a link to

[jira] [Resolved] (SPARK-11336) Include a link to the source file in generated example code

2015-11-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-11336. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9320

[jira] [Updated] (SPARK-11336) Include path to the source file in generated example code

2015-11-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11336: -- Description: It would be nice to include -a link- the path to the example source file at the

[jira] [Assigned] (SPARK-11720) Return Double.NaN instead of null for Mean and Average when count = 0

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11720: Assignee: Apache Spark > Return Double.NaN instead of null for Mean and Average when

[jira] [Created] (SPARK-11735) Add a check in the constructor of SqlContext to make sure the SparkContext is not stopped

2015-11-13 Thread Yin Huai (JIRA)
Yin Huai created SPARK-11735: Summary: Add a check in the constructor of SqlContext to make sure the SparkContext is not stopped Key: SPARK-11735 URL: https://issues.apache.org/jira/browse/SPARK-11735

[jira] [Assigned] (SPARK-11736) Add MonotonicallyIncreasingID to function registry

2015-11-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11736: Assignee: Apache Spark (was: Yin Huai) > Add MonotonicallyIncreasingID to function

[jira] [Updated] (SPARK-8029) ShuffleMapTasks must be robust to concurrent attempts on the same executor

2015-11-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8029: - Fix Version/s: (was: 1.5.2) 1.5.3 > ShuffleMapTasks must be robust to concurrent

  1   2   >