[jira] [Commented] (SPARK-15888) Python UDF over aggregate fails

2016-06-10 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325733#comment-15325733 ] Davies Liu commented on SPARK-15888: After some investigation, it turned out to be that the Python

[jira] [Updated] (SPARK-15888) Python UDF over aggregate fails

2016-06-10 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-15888: --- Summary: Python UDF over aggregate fails (was: UDF fails in Python) > Python UDF over aggregate

[jira] [Commented] (SPARK-15894) Add doc to control #partition for input files

2016-06-10 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325729#comment-15325729 ] Takeshi Yamamuro commented on SPARK-15894: -- cc: [~rxin] [~davies] > Add doc to control

[jira] [Commented] (SPARK-15894) Add doc to control #partition for input files

2016-06-10 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325726#comment-15325726 ] Takeshi Yamamuro commented on SPARK-15894: -- The patch is like

[jira] [Created] (SPARK-15894) Add doc to control #partition for input files

2016-06-10 Thread Takeshi Yamamuro (JIRA)
Takeshi Yamamuro created SPARK-15894: Summary: Add doc to control #partition for input files Key: SPARK-15894 URL: https://issues.apache.org/jira/browse/SPARK-15894 Project: Spark Issue

[jira] [Commented] (SPARK-13207) _SUCCESS should not break partition discovery

2016-06-10 Thread Simeon Simeonov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325718#comment-15325718 ] Simeon Simeonov commented on SPARK-13207: - [~yhuai] The PR associated with that ticket explicitly

[jira] [Resolved] (SPARK-15759) Fallback to non-codegen if fail to compile generated code

2016-06-10 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-15759. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13501

[jira] [Assigned] (SPARK-15639) Try to push down filter at RowGroups level for parquet reader

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15639: Assignee: Apache Spark (was: Liang-Chi Hsieh) > Try to push down filter at RowGroups

[jira] [Assigned] (SPARK-15639) Try to push down filter at RowGroups level for parquet reader

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15639: Assignee: Liang-Chi Hsieh (was: Apache Spark) > Try to push down filter at RowGroups

[jira] [Commented] (SPARK-15585) Don't use null in data source options to indicate default value

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325686#comment-15325686 ] Apache Spark commented on SPARK-15585: -- User 'maropu' has created a pull request for this issue:

[jira] [Updated] (SPARK-15678) Not use cache on appends and overwrites

2016-06-10 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-15678: --- Assignee: Sameer Agarwal > Not use cache on appends and overwrites >

[jira] [Resolved] (SPARK-15678) Not use cache on appends and overwrites

2016-06-10 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-15678. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13566

[jira] [Reopened] (SPARK-15639) Try to push down filter at RowGroups level for parquet reader

2016-06-10 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian reopened SPARK-15639: We've decided to revert the merged PR, so reopening it. > Try to push down filter at RowGroups level

[jira] [Updated] (SPARK-15369) Investigate selectively using Jython for parts of PySpark

2016-06-10 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-15369: Description: Transferring data from the JVM to the Python executor can be a substantial bottleneck. While

[jira] [Commented] (SPARK-12661) Drop Python 2.6 support in PySpark

2016-06-10 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12661?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325678#comment-15325678 ] holdenk commented on SPARK-12661: - What are we missing to drop 2.6 support? We could keep the legacy 2.6

[jira] [Updated] (SPARK-15819) Add KMeanSummary in KMeans of PySpark

2016-06-10 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated SPARK-15819: --- Component/s: PySpark ML > Add KMeanSummary in KMeans of PySpark >

[jira] [Closed] (SPARK-15751) Add generateAssociationRules in fpm in pyspark

2016-06-10 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang closed SPARK-15751. -- Resolution: Won't Fix > Add generateAssociationRules in fpm in pyspark >

[jira] [Updated] (SPARK-15639) Try to push down filter at RowGroups level for parquet reader

2016-06-10 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-15639: --- Assignee: Liang-Chi Hsieh > Try to push down filter at RowGroups level for parquet reader >

[jira] [Updated] (SPARK-15639) Try to push down filter at RowGroups level for parquet reader

2016-06-10 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-15639: --- Affects Version/s: 2.0.0 > Try to push down filter at RowGroups level for parquet reader >

[jira] [Resolved] (SPARK-15639) Try to push down filter at RowGroups level for parquet reader

2016-06-10 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-15639. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13371

[jira] [Created] (SPARK-15893) spark.createDataFrame raises an exception in Spark 2.0 tests on Windows

2016-06-10 Thread Alexander Ulanov (JIRA)
Alexander Ulanov created SPARK-15893: Summary: spark.createDataFrame raises an exception in Spark 2.0 tests on Windows Key: SPARK-15893 URL: https://issues.apache.org/jira/browse/SPARK-15893

[jira] [Commented] (SPARK-15822) segmentation violation in o.a.s.unsafe.types.UTF8String

2016-06-10 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325567#comment-15325567 ] Herman van Hovell commented on SPARK-15822: --- [~robbinspg] I have tried to reproduce the problem

[jira] [Created] (SPARK-15892) aft_survival_regression.py example fails in branch-2.0

2016-06-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-15892: - Summary: aft_survival_regression.py example fails in branch-2.0 Key: SPARK-15892 URL: https://issues.apache.org/jira/browse/SPARK-15892 Project: Spark

[jira] [Commented] (SPARK-15888) UDF fails in Python

2016-06-10 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325558#comment-15325558 ] Xiao Li commented on SPARK-15888: - Sure, will do it. Thanks! > UDF fails in Python > ---

[jira] [Updated] (SPARK-15888) UDF fails in Python

2016-06-10 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-15888: Component/s: SQL > UDF fails in Python > --- > > Key: SPARK-15888 >

[jira] [Created] (SPARK-15891) Make YARN logs less noisy

2016-06-10 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-15891: -- Summary: Make YARN logs less noisy Key: SPARK-15891 URL: https://issues.apache.org/jira/browse/SPARK-15891 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-15884) Override stringArgs method in MapPartitionsInR case class in order to avoid Out Of Mermory exceptions when calling toString

2016-06-10 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-15884. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13610

[jira] [Updated] (SPARK-15884) Override stringArgs method in MapPartitionsInR case class in order to avoid Out Of Mermory exceptions when calling toString

2016-06-10 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-15884: --- Assignee: Narine Kokhlikyan > Override stringArgs method in MapPartitionsInR case class in order to

[jira] [Assigned] (SPARK-15889) Add a unique id to ContinuousQuery

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15889: Assignee: Tathagata Das (was: Apache Spark) > Add a unique id to ContinuousQuery >

[jira] [Assigned] (SPARK-15889) Add a unique id to ContinuousQuery

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15889: Assignee: Apache Spark (was: Tathagata Das) > Add a unique id to ContinuousQuery >

[jira] [Commented] (SPARK-15889) Add a unique id to ContinuousQuery

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15889?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325518#comment-15325518 ] Apache Spark commented on SPARK-15889: -- User 'tdas' has created a pull request for this issue:

[jira] [Commented] (SPARK-14501) spark.ml parity for fpm - frequent items

2016-06-10 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325515#comment-15325515 ] Jeff Zhang commented on SPARK-14501: working on it. > spark.ml parity for fpm - frequent items >

[jira] [Created] (SPARK-15890) Support Stata-like tabulation of values in a single column, optionally with weights

2016-06-10 Thread Shafique Jamal (JIRA)
Shafique Jamal created SPARK-15890: -- Summary: Support Stata-like tabulation of values in a single column, optionally with weights Key: SPARK-15890 URL: https://issues.apache.org/jira/browse/SPARK-15890

[jira] [Created] (SPARK-15889) Add a unique id to ContinuousQuery

2016-06-10 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-15889: - Summary: Add a unique id to ContinuousQuery Key: SPARK-15889 URL: https://issues.apache.org/jira/browse/SPARK-15889 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-15851) Spark 2.0 does not compile in Windows 7

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325470#comment-15325470 ] Apache Spark commented on SPARK-15851: -- User 'avulanov' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15851) Spark 2.0 does not compile in Windows 7

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15851: Assignee: Apache Spark > Spark 2.0 does not compile in Windows 7 >

[jira] [Assigned] (SPARK-15851) Spark 2.0 does not compile in Windows 7

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15851: Assignee: (was: Apache Spark) > Spark 2.0 does not compile in Windows 7 >

[jira] [Commented] (SPARK-15822) segmentation violation in o.a.s.unsafe.types.UTF8String

2016-06-10 Thread Adam Roberts (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325465#comment-15325465 ] Adam Roberts commented on SPARK-15822: -- I added a link above to the dataset, it's 658mb when

[jira] [Updated] (SPARK-15888) UDF fails in Python

2016-06-10 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-15888: - Shepherd: Davies Liu > UDF fails in Python > --- > > Key: SPARK-15888 >

[jira] [Commented] (SPARK-15888) UDF fails in Python

2016-06-10 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325432#comment-15325432 ] Yin Huai commented on SPARK-15888: -- [~davies] I am putting you as the shepherd. > UDF fails in Python >

[jira] [Resolved] (SPARK-15773) Avoid creating local variable `sc` in examples if possible

2016-06-10 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-15773. - Resolution: Fixed Assignee: Dongjoon Hyun Fix Version/s: 2.0.0 > Avoid creating

[jira] [Commented] (SPARK-15822) segmentation violation in o.a.s.unsafe.types.UTF8String

2016-06-10 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325417#comment-15325417 ] Davies Liu commented on SPARK-15822: The latest stacktrace is different than previous one, it seems

[jira] [Commented] (SPARK-15888) UDF fails in Python

2016-06-10 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325413#comment-15325413 ] Yin Huai commented on SPARK-15888: -- [~smilegator] anyone from your side has time to take a look at this?

[jira] [Updated] (SPARK-15888) UDF fails in Python

2016-06-10 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-15888: - Target Version/s: 2.0.0 > UDF fails in Python > --- > > Key: SPARK-15888

[jira] [Created] (SPARK-15888) UDF fails in Python

2016-06-10 Thread Vladimir Feinberg (JIRA)
Vladimir Feinberg created SPARK-15888: - Summary: UDF fails in Python Key: SPARK-15888 URL: https://issues.apache.org/jira/browse/SPARK-15888 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-15581) MLlib 2.1 Roadmap

2016-06-10 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325402#comment-15325402 ] Reynold Xin commented on SPARK-15581: - Note that there is a big, non-ML factor for breeze. It is a

[jira] [Assigned] (SPARK-15887) Bring back the hive-site.xml support for Spark 2.0

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15887: Assignee: Wenchen Fan (was: Apache Spark) > Bring back the hive-site.xml support for

[jira] [Assigned] (SPARK-15887) Bring back the hive-site.xml support for Spark 2.0

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15887: Assignee: Apache Spark (was: Wenchen Fan) > Bring back the hive-site.xml support for

[jira] [Commented] (SPARK-15887) Bring back the hive-site.xml support for Spark 2.0

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325386#comment-15325386 ] Apache Spark commented on SPARK-15887: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Commented] (SPARK-15581) MLlib 2.1 Roadmap

2016-06-10 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325377#comment-15325377 ] Alexander Ulanov commented on SPARK-15581: -- I would like to comment on Breeze and deep learning

[jira] [Created] (SPARK-15887) Bring back the hive-site.xml support for Spark 2.0

2016-06-10 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-15887: --- Summary: Bring back the hive-site.xml support for Spark 2.0 Key: SPARK-15887 URL: https://issues.apache.org/jira/browse/SPARK-15887 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-15884) Override stringArgs method in MapPartitionsInR case class in order to avoid Out Of Mermory exceptions when calling toString

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15884: Assignee: (was: Apache Spark) > Override stringArgs method in MapPartitionsInR case

[jira] [Commented] (SPARK-15884) Override stringArgs method in MapPartitionsInR case class in order to avoid Out Of Mermory exceptions when calling toString

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325360#comment-15325360 ] Apache Spark commented on SPARK-15884: -- User 'NarineK' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15884) Override stringArgs method in MapPartitionsInR case class in order to avoid Out Of Mermory exceptions when calling toString

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15884: Assignee: Apache Spark > Override stringArgs method in MapPartitionsInR case class in

[jira] [Created] (SPARK-15886) PySpark ML examples should use local linear algebra

2016-06-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-15886: - Summary: PySpark ML examples should use local linear algebra Key: SPARK-15886 URL: https://issues.apache.org/jira/browse/SPARK-15886 Project: Spark

[jira] [Closed] (SPARK-15886) PySpark ML examples should use local linear algebra

2016-06-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-15886. - Resolution: Duplicate Oh, just saw in the PR that you'd send a follow-up since the

[jira] [Updated] (SPARK-15862) Better Error Message When Having Database Name in CACHE TABLE AS SELECT

2016-06-10 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-15862: --- Assignee: Xiao Li > Better Error Message When Having Database Name in CACHE TABLE AS SELECT >

[jira] [Created] (SPARK-15885) Provide to executor logs from stage details page in UI

2016-06-10 Thread Tom Magrino (JIRA)
Tom Magrino created SPARK-15885: --- Summary: Provide to executor logs from stage details page in UI Key: SPARK-15885 URL: https://issues.apache.org/jira/browse/SPARK-15885 Project: Spark Issue

[jira] [Updated] (SPARK-15885) Provide links to executor logs from stage details page in UI

2016-06-10 Thread Tom Magrino (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tom Magrino updated SPARK-15885: Summary: Provide links to executor logs from stage details page in UI (was: Provide to executor

[jira] [Commented] (SPARK-13207) _SUCCESS should not break partition discovery

2016-06-10 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325337#comment-15325337 ] Yin Huai commented on SPARK-13207: -- Hey [~simeons], sorry for late reply. SPARK-15454 has fixed this

[jira] [Created] (SPARK-15884) Override stringArgs method in MapPartitionsInR case class in order to avoid Out Of Mermory exceptions when calling toString

2016-06-10 Thread Narine Kokhlikyan (JIRA)
Narine Kokhlikyan created SPARK-15884: - Summary: Override stringArgs method in MapPartitionsInR case class in order to avoid Out Of Mermory exceptions when calling toString Key: SPARK-15884 URL:

[jira] [Resolved] (SPARK-15688) RelationalGroupedDataset.toDF should not add group by expressions that are already added in the aggregate expressions.

2016-06-10 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-15688. -- Resolution: Won't Fix https://github.com/apache/spark/pull/13483#issuecomment-224758653 >

[jira] [Comment Edited] (SPARK-15822) segmentation violation in o.a.s.unsafe.types.UTF8String

2016-06-10 Thread Adam Roberts (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15324475#comment-15324475 ] Adam Roberts edited comment on SPARK-15822 at 6/10/16 9:39 PM: --- Herman,

[jira] [Resolved] (SPARK-15489) Dataset kryo encoder won't load custom user settings

2016-06-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-15489. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13424

[jira] [Updated] (SPARK-15784) Add Power Iteration Clustering to spark.ml

2016-06-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-15784: -- Issue Type: New Feature (was: Improvement) > Add Power Iteration Clustering to

[jira] [Resolved] (SPARK-15654) Reading gzipped files results in duplicate rows

2016-06-10 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-15654. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13531

[jira] [Resolved] (SPARK-15825) sort-merge-join gives invalid results when joining on a tupled key

2016-06-10 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-15825. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13589

[jira] [Commented] (SPARK-15790) Audit @Since annotations in ML

2016-06-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325317#comment-15325317 ] Joseph K. Bradley commented on SPARK-15790: --- Linking existing umbrella. Also, I want to note:

[jira] [Assigned] (SPARK-15879) Update logo in UI and docs to add "Apache"

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15879: Assignee: (was: Apache Spark) > Update logo in UI and docs to add "Apache" >

[jira] [Commented] (SPARK-15879) Update logo in UI and docs to add "Apache"

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325313#comment-15325313 ] Apache Spark commented on SPARK-15879: -- User 'srowen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15879) Update logo in UI and docs to add "Apache"

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15879: Assignee: Apache Spark > Update logo in UI and docs to add "Apache" >

[jira] [Commented] (SPARK-15751) Add generateAssociationRules in fpm in pyspark

2016-06-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325307#comment-15325307 ] Joseph K. Bradley commented on SPARK-15751: --- There isn't a JIRA for this AFAIK, but I think we

[jira] [Assigned] (SPARK-15881) Update microbenchmark results

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15881: Assignee: Apache Spark > Update microbenchmark results > - >

[jira] [Commented] (SPARK-15746) SchemaUtils.checkColumnType with VectorUDT prints instance details in error message

2016-06-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325294#comment-15325294 ] Joseph K. Bradley commented on SPARK-15746: --- Either fix seems fine to me. Modifying

[jira] [Commented] (SPARK-15883) Fix broken links on MLLIB documentations

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325295#comment-15325295 ] Apache Spark commented on SPARK-15883: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Assigned] (SPARK-15883) Fix broken links on MLLIB documentations

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15883: Assignee: (was: Apache Spark) > Fix broken links on MLLIB documentations >

[jira] [Assigned] (SPARK-15883) Fix broken links on MLLIB documentations

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15883: Assignee: Apache Spark > Fix broken links on MLLIB documentations >

[jira] [Resolved] (SPARK-15628) pyspark.ml.evaluation module

2016-06-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-15628. --- Resolution: Done Assignee: holdenk [~holdenk] OK, so no missing items? I'll

[jira] [Updated] (SPARK-15883) Fix broken links on MLLIB documentations

2016-06-10 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-15883: -- Description: This issue fixes all broken links on Spark 2.0 preview MLLib documents. Also,

[jira] [Updated] (SPARK-15883) Fix broken links on MLLIB documentations

2016-06-10 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-15883: -- Description: This issue fixes all broken links on Spark 2.0 preview MLLib documents. Also,

[jira] [Created] (SPARK-15883) Fix broken links on MLLIB documentations

2016-06-10 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-15883: - Summary: Fix broken links on MLLIB documentations Key: SPARK-15883 URL: https://issues.apache.org/jira/browse/SPARK-15883 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-15699) Add chi-squared test statistic as a split quality metric for decision trees

2016-06-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325286#comment-15325286 ] Joseph K. Bradley commented on SPARK-15699: --- [~eje] Just a warning: There are a lot of doc

[jira] [Assigned] (SPARK-15881) Update microbenchmark results

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15881: Assignee: (was: Apache Spark) > Update microbenchmark results >

[jira] [Commented] (SPARK-15881) Update microbenchmark results

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325282#comment-15325282 ] Apache Spark commented on SPARK-15881: -- User 'ericl' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-15628) pyspark.ml.evaluation module

2016-06-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325278#comment-15325278 ] Joseph K. Bradley edited comment on SPARK-15628 at 6/10/16 9:05 PM:

[jira] [Created] (SPARK-15882) Discuss distributed linear algebra in spark.ml package

2016-06-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-15882: - Summary: Discuss distributed linear algebra in spark.ml package Key: SPARK-15882 URL: https://issues.apache.org/jira/browse/SPARK-15882 Project: Spark

[jira] [Created] (SPARK-15881) Update microbenchmark results

2016-06-10 Thread Eric Liang (JIRA)
Eric Liang created SPARK-15881: -- Summary: Update microbenchmark results Key: SPARK-15881 URL: https://issues.apache.org/jira/browse/SPARK-15881 Project: Spark Issue Type: Sub-task

[jira] [Assigned] (SPARK-15086) Update Java API once the Scala one is finalized

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15086: Assignee: Apache Spark > Update Java API once the Scala one is finalized >

[jira] [Assigned] (SPARK-15086) Update Java API once the Scala one is finalized

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15086: Assignee: (was: Apache Spark) > Update Java API once the Scala one is finalized >

[jira] [Commented] (SPARK-15086) Update Java API once the Scala one is finalized

2016-06-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325254#comment-15325254 ] Apache Spark commented on SPARK-15086: -- User 'srowen' has created a pull request for this issue:

[jira] [Resolved] (SPARK-15738) PySpark ml.feature RFormula missing string representation displaying formula

2016-06-10 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-15738. - Resolution: Fixed Fix Version/s: 2.0.0 > PySpark ml.feature RFormula missing string

[jira] [Updated] (SPARK-15738) PySpark ml.feature RFormula missing string representation displaying formula

2016-06-10 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-15738: Assignee: Bryan Cutler > PySpark ml.feature RFormula missing string representation displaying

[jira] [Updated] (SPARK-15782) --packages doesn't work with the spark-shell

2016-06-10 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-15782: --- Target Version/s: 2.0.0 Priority: Blocker (was: Major) Component/s:

[jira] [Resolved] (SPARK-15875) Avoid using Seq.length == 0 and Seq.lenth > 0. Use Seq.isEmpty and Seq.nonEmpty instead.

2016-06-10 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15875?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-15875. - Resolution: Fixed Assignee: Yang Wang Fix Version/s: 2.0.0 > Avoid using

[jira] [Resolved] (SPARK-6320) Adding new query plan strategy to SQLContext

2016-06-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6320. - Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13147

[jira] [Resolved] (SPARK-15871) Add assertNotPartitioned check in DataFrameWriter

2016-06-10 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-15871. -- Resolution: Fixed Assignee: Liwei Lin Fix Version/s: 2.0.0 > Add

[jira] [Resolved] (SPARK-14485) Task finished cause fetch failure when its executor has already been removed by driver

2016-06-10 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout resolved SPARK-14485. Resolution: Won't Fix > Task finished cause fetch failure when its executor has already

[jira] [Commented] (SPARK-14485) Task finished cause fetch failure when its executor has already been removed by driver

2016-06-10 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15325175#comment-15325175 ] Kay Ousterhout commented on SPARK-14485: Reverted this and re-opened the JIRA to mark this as

[jira] [Reopened] (SPARK-14485) Task finished cause fetch failure when its executor has already been removed by driver

2016-06-10 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout reopened SPARK-14485: > Task finished cause fetch failure when its executor has already been removed > by driver >

[jira] [Updated] (SPARK-14485) Task finished cause fetch failure when its executor has already been removed by driver

2016-06-10 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout updated SPARK-14485: --- Fix Version/s: (was: 2.0.0) > Task finished cause fetch failure when its executor has

  1   2   3   >