[jira] [Updated] (SPARK-6636) Use public DNS hostname everywhere in spark_ec2.py

2015-04-06 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-6636: -- Assignee: Matt Aasted > Use public DNS hostname everywhere in spark_ec2.py > ---

[jira] [Resolved] (SPARK-6636) Use public DNS hostname everywhere in spark_ec2.py

2015-04-06 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-6636. --- Resolution: Fixed Fix Version/s: 1.4.0 1.3.2 Issue resolved by pull request

[jira] [Resolved] (SPARK-6716) Change SparkContext.DRIVER_IDENTIFIER from '' to 'driver'

2015-04-06 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-6716. --- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5372 [https://github.com/

[jira] [Commented] (SPARK-6691) Abstract and add a dynamic RateLimiter for Spark Streaming

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14482685#comment-14482685 ] Apache Spark commented on SPARK-6691: - User 'jerryshao' has created a pull request for

[jira] [Assigned] (SPARK-6691) Abstract and add a dynamic RateLimiter for Spark Streaming

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6691: --- Assignee: (was: Apache Spark) > Abstract and add a dynamic RateLimiter for Spark Streamin

[jira] [Assigned] (SPARK-6691) Abstract and add a dynamic RateLimiter for Spark Streaming

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6691: --- Assignee: Apache Spark > Abstract and add a dynamic RateLimiter for Spark Streaming > ---

[jira] [Created] (SPARK-6735) Provide options to make maximum executor failure count ( which kills the application ) relative to a window duration or disable it.

2015-04-06 Thread Twinkle Sachdeva (JIRA)
Twinkle Sachdeva created SPARK-6735: --- Summary: Provide options to make maximum executor failure count ( which kills the application ) relative to a window duration or disable it. Key: SPARK-6735 URL: https://iss

[jira] [Assigned] (SPARK-6733) Suppression of usage of Scala existential code should be done

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6733: --- Assignee: Apache Spark > Suppression of usage of Scala existential code should be done >

[jira] [Assigned] (SPARK-6733) Suppression of usage of Scala existential code should be done

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6733: --- Assignee: (was: Apache Spark) > Suppression of usage of Scala existential code should be

[jira] [Commented] (SPARK-6733) Suppression of usage of Scala existential code should be done

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14482630#comment-14482630 ] Apache Spark commented on SPARK-6733: - User 'vinodkc' has created a pull request for t

[jira] [Assigned] (SPARK-6734) Support GenericUDTF.close for Generate

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6734: --- Assignee: (was: Apache Spark) > Support GenericUDTF.close for Generate >

[jira] [Assigned] (SPARK-6734) Support GenericUDTF.close for Generate

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6734: --- Assignee: Apache Spark > Support GenericUDTF.close for Generate > ---

[jira] [Commented] (SPARK-6734) Support GenericUDTF.close for Generate

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14482567#comment-14482567 ] Apache Spark commented on SPARK-6734: - User 'chenghao-intel' has created a pull reques

[jira] [Created] (SPARK-6734) Support GenericUDTF.close for Generate

2015-04-06 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-6734: Summary: Support GenericUDTF.close for Generate Key: SPARK-6734 URL: https://issues.apache.org/jira/browse/SPARK-6734 Project: Spark Issue Type: Bug Compon

[jira] [Created] (SPARK-6733) Suppression of usage of Scala existential code should be done

2015-04-06 Thread Raymond Tay (JIRA)
Raymond Tay created SPARK-6733: -- Summary: Suppression of usage of Scala existential code should be done Key: SPARK-6733 URL: https://issues.apache.org/jira/browse/SPARK-6733 Project: Spark Issu

[jira] [Created] (SPARK-6732) Scala existentials warning during compilation

2015-04-06 Thread Raymond Tay (JIRA)
Raymond Tay created SPARK-6732: -- Summary: Scala existentials warning during compilation Key: SPARK-6732 URL: https://issues.apache.org/jira/browse/SPARK-6732 Project: Spark Issue Type: Improveme

[jira] [Assigned] (SPARK-6343) Make doc more explicit regarding network connectivity requirements

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6343: --- Assignee: Apache Spark > Make doc more explicit regarding network connectivity requirements >

[jira] [Assigned] (SPARK-6343) Make doc more explicit regarding network connectivity requirements

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6343: --- Assignee: (was: Apache Spark) > Make doc more explicit regarding network connectivity req

[jira] [Commented] (SPARK-6343) Make doc more explicit regarding network connectivity requirements

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14482496#comment-14482496 ] Apache Spark commented on SPARK-6343: - User 'parente' has created a pull request for t

[jira] [Commented] (SPARK-6700) flaky test: run Python application in yarn-cluster mode

2015-04-06 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14482479#comment-14482479 ] Lianhui Wang commented on SPARK-6700: - i had used hadoop2.3.0 to test and that is ok.

[jira] [Commented] (SPARK-6731) Upgrade Apache commons-math3 to 3.4.1

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14482460#comment-14482460 ] Apache Spark commented on SPARK-6731: - User 'punya' has created a pull request for thi

[jira] [Assigned] (SPARK-6731) Upgrade Apache commons-math3 to 3.4.1

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6731: --- Assignee: (was: Apache Spark) > Upgrade Apache commons-math3 to 3.4.1 > -

[jira] [Assigned] (SPARK-6731) Upgrade Apache commons-math3 to 3.4.1

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6731: --- Assignee: Apache Spark > Upgrade Apache commons-math3 to 3.4.1 >

[jira] [Created] (SPARK-6731) Upgrade Apache commons-math3 to 3.4.1

2015-04-06 Thread Punya Biswal (JIRA)
Punya Biswal created SPARK-6731: --- Summary: Upgrade Apache commons-math3 to 3.4.1 Key: SPARK-6731 URL: https://issues.apache.org/jira/browse/SPARK-6731 Project: Spark Issue Type: Dependency upgr

[jira] [Commented] (SPARK-6514) For Kinesis Streaming, use the same region for DynamoDB (KCL checkpoints) as the Kinesis stream itself

2015-04-06 Thread Chris Fregly (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14482455#comment-14482455 ] Chris Fregly commented on SPARK-6514: - we may want to inspect the streamURL for the re

[jira] [Created] (SPARK-6730) Can't have table as identifier in OPTIONS

2015-04-06 Thread Alex Liu (JIRA)
Alex Liu created SPARK-6730: --- Summary: Can't have table as identifier in OPTIONS Key: SPARK-6730 URL: https://issues.apache.org/jira/browse/SPARK-6730 Project: Spark Issue Type: Bug Compo

[jira] [Updated] (SPARK-6730) Can't have table as identifier in OPTIONS

2015-04-06 Thread Alex Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Liu updated SPARK-6730: Description: The following query fails because there is an identifier "table" in OPTIONS {code} CREATE TEM

[jira] [Commented] (SPARK-6506) python support yarn cluster mode requires SPARK_HOME to be set

2015-04-06 Thread Kostas Sakellis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14482414#comment-14482414 ] Kostas Sakellis commented on SPARK-6506: I ran into this issue too by running: bq.

[jira] [Updated] (SPARK-6514) For Kinesis Streaming, use the same region for DynamoDB (KCL checkpoints) as the Kinesis stream itself

2015-04-06 Thread Chris Fregly (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris Fregly updated SPARK-6514: Description: context: i started the original Kinesis impl with KCL 1.0 (not supported), then finis

[jira] [Updated] (SPARK-6514) For Kinesis Streaming, use the same region for DynamoDB (KCL checkpoints) as the Kinesis stream itself

2015-04-06 Thread Chris Fregly (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris Fregly updated SPARK-6514: Description: context: i started the original Kinesis impl with KCL 1.0 (not supported), then finis

[jira] [Updated] (SPARK-6599) Improve reliability and usability of Kinesis-based Spark Streaming

2015-04-06 Thread Chris Fregly (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris Fregly updated SPARK-6599: Summary: Improve reliability and usability of Kinesis-based Spark Streaming (was: Add Kinesis Direc

[jira] [Updated] (SPARK-2960) Spark executables fail to start via symlinks

2015-04-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-2960: - Component/s: Deploy > Spark executables fail to start via symlinks > -

[jira] [Updated] (SPARK-6514) For Kinesis Streaming, use the same region for DynamoDB (KCL checkpoints) as the Kinesis stream itself

2015-04-06 Thread Chris Fregly (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris Fregly updated SPARK-6514: Target Version/s: 1.4.0 (was: 1.3.1) > For Kinesis Streaming, use the same region for DynamoDB (KCL

[jira] [Commented] (SPARK-6721) IllegalStateException

2015-04-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14482367#comment-14482367 ] Sean Owen commented on SPARK-6721: -- (Also "IllegalStateException" isn't a useful JIRA nam

[jira] [Commented] (SPARK-6721) IllegalStateException

2015-04-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14482366#comment-14482366 ] Sean Owen commented on SPARK-6721: -- Isn't this an error / config problem in Mongo rather

[jira] [Updated] (SPARK-6729) DriverQuirks get can get OutOfBounds exception is some cases

2015-04-06 Thread Aaron Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aaron Davidson updated SPARK-6729: -- Assignee: Volodymyr Lyubinets > DriverQuirks get can get OutOfBounds exception is some cases > -

[jira] [Resolved] (SPARK-6729) DriverQuirks get can get OutOfBounds exception is some cases

2015-04-06 Thread Aaron Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aaron Davidson resolved SPARK-6729. --- Resolution: Fixed Fix Version/s: 1.4.0 > DriverQuirks get can get OutOfBounds exception

[jira] [Commented] (SPARK-5281) Registering table on RDD is giving MissingRequirementError

2015-04-06 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14482329#comment-14482329 ] Michael Armbrust commented on SPARK-5281: - I'll add that this is the trick we use

[jira] [Commented] (SPARK-3219) K-Means clusterer should support Bregman distance functions

2015-04-06 Thread Sai Nishanth Parepally (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14482297#comment-14482297 ] Sai Nishanth Parepally commented on SPARK-3219: --- [~mengxr], is https://githu

[jira] [Commented] (SPARK-5281) Registering table on RDD is giving MissingRequirementError

2015-04-06 Thread William Benton (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14482291#comment-14482291 ] William Benton commented on SPARK-5281: --- As [~marmbrus] recently pointed out on the

[jira] [Commented] (SPARK-5281) Registering table on RDD is giving MissingRequirementError

2015-04-06 Thread Patrick Walsh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14482244#comment-14482244 ] Patrick Walsh commented on SPARK-5281: -- I also have this issue with spark 1.3.0. Eve

[jira] [Assigned] (SPARK-6729) DriverQuirks get can get OutOfBounds exception is some cases

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6729: --- Assignee: Apache Spark > DriverQuirks get can get OutOfBounds exception is some cases > -

[jira] [Commented] (SPARK-6729) DriverQuirks get can get OutOfBounds exception is some cases

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6729?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14482193#comment-14482193 ] Apache Spark commented on SPARK-6729: - User 'vlyubin' has created a pull request for t

[jira] [Assigned] (SPARK-6729) DriverQuirks get can get OutOfBounds exception is some cases

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6729: --- Assignee: (was: Apache Spark) > DriverQuirks get can get OutOfBounds exception is some ca

[jira] [Created] (SPARK-6729) DriverQuirks get can get OutOfBounds exception is some cases

2015-04-06 Thread Volodymyr Lyubinets (JIRA)
Volodymyr Lyubinets created SPARK-6729: -- Summary: DriverQuirks get can get OutOfBounds exception is some cases Key: SPARK-6729 URL: https://issues.apache.org/jira/browse/SPARK-6729 Project: Spark

[jira] [Commented] (SPARK-6229) Support SASL encryption in network/common module

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14482157#comment-14482157 ] Apache Spark commented on SPARK-6229: - User 'vanzin' has created a pull request for th

[jira] [Assigned] (SPARK-6229) Support SASL encryption in network/common module

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6229: --- Assignee: Apache Spark > Support SASL encryption in network/common module > -

[jira] [Assigned] (SPARK-6229) Support SASL encryption in network/common module

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6229: --- Assignee: (was: Apache Spark) > Support SASL encryption in network/common module > --

[jira] [Updated] (SPARK-6728) Improve performance of py4j for large bytearray

2015-04-06 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6728: Affects Version/s: 1.3.0 > Improve performance of py4j for large bytearray > ---

[jira] [Updated] (SPARK-6728) Improve performance of py4j for large bytearray

2015-04-06 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6728: Priority: Critical (was: Major) Target Version/s: 1.4.0 > Improve performance of py4j for large

[jira] [Created] (SPARK-6728) Improve performance of py4j for large bytearray

2015-04-06 Thread Davies Liu (JIRA)
Davies Liu created SPARK-6728: - Summary: Improve performance of py4j for large bytearray Key: SPARK-6728 URL: https://issues.apache.org/jira/browse/SPARK-6728 Project: Spark Issue Type: Improveme

[jira] [Commented] (SPARK-6710) Wrong initial bias in GraphX SVDPlusPlus

2015-04-06 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14482063#comment-14482063 ] Reynold Xin commented on SPARK-6710: [~michaelmalak] would you like to submit a pull r

[jira] [Commented] (SPARK-6704) integrate SparkR docs build tool into Spark doc build

2015-04-06 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14481972#comment-14481972 ] Davies Liu commented on SPARK-6704: --- Great, thanks! > integrate SparkR docs build tool

[jira] [Created] (SPARK-6727) Model export/import for spark.ml: HashingTF

2015-04-06 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6727: Summary: Model export/import for spark.ml: HashingTF Key: SPARK-6727 URL: https://issues.apache.org/jira/browse/SPARK-6727 Project: Spark Issue Type:

[jira] [Created] (SPARK-6726) Model export/import for spark.ml: LogisticRegression

2015-04-06 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6726: Summary: Model export/import for spark.ml: LogisticRegression Key: SPARK-6726 URL: https://issues.apache.org/jira/browse/SPARK-6726 Project: Spark Is

[jira] [Created] (SPARK-6725) Model export/import for Pipeline API

2015-04-06 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6725: Summary: Model export/import for Pipeline API Key: SPARK-6725 URL: https://issues.apache.org/jira/browse/SPARK-6725 Project: Spark Issue Type: New Fe

[jira] [Created] (SPARK-6724) Model import/export for FPGrowth

2015-04-06 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6724: Summary: Model import/export for FPGrowth Key: SPARK-6724 URL: https://issues.apache.org/jira/browse/SPARK-6724 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-6723) Model import/export for ChiSqSelector

2015-04-06 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6723: Summary: Model import/export for ChiSqSelector Key: SPARK-6723 URL: https://issues.apache.org/jira/browse/SPARK-6723 Project: Spark Issue Type: Sub-t

[jira] [Created] (SPARK-6722) Model import/export for StreamingKMeansModel

2015-04-06 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6722: Summary: Model import/export for StreamingKMeansModel Key: SPARK-6722 URL: https://issues.apache.org/jira/browse/SPARK-6722 Project: Spark Issue Type

[jira] [Commented] (SPARK-5988) Model import/export for PowerIterationClusteringModel

2015-04-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14481891#comment-14481891 ] Joseph K. Bradley commented on SPARK-5988: -- Feel free to go ahead! I just assign

[jira] [Updated] (SPARK-5988) Model import/export for PowerIterationClusteringModel

2015-04-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5988: - Assignee: Xusen Yin > Model import/export for PowerIterationClusteringModel >

[jira] [Closed] (SPARK-6718) Improve the test on normL1/normL2 of summary statistics

2015-04-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-6718. Resolution: Duplicate > Improve the test on normL1/normL2 of summary statistics > --

[jira] [Updated] (SPARK-6720) PySpark MultivariateStatisticalSummary unit test for normL1 and normL2

2015-04-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6720: - Component/s: PySpark > PySpark MultivariateStatisticalSummary unit test for normL1 and nor

[jira] [Updated] (SPARK-6720) PySpark MultivariateStatisticalSummary unit test for normL1 and normL2

2015-04-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6720: - Assignee: Kai Sasaki > PySpark MultivariateStatisticalSummary unit test for normL1 and nor

[jira] [Updated] (SPARK-6720) PySpark MultivariateStatisticalSummary unit test for normL1 and normL2

2015-04-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6720: - Target Version/s: 1.4.0 Fix Version/s: (was: 1.4.0) > PySpark MultivariateStati

[jira] [Updated] (SPARK-6720) PySpark MultivariateStatisticalSummary unit test for normL1 and normL2

2015-04-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6720: - Affects Version/s: (was: 1.3.0) 1.4.0 > PySpark MultivariateSta

[jira] [Updated] (SPARK-6720) PySpark MultivariateStatisticalSummary unit test for normL1 and normL2

2015-04-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6720: - Issue Type: Improvement (was: Bug) > PySpark MultivariateStatisticalSummary unit test for

[jira] [Commented] (SPARK-6407) Streaming ALS for Collaborative Filtering

2015-04-06 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14481874#comment-14481874 ] Burak Yavuz commented on SPARK-6407: I actually worked on this over the weekend for fu

[jira] [Updated] (SPARK-6713) Iterators in columnSimilarities to allow flatMap spill

2015-04-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6713: - Assignee: Reza Zadeh > Iterators in columnSimilarities to allow flatMap spill > --

[jira] [Resolved] (SPARK-6713) Iterators in columnSimilarities to allow flatMap spill

2015-04-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-6713. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5364 [https://githu

[jira] [Closed] (SPARK-6711) Support parallelized online matrix factorization for Collaborative Filtering

2015-04-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng closed SPARK-6711. Resolution: Duplicate > Support parallelized online matrix factorization for Collaborative Filtering

[jira] [Commented] (SPARK-6407) Streaming ALS for Collaborative Filtering

2015-04-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14481711#comment-14481711 ] Xiangrui Meng commented on SPARK-6407: -- Attached the comment from Chunnan Yao in SPAR

[jira] [Commented] (SPARK-6606) Accumulator deserialized twice because the NarrowCoGroupSplitDep contains rdd object.

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14481639#comment-14481639 ] Apache Spark commented on SPARK-6606: - User 'kayousterhout' has created a pull request

[jira] [Updated] (SPARK-6692) Add an option for client to kill AM when it is killed

2015-04-06 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park updated SPARK-6692: - Summary: Add an option for client to kill AM when it is killed (was: Make it possible to kill AM

[jira] [Updated] (SPARK-6222) [STREAMING] All data may not be recovered from WAL when driver is killed

2015-04-06 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6222: --- Fix Version/s: 1.4.0 1.3.1 > [STREAMING] All data may not be recovered from

[jira] [Commented] (SPARK-6700) flaky test: run Python application in yarn-cluster mode

2015-04-06 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14481534#comment-14481534 ] Davies Liu commented on SPARK-6700: --- There is one failure here: https://amplab.cs.berke

[jira] [Updated] (SPARK-6721) IllegalStateException

2015-04-06 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-6721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Luis Rodríguez Trejo updated SPARK-6721: Description: I get the following exception when using saveAsNewAPIHadoopFile: {code}

[jira] [Created] (SPARK-6721) IllegalStateException

2015-04-06 Thread JIRA
Luis Rodríguez Trejo created SPARK-6721: --- Summary: IllegalStateException Key: SPARK-6721 URL: https://issues.apache.org/jira/browse/SPARK-6721 Project: Spark Issue Type: Bug C

[jira] [Commented] (SPARK-3702) Standardize MLlib classes for learners, models

2015-04-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14481464#comment-14481464 ] Joseph K. Bradley commented on SPARK-3702: -- Using Vector types is better since th

[jira] [Commented] (SPARK-6682) Deprecate static train and use builder instead for Scala/Java

2015-04-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14481455#comment-14481455 ] Joseph K. Bradley commented on SPARK-6682: -- As you're suggesting, a wrapper mecha

[jira] [Commented] (SPARK-6577) SparseMatrix should be supported in PySpark

2015-04-06 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14481395#comment-14481395 ] Manoj Kumar commented on SPARK-6577: Let us please take the discussion to the Pull Req

[jira] [Updated] (SPARK-5261) In some cases ,The value of word's vector representation is too big

2015-04-06 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guoqiang Li updated SPARK-5261: --- Description: Get data: {code:none} normalize_text() { awk '{print tolower($0);}' | sed -e "s/’/'/g"

[jira] [Updated] (SPARK-5261) In some cases ,The value of word's vector representation is too big

2015-04-06 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guoqiang Li updated SPARK-5261: --- Description: Get data: {code:none} normalize_text() { awk '{print tolower($0);}' | sed -e "s/’/'/g"

[jira] [Commented] (SPARK-5261) In some cases ,The value of word's vector representation is too big

2015-04-06 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14481378#comment-14481378 ] Guoqiang Li commented on SPARK-5261: I'm sorry, the after one 's mincount is 100 >

[jira] [Updated] (SPARK-5261) In some cases ,The value of word's vector representation is too big

2015-04-06 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guoqiang Li updated SPARK-5261: --- Description: Get data: {code:none} normalize_text() { awk '{print tolower($0);}' | sed -e "s/’/'/g"

[jira] [Comment Edited] (SPARK-3702) Standardize MLlib classes for learners, models

2015-04-06 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14481342#comment-14481342 ] Peter Rudenko edited comment on SPARK-3702 at 4/6/15 4:06 PM: --

[jira] [Commented] (SPARK-3702) Standardize MLlib classes for learners, models

2015-04-06 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14481342#comment-14481342 ] Peter Rudenko commented on SPARK-3702: -- For trees based algorithms curious whether th

[jira] [Reopened] (SPARK-2960) Spark executables fail to start via symlinks

2015-04-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-2960: -- Not sure what happened there -- probably my fault in any event -- but this one is duplicated, rather than i

[jira] [Updated] (SPARK-6205) UISeleniumSuite fails for Hadoop 2.x test with NoClassDefFoundError

2015-04-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6205: - Fix Version/s: 1.3.2 > UISeleniumSuite fails for Hadoop 2.x test with NoClassDefFoundError > -

[jira] [Commented] (SPARK-6431) Couldn't find leader offsets exception when creating KafkaDirectStream

2015-04-06 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14481266#comment-14481266 ] Cody Koeninger commented on SPARK-6431: --- I think this got mis-diagnosed on the maili

[jira] [Commented] (SPARK-2960) Spark executables fail to start via symlinks

2015-04-06 Thread Danil Mironov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14481228#comment-14481228 ] Danil Mironov commented on SPARK-2960: -- This now formed a loop of three tickets (SPAR

[jira] [Assigned] (SPARK-2991) RDD transforms for scan and scanLeft

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-2991: --- Assignee: Erik Erlandson (was: Apache Spark) > RDD transforms for scan and scanLeft > -

[jira] [Assigned] (SPARK-2991) RDD transforms for scan and scanLeft

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-2991: --- Assignee: Apache Spark (was: Erik Erlandson) > RDD transforms for scan and scanLeft > -

[jira] [Assigned] (SPARK-6720) PySpark MultivariateStatisticalSummary unit test for normL1 and normL2

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6720: --- Assignee: (was: Apache Spark) > PySpark MultivariateStatisticalSummary unit test for norm

[jira] [Commented] (SPARK-6720) PySpark MultivariateStatisticalSummary unit test for normL1 and normL2

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14481176#comment-14481176 ] Apache Spark commented on SPARK-6720: - User 'Lewuathe' has created a pull request for

[jira] [Assigned] (SPARK-6720) PySpark MultivariateStatisticalSummary unit test for normL1 and normL2

2015-04-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6720: --- Assignee: Apache Spark > PySpark MultivariateStatisticalSummary unit test for normL1 and norm

[jira] [Created] (SPARK-6720) PySpark MultivariateStatisticalSummary unit test for normL1 and normL2

2015-04-06 Thread Kai Sasaki (JIRA)
Kai Sasaki created SPARK-6720: - Summary: PySpark MultivariateStatisticalSummary unit test for normL1 and normL2 Key: SPARK-6720 URL: https://issues.apache.org/jira/browse/SPARK-6720 Project: Spark

[jira] [Commented] (SPARK-5261) In some cases ,The value of word's vector representation is too big

2015-04-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14481132#comment-14481132 ] Sean Owen commented on SPARK-5261: -- In the new code you pasted, I don't see a difference

[jira] [Resolved] (SPARK-6687) In the hadoop 0.23 profile, hadoop pulls in an older version of netty which conflicts with akka's netty

2015-04-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6687. -- Resolution: Not A Problem I'm not sure what the problem is here, so closing until there's any follow up.

[jira] [Resolved] (SPARK-6630) SparkConf.setIfMissing should only evaluate the assigned value if indeed missing

2015-04-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6630. -- Resolution: Won't Fix Idea was good, just probably can't be reconciled with binary compatibility at thi

  1   2   >