[jira] [Commented] (SPARK-7327) DataFrame show() method doesn't like empty dataframes

2015-05-14 Thread Akhil Thatipamula (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543629#comment-14543629 ] Akhil Thatipamula commented on SPARK-7327: -- @Oliver I have checked, but i haven't

[jira] [Assigned] (SPARK-7634) [SQL] thrift server UI optimization

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7634: --- Assignee: Apache Spark [SQL] thrift server UI optimization

[jira] [Created] (SPARK-7634) [SQL] thrift server UI optimization

2015-05-14 Thread Gankun Luo (JIRA)
Gankun Luo created SPARK-7634: - Summary: [SQL] thrift server UI optimization Key: SPARK-7634 URL: https://issues.apache.org/jira/browse/SPARK-7634 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-5960) Allow AWS credentials to be passed to KinesisUtils.createStream()

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543586#comment-14543586 ] Apache Spark commented on SPARK-5960: - User 'tdas' has created a pull request for this

[jira] [Commented] (SPARK-6656) Allow the application name to be passed in versus pulling from SparkContext.getAppName()

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6656?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543587#comment-14543587 ] Apache Spark commented on SPARK-6656: - User 'tdas' has created a pull request for this

[jira] [Commented] (SPARK-6514) For Kinesis Streaming, use the same region for DynamoDB (KCL checkpoints) as the Kinesis stream itself

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543585#comment-14543585 ] Apache Spark commented on SPARK-6514: - User 'tdas' has created a pull request for this

[jira] [Commented] (SPARK-6743) Join with empty projection on one side produces invalid results

2015-05-14 Thread Santiago M. Mola (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543602#comment-14543602 ] Santiago M. Mola commented on SPARK-6743: - This problem only happens for cached

[jira] [Commented] (SPARK-7635) SparkContextSchedulerCreationSuite tests may fail due to unrecognized UnsatisfiedLinkError message.

2015-05-14 Thread Tim Ellison (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543669#comment-14543669 ] Tim Ellison commented on SPARK-7635: Yep, a happy coincidence! Full disclosure: Matt

[jira] [Resolved] (SPARK-7249) Updated Hadoop dependencies due to inconsistency in the versions

2015-05-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7249. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5786

[jira] [Commented] (SPARK-7226) Support math functions in R DataFrame

2015-05-14 Thread Qian Huang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543605#comment-14543605 ] Qian Huang commented on SPARK-7226: --- start working on this Support math functions in R

[jira] [Created] (SPARK-7635) SparkContextSchedulerCreationSuite tests may fail due to unrecognized UnsatisfiedLinkError message.

2015-05-14 Thread Matthew Brandyberry (JIRA)
Matthew Brandyberry created SPARK-7635: -- Summary: SparkContextSchedulerCreationSuite tests may fail due to unrecognized UnsatisfiedLinkError message. Key: SPARK-7635 URL:

[jira] [Resolved] (SPARK-7635) SparkContextSchedulerCreationSuite tests may fail due to unrecognized UnsatisfiedLinkError message.

2015-05-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7635. -- Resolution: Fixed Fix Version/s: 1.5.0 Assignee: Tim Ellison I think that was literally

[jira] [Commented] (SPARK-7634) [SQL] thrift server UI optimization

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543630#comment-14543630 ] Apache Spark commented on SPARK-7634: - User 'luogankun' has created a pull request for

[jira] [Assigned] (SPARK-7634) [SQL] thrift server UI optimization

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7634: --- Assignee: (was: Apache Spark) [SQL] thrift server UI optimization

[jira] [Updated] (SPARK-7249) Updated Hadoop dependencies due to inconsistency in the versions

2015-05-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7249: - Assignee: Favio Vázquez Updated Hadoop dependencies due to inconsistency in the versions

[jira] [Assigned] (SPARK-7624) Task scheduler delay is increasing time over time in spark local mode

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7624: --- Assignee: Apache Spark (was: Davies Liu) Task scheduler delay is increasing time over time

[jira] [Assigned] (SPARK-7624) Task scheduler delay is increasing time over time in spark local mode

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7624: --- Assignee: Davies Liu (was: Apache Spark) Task scheduler delay is increasing time over time

[jira] [Commented] (SPARK-7624) Task scheduler delay is increasing time over time in spark local mode

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544962#comment-14544962 ] Apache Spark commented on SPARK-7624: - User 'davies' has created a pull request for

[jira] [Commented] (SPARK-7624) Task scheduler delay is increasing time over time in spark local mode

2015-05-14 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544978#comment-14544978 ] Davies Liu commented on SPARK-7624: --- In the context of Spark Streaming, there could be

[jira] [Commented] (SPARK-6846) Stage kill URL easy to accidentally trigger and possibility for security issue.

2015-05-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543980#comment-14543980 ] Sean Owen commented on SPARK-6846: -- Since the kill endpoint does a 302 redirect to the

[jira] [Commented] (SPARK-4128) Create instructions on fully building Spark in Intellij

2015-05-14 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544118#comment-14544118 ] Patrick Wendell commented on SPARK-4128: Thanks for bringing this back up

[jira] [Updated] (SPARK-7455) Perf test for LDA (EM/online)

2015-05-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7455: - Assignee: yuhao yang Perf test for LDA (EM/online) -

[jira] [Commented] (SPARK-7642) Missing 1 worker on standalone clusters.

2015-05-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544196#comment-14544196 ] Xiangrui Meng commented on SPARK-7642: -- It seems that I'm using an old slave node,

[jira] [Commented] (SPARK-7640) Private VPC with default Spark AMI breaks yum

2015-05-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544051#comment-14544051 ] Sean Owen commented on SPARK-7640: -- https://issues.apache.org/jira/browse/SPARK-6220

[jira] [Commented] (SPARK-7640) Private VPC with default Spark AMI breaks yum

2015-05-14 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544077#comment-14544077 ] Brad Willard commented on SPARK-7640: - I'm happy to try, do you know specifically

[jira] [Updated] (SPARK-7541) Check model save/load for MLlib 1.4

2015-05-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7541: - Description: For each model which supports save/load methods, we need to verify: * These

[jira] [Updated] (SPARK-7537) Audit new public Scala APIs for MLlib 1.4

2015-05-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7537: - Description: Audit new public Scala APIs added to MLlib in 1.4. Take note of: *

[jira] [Created] (SPARK-7643) Number of executors and partitions are displayed wrongly in storage tab

2015-05-14 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7643: Summary: Number of executors and partitions are displayed wrongly in storage tab Key: SPARK-7643 URL: https://issues.apache.org/jira/browse/SPARK-7643 Project: Spark

[jira] [Created] (SPARK-7640) Private VPC with default Spark AMI breaks yum

2015-05-14 Thread Brad Willard (JIRA)
Brad Willard created SPARK-7640: --- Summary: Private VPC with default Spark AMI breaks yum Key: SPARK-7640 URL: https://issues.apache.org/jira/browse/SPARK-7640 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-7641) Add subsampling of frequent words for Word2Vec

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7641: --- Assignee: Apache Spark Add subsampling of frequent words for Word2Vec

[jira] [Commented] (SPARK-7640) Private VPC with default Spark AMI breaks yum

2015-05-14 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544111#comment-14544111 ] Shivaram Venkataraman commented on SPARK-7640: -- Unfortunately I've never

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7536: - Description: For new public APIs added to MLlib, we need to check the generated HTML doc

[jira] [Updated] (SPARK-7537) Audit new public Scala APIs for MLlib 1.4

2015-05-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7537: - Description: Audit new public Scala APIs added to MLlib in 1.4. Take note of: *

[jira] [Commented] (SPARK-7640) Private VPC with default Spark AMI breaks yum

2015-05-14 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544169#comment-14544169 ] Nicholas Chammas commented on SPARK-7640: - {quote} Switch everything to support

[jira] [Assigned] (SPARK-7642) Missing 1 worker on standalone clusters.

2015-05-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-7642: Assignee: Xiangrui Meng Missing 1 worker on standalone clusters.

[jira] [Commented] (SPARK-7640) Private VPC with default Spark AMI breaks yum

2015-05-14 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544066#comment-14544066 ] Brad Willard commented on SPARK-7640: - I manually implemented the first one just to

[jira] [Created] (SPARK-7641) Add subsampling of frequent words for Word2Vec

2015-05-14 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-7641: -- Summary: Add subsampling of frequent words for Word2Vec Key: SPARK-7641 URL: https://issues.apache.org/jira/browse/SPARK-7641 Project: Spark Issue Type:

[jira] [Commented] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544135#comment-14544135 ] Joseph K. Bradley commented on SPARK-7536: -- [~yanboliang] Could you please add

[jira] [Commented] (SPARK-7640) Private VPC with default Spark AMI breaks yum

2015-05-14 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544136#comment-14544136 ] Brad Willard commented on SPARK-7640: - I think this might be working I disabled the

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7536: - Description: For new public APIs added to MLlib, we need to check the generated HTML doc

[jira] [Commented] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544152#comment-14544152 ] Joseph K. Bradley commented on SPARK-7536: -- [~yanboliang] I realized I forgot to

[jira] [Updated] (SPARK-7643) Number of executors and partitions are displayed wrongly in storage tab

2015-05-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7643: - Attachment: 1.4 data distribution.png Number of executors and partitions are displayed wrongly

[jira] [Assigned] (SPARK-7591) FSBasedRelation interface tweaks

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7591: --- Assignee: Cheng Lian (was: Apache Spark) FSBasedRelation interface tweaks

[jira] [Commented] (SPARK-7591) FSBasedRelation interface tweaks

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544009#comment-14544009 ] Apache Spark commented on SPARK-7591: - User 'liancheng' has created a pull request for

[jira] [Assigned] (SPARK-7591) FSBasedRelation interface tweaks

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7591: --- Assignee: Apache Spark (was: Cheng Lian) FSBasedRelation interface tweaks

[jira] [Commented] (SPARK-7640) Private VPC with default Spark AMI breaks yum

2015-05-14 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544072#comment-14544072 ] Shivaram Venkataraman commented on SPARK-7640: -- Does it work if you add

[jira] [Updated] (SPARK-7642) Missing 1 worker on standalone clusters.

2015-05-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7642: - Attachment: 1.4 data distribution.png 1.3 data distribution.png Attached the

[jira] [Updated] (SPARK-7643) Number of executors and partitions are displayed wrongly in storage tab

2015-05-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7643: - Description: Saw this in the storage tab of an RDD on a 1.4 cluster. An RDD is distributed among

[jira] [Updated] (SPARK-7640) Private VPC with default Spark AMI breaks yum

2015-05-14 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brad Willard updated SPARK-7640: Description: If you create a spark cluster in a private vpc, the amazon yum repos return 403

[jira] [Assigned] (SPARK-7641) Add subsampling of frequent words for Word2Vec

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7641: --- Assignee: (was: Apache Spark) Add subsampling of frequent words for Word2Vec

[jira] [Commented] (SPARK-7641) Add subsampling of frequent words for Word2Vec

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544081#comment-14544081 ] Apache Spark commented on SPARK-7641: - User 'viirya' has created a pull request for

[jira] [Commented] (SPARK-6444) SQL functions (either built-in or UDF) should check for data types of their arguments

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6444?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544137#comment-14544137 ] Apache Spark commented on SPARK-6444: - User 'cloud-fan' has created a pull request for

[jira] [Assigned] (SPARK-6444) SQL functions (either built-in or UDF) should check for data types of their arguments

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6444: --- Assignee: Apache Spark SQL functions (either built-in or UDF) should check for data types

[jira] [Assigned] (SPARK-6444) SQL functions (either built-in or UDF) should check for data types of their arguments

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6444: --- Assignee: (was: Apache Spark) SQL functions (either built-in or UDF) should check for

[jira] [Created] (SPARK-7642) Missing 1 worker on standalone clusters.

2015-05-14 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7642: Summary: Missing 1 worker on standalone clusters. Key: SPARK-7642 URL: https://issues.apache.org/jira/browse/SPARK-7642 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-7644) Ensure all scoped RDD operations are tested and cleaned

2015-05-14 Thread Andrew Or (JIRA)
Andrew Or created SPARK-7644: Summary: Ensure all scoped RDD operations are tested and cleaned Key: SPARK-7644 URL: https://issues.apache.org/jira/browse/SPARK-7644 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-6154) Support Kafka, JDBC in Scala 2.11

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6154: --- Assignee: (was: Apache Spark) Support Kafka, JDBC in Scala 2.11

[jira] [Assigned] (SPARK-6154) Support Kafka, JDBC in Scala 2.11

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6154: --- Assignee: Apache Spark Support Kafka, JDBC in Scala 2.11 -

[jira] [Updated] (SPARK-7637) StructType.merge slow with large nenormalised tables O(N2)

2015-05-14 Thread Rowan Chattaway (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rowan Chattaway updated SPARK-7637: --- Description: StructType.merge does a linear scan through the left schema and for each

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-14 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-14 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Commented] (SPARK-6846) Stage kill URL easy to accidentally trigger and possibility for security issue.

2015-05-14 Thread Dev Lakhani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543970#comment-14543970 ] Dev Lakhani commented on SPARK-6846: As a more complex solution would it be possible

[jira] [Created] (SPARK-7636) Significant performance regression with GradientDescent in 1.4

2015-05-14 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7636: Summary: Significant performance regression with GradientDescent in 1.4 Key: SPARK-7636 URL: https://issues.apache.org/jira/browse/SPARK-7636 Project: Spark

[jira] [Created] (SPARK-7639) Add Python API for Statistics.kernelDensity

2015-05-14 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-7639: -- Summary: Add Python API for Statistics.kernelDensity Key: SPARK-7639 URL: https://issues.apache.org/jira/browse/SPARK-7639 Project: Spark Issue Type: New

[jira] [Created] (SPARK-7637) StructType.merge slow with large nenormalised tables O(N2)

2015-05-14 Thread Rowan Chattaway (JIRA)
Rowan Chattaway created SPARK-7637: -- Summary: StructType.merge slow with large nenormalised tables O(N2) Key: SPARK-7637 URL: https://issues.apache.org/jira/browse/SPARK-7637 Project: Spark

[jira] [Comment Edited] (SPARK-7637) StructType.merge slow with large nenormalised tables O(N2)

2015-05-14 Thread Rowan Chattaway (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7637?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543770#comment-14543770 ] Rowan Chattaway edited comment on SPARK-7637 at 5/14/15 3:07 PM:

[jira] [Commented] (SPARK-6154) Support Kafka, JDBC in Scala 2.11

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543831#comment-14543831 ] Apache Spark commented on SPARK-6154: - User 'dragos' has created a pull request for

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-14 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Commented] (SPARK-7615) MLLIB Word2Vec wordVectors divided by Euclidean Norm equals to zero

2015-05-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7615?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543981#comment-14543981 ] Sean Owen commented on SPARK-7615: -- Yes the submitter is about to open another PR, as I

[jira] [Created] (SPARK-7638) Python API for pmml.export

2015-05-14 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-7638: -- Summary: Python API for pmml.export Key: SPARK-7638 URL: https://issues.apache.org/jira/browse/SPARK-7638 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-7615) MLLIB Word2Vec wordVectors divided by Euclidean Norm equals to zero

2015-05-14 Thread Angel Martinez Gonzalez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7615?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543977#comment-14543977 ] Angel Martinez Gonzalez commented on SPARK-7615: Hi, is any body working

[jira] [Updated] (SPARK-7637) StructType.merge slow with large nenormalised tables O(N2)

2015-05-14 Thread Rowan Chattaway (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rowan Chattaway updated SPARK-7637: --- Shepherd: Rowan Chattaway StructType.merge slow with large nenormalised tables O(N2)

[jira] [Commented] (SPARK-7637) StructType.merge slow with large nenormalised tables O(N2)

2015-05-14 Thread Rowan Chattaway (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7637?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543770#comment-14543770 ] Rowan Chattaway commented on SPARK-7637: I have made the changes and will submit a

[jira] [Updated] (SPARK-7536) Audit MLlib Python API for 1.4

2015-05-14 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-7536: --- Description: For new public APIs added to MLlib, we need to check the generated HTML doc and compare

[jira] [Assigned] (SPARK-7645) Show milliseconds in the UI if the batch interval 1 second

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7645: --- Assignee: Apache Spark Show milliseconds in the UI if the batch interval 1 second

[jira] [Assigned] (SPARK-7645) Show milliseconds in the UI if the batch interval 1 second

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7645: --- Assignee: (was: Apache Spark) Show milliseconds in the UI if the batch interval 1

[jira] [Commented] (SPARK-7645) Show milliseconds in the UI if the batch interval 1 second

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544245#comment-14544245 ] Apache Spark commented on SPARK-7645: - User 'zsxwing' has created a pull request for

[jira] [Created] (SPARK-7647) Additional methods in JavaModel wrappers

2015-05-14 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7647: Summary: Additional methods in JavaModel wrappers Key: SPARK-7647 URL: https://issues.apache.org/jira/browse/SPARK-7647 Project: Spark Issue Type: New

[jira] [Commented] (SPARK-7640) Private VPC with default Spark AMI breaks yum

2015-05-14 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544295#comment-14544295 ] Brad Willard commented on SPARK-7640: - sounds good. I'm going to try and make a custom

[jira] [Resolved] (SPARK-7297) Make timeline more discoverable

2015-05-14 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-7297. Resolution: Fixed Make timeline more discoverable ---

[jira] [Updated] (SPARK-7647) Additional methods in GLM JavaModel wrappers

2015-05-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7647: - Summary: Additional methods in GLM JavaModel wrappers (was: Additional methods in JavaModel

[jira] [Commented] (SPARK-7648) Additional methods in ALS JavaModel wrappers

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544313#comment-14544313 ] Apache Spark commented on SPARK-7648: - User 'mengxr' has created a pull request for

[jira] [Assigned] (SPARK-7648) Additional methods in ALS JavaModel wrappers

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7648: --- Assignee: (was: Apache Spark) Additional methods in ALS JavaModel wrappers

[jira] [Assigned] (SPARK-7648) Additional methods in ALS JavaModel wrappers

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7648: --- Assignee: Apache Spark Additional methods in ALS JavaModel wrappers

[jira] [Commented] (SPARK-7640) Private VPC with default Spark AMI breaks yum

2015-05-14 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544251#comment-14544251 ] Brad Willard commented on SPARK-7640: - So installing python 27 was just an example to

[jira] [Commented] (SPARK-7563) OutputCommitCoordinator.stop() should only be executed in driver

2015-05-14 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544300#comment-14544300 ] Josh Rosen commented on SPARK-7563: --- Yep, looks like a pretty clear bug. I think that

[jira] [Commented] (SPARK-7640) Private VPC with default Spark AMI breaks yum

2015-05-14 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544242#comment-14544242 ] Brad Willard commented on SPARK-7640: - So the centos repo doesn't seem to actually

[jira] [Created] (SPARK-7645) Show milliseconds in the UI if the batch interval 1 second

2015-05-14 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-7645: --- Summary: Show milliseconds in the UI if the batch interval 1 second Key: SPARK-7645 URL: https://issues.apache.org/jira/browse/SPARK-7645 Project: Spark

[jira] [Closed] (SPARK-7642) Missing 1 worker on standalone clusters.

2015-05-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng closed SPARK-7642. Resolution: Not A Problem I used launch more like this to increase the cluster size to 16. The

[jira] [Created] (SPARK-7646) Create table support to JDBC Datasource

2015-05-14 Thread Venkata Ramana G (JIRA)
Venkata Ramana G created SPARK-7646: --- Summary: Create table support to JDBC Datasource Key: SPARK-7646 URL: https://issues.apache.org/jira/browse/SPARK-7646 Project: Spark Issue Type:

[jira] [Commented] (SPARK-7563) OutputCommitCoordinator.stop() should only be executed in driver

2015-05-14 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544290#comment-14544290 ] Patrick Wendell commented on SPARK-7563: /cc [~joshrosen] I think this is caused

[jira] [Commented] (SPARK-7640) Private VPC with default Spark AMI breaks yum

2015-05-14 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544288#comment-14544288 ] Nicholas Chammas commented on SPARK-7640: - If there is no way around this (like,

[jira] [Created] (SPARK-7648) Additional methods in ALS JavaModel wrappers

2015-05-14 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7648: Summary: Additional methods in ALS JavaModel wrappers Key: SPARK-7648 URL: https://issues.apache.org/jira/browse/SPARK-7648 Project: Spark Issue Type: New

[jira] [Assigned] (SPARK-7646) Create table support to JDBC Datasource

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7646: --- Assignee: Apache Spark Create table support to JDBC Datasource

[jira] [Commented] (SPARK-7646) Create table support to JDBC Datasource

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544275#comment-14544275 ] Apache Spark commented on SPARK-7646: - User 'gvramana' has created a pull request for

[jira] [Assigned] (SPARK-7646) Create table support to JDBC Datasource

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7646: --- Assignee: (was: Apache Spark) Create table support to JDBC Datasource

[jira] [Commented] (SPARK-1529) Support setting spark.local.dirs to a hadoop FileSystem

2015-05-14 Thread Kannan Rajah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544563#comment-14544563 ] Kannan Rajah commented on SPARK-1529: - Just wanted to check if folks got a chance to

[jira] [Updated] (SPARK-7098) Inconsistent Timestamp behavior when used in WHERE clause

2015-05-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7098: --- Target Version/s: 1.4.0 Assignee: Liang-Chi Hsieh Inconsistent Timestamp behavior when

[jira] [Closed] (SPARK-7453) Perf test for Bernoulli naive Bayes

2015-05-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng closed SPARK-7453. Resolution: Done Fix Version/s: 1.4.0 Merged PR

[jira] [Assigned] (SPARK-7649) Use window.localStorage to store the status rather than the url

2015-05-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7649: --- Assignee: Apache Spark Use window.localStorage to store the status rather than the url

  1   2   3   >