[jira] [Comment Edited] (SPARK-5016) GaussianMixtureEM should distribute matrix inverse for large numFeatures, k

2015-02-05 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306688#comment-14306688 ] Manoj Kumar edited comment on SPARK-5016 at 2/5/15 8:09 AM:

[jira] [Commented] (SPARK-5604) Remove setCheckpointDir from LDA and tree Strategy

2015-02-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306810#comment-14306810 ] Apache Spark commented on SPARK-5604: - User 'mengxr' has created a pull request for

[jira] [Commented] (SPARK-5081) Shuffle write increases

2015-02-05 Thread Shekhar Bansal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308596#comment-14308596 ] Shekhar Bansal commented on SPARK-5081: --- I faced same problem, moving to lz4

[jira] [Commented] (SPARK-5388) Provide a stable application submission gateway in standalone cluster mode

2015-02-05 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308598#comment-14308598 ] Andrew Or commented on SPARK-5388: -- [~tigerquoll] I still don't think we should use

[jira] [Commented] (SPARK-5556) Latent Dirichlet Allocation (LDA) using Gibbs sampler

2015-02-05 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308615#comment-14308615 ] Guoqiang Li commented on SPARK-5556: LightLDA's computational complexity is O(1) The

[jira] [Commented] (SPARK-5598) Model import/export for ALS

2015-02-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308692#comment-14308692 ] Sean Owen commented on SPARK-5598: -- [~mengxr] No, no other tool could usefully read such

[jira] [Resolved] (SPARK-926) spark_ec2 script when ssh/scp-ing should pipe UserknowHostFile to /dev/null

2015-02-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-926. - Resolution: Duplicate Going to make this one the duplicate since SPARK-5403 has an active PR. spark_ec2

[jira] [Created] (SPARK-5647) Output metrics do not show up for older hadoop versions ( 2.5)

2015-02-05 Thread Kostas Sakellis (JIRA)
Kostas Sakellis created SPARK-5647: -- Summary: Output metrics do not show up for older hadoop versions ( 2.5) Key: SPARK-5647 URL: https://issues.apache.org/jira/browse/SPARK-5647 Project: Spark

[jira] [Commented] (SPARK-5563) LDA with online variational inference

2015-02-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308584#comment-14308584 ] Apache Spark commented on SPARK-5563: - User 'hhbyyh' has created a pull request for

[jira] [Commented] (SPARK-4279) Implementing TinkerPop on top of GraphX

2015-02-05 Thread Jianshi Huang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308595#comment-14308595 ] Jianshi Huang commented on SPARK-4279: -- Anyone is working on this? Implementing

[jira] [Commented] (SPARK-5556) Latent Dirichlet Allocation (LDA) using Gibbs sampler

2015-02-05 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308603#comment-14308603 ] Pedro Rodriguez commented on SPARK-5556: Posting here as a status update. I will

[jira] [Commented] (SPARK-4279) Implementing TinkerPop on top of GraphX

2015-02-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308651#comment-14308651 ] Sean Owen commented on SPARK-4279: -- This sounds like something that should live outside

[jira] [Resolved] (SPARK-5625) Spark binaries do not incude Spark Core

2015-02-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5625. -- Resolution: Not a Problem All of these distributions include an assembly JAR with the entire Spark

[jira] [Commented] (SPARK-5635) Allow users to run .scala files directly from spark-submit

2015-02-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308776#comment-14308776 ] Sean Owen commented on SPARK-5635: -- spark-shell uses spark-submit, and spark-shell is the

[jira] [Created] (SPARK-5645) Track local bytes read for shuffles - update UI

2015-02-05 Thread Kostas Sakellis (JIRA)
Kostas Sakellis created SPARK-5645: -- Summary: Track local bytes read for shuffles - update UI Key: SPARK-5645 URL: https://issues.apache.org/jira/browse/SPARK-5645 Project: Spark Issue

[jira] [Comment Edited] (SPARK-5388) Provide a stable application submission gateway in standalone cluster mode

2015-02-05 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308599#comment-14308599 ] Andrew Or edited comment on SPARK-5388 at 2/6/15 5:35 AM: -- By the

[jira] [Created] (SPARK-5643) Add a show method to print the content of a DataFrame in columnar format

2015-02-05 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-5643: -- Summary: Add a show method to print the content of a DataFrame in columnar format Key: SPARK-5643 URL: https://issues.apache.org/jira/browse/SPARK-5643 Project: Spark

[jira] [Created] (SPARK-5644) Delete tmp dir when sc is stop

2015-02-05 Thread Weizhong (JIRA)
Weizhong created SPARK-5644: --- Summary: Delete tmp dir when sc is stop Key: SPARK-5644 URL: https://issues.apache.org/jira/browse/SPARK-5644 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-5031) ml.LogisticRegression score column should be renamed probability

2015-02-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5031. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3637

[jira] [Resolved] (SPARK-4942) ML Transformers should allow output cols to be turned on,off

2015-02-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4942. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3637

[jira] [Resolved] (SPARK-4789) Standardize ML Prediction APIs

2015-02-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4789. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3637

[jira] [Commented] (SPARK-5616) Add examples for PySpark API

2015-02-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308555#comment-14308555 ] Apache Spark commented on SPARK-5616: - User 'lazyman500' has created a pull request

[jira] [Commented] (SPARK-5388) Provide a stable application submission gateway in standalone cluster mode

2015-02-05 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308599#comment-14308599 ] Andrew Or commented on SPARK-5388: -- By the way for the more specific comments it would be

[jira] [Commented] (SPARK-5081) Shuffle write increases

2015-02-05 Thread Kevin Jung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308608#comment-14308608 ] Kevin Jung commented on SPARK-5081: --- Sorry, I will make an effort to provide another

[jira] [Commented] (SPARK-5388) Provide a stable application submission gateway in standalone cluster mode

2015-02-05 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308635#comment-14308635 ] Patrick Wendell commented on SPARK-5388: I think it's reasonable to use DELETE per

[jira] [Updated] (SPARK-5563) LDA with online variational inference

2015-02-05 Thread Jason Dai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Dai updated SPARK-5563: - Assignee: yuhao yang LDA with online variational inference -

[jira] [Commented] (SPARK-5566) Tokenizer for mllib package

2015-02-05 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308733#comment-14308733 ] yuhao yang commented on SPARK-5566: --- I mean only the underlying implementation.

[jira] [Commented] (SPARK-5644) Delete tmp dir when sc is stop

2015-02-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308719#comment-14308719 ] Apache Spark commented on SPARK-5644: - User 'Sephiroth-Lin' has created a pull request

[jira] [Commented] (SPARK-5081) Shuffle write increases

2015-02-05 Thread Kevin Jung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308620#comment-14308620 ] Kevin Jung commented on SPARK-5081: --- To test under the same condition, I set this to

[jira] [Commented] (SPARK-5556) Latent Dirichlet Allocation (LDA) using Gibbs sampler

2015-02-05 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308619#comment-14308619 ] Pedro Rodriguez commented on SPARK-5556: I will read that paper, seems

[jira] [Commented] (SPARK-3185) SPARK launch on Hadoop 2 in EC2 throws Tachyon exception when Formatting JOURNAL_FOLDER

2015-02-05 Thread Florian Verhein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308644#comment-14308644 ] Florian Verhein commented on SPARK-3185: [~dvohra] Sure, but the exception is

[jira] [Commented] (SPARK-4808) Spark fails to spill with small number of large objects

2015-02-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308677#comment-14308677 ] Apache Spark commented on SPARK-4808: - User 'mingyukim' has created a pull request for

[jira] [Resolved] (SPARK-5639) Support DataFrame.renameColumn

2015-02-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5639. Resolution: Fixed Fix Version/s: 1.3.0 Support DataFrame.renameColumn

[jira] [Comment Edited] (SPARK-5391) SparkSQL fails to create tables with custom JSON SerDe

2015-02-05 Thread Muthupandi K (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308627#comment-14308627 ] Muthupandi K edited comment on SPARK-5391 at 2/6/15 5:13 AM: -

[jira] [Commented] (SPARK-5391) SparkSQL fails to create tables with custom JSON SerDe

2015-02-05 Thread Muthupandi K (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308627#comment-14308627 ] Muthupandi K commented on SPARK-5391: - Same error occoured when a table is created

[jira] [Resolved] (SPARK-5631) Server IPC version 7 cannot communicate with client version 4

2015-02-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5631. -- Resolution: Not a Problem The right place to ask questions and discuss this is the mailing list. This

[jira] [Resolved] (SPARK-5531) Spark download .tgz file does not get unpacked

2015-02-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5531. -- Resolution: Not a Problem Spark download .tgz file does not get unpacked

[jira] [Updated] (SPARK-5645) Track local bytes read for shuffles - update UI

2015-02-05 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza updated SPARK-5645: -- Assignee: Kostas Sakellis Track local bytes read for shuffles - update UI

[jira] [Commented] (SPARK-5643) Add a show method to print the content of a DataFrame in columnar format

2015-02-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308543#comment-14308543 ] Apache Spark commented on SPARK-5643: - User 'rxin' has created a pull request for this

[jira] [Created] (SPARK-5646) Record output metrics for cache

2015-02-05 Thread Kostas Sakellis (JIRA)
Kostas Sakellis created SPARK-5646: -- Summary: Record output metrics for cache Key: SPARK-5646 URL: https://issues.apache.org/jira/browse/SPARK-5646 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-5646) Record output metrics for cache

2015-02-05 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza updated SPARK-5646: -- Assignee: Kostas Sakellis Record output metrics for cache ---

[jira] [Commented] (SPARK-5021) GaussianMixtureEM should be faster for SparseVector input

2015-02-05 Thread Travis Galoppo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307196#comment-14307196 ] Travis Galoppo commented on SPARK-5021: --- [~MechCoder] It is probably better to get

[jira] [Commented] (SPARK-4705) Driver retries in yarn-cluster mode always fail if event logging is enabled

2015-02-05 Thread Twinkle Sachdeva (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307406#comment-14307406 ] Twinkle Sachdeva commented on SPARK-4705: - Hi [~vanzin], Regarding adding that

[jira] [Commented] (SPARK-4897) Python 3 support

2015-02-05 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307463#comment-14307463 ] Josh Rosen commented on SPARK-4897: --- Hi [~ianozsvald], Until now, the main motivation

[jira] [Updated] (SPARK-4897) Python 3 support

2015-02-05 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4897: -- Target Version/s: 1.4.0 (was: 1.3.0) Python 3 support Key:

[jira] [Commented] (SPARK-4897) Python 3 support

2015-02-05 Thread thom neale (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307474#comment-14307474 ] thom neale commented on SPARK-4897: --- I'm still very interested in helping with the 3.4

[jira] [Created] (SPARK-5616) Add examples for PySpark API

2015-02-05 Thread dongxu (JIRA)
dongxu created SPARK-5616: - Summary: Add examples for PySpark API Key: SPARK-5616 URL: https://issues.apache.org/jira/browse/SPARK-5616 Project: Spark Issue Type: Improvement Components:

[jira] [Created] (SPARK-5617) test failure of SQLQuerySuite

2015-02-05 Thread wangfei (JIRA)
wangfei created SPARK-5617: -- Summary: test failure of SQLQuerySuite Key: SPARK-5617 URL: https://issues.apache.org/jira/browse/SPARK-5617 Project: Spark Issue Type: Bug Components: SQL

[jira] [Created] (SPARK-5618) Optimise utility code.

2015-02-05 Thread Makoto Fukuhara (JIRA)
Makoto Fukuhara created SPARK-5618: -- Summary: Optimise utility code. Key: SPARK-5618 URL: https://issues.apache.org/jira/browse/SPARK-5618 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-5631) Server IPC version 7 cannot communicate with client version 4

2015-02-05 Thread DeepakVohra (JIRA)
DeepakVohra created SPARK-5631: -- Summary: Server IPC version 7 cannot communicate with client version 4 Key: SPARK-5631 URL: https://issues.apache.org/jira/browse/SPARK-5631 Project: Spark

[jira] [Resolved] (SPARK-5135) Add support for describe [extended] table to DDL in SQLContext

2015-02-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5135. Resolution: Fixed Add support for describe [extended] table to DDL in SQLContext

[jira] [Created] (SPARK-5615) Fix testPackage in StreamingContextSuite

2015-02-05 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5615: -- Summary: Fix testPackage in StreamingContextSuite Key: SPARK-5615 URL: https://issues.apache.org/jira/browse/SPARK-5615 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-5608) Improve SEO of Spark documentation site to let Google find latest docs

2015-02-05 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-5608. -- Resolution: Fixed Fix Version/s: 1.3.0 Improve SEO of Spark documentation site to let

[jira] [Commented] (SPARK-2808) update kafka to version 0.8.2

2015-02-05 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307254#comment-14307254 ] koert kuipers commented on SPARK-2808: -- what is the motivation for this upgrade?

[jira] [Comment Edited] (SPARK-2808) update kafka to version 0.8.2

2015-02-05 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307254#comment-14307254 ] koert kuipers edited comment on SPARK-2808 at 2/5/15 2:28 PM: --

[jira] [Commented] (SPARK-5480) GraphX pageRank: java.lang.ArrayIndexOutOfBoundsException:

2015-02-05 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307289#comment-14307289 ] Takeshi Yamamuro commented on SPARK-5480: - These codes didn't throw such

[jira] [Created] (SPARK-5632) not able to resolve dot('.') in field name

2015-02-05 Thread Lishu Liu (JIRA)
Lishu Liu created SPARK-5632: Summary: not able to resolve dot('.') in field name Key: SPARK-5632 URL: https://issues.apache.org/jira/browse/SPARK-5632 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-5633) pyspark saveAsTextFile support for compression codec

2015-02-05 Thread Vladimir Vladimirov (JIRA)
Vladimir Vladimirov created SPARK-5633: -- Summary: pyspark saveAsTextFile support for compression codec Key: SPARK-5633 URL: https://issues.apache.org/jira/browse/SPARK-5633 Project: Spark

[jira] [Created] (SPARK-5634) History server shows misleading message when there are no incomplete apps

2015-02-05 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-5634: - Summary: History server shows misleading message when there are no incomplete apps Key: SPARK-5634 URL: https://issues.apache.org/jira/browse/SPARK-5634 Project:

[jira] [Commented] (SPARK-5633) pyspark saveAsTextFile support for compression codec

2015-02-05 Thread Vladimir Vladimirov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308128#comment-14308128 ] Vladimir Vladimirov commented on SPARK-5633: Here is a workaround before

[jira] [Commented] (SPARK-5622) Add connector/handler hive configuration settings to hive-thrift-server

2015-02-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308141#comment-14308141 ] Apache Spark commented on SPARK-5622: - User 'alexliu68' has created a pull request for

[jira] [Commented] (SPARK-5493) Support proxy users under kerberos

2015-02-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308131#comment-14308131 ] Apache Spark commented on SPARK-5493: - User 'vanzin' has created a pull request for

[jira] [Commented] (SPARK-5335) Destroying cluster in VPC with --delete-groups fails to remove security groups

2015-02-05 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308133#comment-14308133 ] Nicholas Chammas commented on SPARK-5335: - For the record: [AWS

[jira] [Commented] (SPARK-5604) Remove setCheckpointDir from LDA and tree Strategy

2015-02-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308166#comment-14308166 ] Apache Spark commented on SPARK-5604: - User 'mengxr' has created a pull request for

[jira] [Commented] (SPARK-5633) pyspark saveAsTextFile support for compression codec

2015-02-05 Thread Vladimir Vladimirov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308077#comment-14308077 ] Vladimir Vladimirov commented on SPARK-5633: Here is pull request that adds

[jira] [Commented] (SPARK-5620) Group methods in generated unidoc

2015-02-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308084#comment-14308084 ] Apache Spark commented on SPARK-5620: - User 'mengxr' has created a pull request for

[jira] [Updated] (SPARK-5632) not able to resolve dot('.') in field name

2015-02-05 Thread Lishu Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lishu Liu updated SPARK-5632: - Description: My cassandra table task_trace has a field sm.result which contains dot in the name. So SQL

[jira] [Resolved] (SPARK-5528) Support schema merging while reading Parquet files

2015-02-05 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5528. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4308

[jira] [Commented] (SPARK-5388) Provide a stable application submission gateway in standalone cluster mode

2015-02-05 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308263#comment-14308263 ] Marcelo Vanzin commented on SPARK-5388: --- Also, a fun fact about the Jersey

[jira] [Resolved] (SPARK-5509) EqualTo operator doesn't handle binary type properly

2015-02-05 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5509. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4308

[jira] [Created] (SPARK-5635) Allow users to run .scala files directly from spark-submit

2015-02-05 Thread Grant Henke (JIRA)
Grant Henke created SPARK-5635: -- Summary: Allow users to run .scala files directly from spark-submit Key: SPARK-5635 URL: https://issues.apache.org/jira/browse/SPARK-5635 Project: Spark Issue

[jira] [Created] (SPARK-5637) Expose spark_ec2 as as StarCluster Plugin

2015-02-05 Thread Alex Rothberg (JIRA)
Alex Rothberg created SPARK-5637: Summary: Expose spark_ec2 as as StarCluster Plugin Key: SPARK-5637 URL: https://issues.apache.org/jira/browse/SPARK-5637 Project: Spark Issue Type:

[jira] [Created] (SPARK-5638) Add a config flag to disable eager analysis of DataFrames

2015-02-05 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-5638: -- Summary: Add a config flag to disable eager analysis of DataFrames Key: SPARK-5638 URL: https://issues.apache.org/jira/browse/SPARK-5638 Project: Spark Issue

[jira] [Commented] (SPARK-5638) Add a config flag to disable eager analysis of DataFrames

2015-02-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308286#comment-14308286 ] Apache Spark commented on SPARK-5638: - User 'rxin' has created a pull request for this

[jira] [Created] (SPARK-5639) Support DataFrame.renameColumn

2015-02-05 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-5639: -- Summary: Support DataFrame.renameColumn Key: SPARK-5639 URL: https://issues.apache.org/jira/browse/SPARK-5639 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-3454) Expose JSON representation of data shown in WebUI

2015-02-05 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308324#comment-14308324 ] Marcelo Vanzin commented on SPARK-3454: --- Hi [~imranr], There are two ways I can see

[jira] [Commented] (SPARK-5639) Support DataFrame.renameColumn

2015-02-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308328#comment-14308328 ] Apache Spark commented on SPARK-5639: - User 'rxin' has created a pull request for this

[jira] [Resolved] (SPARK-5620) Group methods in generated unidoc

2015-02-05 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5620. Resolution: Fixed Fix Version/s: 1.3.0 Group methods in generated unidoc

[jira] [Resolved] (SPARK-5604) Remove setCheckpointDir from LDA and tree Strategy

2015-02-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5604. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4390

[jira] [Resolved] (SPARK-5182) Partitioning support for tables created by the data source API

2015-02-05 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5182. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4308

[jira] [Resolved] (SPARK-3575) Hive Schema is ignored when using convertMetastoreParquet

2015-02-05 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-3575. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4308

[jira] [Commented] (SPARK-5624) Can't find new column

2015-02-05 Thread Alex Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308261#comment-14308261 ] Alex Liu commented on SPARK-5624: - Test it on the latest master branch it doesn't have

[jira] [Commented] (SPARK-5388) Provide a stable application submission gateway in standalone cluster mode

2015-02-05 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308260#comment-14308260 ] Marcelo Vanzin commented on SPARK-5388: --- Hi [~andrewor14], Thanks for updating the

[jira] [Created] (SPARK-5636) Lower dynamic allocation add interval

2015-02-05 Thread Andrew Or (JIRA)
Andrew Or created SPARK-5636: Summary: Lower dynamic allocation add interval Key: SPARK-5636 URL: https://issues.apache.org/jira/browse/SPARK-5636 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-5557) spark-shell failed to start

2015-02-05 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308346#comment-14308346 ] Patrick Wendell commented on SPARK-5557: I can send a fix for this shortly. It

[jira] [Created] (SPARK-5621) Cannot fetch dependencies for mllib

2015-02-05 Thread Luca Venturini (JIRA)
Luca Venturini created SPARK-5621: - Summary: Cannot fetch dependencies for mllib Key: SPARK-5621 URL: https://issues.apache.org/jira/browse/SPARK-5621 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-1867) Spark Documentation Error causes java.lang.IllegalStateException: unread block data

2015-02-05 Thread Philippe Girolami (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307580#comment-14307580 ] Philippe Girolami commented on SPARK-1867: -- Has anyone figured this out ? I'm

[jira] [Commented] (SPARK-5081) Shuffle write increases

2015-02-05 Thread Kostas Sakellis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307584#comment-14307584 ] Kostas Sakellis commented on SPARK-5081: Can you add a sample of the code too?

[jira] [Created] (SPARK-5622) Add connector/handler hive configuration settings to hive-thrift-server

2015-02-05 Thread Alex Liu (JIRA)
Alex Liu created SPARK-5622: --- Summary: Add connector/handler hive configuration settings to hive-thrift-server Key: SPARK-5622 URL: https://issues.apache.org/jira/browse/SPARK-5622 Project: Spark

[jira] [Created] (SPARK-5623) Replace an obsolete mapReduceTriplets with a new aggregateMessages in GraphSuite

2015-02-05 Thread Takeshi Yamamuro (JIRA)
Takeshi Yamamuro created SPARK-5623: --- Summary: Replace an obsolete mapReduceTriplets with a new aggregateMessages in GraphSuite Key: SPARK-5623 URL: https://issues.apache.org/jira/browse/SPARK-5623

[jira] [Commented] (SPARK-5013) User guide for Gaussian Mixture Model

2015-02-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307605#comment-14307605 ] Apache Spark commented on SPARK-5013: - User 'tgaloppo' has created a pull request for

[jira] [Commented] (SPARK-5480) GraphX pageRank: java.lang.ArrayIndexOutOfBoundsException:

2015-02-05 Thread Stephane Maarek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307628#comment-14307628 ] Stephane Maarek commented on SPARK-5480: It happened once after one of my server

[jira] [Resolved] (SPARK-5621) Cannot fetch dependencies for mllib

2015-02-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5621. -- Resolution: Not a Problem It certainly exists :

[jira] [Commented] (SPARK-5610) Generate Java docs without package private classes and methods

2015-02-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307599#comment-14307599 ] Sean Owen commented on SPARK-5610: -- From looking at the Javadoc 8 + unidoc issue, I

[jira] [Resolved] (SPARK-1867) Spark Documentation Error causes java.lang.IllegalStateException: unread block data

2015-02-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1867. -- Resolution: Not a Problem I think there are a number of manifestations of the same basic problem here:

[jira] [Commented] (SPARK-2827) Add DegreeDist function support

2015-02-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307499#comment-14307499 ] Apache Spark commented on SPARK-2827: - User 'maropu' has created a pull request for

[jira] [Created] (SPARK-5620) Group methods in generated unidoc

2015-02-05 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5620: Summary: Group methods in generated unidoc Key: SPARK-5620 URL: https://issues.apache.org/jira/browse/SPARK-5620 Project: Spark Issue Type: Documentation

[jira] [Commented] (SPARK-4897) Python 3 support

2015-02-05 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14307482#comment-14307482 ] Josh Rosen commented on SPARK-4897: --- By the way, it might be nice to see if we can

[jira] [Commented] (SPARK-5557) Servlet API classes now missing after jetty shading

2015-02-05 Thread Kostas Sakellis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308374#comment-14308374 ] Kostas Sakellis commented on SPARK-5557: [~pwendell] recommended this which did

[jira] [Comment Edited] (SPARK-5335) Destroying cluster in VPC with --delete-groups fails to remove security groups

2015-02-05 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308133#comment-14308133 ] Nicholas Chammas edited comment on SPARK-5335 at 2/6/15 1:15 AM:

  1   2   >