[jira] [Closed] (SPARK-6157) Unrolling with MEMORY_AND_DISK should always release memory

2015-12-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen closed SPARK-6157. Assignee: (was: SuYan) > Unrolling with MEMORY_AND_DISK should always release memory >

[jira] [Resolved] (SPARK-6332) compute calibration curve for binary classifiers

2015-12-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6332. -- Resolution: Won't Fix > compute calibration curve for binary classifiers >

[jira] [Resolved] (SPARK-2426) Quadratic Minimization for MLlib ALS

2015-12-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-2426. -- Resolution: Won't Fix > Quadratic Minimization for MLlib ALS > > >

[jira] [Resolved] (SPARK-6157) Unrolling with MEMORY_AND_DISK should always release memory

2015-12-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6157. -- Resolution: Won't Fix > Unrolling with MEMORY_AND_DISK should always release memory >

[jira] [Resolved] (SPARK-4961) Put HadoopRDD.getPartitions forward to reduce DAGScheduler.JobSubmitted processing time

2015-12-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4961. -- Resolution: Won't Fix > Put HadoopRDD.getPartitions forward to reduce DAGScheduler.JobSubmitted >

[jira] [Commented] (SPARK-12438) Add SQLUserDefinedType support for encoder

2015-12-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15075850#comment-15075850 ] Apache Spark commented on SPARK-12438: -- User 'thomastechs' has created a pull request for this

[jira] [Resolved] (SPARK-12039) HiveSparkSubmitSuite's SPARK-9757 Persist Parquet relation with decimal column is very flaky

2015-12-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-12039. - Resolution: Fixed Fix Version/s: 2.0.0 > HiveSparkSubmitSuite's SPARK-9757 Persist

[jira] [Updated] (SPARK-12039) HiveSparkSubmitSuite's SPARK-9757 Persist Parquet relation with decimal column is very flaky

2015-12-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12039: Assignee: Yin Huai > HiveSparkSubmitSuite's SPARK-9757 Persist Parquet relation with decimal >

[jira] [Resolved] (SPARK-1987) More memory-efficient graph construction

2015-12-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1987. -- Resolution: Won't Fix > More memory-efficient graph construction >

[jira] [Resolved] (SPARK-4675) Find similar products and similar users in MatrixFactorizationModel

2015-12-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4675. -- Resolution: Won't Fix > Find similar products and similar users in MatrixFactorizationModel >

[jira] [Commented] (SPARK-12537) Add option to accept quoting of all character backslash quoting mechanism

2015-12-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15075831#comment-15075831 ] Sean Owen commented on SPARK-12537: --- (How about soliciting more opinions here to see where others'

[jira] [Resolved] (SPARK-4086) Fold-style aggregation for VertexRDD

2015-12-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4086. -- Resolution: Won't Fix > Fold-style aggregation for VertexRDD > > >

[jira] [Commented] (SPARK-1061) allow Hadoop RDDs to be read w/ a partitioner

2015-12-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15075854#comment-15075854 ] Sean Owen commented on SPARK-1061: -- Is this still live? > allow Hadoop RDDs to be read w/ a partitioner

[jira] [Resolved] (SPARK-5036) Better support sending partial messages in Pregel API

2015-12-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5036. -- Resolution: Won't Fix > Better support sending partial messages in Pregel API >

[jira] [Resolved] (SPARK-5832) Add Affinity Propagation clustering algorithm

2015-12-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5832. -- Resolution: Won't Fix Target Version/s: (was: ) > Add Affinity Propagation clustering

[jira] [Resolved] (SPARK-6105) enhance spark-ganglia to support redundant gmond addresses setting in ganglia unicast mode

2015-12-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6105. -- Resolution: Won't Fix > enhance spark-ganglia to support redundant gmond addresses setting in ganglia

[jira] [Resolved] (SPARK-7441) Implement microbatch functionality so that Spark Streaming can process a large backlog of existing files discovered in batch in smaller batches

2015-12-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7441. -- Resolution: Won't Fix > Implement microbatch functionality so that Spark Streaming can process a >

[jira] [Resolved] (SPARK-7995) Remove AkkaRpcEnv and remove Akka from the dependencies of Core

2015-12-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7995. Resolution: Fixed Fix Version/s: 2.0.0 > Remove AkkaRpcEnv and remove Akka from the

[jira] [Resolved] (SPARK-6280) Remove Akka systemName from Spark

2015-12-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-6280. Resolution: Fixed Fix Version/s: 2.0.0 > Remove Akka systemName from Spark >

[jira] [Resolved] (SPARK-12590) Inconsistent behavior of randomSplit in YARN mode

2015-12-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-12590. --- Resolution: Not A Problem Yes, I think you've hit it on the head: the issue is that you're

[jira] [Updated] (SPARK-12591) NullPointerException using checkpointed mapWithState with KryoSerializer

2015-12-31 Thread Jan Uyttenhove (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jan Uyttenhove updated SPARK-12591: --- Description: Issue occured after upgrading to the RC4 of Spark (streaming) 1.6.0 to (re)test

[jira] [Updated] (SPARK-12591) NullPointerException using checkpointed mapWithState with KryoSerializer

2015-12-31 Thread Jan Uyttenhove (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jan Uyttenhove updated SPARK-12591: --- Description: Issue occured after upgrading to the RC4 of Spark (streaming) 1.6.0 to (re)test

[jira] [Updated] (SPARK-12591) NullPointerException using checkpointed mapWithState with KryoSerializer

2015-12-31 Thread Jan Uyttenhove (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jan Uyttenhove updated SPARK-12591: --- Description: Issue occured after upgrading to the RC4 of Spark (streaming) 1.6.0 to (re)test

[jira] [Created] (SPARK-12593) Convert resolved logical plans back to SQL query strings

2015-12-31 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-12593: -- Summary: Convert resolved logical plans back to SQL query strings Key: SPARK-12593 URL: https://issues.apache.org/jira/browse/SPARK-12593 Project: Spark Issue

[jira] [Commented] (SPARK-8555) Online Variational Inference for the Hierarchical Dirichlet Process

2015-12-31 Thread Tu Dinh Nguyen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15075909#comment-15075909 ] Tu Dinh Nguyen commented on SPARK-8555: --- Hi, I'm Tu from Deakin University. Our team is currently

[jira] [Created] (SPARK-12592) TestHive.reset hides Spark testing logs

2015-12-31 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-12592: -- Summary: TestHive.reset hides Spark testing logs Key: SPARK-12592 URL: https://issues.apache.org/jira/browse/SPARK-12592 Project: Spark Issue Type: Test

[jira] [Assigned] (SPARK-12592) TestHive.reset hides Spark testing logs

2015-12-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12592: Assignee: Apache Spark > TestHive.reset hides Spark testing logs >

[jira] [Assigned] (SPARK-12592) TestHive.reset hides Spark testing logs

2015-12-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12592: Assignee: (was: Apache Spark) > TestHive.reset hides Spark testing logs >

[jira] [Resolved] (SPARK-4902) gap-sampling performance optimization

2015-12-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4902. -- Resolution: Won't Fix > gap-sampling performance optimization > - >

[jira] [Commented] (SPARK-12592) TestHive.reset hides Spark testing logs

2015-12-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15075910#comment-15075910 ] Apache Spark commented on SPARK-12592: -- User 'liancheng' has created a pull request for this issue:

[jira] [Commented] (SPARK-12592) TestHive.reset hides Spark testing logs

2015-12-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15075930#comment-15075930 ] Apache Spark commented on SPARK-12592: -- User 'liancheng' has created a pull request for this issue:

[jira] [Resolved] (SPARK-4976) trust region Newton optimizer in mllib

2015-12-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4976. -- Resolution: Won't Fix > trust region Newton optimizer in mllib > --

[jira] [Resolved] (SPARK-4526) Gradient should be added batch computing interface

2015-12-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4526. -- Resolution: Won't Fix > Gradient should be added batch computing interface >

[jira] [Created] (SPARK-12591) NullPointerException using checkpointed mapWithState with KryoSerializer

2015-12-31 Thread Jan Uyttenhove (JIRA)
Jan Uyttenhove created SPARK-12591: -- Summary: NullPointerException using checkpointed mapWithState with KryoSerializer Key: SPARK-12591 URL: https://issues.apache.org/jira/browse/SPARK-12591

[jira] [Commented] (SPARK-12590) Inconsistent behavior of randomSplit in YARN mode

2015-12-31 Thread Gaurav Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15075902#comment-15075902 ] Gaurav Kumar commented on SPARK-12590: -- Thanks [~srowen] for the explanation. I think most users,

[jira] [Created] (SPARK-12594) Join Conversion: Outer to Inner/Left/Right, Right to Inner and Left to Inner

2015-12-31 Thread Xiao Li (JIRA)
Xiao Li created SPARK-12594: --- Summary: Join Conversion: Outer to Inner/Left/Right, Right to Inner and Left to Inner Key: SPARK-12594 URL: https://issues.apache.org/jira/browse/SPARK-12594 Project: Spark

[jira] [Commented] (SPARK-12594) Join Conversion: Outer to Inner/Left/Right, Right to Inner and Left to Inner

2015-12-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15076200#comment-15076200 ] Apache Spark commented on SPARK-12594: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-12594) Join Conversion: Outer to Inner/Left/Right, Right to Inner and Left to Inner

2015-12-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12594: Assignee: (was: Apache Spark) > Join Conversion: Outer to Inner/Left/Right, Right to

[jira] [Assigned] (SPARK-12594) Join Conversion: Outer to Inner/Left/Right, Right to Inner and Left to Inner

2015-12-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12594: Assignee: Apache Spark > Join Conversion: Outer to Inner/Left/Right, Right to Inner and

[jira] [Updated] (SPARK-12196) Store/retrieve blocks in different speed storage devices by hierarchy way

2015-12-31 Thread yucai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yucai updated SPARK-12196: -- Description: *Motivation* Nowadays, customers have both SSDs(SATA SSD/PCIe SSD) and HDDs. SSDs have great

[jira] [Commented] (SPARK-10359) Enumerate Spark's dependencies in a file and diff against it for new pull requests

2015-12-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15076234#comment-15076234 ] Apache Spark commented on SPARK-10359: -- User 'JoshRosen' has created a pull request for this issue: