[jira] [Updated] (SPARK-16046) Add Spark SQL Dataset Tutorial

2016-06-18 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pedro Rodriguez updated SPARK-16046: Description: Issue to update the Spark SQL guide to provide more content around using

[jira] [Commented] (SPARK-16046) Add Spark SQL Dataset Tutorial

2016-06-18 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15337869#comment-15337869 ] Pedro Rodriguez commented on SPARK-16046: - I would like to take on this issue and will base work

[jira] [Updated] (SPARK-16046) Add Spark SQL Dataset Tutorial

2016-06-18 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pedro Rodriguez updated SPARK-16046: Component/s: SQL Documentation > Add Spark SQL Dataset Tutorial >

[jira] [Created] (SPARK-16046) Add Spark SQL Dataset Tutorial

2016-06-18 Thread Pedro Rodriguez (JIRA)
Pedro Rodriguez created SPARK-16046: --- Summary: Add Spark SQL Dataset Tutorial Key: SPARK-16046 URL: https://issues.apache.org/jira/browse/SPARK-16046 Project: Spark Issue Type:

[jira] [Commented] (SPARK-5556) Latent Dirichlet Allocation (LDA) using Gibbs sampler

2015-08-20 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14704885#comment-14704885 ] Pedro Rodriguez commented on SPARK-5556: That is awesome. I've been a bit busy

[jira] [Commented] (SPARK-8231) complex function: array_contains

2015-07-21 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14635425#comment-14635425 ] Pedro Rodriguez commented on SPARK-8231: I can give this one a shot since I

[jira] [Commented] (SPARK-8231) complex function: array_contains

2015-07-21 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14635797#comment-14635797 ] Pedro Rodriguez commented on SPARK-8231: What should the null behavior of this be?

[jira] [Commented] (SPARK-8231) complex function: array_contains

2015-07-21 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14636004#comment-14636004 ] Pedro Rodriguez commented on SPARK-8231: Looks like the critical points are 1. If

[jira] [Commented] (SPARK-8230) complex function: size

2015-07-17 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14630921#comment-14630921 ] Pedro Rodriguez commented on SPARK-8230: [~chenghao], code is ready for review

[jira] [Commented] (SPARK-8230) complex function: size

2015-07-16 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14630575#comment-14630575 ] Pedro Rodriguez commented on SPARK-8230: I took a look at that code as well as

[jira] [Commented] (SPARK-8230) complex function: size

2015-07-15 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629079#comment-14629079 ] Pedro Rodriguez commented on SPARK-8230: Moving to here instead of mailing list.

[jira] [Commented] (SPARK-5556) Latent Dirichlet Allocation (LDA) using Gibbs sampler

2015-07-06 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14615616#comment-14615616 ] Pedro Rodriguez commented on SPARK-5556: I am still interested, but was unsure of

[jira] [Commented] (SPARK-5556) Latent Dirichlet Allocation (LDA) using Gibbs sampler

2015-04-29 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1452#comment-1452 ] Pedro Rodriguez commented on SPARK-5556: What are thoughts on implementation?It

[jira] [Commented] (SPARK-5556) Latent Dirichlet Allocation (LDA) using Gibbs sampler

2015-04-28 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14518378#comment-14518378 ] Pedro Rodriguez commented on SPARK-5556: I will start working on it again then. It

[jira] [Commented] (SPARK-5556) Latent Dirichlet Allocation (LDA) using Gibbs sampler

2015-04-28 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14518601#comment-14518601 ] Pedro Rodriguez commented on SPARK-5556: [~gq] is the LDAGibbs line what I

[jira] [Commented] (SPARK-5556) Latent Dirichlet Allocation (LDA) using Gibbs sampler

2015-04-28 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14518133#comment-14518133 ] Pedro Rodriguez commented on SPARK-5556: With the refactoring done, I can get to

[jira] [Commented] (SPARK-4414) SparkContext.wholeTextFiles Doesn't work with S3 Buckets

2015-03-25 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14381323#comment-14381323 ] Pedro Rodriguez commented on SPARK-4414: I haven't looked at this for a while, so

[jira] [Commented] (SPARK-5556) Latent Dirichlet Allocation (LDA) using Gibbs sampler

2015-02-26 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14339772#comment-14339772 ] Pedro Rodriguez commented on SPARK-5556: See PR for info, TLDR: contains

[jira] [Commented] (SPARK-5556) Latent Dirichlet Allocation (LDA) using Gibbs sampler

2015-02-26 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14339849#comment-14339849 ] Pedro Rodriguez commented on SPARK-5556: Based on initial testing, I recall

[jira] [Commented] (SPARK-5556) Latent Dirichlet Allocation (LDA) using Gibbs sampler

2015-02-05 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308603#comment-14308603 ] Pedro Rodriguez commented on SPARK-5556: Posting here as a status update. I will

[jira] [Commented] (SPARK-5556) Latent Dirichlet Allocation (LDA) using Gibbs sampler

2015-02-05 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308619#comment-14308619 ] Pedro Rodriguez commented on SPARK-5556: I will read that paper, seems

[jira] [Created] (SPARK-5385) Calling textFile, parallelize, zip, then partitions causes failure on some local[*]

2015-01-23 Thread Pedro Rodriguez (JIRA)
Pedro Rodriguez created SPARK-5385: -- Summary: Calling textFile, parallelize, zip, then partitions causes failure on some local[*] Key: SPARK-5385 URL: https://issues.apache.org/jira/browse/SPARK-5385

[jira] [Closed] (SPARK-5385) Calling textFile, parallelize, zip, then partitions causes failure on some local[*]

2015-01-23 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pedro Rodriguez closed SPARK-5385. -- Resolution: Fixed Indeed, not a bug, fixed by calling textFiles, then passing partitions.size

[jira] [Commented] (SPARK-2823) GraphX jobs throw IllegalArgumentException

2015-01-23 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14289499#comment-14289499 ] Pedro Rodriguez commented on SPARK-2823: I looked into this more and it looks like

[jira] [Commented] (SPARK-5385) Calling textFile, parallelize, zip, then partitions causes failure on some local[*]

2015-01-23 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14289604#comment-14289604 ] Pedro Rodriguez commented on SPARK-5385: Perhaps its not a bug then, if so, then

[jira] [Commented] (SPARK-5385) Calling textFile, parallelize, zip, then partitions causes failure on some local[*]

2015-01-23 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14289623#comment-14289623 ] Pedro Rodriguez commented on SPARK-5385: On your prior comment, I know

[jira] [Comment Edited] (SPARK-2823) GraphX jobs throw IllegalArgumentException

2015-01-19 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283331#comment-14283331 ] Pedro Rodriguez edited comment on SPARK-2823 at 1/20/15 2:46 AM:

[jira] [Commented] (SPARK-2823) GraphX jobs throw IllegalArgumentException

2015-01-19 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283331#comment-14283331 ] Pedro Rodriguez commented on SPARK-2823: I just ran into this bug while testing

[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2015-01-14 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278381#comment-14278381 ] Pedro Rodriguez commented on SPARK-1405: Worked on some preliminary testing

[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2015-01-09 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14272184#comment-14272184 ] Pedro Rodriguez commented on SPARK-1405: Sounds good Joseph. Have some good news.

[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2015-01-09 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271717#comment-14271717 ] Pedro Rodriguez commented on SPARK-1405: Second on nice design doc and proposal. I

[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2014-11-24 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223922#comment-14223922 ] Pedro Rodriguez commented on SPARK-1405: Finished an initial implementation of an

[jira] [Comment Edited] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2014-11-24 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223922#comment-14223922 ] Pedro Rodriguez edited comment on SPARK-1405 at 11/25/14 2:18 AM:

[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2014-11-22 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14222030#comment-14222030 ] Pedro Rodriguez commented on SPARK-1405: I don't know of a larger data set, but I

[jira] [Created] (SPARK-4543) Javadoc failure for network-common causes publish-local to fail

2014-11-21 Thread Pedro Rodriguez (JIRA)
Pedro Rodriguez created SPARK-4543: -- Summary: Javadoc failure for network-common causes publish-local to fail Key: SPARK-4543 URL: https://issues.apache.org/jira/browse/SPARK-4543 Project: Spark

[jira] [Updated] (SPARK-4543) Javadoc failure for network-common causes publish-local to fail

2014-11-21 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pedro Rodriguez updated SPARK-4543: --- Description: Javadoc for network-common fails. This causes sbt publish-local to fail, and

[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2014-11-19 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14218907#comment-14218907 ] Pedro Rodriguez commented on SPARK-1405: I am not super familiar with LSA, so

[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2014-11-19 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14219094#comment-14219094 ] Pedro Rodriguez commented on SPARK-1405: I will take a look at those when I get a

[jira] [Created] (SPARK-4408) Behavior difference between spark-submit conf vs cmd line args

2014-11-14 Thread Pedro Rodriguez (JIRA)
Pedro Rodriguez created SPARK-4408: -- Summary: Behavior difference between spark-submit conf vs cmd line args Key: SPARK-4408 URL: https://issues.apache.org/jira/browse/SPARK-4408 Project: Spark

[jira] [Created] (SPARK-4414) SparkContext.wholeTextFiles Doesn't work with S3 Buckets

2014-11-14 Thread Pedro Rodriguez (JIRA)
Pedro Rodriguez created SPARK-4414: -- Summary: SparkContext.wholeTextFiles Doesn't work with S3 Buckets Key: SPARK-4414 URL: https://issues.apache.org/jira/browse/SPARK-4414 Project: Spark

[jira] [Created] (SPARK-3936) Incorrect result in GraphX BytecodeUtils with closures + class/object methods

2014-10-13 Thread Pedro Rodriguez (JIRA)
Pedro Rodriguez created SPARK-3936: -- Summary: Incorrect result in GraphX BytecodeUtils with closures + class/object methods Key: SPARK-3936 URL: https://issues.apache.org/jira/browse/SPARK-3936

[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2014-09-25 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14148555#comment-14148555 ] Pedro Rodriguez commented on SPARK-1405: [~mengxr], definitely a good idea to be

[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2014-09-15 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14134445#comment-14134445 ] Pedro Rodriguez commented on SPARK-1405: Hi All. Just wanted to quickly introduce