[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2015-02-03 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302952#comment-14302952 ] yuhao yang commented on SPARK-1405: --- Hi everyone, I'm sharing an implementation of

[jira] [Comment Edited] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2015-02-03 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302952#comment-14302952 ] yuhao yang edited comment on SPARK-1405 at 2/3/15 8:35 AM: --- Hi

[jira] [Commented] (SPARK-5566) Tokenizer for mllib package

2015-02-05 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14308733#comment-14308733 ] yuhao yang commented on SPARK-5566: --- I mean only the underlying implementation.

[jira] [Comment Edited] (SPARK-5563) LDA with online variational inference

2015-02-04 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305115#comment-14305115 ] yuhao yang edited comment on SPARK-5563 at 2/4/15 2:22 PM: ---

[jira] [Commented] (SPARK-5563) LDA with online variational inference

2015-02-04 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305115#comment-14305115 ] yuhao yang commented on SPARK-5563: --- Thanks Joseph for helping create the jira. Paste

[jira] [Comment Edited] (SPARK-5563) LDA with online variational inference

2015-02-04 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305115#comment-14305115 ] yuhao yang edited comment on SPARK-5563 at 2/4/15 2:23 PM: ---

[jira] [Commented] (SPARK-5563) LDA with online variational inference

2015-02-04 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305199#comment-14305199 ] yuhao yang commented on SPARK-5563: --- BTW, batch versions of online variational inference

[jira] [Commented] (SPARK-5566) Tokenizer for mllib package

2015-02-04 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14305172#comment-14305172 ] yuhao yang commented on SPARK-5566: --- Actually I believe many current code like Word2Vec

[jira] [Closed] (SPARK-5282) RowMatrix easily gets int overflow in the memory size warning

2015-01-19 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang closed SPARK-5282. - fixed RowMatrix easily gets int overflow in the memory size warning

[jira] [Commented] (SPARK-5186) Vector.equals and Vector.hashCode are very inefficient and fail on SparseVectors with large size

2015-01-16 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14280025#comment-14280025 ] yuhao yang commented on SPARK-5186: --- I just updated the PR with a hashCode fix. Please

[jira] [Closed] (SPARK-5234) examples for ml don't have sparkContext.stop

2015-01-16 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang closed SPARK-5234. - fixed examples for ml don't have sparkContext.stop

[jira] [Created] (SPARK-5282) RowMatrix easily gets int overflow in the memory size warning

2015-01-16 Thread yuhao yang (JIRA)
yuhao yang created SPARK-5282: - Summary: RowMatrix easily gets int overflow in the memory size warning Key: SPARK-5282 URL: https://issues.apache.org/jira/browse/SPARK-5282 Project: Spark Issue

[jira] [Commented] (SPARK-5282) RowMatrix easily gets int overflow in the memory size warning

2015-01-16 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14280159#comment-14280159 ] yuhao yang commented on SPARK-5282: --- typical wrong message: Row matrix: 17000 cloumns

[jira] [Closed] (SPARK-5717) add sc.stop to LDA examples

2015-02-10 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang closed SPARK-5717. - merged. Thanks add sc.stop to LDA examples --- Key: SPARK-5717

[jira] [Closed] (SPARK-5384) Vectors.sqdist return inconsistent result for sparse/dense vectors when the vectors have different lengths

2015-01-25 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang closed SPARK-5384. - fixed Vectors.sqdist return inconsistent result for sparse/dense vectors when the vectors have different

[jira] [Created] (SPARK-5406) LocalLAPACK mode in RowMatrix.computeSVD should have much smaller upper bound

2015-01-25 Thread yuhao yang (JIRA)
yuhao yang created SPARK-5406: - Summary: LocalLAPACK mode in RowMatrix.computeSVD should have much smaller upper bound Key: SPARK-5406 URL: https://issues.apache.org/jira/browse/SPARK-5406 Project: Spark

[jira] [Updated] (SPARK-5406) LocalLAPACK mode in RowMatrix.computeSVD should have much smaller upper bound

2015-01-26 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-5406: -- Description: In RowMatrix.computeSVD, under LocalLAPACK mode, the code would invoke brzSvd. Yet breeze

[jira] [Updated] (SPARK-5406) LocalLAPACK mode in RowMatrix.computeSVD should have much smaller upper bound

2015-01-26 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-5406: -- Description: In RowMatrix.computeSVD, under LocalLAPACK mode, the code would invoke brzSvd. Yet breeze

[jira] [Closed] (SPARK-5406) LocalLAPACK mode in RowMatrix.computeSVD should have much smaller upper bound

2015-02-01 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang closed SPARK-5406. - fix and merged. Thanks LocalLAPACK mode in RowMatrix.computeSVD should have much smaller upper bound

[jira] [Commented] (SPARK-5510) How can I fix the spark-submit script and then running the program on cluster ?

2015-02-01 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14300939#comment-14300939 ] yuhao yang commented on SPARK-5510: --- https://spark.apache.org/community.html check the

[jira] [Created] (SPARK-5234) examples for ml don't have sparkContext.stop

2015-01-13 Thread yuhao yang (JIRA)
yuhao yang created SPARK-5234: - Summary: examples for ml don't have sparkContext.stop Key: SPARK-5234 URL: https://issues.apache.org/jira/browse/SPARK-5234 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-5243) Spark will hang if (driver memory + executor memory) exceeds limit on a 1-worker cluster

2015-01-13 Thread yuhao yang (JIRA)
yuhao yang created SPARK-5243: - Summary: Spark will hang if (driver memory + executor memory) exceeds limit on a 1-worker cluster Key: SPARK-5243 URL: https://issues.apache.org/jira/browse/SPARK-5243

[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2015-01-09 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14270869#comment-14270869 ] yuhao yang commented on SPARK-1405: --- Great design doc and solid proposal. I noticed

[jira] [Created] (SPARK-5717) add sc.stop to LDA examples

2015-02-10 Thread yuhao yang (JIRA)
yuhao yang created SPARK-5717: - Summary: add sc.stop to LDA examples Key: SPARK-5717 URL: https://issues.apache.org/jira/browse/SPARK-5717 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-5243) Spark will hang if (driver memory + executor memory) exceeds limit on a 1-worker cluster

2015-02-10 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-5243: -- Description: Spark will hang if calling spark-submit under the conditions: 1. the cluster has only one

[jira] [Updated] (SPARK-5243) Spark will hang if (driver memory + executor memory) exceeds limit on a 1-worker cluster

2015-02-10 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-5243: -- Description: Spark will hang if calling spark-submit under the conditions: 1. the cluster has only one

[jira] [Commented] (SPARK-5563) LDA with online variational inference

2015-03-16 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14364350#comment-14364350 ] yuhao yang commented on SPARK-5563: --- Matthew Willson. Thanks for the attention and idea.

[jira] [Comment Edited] (SPARK-5563) LDA with online variational inference

2015-03-16 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14364350#comment-14364350 ] yuhao yang edited comment on SPARK-5563 at 3/17/15 1:13 AM:

[jira] [Created] (SPARK-6374) Add getter for GeneralizedLinearAlgorithm

2015-03-16 Thread yuhao yang (JIRA)
yuhao yang created SPARK-6374: - Summary: Add getter for GeneralizedLinearAlgorithm Key: SPARK-6374 URL: https://issues.apache.org/jira/browse/SPARK-6374 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-6177) LDA should check partitions size of the input

2015-03-09 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-6177: -- Description: Add comment to introduce coalesce to LDA example to avoid the possible massive partitions

[jira] [Updated] (SPARK-6177) Add note for

2015-03-09 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-6177: -- Summary: Add note for (was: LDA should check partitions size of the input) Add note for

[jira] [Updated] (SPARK-6177) Add note in LDA example to remind possible coalesce

2015-03-09 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-6177: -- Summary: Add note in LDA example to remind possible coalesce (was: Add note for ) Add note in LDA

[jira] [Commented] (SPARK-6268) KMeans parameter getter methods

2015-03-10 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6268?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14356125#comment-14356125 ] yuhao yang commented on SPARK-6268: --- Sure, I'll propose a PR very soon. Thanks! KMeans

[jira] [Comment Edited] (SPARK-6268) KMeans parameter getter methods

2015-03-10 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6268?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14356106#comment-14356106 ] yuhao yang edited comment on SPARK-6268 at 3/11/15 2:14 AM: Hi

[jira] [Closed] (SPARK-6177) Add note in LDA example to remind possible coalesce

2015-03-10 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang closed SPARK-6177. - Fix and merged, thanks. Add note in LDA example to remind possible coalesce

[jira] [Created] (SPARK-6177) LDA should check partitions size of the input

2015-03-04 Thread yuhao yang (JIRA)
yuhao yang created SPARK-6177: - Summary: LDA should check partitions size of the input Key: SPARK-6177 URL: https://issues.apache.org/jira/browse/SPARK-6177 Project: Spark Issue Type:

[jira] [Updated] (SPARK-6177) LDA should check partitions size of the input

2015-03-04 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-6177: -- Description: sc.textFile will create RDD with one partition for each file, and the possible massive

[jira] [Created] (SPARK-5384) Vectors.sqdist return inconsistent result for sparse/dense vectors when the vectors have different lengths

2015-01-23 Thread yuhao yang (JIRA)
yuhao yang created SPARK-5384: - Summary: Vectors.sqdist return inconsistent result for sparse/dense vectors when the vectors have different lengths Key: SPARK-5384 URL:

[jira] [Created] (SPARK-6693) add to string with max lines and width for matrix

2015-04-03 Thread yuhao yang (JIRA)
yuhao yang created SPARK-6693: - Summary: add to string with max lines and width for matrix Key: SPARK-6693 URL: https://issues.apache.org/jira/browse/SPARK-6693 Project: Spark Issue Type:

[jira] [Updated] (SPARK-6693) add toString with max lines and width for matrix

2015-04-03 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-6693: -- Summary: add toString with max lines and width for matrix (was: add to string with max lines and width

[jira] [Closed] (SPARK-6374) Add getter for GeneralizedLinearAlgorithm

2015-04-14 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang closed SPARK-6374. - fix merged. Thanks. Add getter for GeneralizedLinearAlgorithm -

[jira] [Closed] (SPARK-6693) add toString with max lines and width for matrix

2015-04-14 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang closed SPARK-6693. - Fix merged. Thanks. add toString with max lines and width for matrix

[jira] [Commented] (SPARK-7090) Introduce LDAOptimizer to LDA to further improve extensibility

2015-04-23 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14508907#comment-14508907 ] yuhao yang commented on SPARK-7090: --- Hoops, I thought there was something wrong... I'll

[jira] [Closed] (SPARK-7089) Introduce LDAOptimizer to LDA to improve extensibility

2015-04-23 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang closed SPARK-7089. - Resolution: Duplicate Sorry for the duplication Introduce LDAOptimizer to LDA to improve extensibility

[jira] [Created] (SPARK-7089) Introduce LDAOptimizer to LDA to improve extensibility

2015-04-23 Thread yuhao yang (JIRA)
yuhao yang created SPARK-7089: - Summary: Introduce LDAOptimizer to LDA to improve extensibility Key: SPARK-7089 URL: https://issues.apache.org/jira/browse/SPARK-7089 Project: Spark Issue Type:

[jira] [Created] (SPARK-7090) Introduce LDAOptimizer to LDA to improve extensibility

2015-04-23 Thread yuhao yang (JIRA)
yuhao yang created SPARK-7090: - Summary: Introduce LDAOptimizer to LDA to improve extensibility Key: SPARK-7090 URL: https://issues.apache.org/jira/browse/SPARK-7090 Project: Spark Issue Type:

[jira] [Updated] (SPARK-7090) Introduce LDAOptimizer to LDA to further improve extensibility

2015-04-23 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-7090: -- Summary: Introduce LDAOptimizer to LDA to further improve extensibility (was: Introduce LDAOptimizer

[jira] [Updated] (SPARK-7090) Introduce LDAOptimizer to LDA to improve extensibility

2015-04-23 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-7090: -- Summary: Introduce LDAOptimizer to LDA to improve extensibility (was: Introduce LDAOptimizer to LDA

[jira] [Reopened] (SPARK-7090) Introduce LDAOptimizer to LDA to further improve extensibility

2015-04-23 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang reopened SPARK-7090: --- Reopen this since 7089 was already closed. Introduce LDAOptimizer to LDA to further improve

[jira] [Comment Edited] (SPARK-7090) Introduce LDAOptimizer to LDA to further improve extensibility

2015-04-23 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14508907#comment-14508907 ] yuhao yang edited comment on SPARK-7090 at 4/23/15 12:00 PM: -

[jira] [Created] (SPARK-7368) add QR decomposition for RowMatrix

2015-05-05 Thread yuhao yang (JIRA)
yuhao yang created SPARK-7368: - Summary: add QR decomposition for RowMatrix Key: SPARK-7368 URL: https://issues.apache.org/jira/browse/SPARK-7368 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-7368) add QR decomposition for RowMatrix

2015-05-05 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14529742#comment-14529742 ] yuhao yang commented on SPARK-7368: --- Oops, I was not aware of the previous effort.

[jira] [Created] (SPARK-7475) adjust ldaExample for online LDA

2015-05-07 Thread yuhao yang (JIRA)
yuhao yang created SPARK-7475: - Summary: adjust ldaExample for online LDA Key: SPARK-7475 URL: https://issues.apache.org/jira/browse/SPARK-7475 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-7514) Add MinMaxScaler to feature transformation

2015-05-10 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14537602#comment-14537602 ] yuhao yang commented on SPARK-7514: --- Class name has always been MinMaxScaler in the

[jira] [Commented] (SPARK-7514) Add MinMaxScaler to feature transformation

2015-05-11 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14537651#comment-14537651 ] yuhao yang commented on SPARK-7514: --- Thanks Joseph, just one concern for using center as

[jira] [Comment Edited] (SPARK-7514) Add MinMaxScaler to feature transformation

2015-05-11 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14537651#comment-14537651 ] yuhao yang edited comment on SPARK-7514 at 5/11/15 6:41 AM:

[jira] [Updated] (SPARK-7514) Add MinMaxScaler to feature transformation

2015-05-10 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-7514: -- Summary: Add MinMaxScaler to feature transformation (was: Add MinMaxNormalizer to feature

[jira] [Commented] (SPARK-7514) Add MinMaxNormalizer to feature transformation

2015-05-10 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14537527#comment-14537527 ] yuhao yang commented on SPARK-7514: --- Hi Joseph, that a good idea. I did a simple google:

[jira] [Commented] (SPARK-7496) Update Programming guide with Online LDA

2015-05-10 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14537482#comment-14537482 ] yuhao yang commented on SPARK-7496: --- Thanks Joseph. PR sent. Update Programming guide

[jira] [Created] (SPARK-7514) Add MinMaxNormalizer to feature transformation

2015-05-10 Thread yuhao yang (JIRA)
yuhao yang created SPARK-7514: - Summary: Add MinMaxNormalizer to feature transformation Key: SPARK-7514 URL: https://issues.apache.org/jira/browse/SPARK-7514 Project: Spark Issue Type: New

[jira] [Updated] (SPARK-7514) Add MinMaxNormalizer to feature transformation

2015-05-10 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-7514: -- Description: Add a new scaling method to feature component, which is commonly known as min-max

[jira] [Updated] (SPARK-7514) Add MinMaxNormalizer to feature transformation

2015-05-10 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-7514: -- Description: Add a popular scaling method to feature component, which is commonly known as min-max

[jira] [Updated] (SPARK-7514) Add MinMaxNormalizer to feature transformation

2015-05-10 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-7514: -- Description: Add a new scaling method to feature component, which is commonly known as min-max

[jira] [Commented] (SPARK-7496) Update Programming guide with Online LDA

2015-05-10 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14537114#comment-14537114 ] yuhao yang commented on SPARK-7496: --- Hi Joseph, just something I got for your reference,

[jira] [Closed] (SPARK-7090) Introduce LDAOptimizer to LDA to further improve extensibility

2015-05-10 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang closed SPARK-7090. - Close the jira as code merged. Thanks for the careful review and important fix. Introduce LDAOptimizer to

[jira] [Comment Edited] (SPARK-7368) add QR decomposition for RowMatrix

2015-05-10 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14537205#comment-14537205 ] yuhao yang edited comment on SPARK-7368 at 5/10/15 2:54 PM: Hi

[jira] [Commented] (SPARK-7368) add QR decomposition for RowMatrix

2015-05-10 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14537205#comment-14537205 ] yuhao yang commented on SPARK-7368: --- Hi Zongheng, since the Amplab version is built upon

[jira] [Commented] (SPARK-7455) Perf test for LDA (EM/online)

2015-05-13 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543065#comment-14543065 ] yuhao yang commented on SPARK-7455: --- I'll start to work on this. Any help or suggestion

[jira] [Closed] (SPARK-7496) User guide update for Online LDA

2015-05-13 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang closed SPARK-7496. - Doc updated. Thanks for review. User guide update for Online LDA

[jira] [Commented] (SPARK-7455) Perf test for LDA (EM/online)

2015-05-19 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14551624#comment-14551624 ] yuhao yang commented on SPARK-7455: --- work in progress

[jira] [Commented] (SPARK-5567) Add prediction methods to LDA

2015-06-07 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14576358#comment-14576358 ] yuhao yang commented on SPARK-5567: --- I guess the major consideration is proper code

[jira] [Commented] (SPARK-5567) Add prediction methods to LDA

2015-06-09 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14578595#comment-14578595 ] yuhao yang commented on SPARK-5567: --- Hi Joseph, just to be clear. If we're using the MAP

[jira] [Commented] (SPARK-8169) Add StopWordsRemover as a transformer

2015-06-09 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14578444#comment-14578444 ] yuhao yang commented on SPARK-8169: --- This looks useful. I'd like to give it a try if no

[jira] [Commented] (SPARK-7541) Check model save/load for MLlib 1.4

2015-06-03 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14570527#comment-14570527 ] yuhao yang commented on SPARK-7541: --- I find no more issues. Check model save/load for

[jira] [Updated] (SPARK-7983) Add require for one-based indices in loadLibSVMFile

2015-06-03 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-7983: -- Priority: Minor (was: Trivial) Add require for one-based indices in loadLibSVMFile

[jira] [Created] (SPARK-8531) Update ML user guide for MinMaxScaler

2015-06-22 Thread yuhao yang (JIRA)
yuhao yang created SPARK-8531: - Summary: Update ML user guide for MinMaxScaler Key: SPARK-8531 URL: https://issues.apache.org/jira/browse/SPARK-8531 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-8529) Set metadata for MinMaxScaler

2015-06-22 Thread yuhao yang (JIRA)
yuhao yang created SPARK-8529: - Summary: Set metadata for MinMaxScaler Key: SPARK-8529 URL: https://issues.apache.org/jira/browse/SPARK-8529 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-8530) Add Python API for MinMaxScaler

2015-06-22 Thread yuhao yang (JIRA)
yuhao yang created SPARK-8530: - Summary: Add Python API for MinMaxScaler Key: SPARK-8530 URL: https://issues.apache.org/jira/browse/SPARK-8530 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-8547) xgboost exploration

2015-06-22 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8547?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14597012#comment-14597012 ] yuhao yang commented on SPARK-8547: --- This is definitely useful with many potential

[jira] [Created] (SPARK-8555) Online Variational Inference for the Hierarchical Dirichlet Process

2015-06-23 Thread yuhao yang (JIRA)
yuhao yang created SPARK-8555: - Summary: Online Variational Inference for the Hierarchical Dirichlet Process Key: SPARK-8555 URL: https://issues.apache.org/jira/browse/SPARK-8555 Project: Spark

[jira] [Updated] (SPARK-8555) Online Variational Inference for the Hierarchical Dirichlet Process

2015-06-23 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-8555: -- Issue Type: New Feature (was: Bug) Online Variational Inference for the Hierarchical Dirichlet

[jira] [Commented] (SPARK-8555) Online Variational Inference for the Hierarchical Dirichlet Process

2015-06-23 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14597237#comment-14597237 ] yuhao yang commented on SPARK-8555: --- A basic implementation on

[jira] [Created] (SPARK-8308) add missing save load for python doc example and tune down MatrixFactorization iterations

2015-06-11 Thread yuhao yang (JIRA)
yuhao yang created SPARK-8308: - Summary: add missing save load for python doc example and tune down MatrixFactorization iterations Key: SPARK-8308 URL: https://issues.apache.org/jira/browse/SPARK-8308

[jira] [Comment Edited] (SPARK-7541) Check model save/load for MLlib 1.4

2015-05-29 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14564263#comment-14564263 ] yuhao yang edited comment on SPARK-7541 at 5/29/15 6:40 AM:

[jira] [Comment Edited] (SPARK-7541) Check model save/load for MLlib 1.4

2015-05-29 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14564296#comment-14564296 ] yuhao yang edited comment on SPARK-7541 at 5/29/15 7:14 AM:

[jira] [Commented] (SPARK-7541) Check model save/load for MLlib 1.4

2015-05-29 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14564263#comment-14564263 ] yuhao yang commented on SPARK-7541: --- ||model||Scala UT ||

[jira] [Commented] (SPARK-7541) Check model save/load for MLlib 1.4

2015-05-29 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14564296#comment-14564296 ] yuhao yang commented on SPARK-7541: --- Oh, checked means I found no python support for

[jira] [Updated] (SPARK-7949) update document with some missing save/load

2015-05-29 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang updated SPARK-7949: -- Description: As part of 7541, add save load for examples: KMeansModel PowerIterationClusteringModel

[jira] [Created] (SPARK-7949) update document with some missing save/load

2015-05-29 Thread yuhao yang (JIRA)
yuhao yang created SPARK-7949: - Summary: update document with some missing save/load Key: SPARK-7949 URL: https://issues.apache.org/jira/browse/SPARK-7949 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-7983) Add require for one-based indices in loadLibSVMFile

2015-05-31 Thread yuhao yang (JIRA)
yuhao yang created SPARK-7983: - Summary: Add require for one-based indices in loadLibSVMFile Key: SPARK-7983 URL: https://issues.apache.org/jira/browse/SPARK-7983 Project: Spark Issue Type:

[jira] [Commented] (SPARK-7541) Check model save/load for MLlib 1.4

2015-06-02 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14568998#comment-14568998 ] yuhao yang commented on SPARK-7541: --- Oh, I haven't checked though all the examples in

[jira] [Comment Edited] (SPARK-7541) Check model save/load for MLlib 1.4

2015-06-02 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14568998#comment-14568998 ] yuhao yang edited comment on SPARK-7541 at 6/2/15 11:53 AM:

[jira] [Comment Edited] (SPARK-7541) Check model save/load for MLlib 1.4

2015-06-02 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14568998#comment-14568998 ] yuhao yang edited comment on SPARK-7541 at 6/2/15 11:56 AM:

[jira] [Created] (SPARK-8043) update NaiveBayes and SVM examples in doc

2015-06-02 Thread yuhao yang (JIRA)
yuhao yang created SPARK-8043: - Summary: update NaiveBayes and SVM examples in doc Key: SPARK-8043 URL: https://issues.apache.org/jira/browse/SPARK-8043 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-7949) update document with some missing save/load

2015-06-01 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14568339#comment-14568339 ] yuhao yang commented on SPARK-7949: --- Oh thanks, I thought we should close jira when the

[jira] [Closed] (SPARK-7949) update document with some missing save/load

2015-06-01 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang closed SPARK-7949. - update document with some missing save/load ---

[jira] [Commented] (SPARK-8744) StringIndexerModel should have public constructor

2015-07-01 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14610558#comment-14610558 ] yuhao yang commented on SPARK-8744: --- Just a reminder: There seems to be more jobs to do

[jira] [Comment Edited] (SPARK-8744) StringIndexerModel should have public constructor

2015-07-01 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14610558#comment-14610558 ] yuhao yang edited comment on SPARK-8744 at 7/1/15 4:10 PM: ---

[jira] [Commented] (SPARK-8703) Add CountVectorizer as a ml transformer to convert document to words count vector

2015-07-01 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14609703#comment-14609703 ] yuhao yang commented on SPARK-8703: --- Thanks Joseph. It's true that CountVectorizer and

[jira] [Created] (SPARK-8703) Add CountVectorizer as a ml transformer to convert document to words count vector

2015-06-29 Thread yuhao yang (JIRA)
yuhao yang created SPARK-8703: - Summary: Add CountVectorizer as a ml transformer to convert document to words count vector Key: SPARK-8703 URL: https://issues.apache.org/jira/browse/SPARK-8703 Project:

  1   2   3   4   5   6   >