[jira] [Commented] (MAHOUT-940) Clusterdumper - Get rid of map based implementation

2012-04-18 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13257303#comment-13257303 ] Paritosh Ranjan commented on MAHOUT-940: I think the best way would be to w

[jira] [Commented] (MAHOUT-940) Clusterdumper - Get rid of map based implementation

2012-04-18 Thread Saikat Kanjilal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13257279#comment-13257279 ] Saikat Kanjilal commented on MAHOUT-940: Ok, sorry about the delayed resp

[jira] [Commented] (MAHOUT-940) Clusterdumper - Get rid of map based implementation

2012-04-09 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13249859#comment-13249859 ] Paritosh Ranjan commented on MAHOUT-940: Clustering a large dataset and

[jira] [Commented] (MAHOUT-999) KMeans failing to create correct Clustering Policy

2012-04-09 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13249855#comment-13249855 ] Paritosh Ranjan commented on MAHOUT-999: Will try to fix it soon. Running re

[jira] [Commented] (MAHOUT-940) Clusterdumper - Get rid of map based implementation

2012-04-08 Thread Saikat Kanjilal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13249578#comment-13249578 ] Saikat Kanjilal commented on MAHOUT-940: Paritosh, Having some time to wor

[jira] [Commented] (MAHOUT-973) SparseVectorsFromSequenceFiles will not create a proper TFIDF (bug in TFIDFPartialVectorReducer)

2012-04-06 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13248501#comment-13248501 ] Hudson commented on MAHOUT-973: --- Integrated in Mahout-Quality #1427 (See [h

[jira] [Commented] (MAHOUT-973) SparseVectorsFromSequenceFiles will not create a proper TFIDF (bug in TFIDFPartialVectorReducer)

2012-04-06 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13248382#comment-13248382 ] Hudson commented on MAHOUT-973: --- Integrated in Mahout-Quality #1426 (See [h

[jira] [Commented] (MAHOUT-940) Clusterdumper - Get rid of map based implementation

2012-04-04 Thread Saikat Kanjilal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13247008#comment-13247008 ] Saikat Kanjilal commented on MAHOUT-940: Ha that's funny, ok time to kee

[jira] [Commented] (MAHOUT-940) Clusterdumper - Get rid of map based implementation

2012-04-04 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13247007#comment-13247007 ] Paritosh Ranjan commented on MAHOUT-940: Even I am not familiar with this

[jira] [Commented] (MAHOUT-940) Clusterdumper - Get rid of map based implementation

2012-04-04 Thread Saikat Kanjilal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13247003#comment-13247003 ] Saikat Kanjilal commented on MAHOUT-940: Still reading code to get a de

[jira] [Commented] (MAHOUT-994) mahout script shouldn't rely on HADOOP_HOME since that was deprecated in all major Hadoop branches

2012-04-03 Thread jirapos...@reviews.apache.org (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13245925#comment-13245925 ] jirapos...@reviews.apache.org commented on MAHOUT

[jira] [Commented] (MAHOUT-990) Convert Dirichlet buildClusters to use new ClusterIterator

2012-04-03 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13245594#comment-13245594 ] Paritosh Ranjan commented on MAHOUT-990: I am a bit confused on whether

[jira] [Commented] (MAHOUT-990) Convert Dirichlet buildClusters to use new ClusterIterator

2012-04-03 Thread jirapos...@reviews.apache.org (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13245590#comment-13245590 ] jirapos...@reviews.apache.org commented on MAHOUT

[jira] [Commented] (MAHOUT-940) Clusterdumper - Get rid of map based implementation

2012-04-02 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13244952#comment-13244952 ] Paritosh Ranjan commented on MAHOUT-940: 1) yes 2) It might be a good idea t

[jira] [Commented] (MAHOUT-940) Clusterdumper - Get rid of map based implementation

2012-04-02 Thread Saikat Kanjilal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13244943#comment-13244943 ] Saikat Kanjilal commented on MAHOUT-940: So after researching this some mor

[jira] [Commented] (MAHOUT-988) Convert K-means buildClusters to use new ClusterIterator

2012-04-01 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13243683#comment-13243683 ] Hudson commented on MAHOUT-988: --- Integrated in Mahout-Quality #1420 (See [h

[jira] [Commented] (MAHOUT-989) Convert fuzzy-K-means buildClusters to use new ClusterIterator

2012-04-01 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13243684#comment-13243684 ] Hudson commented on MAHOUT-989: --- Integrated in Mahout-Quality #1420 (See [h

[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning

2012-03-31 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13243290#comment-13243290 ] Hudson commented on MAHOUT-984: --- Integrated in Mahout-Quality #1418 (See [h

[jira] [Commented] (MAHOUT-988) Convert K-means buildClusters to use new ClusterIterator

2012-03-31 Thread Jeff Eastman (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13243282#comment-13243282 ] Jeff Eastman commented on MAHOUT-988: - It will be very interesting to compare

[jira] [Commented] (MAHOUT-988) Convert K-means buildClusters to use new ClusterIterator

2012-03-31 Thread Jeff Eastman (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13243277#comment-13243277 ] Jeff Eastman commented on MAHOUT-988: - +1 Huge code reduction, eh? This is just

[jira] [Commented] (MAHOUT-989) Convert fuzzy-K-means buildClusters to use new ClusterIterator

2012-03-31 Thread jirapos...@reviews.apache.org (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13243253#comment-13243253 ] jirapos...@reviews.apache.org commented on MAHOUT

[jira] [Commented] (MAHOUT-940) Clusterdumper - Get rid of map based implementation

2012-03-31 Thread Saikat Kanjilal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13243165#comment-13243165 ] Saikat Kanjilal commented on MAHOUT-940: Paritosh, I'm assuming OOM mea

[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning

2012-03-31 Thread Saikat Kanjilal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13243163#comment-13243163 ] Saikat Kanjilal commented on MAHOUT-984: Paritosh, Thanks for the update, s

[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning

2012-03-31 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13243083#comment-13243083 ] Paritosh Ranjan commented on MAHOUT-984: Saikat, I am picking this up now sin

[jira] [Commented] (MAHOUT-997) Make splitData smart enough to not consider a CSV header to be part of the data

2012-03-30 Thread Lance Norskog (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13242903#comment-13242903 ] Lance Norskog commented on MAHOUT-997: -- This is a general problem, not a split

[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning

2012-03-30 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13242585#comment-13242585 ] Paritosh Ranjan commented on MAHOUT-984: Yes, you can debug that code in ecl

[jira] [Commented] (MAHOUT-988) Convert K-means buildClusters to use new ClusterIterator

2012-03-30 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13242569#comment-13242569 ] Paritosh Ranjan commented on MAHOUT-988: Jeff, since this is first case of u

[jira] [Commented] (MAHOUT-988) Convert K-means buildClusters to use new ClusterIterator

2012-03-30 Thread jirapos...@reviews.apache.org (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13242567#comment-13242567 ] jirapos...@reviews.apache.org commented on MAHOUT

[jira] [Commented] (MAHOUT-810) Create EnsembleRecommender

2012-03-28 Thread Emmanouil Amolochitis (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13240346#comment-13240346 ] Emmanouil Amolochitis commented on MAHOUT-810: -- Mr. Dogan, are you s

[jira] [Commented] (MAHOUT-976) Implement Multilayer Perceptron

2012-03-27 Thread Christian Herta (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13240231#comment-13240231 ] Christian Herta commented on MAHOUT-976: Until now I have implemented

[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning

2012-03-27 Thread Saikat Kanjilal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13240213#comment-13240213 ] Saikat Kanjilal commented on MAHOUT-984: Paritosh, Some updates for you, fin

[jira] [Commented] (MAHOUT-994) mahout script shouldn't rely on HADOOP_HOME since that was deprecated in all major Hadoop branches

2012-03-27 Thread tom pierce (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13240066#comment-13240066 ] tom pierce commented on MAHOUT-994: --- You bet - happy to re

[jira] [Commented] (MAHOUT-994) mahout script shouldn't rely on HADOOP_HOME since that was deprecated in all major Hadoop branches

2012-03-27 Thread Dmitriy Lyubimov (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13240045#comment-13240045 ] Dmitriy Lyubimov commented on MAHOUT-994: - @Tom, perhaps you could be a

[jira] [Commented] (MAHOUT-994) mahout script shouldn't rely on HADOOP_HOME since that was deprecated in all major Hadoop branches

2012-03-27 Thread Dmitriy Lyubimov (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13240040#comment-13240040 ] Dmitriy Lyubimov commented on MAHOUT-994: - Sounds good to me. The only con

[jira] [Commented] (MAHOUT-994) mahout script shouldn't rely on HADOOP_HOME since that was deprecated in all major Hadoop branches

2012-03-27 Thread Dmitriy Lyubimov (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13240042#comment-13240042 ] Dmitriy Lyubimov commented on MAHOUT-994: - PS review request = i mea

[jira] [Commented] (MAHOUT-994) mahout script shouldn't rely on HADOOP_HOME since that was deprecated in all major Hadoop branches

2012-03-27 Thread Roman Shaposhnik (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13240031#comment-13240031 ] Roman Shaposhnik commented on MAHOUT-994: - @Dmitriy bq. I am not sure we

[jira] [Commented] (MAHOUT-994) mahout script shouldn't rely on HADOOP_HOME since that was deprecated in all major Hadoop branches

2012-03-27 Thread tom pierce (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13240015#comment-13240015 ] tom pierce commented on MAHOUT-994: --- It would make a lot of sense to me to unify

[jira] [Commented] (MAHOUT-991) Convert Canopy, MeanShift, K-means, Dirichlet, Fuzzy KMeans and Other Tools to emit ClusterWritable

2012-03-23 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13237086#comment-13237086 ] Hudson commented on MAHOUT-991: --- Integrated in Mahout-Quality #1408 (See [h

[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning

2012-03-23 Thread Saikat Kanjilal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13236761#comment-13236761 ] Saikat Kanjilal commented on MAHOUT-984: Great will do for sure, just

[jira] [Commented] (MAHOUT-991) Convert Canopy, MeanShift, K-means, Dirichlet, Fuzzy KMeans and Other Tools to emit ClusterWritable

2012-03-23 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13236756#comment-13236756 ] Paritosh Ranjan commented on MAHOUT-991: Jeff, thanks for reviewin

[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning

2012-03-23 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13236752#comment-13236752 ] Paritosh Ranjan commented on MAHOUT-984: Saikat, I am expecting a patch from

[jira] [Commented] (MAHOUT-991) Convert Canopy, MeanShift, K-means, Dirichlet, Fuzzy KMeans and Other Tools to emit ClusterWritable

2012-03-23 Thread Jeff Eastman (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13236714#comment-13236714 ] Jeff Eastman commented on MAHOUT-991: - +1 Paritosh, the changes look like what I

[jira] [Commented] (MAHOUT-991) Convert Canopy, MeanShift, K-means, Dirichlet, Fuzzy KMeans and Other Tools to emit ClusterWritable

2012-03-23 Thread Saikat Kanjilal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13236640#comment-13236640 ] Saikat Kanjilal commented on MAHOUT-991: Paritosh, Did you already get

[jira] [Commented] (MAHOUT-504) Kmeans clustering error

2012-03-23 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13236517#comment-13236517 ] Paritosh Ranjan commented on MAHOUT-504: The Examples Cluster Reuter

[jira] [Commented] (MAHOUT-994) mahout script shouldn't rely on HADOOP_HOME since that was deprecated in all major Hadoop branches

2012-03-22 Thread Lance Norskog (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13236249#comment-13236249 ] Lance Norskog commented on MAHOUT-994: -- Don't the job jars pack up Hadoop

[jira] [Commented] (MAHOUT-991) Convert Canopy, MeanShift, K-means, Dirichlet, Fuzzy KMeans and Other Tools to emit ClusterWritable

2012-03-22 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13235983#comment-13235983 ] Paritosh Ranjan commented on MAHOUT-991: All junit tests run successfully. I

[jira] [Commented] (MAHOUT-991) Convert Canopy, MeanShift, K-means, Dirichlet, Fuzzy KMeans and Other Tools to emit ClusterWritable

2012-03-22 Thread Shannon Quinn (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13235936#comment-13235936 ] Shannon Quinn commented on MAHOUT-991: -- Yes! That's correct. Sorry for the

[jira] [Commented] (MAHOUT-991) Convert Canopy, MeanShift, K-means, Dirichlet, Fuzzy KMeans and Other Tools to emit ClusterWritable

2012-03-22 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13235930#comment-13235930 ] Paritosh Ranjan commented on MAHOUT-991: SpectralKMeansDriver is u

[jira] [Commented] (MAHOUT-991) Convert Canopy, MeanShift, K-means, Dirichlet, Fuzzy KMeans and Other Tools to emit ClusterWritable

2012-03-22 Thread Shannon Quinn (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13235924#comment-13235924 ] Shannon Quinn commented on MAHOUT-991: -- I suspect it would be good to make this

[jira] [Commented] (MAHOUT-991) Convert Canopy, MeanShift, K-means, Dirichlet, Fuzzy KMeans and Other Tools to emit ClusterWritable

2012-03-22 Thread jirapos...@reviews.apache.org (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13235920#comment-13235920 ] jirapos...@reviews.apache.org commented on MAHOUT

[jira] [Commented] (MAHOUT-994) mahout script shouldn't rely on HADOOP_HOME since that was deprecated in all major Hadoop branches

2012-03-22 Thread Dmitriy Lyubimov (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13235891#comment-13235891 ] Dmitriy Lyubimov commented on MAHOUT-994: - I think it's a l

[jira] [Commented] (MAHOUT-994) mahout script shouldn't rely on HADOOP_HOME since that was deprecated in all major Hadoop branches

2012-03-22 Thread Dmitriy Lyubimov (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13235879#comment-13235879 ] Dmitriy Lyubimov commented on MAHOUT-994: - But you are right, it looks lik

[jira] [Commented] (MAHOUT-994) mahout script shouldn't rely on HADOOP_HOME since that was deprecated in all major Hadoop branches

2012-03-22 Thread Dmitriy Lyubimov (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13235871#comment-13235871 ] Dmitriy Lyubimov commented on MAHOUT-994: - I am not sure we use hadoop execut

[jira] [Commented] (MAHOUT-994) mahout script shouldn't rely on HADOOP_HOME since that was deprecated in all major Hadoop branches

2012-03-22 Thread Roman Shaposhnik (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13235814#comment-13235814 ] Roman Shaposhnik commented on MAHOUT-994: - There's no need to add $HA

[jira] [Commented] (MAHOUT-994) mahout script shouldn't rely on HADOOP_HOME since that was deprecated in all major Hadoop branches

2012-03-22 Thread Dmitriy Lyubimov (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13235796#comment-13235796 ] Dmitriy Lyubimov commented on MAHOUT-994: - looking at the pig code and

[jira] [Commented] (MAHOUT-991) Convert Canopy, MeanShift, K-means, Dirichlet, Fuzzy KMeans and Other Tools to emit ClusterWritable

2012-03-22 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13235775#comment-13235775 ] Paritosh Ranjan commented on MAHOUT-991: I just figured out it will. So,

[jira] [Commented] (MAHOUT-991) Convert Canopy, MeanShift, K-means, Dirichlet, Fuzzy KMeans and Other Tools to emit ClusterWritable

2012-03-22 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13235730#comment-13235730 ] Paritosh Ranjan commented on MAHOUT-991: I am successful in converting all ex

[jira] [Commented] (MAHOUT-994) mahout script shouldn't rely on HADOOP_HOME since that was deprecated in all major Hadoop branches

2012-03-22 Thread Roman Shaposhnik (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13235684#comment-13235684 ] Roman Shaposhnik commented on MAHOUT-994: - Sure. In fact, let me use Pig

[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning

2012-03-22 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13235423#comment-13235423 ] Paritosh Ranjan commented on MAHOUT-984: Debugging the issue might help you

[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning

2012-03-21 Thread Saikat Kanjilal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13235364#comment-13235364 ] Saikat Kanjilal commented on MAHOUT-984: Paritosh, I'm running into

[jira] [Commented] (MAHOUT-994) mahout script shouldn't rely on HADOOP_HOME since that was deprecated in all major Hadoop branches

2012-03-21 Thread Dmitriy Lyubimov (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13234521#comment-13234521 ] Dmitriy Lyubimov commented on MAHOUT-994: - What it should be relied on in

[jira] [Commented] (MAHOUT-981) Refactor KMeans Clustering into a separate post process with outlier pruning

2012-03-21 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13234184#comment-13234184 ] Hudson commented on MAHOUT-981: --- Integrated in Mahout-Quality #1405 (See [h

[jira] [Commented] (MAHOUT-716) Implement Boosting

2012-03-20 Thread Hector Yee (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13234013#comment-13234013 ] Hector Yee commented on MAHOUT-716: --- Thanks for the review Isabel. The git used t

[jira] [Commented] (MAHOUT-716) Implement Boosting

2012-03-20 Thread Isabel Drost (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13233818#comment-13233818 ] Isabel Drost commented on MAHOUT-716: - After not much activity - took a brief loo

[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning

2012-03-19 Thread Saikat Kanjilal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13232997#comment-13232997 ] Saikat Kanjilal commented on MAHOUT-984: Never mind, figured it out, wil

[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning

2012-03-18 Thread Saikat Kanjilal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13232455#comment-13232455 ] Saikat Kanjilal commented on MAHOUT-984: Paritosh, I am in the middle of

[jira] [Commented] (MAHOUT-981) Refactor KMeans Clustering into a separate post process with outlier pruning

2012-03-18 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13232269#comment-13232269 ] Hudson commented on MAHOUT-981: --- Integrated in Mahout-Quality #1401 (See [h

[jira] [Commented] (MAHOUT-981) Refactor KMeans Clustering into a separate post process with outlier pruning

2012-03-17 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13231913#comment-13231913 ] Hudson commented on MAHOUT-981: --- Integrated in Mahout-Quality #1399 (See [h

[jira] [Commented] (MAHOUT-983) Refactor Dirichlet Clustering into a separate post process with outlier pruning

2012-03-16 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13231767#comment-13231767 ] Hudson commented on MAHOUT-983: --- Integrated in Mahout-Quality #1398 (See [h

[jira] [Commented] (MAHOUT-981) Refactor KMeans Clustering into a separate post process with outlier pruning

2012-03-16 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13231766#comment-13231766 ] Hudson commented on MAHOUT-981: --- Integrated in Mahout-Quality #1398 (See [h

[jira] [Commented] (MAHOUT-981) Refactor KMeans Clustering into a separate post process with outlier pruning

2012-03-16 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13231544#comment-13231544 ] Hudson commented on MAHOUT-981: --- Integrated in Mahout-Quality #1397 (See [h

[jira] [Commented] (MAHOUT-983) Refactor Dirichlet Clustering into a separate post process with outlier pruning

2012-03-16 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13231545#comment-13231545 ] Hudson commented on MAHOUT-983: --- Integrated in Mahout-Quality #1397 (See [h

[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning

2012-03-16 Thread Saikat Kanjilal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13231414#comment-13231414 ] Saikat Kanjilal commented on MAHOUT-984: Thanks will do and will upload p

[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning

2012-03-16 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13231406#comment-13231406 ] Paritosh Ranjan commented on MAHOUT-984: The code has been committed now, so

[jira] [Commented] (MAHOUT-944) LuceneIndexToSequenceFiles (lucene2seq) utility

2012-03-16 Thread Frank Scholten (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13231038#comment-13231038 ] Frank Scholten commented on MAHOUT-944: --- Grant: do you some have time to re

[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning

2012-03-14 Thread Saikat Kanjilal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13229904#comment-13229904 ] Saikat Kanjilal commented on MAHOUT-984: so I went through the patch appl

[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning

2012-03-14 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13229886#comment-13229886 ] Paritosh Ranjan commented on MAHOUT-984: You can apply the patch on the t

[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning

2012-03-14 Thread Saikat Kanjilal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13229863#comment-13229863 ] Saikat Kanjilal commented on MAHOUT-984: Paritosh, I'm ready to start w

[jira] [Commented] (MAHOUT-988) Convert K-means buildClusters to use new ClusterIterator

2012-03-14 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13229840#comment-13229840 ] Paritosh Ranjan commented on MAHOUT-988: Jeff, I would like to work on this i

[jira] [Commented] (MAHOUT-993) Some vector dumper flags are expecting arguments.

2012-03-14 Thread Jake Mannix (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13229731#comment-13229731 ] Jake Mannix commented on MAHOUT-993: Yeah, I think I've always done &quo

[jira] [Commented] (MAHOUT-983) Refactor Dirichlet Clustering into a separate post process with outlier pruning

2012-03-14 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13229214#comment-13229214 ] Paritosh Ranjan commented on MAHOUT-983: The patch is also uploaded on the re

[jira] [Commented] (MAHOUT-981) Refactor KMeans Clustering into a separate post process with outlier pruning

2012-03-14 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13229213#comment-13229213 ] Paritosh Ranjan commented on MAHOUT-981: The patch is also uploaded on the re

[jira] [Commented] (MAHOUT-822) Mahout needs to be made compatible with Hadoop .23 releases

2012-03-13 Thread tom pierce (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13228388#comment-13228388 ] tom pierce commented on MAHOUT-822: --- Unfortunately, I can't duplicate this tes

[jira] [Commented] (MAHOUT-822) Mahout needs to be made compatible with Hadoop .23 releases

2012-03-13 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13228353#comment-13228353 ] Hudson commented on MAHOUT-822: --- Integrated in Mahout-Quality #1392 (See [h

[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning

2012-03-12 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13228248#comment-13228248 ] Paritosh Ranjan commented on MAHOUT-984: This refactoring is for cluster

[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning

2012-03-12 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13228246#comment-13228246 ] Paritosh Ranjan commented on MAHOUT-984: 1) Yes 2) CCD takes a ccThreshold

[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning

2012-03-12 Thread Saikat Kanjilal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13228238#comment-13228238 ] Saikat Kanjilal commented on MAHOUT-984: Paritosh, I've read th

[jira] [Commented] (MAHOUT-984) Refactor Fuzzy K Means Clustering into a separate post process with outlier pruning

2012-03-12 Thread Saikat Kanjilal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13228179#comment-13228179 ] Saikat Kanjilal commented on MAHOUT-984: I will start researching this i

[jira] [Commented] (MAHOUT-981) Refactor KMeans Clustering into a separate post process with outlier pruning

2012-03-12 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13228176#comment-13228176 ] Paritosh Ranjan commented on MAHOUT-981: Since I have already started this i

[jira] [Commented] (MAHOUT-981) Refactor KMeans Clustering into a separate post process with outlier pruning

2012-03-12 Thread Saikat Kanjilal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13228156#comment-13228156 ] Saikat Kanjilal commented on MAHOUT-981: Paritosh, It looks like you'v

[jira] [Commented] (MAHOUT-992) Audit DistributedCache use to support EMR

2012-03-12 Thread Matteo Riondato (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13227825#comment-13227825 ] Matteo Riondato commented on MAHOUT-992: I can probably look at this soon, o

[jira] [Commented] (MAHOUT-982) Refactor Canopy Clustering into a separate post process with outlier pruning

2012-03-10 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13226906#comment-13226906 ] Hudson commented on MAHOUT-982: --- Integrated in Mahout-Quality #1388 (See [h

[jira] [Commented] (MAHOUT-822) Mahout needs to be made compatible with Hadoop .23 releases

2012-03-09 Thread jirapos...@reviews.apache.org (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13226548#comment-13226548 ] jirapos...@reviews.apache.org commented on MAHOUT-822: -- bq.

[jira] [Commented] (MAHOUT-933) Implement mapreduce version of ClusterIterator

2012-03-09 Thread Jeff Eastman (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13226397#comment-13226397 ] Jeff Eastman commented on MAHOUT-933: - r1298625 made the following changes: MA

[jira] [Commented] (MAHOUT-822) Mahout needs to be made compatible with Hadoop .23 releases

2012-03-08 Thread jirapos...@reviews.apache.org (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13225834#comment-13225834 ] jirapos...@reviews.apache.org commented on MAHOUT-822: -- bq.

[jira] [Commented] (MAHOUT-822) Mahout needs to be made compatible with Hadoop .23 releases

2012-03-08 Thread jirapos...@reviews.apache.org (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13225681#comment-13225681 ] jirapos...@reviews.apache.org commented on MAHOUT

[jira] [Commented] (MAHOUT-982) Refactor Canopy Clustering into a separate post process with outlier pruning

2012-03-08 Thread Jeff Eastman (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13225326#comment-13225326 ] Jeff Eastman commented on MAHOUT-982: - +1 I like the way the driver was compre

[jira] [Commented] (MAHOUT-982) Refactor Canopy Clustering into a separate post process with outlier pruning

2012-03-08 Thread Paritosh Ranjan (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13225324#comment-13225324 ] Paritosh Ranjan commented on MAHOUT-982: I plan to commit this in a day or

[jira] [Commented] (MAHOUT-982) Refactor Canopy Clustering into a separate post process with outlier pruning

2012-03-08 Thread jirapos...@reviews.apache.org (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13225319#comment-13225319 ] jirapos...@reviews.apache.org commented on MAHOUT

[jira] [Commented] (MAHOUT-987) Our build is unstable - this should reduce our style warnings by >200

2012-03-07 Thread jirapos...@reviews.apache.org (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13224968#comment-13224968 ] jirapos...@reviews.apache.org commented on MAHOUT

  1   2   3   4   5   6   7   8   9   10   >