[jira] Issue Comment Edited: (MAHOUT-323) Classify new data using Decision Forest

2010-05-04 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12853185#action_12853185 ] Deneche A. Hakim edited comment on MAHOUT-323 at 5/4/10 11:25 PM: ---

[jira] Commented: (MAHOUT-323) Classify new data using Decision Forest

2010-04-03 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12853185#action_12853185 ] Deneche A. Hakim commented on MAHOUT-323: - committed a basic mapreduce version of T

[jira] Commented: (MAHOUT-323) Classify new data using Decision Forest

2010-03-26 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12850480#action_12850480 ] Deneche A. Hakim commented on MAHOUT-323: - committed the ability to classify a dire

[jira] Commented: (MAHOUT-323) Classify new data using Decision Forest

2010-03-13 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12844928#action_12844928 ] Deneche A. Hakim commented on MAHOUT-323: - updated [Wiki page|http://cwiki.apache.o

[jira] Commented: (MAHOUT-323) Classify new data using Decision Forest

2010-03-13 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12844858#action_12844858 ] Deneche A. Hakim commented on MAHOUT-323: - just committed the patch. Should work (h

[jira] Commented: (MAHOUT-323) Classify new data using Decision Forest

2010-03-07 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12842482#action_12842482 ] Deneche A. Hakim commented on MAHOUT-323: - yep > Classify new data using Decisio

[jira] Updated: (MAHOUT-323) Classify new data using Decision Forest

2010-03-07 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-323: Attachment: mahout-323.patch here is the patch =P should commit it as soon as possible, alo

[jira] Updated: (MAHOUT-323) Classify new data using Decision Forest

2010-03-06 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-323: Status: Patch Available (was: Open) working patch. Waiting for the end of code freeze to c

[jira] Created: (MAHOUT-323) Classify new data using Decision Forest

2010-03-05 Thread Deneche A. Hakim (JIRA)
Classify new data using Decision Forest --- Key: MAHOUT-323 URL: https://issues.apache.org/jira/browse/MAHOUT-323 Project: Mahout Issue Type: Improvement Components: Classification Affects Ve

[jira] Updated: (MAHOUT-245) Better handling of Categorical attributes when building Decision Forests

2010-01-14 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-245: Status: Patch Available (was: Open) > Better handling of Categorical attributes when build

[jira] Commented: (MAHOUT-245) Better handling of Categorical attributes when building Decision Forests

2010-01-14 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12800156#action_12800156 ] Deneche A. Hakim commented on MAHOUT-245: - I modified the code to not select Catego

[jira] Updated: (MAHOUT-245) Better handling of Categorical attributes when building Decision Forests

2010-01-14 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-245: Attachment: mahout-245.patch > Better handling of Categorical attributes when building Deci

[jira] Created: (MAHOUT-245) Better handling of Categorical attributes when building Decision Forests

2010-01-14 Thread Deneche A. Hakim (JIRA)
Better handling of Categorical attributes when building Decision Forests Key: MAHOUT-245 URL: https://issues.apache.org/jira/browse/MAHOUT-245 Project: Mahout Issue Typ

[jira] Resolved: (MAHOUT-216) Improve the results of MAHOUT-145 by uniformly distributing the classes in the partitioned data

2010-01-14 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim resolved MAHOUT-216. - Resolution: Fixed Done. > Improve the results of MAHOUT-145 by uniformly distributing th

[jira] Issue Comment Edited: (MAHOUT-216) Improve the results of MAHOUT-145 by uniformly distributing the classes in the partitioned data

2010-01-06 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797544#action_12797544 ] Deneche A. Hakim edited comment on MAHOUT-216 at 1/7/10 7:30 AM:

[jira] Commented: (MAHOUT-216) Improve the results of MAHOUT-145 by uniformly distributing the classes in the partitioned data

2010-01-06 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797544#action_12797544 ] Deneche A. Hakim commented on MAHOUT-216: - Here are some results on a 5 slave ec2 c

[jira] Issue Comment Edited: (MAHOUT-216) Improve the results of MAHOUT-145 by uniformly distributing the classes in the partitioned data

2009-12-10 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12788902#action_12788902 ] Deneche A. Hakim edited comment on MAHOUT-216 at 12/10/09 8:32 PM: --

[jira] Commented: (MAHOUT-216) Improve the results of MAHOUT-145 by uniformly distributing the classes in the partitioned data

2009-12-10 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12788902#action_12788902 ] Deneche A. Hakim commented on MAHOUT-216: - the next step is to implement a tool tha

[jira] Commented: (MAHOUT-216) Improve the results of MAHOUT-145 by uniformly distributing the classes in the partitioned data

2009-12-10 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12788878#action_12788878 ] Deneche A. Hakim commented on MAHOUT-216: - First of all I implemented a simple mapr

[jira] Created: (MAHOUT-216) Improve the results of MAHOUT-145 by uniformly distributing the classes in the partitioned data

2009-12-10 Thread Deneche A. Hakim (JIRA)
Improve the results of MAHOUT-145 by uniformly distributing the classes in the partitioned data --- Key: MAHOUT-216 URL: https://issues.apache.org/jira/browse/MAHOUT-216

[jira] Updated: (MAHOUT-177) Fix for "java.lang.ClassNotFoundException Exception"

2009-10-17 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-177: Attachment: cnf_dirichlet_fix.patch modified DirichletMapper in order to avoid the followin

[jira] Updated: (MAHOUT-113) CDInfosToolTest.testGatherInfos failure in Mahout examples

2009-10-11 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-113: Resolution: Fixed Fix Version/s: 0.2 Status: Resolved (was: Patch Availab

[jira] Assigned: (MAHOUT-113) CDInfosToolTest.testGatherInfos failure in Mahout examples

2009-10-10 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim reassigned MAHOUT-113: --- Assignee: Deneche A. Hakim > CDInfosToolTest.testGatherInfos failure in Mahout exampl

[jira] Updated: (MAHOUT-113) CDInfosToolTest.testGatherInfos failure in Mahout examples

2009-10-10 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-113: Attachment: mahout113-patch-update.diff updated the patch to the current trunk > CDInfosTo

[jira] Updated: (MAHOUT-177) Fix for "java.lang.ClassNotFoundException Exception"

2009-10-05 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-177: Resolution: Fixed Assignee: Deneche A. Hakim Status: Resolved (was: Patch Ava

[jira] Commented: (MAHOUT-184) Code tweaks for .df.* code

2009-10-04 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12762089#action_12762089 ] Deneche A. Hakim commented on MAHOUT-184: - .df* and .ga.* changes looks good to me

[jira] Issue Comment Edited: (MAHOUT-184) Code tweaks for .df.* code

2009-10-02 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12761819#action_12761819 ] Deneche A. Hakim edited comment on MAHOUT-184 at 10/2/09 11:32 PM: --

[jira] Commented: (MAHOUT-184) Code tweaks for .df.* code

2009-10-02 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12761819#action_12761819 ] Deneche A. Hakim commented on MAHOUT-184: - I'm having trouble applying the patch, g

[jira] Updated: (MAHOUT-145) PartialData mapreduce Random Forests

2009-09-29 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-145: Resolution: Fixed Status: Resolved (was: Patch Available) > PartialData mapreduce

[jira] Updated: (MAHOUT-145) PartialData mapreduce Random Forests

2009-09-29 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-145: Attachment: partial_Sep_30.patch committed patch > PartialData mapreduce Random Forests >

[jira] Updated: (MAHOUT-140) In-memory mapreduce Random Forests

2009-09-29 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-140: Resolution: Fixed Status: Resolved (was: Patch Available) > In-memory mapreduce Ra

[jira] Updated: (MAHOUT-140) In-memory mapreduce Random Forests

2009-09-29 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-140: Attachment: inmem_Sep29.patch committed patch > In-memory mapreduce Random Forests > -

[jira] Commented: (MAHOUT-122) Random Forests Reference Implementation

2009-09-28 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12760172#action_12760172 ] Deneche A. Hakim commented on MAHOUT-122: - Added a quick start guide in the wiki:

[jira] Updated: (MAHOUT-122) Random Forests Reference Implementation

2009-09-27 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-122: Resolution: Fixed Status: Resolved (was: Patch Available) > Random Forests Referen

[jira] Updated: (MAHOUT-122) Random Forests Reference Implementation

2009-09-26 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-122: Attachment: refimp_Sep27.patch * updated the patch with the latest changes in the trunk (co

[jira] Updated: (MAHOUT-122) Random Forests Reference Implementation

2009-09-16 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-122: Attachment: refimp_Sep_15.patch This patch contains the reference implementations and also

[jira] Updated: (MAHOUT-145) PartialData mapreduce Random Forests

2009-09-15 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-145: Status: Patch Available (was: Open) * This patch also includes [MAHOUT-140|https://issues

[jira] Assigned: (MAHOUT-122) Random Forests Reference Implementation

2009-09-15 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim reassigned MAHOUT-122: --- Assignee: Deneche A. Hakim > Random Forests Reference Implementation > --

[jira] Assigned: (MAHOUT-140) In-memory mapreduce Random Forests

2009-09-15 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim reassigned MAHOUT-140: --- Assignee: Deneche A. Hakim > In-memory mapreduce Random Forests > ---

[jira] Assigned: (MAHOUT-145) PartialData mapreduce Random Forests

2009-09-15 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim reassigned MAHOUT-145: --- Assignee: Deneche A. Hakim > PartialData mapreduce Random Forests > -

[jira] Updated: (MAHOUT-145) PartialData mapreduce Random Forests

2009-09-15 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-145: Attachment: partial_Sep_15.patch * DONE: no need to load the whole dataset in memory just t

[jira] Updated: (MAHOUT-177) Fix for "java.lang.ClassNotFoundException Exception"

2009-09-13 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-177: Status: Patch Available (was: Open) > Fix for "java.lang.ClassNotFoundException Exception"

[jira] Updated: (MAHOUT-177) Fix for "java.lang.ClassNotFoundException Exception"

2009-09-13 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-177: Attachment: cnf_fix.patch *important* I tested only the SyntheticControl examples > Fix fo

[jira] Created: (MAHOUT-177) Fix for "java.lang.ClassNotFoundException Exception"

2009-09-13 Thread Deneche A. Hakim (JIRA)
Fix for "java.lang.ClassNotFoundException Exception" Key: MAHOUT-177 URL: https://issues.apache.org/jira/browse/MAHOUT-177 Project: Mahout Issue Type: Bug Affects Versions: 0.2

[jira] Updated: (MAHOUT-145) PartialData mapreduce Random Forests

2009-09-12 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-145: Fix Version/s: 0.2 Affects Version/s: 0.2 * Will be committed as part of [MAHOUT-1

[jira] Updated: (MAHOUT-140) In-memory mapreduce Random Forests

2009-09-12 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-140: Fix Version/s: 0.2 * Will be committed as part of [MAHOUT-145|https://issues.apache.org/ji

[jira] Updated: (MAHOUT-122) Random Forests Reference Implementation

2009-09-12 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-122: Fix Version/s: 0.2 This issue will be committed as part of [MAHOUT-145|https://issues.apac

[jira] Commented: (MAHOUT-145) PartialData mapreduce Random Forests

2009-09-10 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12753572#action_12753572 ] Deneche A. Hakim commented on MAHOUT-145: - bq. What about using the Yahoo 0.20 dist

[jira] Issue Comment Edited: (MAHOUT-145) PartialData mapreduce Random Forests

2009-09-06 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12751842#action_12751842 ] Deneche A. Hakim edited comment on MAHOUT-145 at 9/6/09 2:52 AM:

[jira] Commented: (MAHOUT-145) PartialData mapreduce Random Forests

2009-09-06 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12751842#action_12751842 ] Deneche A. Hakim commented on MAHOUT-145: - bq.* TODO: test the code on a Hadoo

[jira] Updated: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-31 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-145: Attachment: partial_August_31.patch * Corrected some bugs in the new code when testing in a

[jira] Updated: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-29 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-145: Attachment: partial_August_27.patch * DONE: convert the partial implementation tests to Had

[jira] Updated: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-24 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-145: Attachment: partial_August_24.patch * DONE: partial implementation that uses Hadoop 0.20.0

[jira] Updated: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-19 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-145: Attachment: partial_August_19.patch *Preparation for mahout 0.2* * moving to Hadoop 0.20.0

[jira] Updated: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-17 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-145: Attachment: partial_August_17.patch *GSoC latest patch* * DONE: move rf.ref.examples.Breima

[jira] Updated: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-15 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-145: Attachment: partial_August_15.patch *Preparing for GSoC deadline* * DONE: move rf.mapred.x

[jira] Updated: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-13 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-145: Attachment: partial_August_13.patch *Preparing the code for GSoC deadline* * DONE: move rf

[jira] Issue Comment Edited: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-12 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12742262#action_12742262 ] Deneche A. Hakim edited comment on MAHOUT-145 at 8/12/09 3:39 AM: ---

[jira] Commented: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-12 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12742274#action_12742274 ] Deneche A. Hakim commented on MAHOUT-145: - KDD 50% || Num Map Tasks || Num Trees ||

[jira] Commented: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-12 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12742262#action_12742262 ] Deneche A. Hakim commented on MAHOUT-145: - KDD 25% || Num Map Tasks || Num Trees ||

[jira] Commented: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-11 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12742000#action_12742000 ] Deneche A. Hakim commented on MAHOUT-145: - How the Partial Mapred builder works: *

[jira] Commented: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-11 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12741759#action_12741759 ] Deneche A. Hakim commented on MAHOUT-145: - bq. These are confusing numbers. First,

[jira] Commented: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-10 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12741470#action_12741470 ] Deneche A. Hakim commented on MAHOUT-145: - Here are some results from a 10 nodes cl

[jira] Updated: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-10 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-145: Attachment: partial_August_10.patch *changes* Partial Implementation has been improved to

[jira] Updated: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-09 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-145: Attachment: partial_August_9.patch * resolved a bug in Partial Implementation This patch i

[jira] Commented: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-08 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12740931#action_12740931 ] Deneche A. Hakim commented on MAHOUT-145: - bq. have demonstrated that partitioning

[jira] Commented: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-08 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12740876#action_12740876 ] Deneche A. Hakim commented on MAHOUT-145: - Ok here what I did: * Load KDD 10% * pa

[jira] Commented: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-08 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12740872#action_12740872 ] Deneche A. Hakim commented on MAHOUT-145: - as expected I found a bug and removed it

[jira] Commented: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-05 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12739910#action_12739910 ] Deneche A. Hakim commented on MAHOUT-145: - bq. What really bugs me is that it is wo

[jira] Commented: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-05 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12739584#action_12739584 ] Deneche A. Hakim commented on MAHOUT-145: - more tests on my laptop: KDD 10% || Num

[jira] Commented: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-05 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12739386#action_12739386 ] Deneche A. Hakim commented on MAHOUT-145: - I'm running some tests to compare betwee

[jira] Updated: (MAHOUT-145) PartialData mapreduce Random Forests

2009-08-02 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-145: Attachment: partial_August_2.patch partial-mapred implementation *changes* * abstract cla

[jira] Commented: (MAHOUT-145) PartialData mapreduce Random Forests

2009-07-27 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12735703#action_12735703 ] Deneche A. Hakim commented on MAHOUT-145: - to be able to predict the class of an ou

[jira] Commented: (MAHOUT-145) PartialData mapreduce Random Forests

2009-07-19 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12732990#action_12732990 ] Deneche A. Hakim commented on MAHOUT-145: - In the partial implementation, the input

[jira] Updated: (MAHOUT-140) In-memory mapreduce Random Forests

2009-07-19 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-140: Attachment: inmem_July19_patch.diff *Changes* * InMemBuilder can now use a seed to be repe

[jira] Issue Comment Edited: (MAHOUT-140) In-memory mapreduce Random Forests

2009-07-18 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12732922#action_12732922 ] Deneche A. Hakim edited comment on MAHOUT-140 at 7/18/09 11:20 AM: --

[jira] Commented: (MAHOUT-140) In-memory mapreduce Random Forests

2009-07-18 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12732922#action_12732922 ] Deneche A. Hakim commented on MAHOUT-140: - * First of all I implemented the *in-mem

[jira] Issue Comment Edited: (MAHOUT-143) Refactor Hadoop deprecations

2009-07-13 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12730297#action_12730297 ] Deneche A. Hakim edited comment on MAHOUT-143 at 7/13/09 9:53 AM: ---

[jira] Commented: (MAHOUT-143) Refactor Hadoop deprecations

2009-07-13 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12730297#action_12730297 ] Deneche A. Hakim commented on MAHOUT-143: - I have two questions: * This is related

[jira] Commented: (MAHOUT-145) PartialData mapreduce Random Forests

2009-07-13 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12730294#action_12730294 ] Deneche A. Hakim commented on MAHOUT-145: - bq. What do you think about using a norm

[jira] Commented: (MAHOUT-140) In-memory mapreduce Random Forests

2009-07-13 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12730292#action_12730292 ] Deneche A. Hakim commented on MAHOUT-140: - bq. But these numbers don't seem to show

[jira] Commented: (MAHOUT-145) PartialData mapreduce Random Forests

2009-07-12 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12730065#action_12730065 ] Deneche A. Hakim commented on MAHOUT-145: - A possible implementation is as follows:

[jira] Created: (MAHOUT-145) PartialData mapreduce Random Forests

2009-07-12 Thread Deneche A. Hakim (JIRA)
PartialData mapreduce Random Forests Key: MAHOUT-145 URL: https://issues.apache.org/jira/browse/MAHOUT-145 Project: Mahout Issue Type: New Feature Components: Classification Reporter

[jira] Updated: (MAHOUT-140) In-memory mapreduce Random Forests

2009-07-12 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-140: Attachment: mapred_jul12.diff *Changes* * The oob error estimation has been rewritten to be

[jira] Updated: (MAHOUT-122) Random Forests Reference Implementation

2009-07-07 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-122: Attachment: refimp_Jul7.diff I did some tests on the "poker hand" dataset from UCI, it cont

[jira] Commented: (MAHOUT-122) Random Forests Reference Implementation

2009-07-07 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12728034#action_12728034 ] Deneche A. Hakim commented on MAHOUT-122: - I forgot to mention that I used Kdd50% i

[jira] Updated: (MAHOUT-122) Random Forests Reference Implementation

2009-07-06 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-122: Attachment: refimp_Jul6.diff *Optimization patch* Just the reference implementation, does n

[jira] Updated: (MAHOUT-140) In-memory mapreduce Random Forests

2009-06-29 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-140: Attachment: mapred_patch.diff org.apache.mahout.rf.mapred To make it simple, MAHOUT-122 (r

[jira] Updated: (MAHOUT-140) In-memory mapreduce Random Forests

2009-06-29 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-140: Status: Patch Available (was: Open) Work in progress... A working implementation, I teste

[jira] Created: (MAHOUT-140) In-memory mapreduce Random Forests

2009-06-29 Thread Deneche A. Hakim (JIRA)
In-memory mapreduce Random Forests -- Key: MAHOUT-140 URL: https://issues.apache.org/jira/browse/MAHOUT-140 Project: Mahout Issue Type: New Feature Components: Classification Affects Versions: 0.

[jira] Updated: (MAHOUT-122) Random Forests Reference Implementation

2009-06-29 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-122: Remaining Estimate: (was: 25h) Original Estimate: (was: 25h) > Random Forests

[jira] Commented: (MAHOUT-122) Random Forests Reference Implementation

2009-06-29 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12725125#action_12725125 ] Deneche A. Hakim commented on MAHOUT-122: - bq. The 450 byte overhead per training i

[jira] Issue Comment Edited: (MAHOUT-122) Random Forests Reference Implementation

2009-06-17 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12718771#action_12718771 ] Deneche A. Hakim edited comment on MAHOUT-122 at 6/17/09 2:57 AM: ---

[jira] Updated: (MAHOUT-122) Random Forests Reference Implementation

2009-06-12 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-122: Attachment: 3w_patch.diff *3rd Week Patch* work in progress... *Changes* * ForestBuilder

[jira] Commented: (MAHOUT-122) Random Forests Reference Implementation

2009-06-12 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12718777#action_12718777 ] Deneche A. Hakim commented on MAHOUT-122: - I did some tests on some of the datasets

[jira] Issue Comment Edited: (MAHOUT-122) Random Forests Reference Implementation

2009-06-12 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12718771#action_12718771 ] Deneche A. Hakim edited comment on MAHOUT-122 at 6/12/09 2:44 AM: ---

[jira] Commented: (MAHOUT-122) Random Forests Reference Implementation

2009-06-12 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12718771#action_12718771 ] Deneche A. Hakim commented on MAHOUT-122: - I was wrong about the memory usage of th

[jira] Updated: (MAHOUT-113) CDInfosToolTest.testGatherInfos failure in Mahout examples

2009-06-08 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-113: Attachment: mahout113-patch.diff I think I found the problem, it was caused by a bad random

[jira] Updated: (MAHOUT-113) CDInfosToolTest.testGatherInfos failure in Mahout examples

2009-06-08 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated MAHOUT-113: Affects Version/s: (was: 0.1) Status: Patch Available (was: Open) > CDI

[jira] Commented: (MAHOUT-122) Random Forests Reference Implementation

2009-06-07 Thread Deneche A. Hakim (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12717024#action_12717024 ] Deneche A. Hakim commented on MAHOUT-122: - I've been reading Breiman's paper about

  1   2   >