[jira] Updated: (MAHOUT-76) Singular Value Decomposition for SparseMatrix / DenseMatrix

2008-08-20 Thread Allen Day (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-76?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allen Day updated MAHOUT-76: Attachment: SVD.patch > Singular Value Decomposition for SparseMatrix / DenseMatrix > --

[jira] Created: (MAHOUT-76) Singular Value Decomposition for SparseMatrix / DenseMatrix

2008-08-20 Thread Allen Day (JIRA)
Singular Value Decomposition for SparseMatrix / DenseMatrix --- Key: MAHOUT-76 URL: https://issues.apache.org/jira/browse/MAHOUT-76 Project: Mahout Issue Type: New Feature Com

[jira] Updated: (MAHOUT-72) Separate out Examples from Core

2008-08-20 Thread Karl Wettin (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-72?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karl Wettin updated MAHOUT-72: -- Fix Version/s: 0.1 > Separate out Examples from Core > --- > >

[jira] Updated: (MAHOUT-75) asFormatString tests fail

2008-08-20 Thread Karl Wettin (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-75?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karl Wettin updated MAHOUT-75: -- Attachment: MAHOUT-75.txt This patch sorts the features of a SparseVector and SparseMatrix by their col

[jira] Created: (MAHOUT-75) asFormatString tests fail

2008-08-20 Thread Karl Wettin (JIRA)
asFormatString tests fail - Key: MAHOUT-75 URL: https://issues.apache.org/jira/browse/MAHOUT-75 Project: Mahout Issue Type: Bug Components: Matrix Reporter: Karl Wettin Assignee: Karl

Reminder: Monthly Hadoop User Group Meeting (Bay Area) today

2008-08-20 Thread Ajay Anand
Reminder: The next Hadoop User Group (Bay Area) meeting is scheduled for today, Wednesday, Aug 20th from 6 - 7:30 pm at Yahoo! Mission College, Santa Clara, CA, Building 1, Training Rooms 3&4. Agenda: Pig Update: Olga Natkovich Hadoop 0.18 and post 0.18 - Sameer Paranjpye Registration and

Re: aprior algorithm in MR

2008-08-20 Thread Ted Dunning
Also, I should point out that the rows (or columns) of the association matrix are already an interesting source of more than pairwise groups. In some applications, these row based groups are even more interesting than the transitive closures provided by the other clustering methods I mentioned. O

Re: aprior algorithm in MR

2008-08-20 Thread Ted Dunning
A score like log-likelihood ratio can be used to establish the sparsity pattern for an association matrix whose non-zero elements are a more probabilistically defensible measure of similar occurrence. This matrix can be used for clustering using, inter alia, spectral methods, agglomerative cluster

Re: [jira] Issue Comment Edited: (MAHOUT-69) 0.1 RELEASE TODO

2008-08-20 Thread Grant Ingersoll
From http://wiki.apache.org/lucene-java/ReleaseTodo All Lucene does is: Copy the Maven artifacts to the distribution directory (follow the existing directory structure), to have them pushed to the central Maven repositories: people.apache.org:/www/people.apache.org/repo/m2- ibiblio-rsync-rep

[jira] Issue Comment Edited: (MAHOUT-69) 0.1 RELEASE TODO

2008-08-20 Thread Karl Wettin (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-69?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12624009#action_12624009 ] karl.wettin edited comment on MAHOUT-69 at 8/20/08 7:57 AM: We n

[jira] Commented: (MAHOUT-69) 0.1 RELEASE TODO

2008-08-20 Thread Karl Wettin (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-69?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12624009#action_12624009 ] Karl Wettin commented on MAHOUT-69: --- We need to get in touch with the people at repo1.mave

Re: 0.1 Planning

2008-08-20 Thread Karl Wettin
I think it would be nice to get it out ASAP, perhaps even by next weekend? I'll get started on the HowToRelease wiki page right now. I also got a bunch of post 0.1 thoughts: We could post a wishlist/planning for 0.2 in the release of 0.1. This is probably just a link to a currently non exis

Build failed in Hudson: Mahout Trunk #6

2008-08-20 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Mahout%20Trunk/6/changes Changes: [gsingers] lucene libs -- [...truncated 2349 lines...] [junit] at org.apache.tools.ant.taskdefs.optional.junit.JUnitTestRunner.launch(JUnitTestRunner.java:912) [ju

Re: 0.1 Planning

2008-08-20 Thread Grant Ingersoll
Also, we need to setup some Javadocs targets and then we can publish the release javadocs on the website, and also start building nightlies on Hudson. I'm in the process of setting up to run the tests nightly. -Grant On Aug 20, 2008, at 8:59 AM, Grant Ingersoll wrote: Hi Mahouters, I'd l

Build failed in Hudson: Mahout Trunk #5

2008-08-20 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Mahout%20Trunk/5/changes -- [...truncated 463 lines...] AU examples/src/test/java/org/apache/mahout/ga/watchmaker/cd/tool/ToolMapperTest.java AU examples/src/test/java/org/apache/mahout/ga/watchma

0.1 Planning

2008-08-20 Thread Grant Ingersoll
Hi Mahouters, I'd like to suggest we start gearing up for a 0.1 release. Since this is our first one, we're going to have a bit of extra work to get things in the right shape, so any extra time you have would be most appreciated. First and foremost, would be testing, etc. on the current

[jira] Updated: (MAHOUT-74) Fuzzy K-Means clustering

2008-08-20 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-74?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated MAHOUT-74: -- Fix Version/s: 0.1 Priority: Minor (was: Major) > Fuzzy K-Means clustering > -

Build failed in Hudson: Mahout Trunk #4

2008-08-20 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Mahout%20Trunk/4/changes -- started Building remotely on lucene.zones.apache.org ERROR: svn: timed out waiting for server svn: OPTIONS request failed on '/repos/asf/lucene/mahout/trunk' org.tmatesoft.svn.core.SVN

Build failed in Hudson: Mahout Trunk #3

2008-08-20 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Mahout%20Trunk/3/changes -- started Building remotely on lucene.zones.apache.org ERROR: svn: timed out waiting for server svn: OPTIONS request failed on '/repos/asf/lucene/mahout/trunk' org.tmatesoft.svn.core.SVN

Re: Hadoop 0.19/0.20

2008-08-20 Thread Karl Wettin
Append was not quite as supported as I hoped for it to be. Just added HADOOP-3977 and it makes very little sense (for me) to upgrade Mahout until it has been reviewed, cleaned up and and committed. 19 aug 2008 kl. 16.17 skrev Grant Ingersoll: Hmmm, 1.6. Didn't realize that. What do people

Re: aprior algorithm in MR

2008-08-20 Thread Lukáš Vlček
Hi, Actually, I have a plan to implement something like FP-Growth for Mahout (but due to lack of time the progress is slow so far). As for the tree traversal it is considered to be one of the most difficult tasks within MR paradigm (see original Google lecture videos on MR programming). However, o

Re: aprior algorithm in MR

2008-08-20 Thread sej
Hello all, Ted: I'm not quite sure I understand your suggestion. Co-occurrence modeling would be limited to finding the most interesting pairs. If you have a follow up link to elaborate on item sets that extend beyond pairs (cardinality > 2), that would be helpful. All: A related question: I a