[jira] [Commented] (MAHOUT-884) Matrix Concatenate utility

2011-11-13 Thread Dan Brickley (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149483#comment-13149483 ] Dan Brickley commented on MAHOUT-884: - My original use case was here: http://www.sear

Re: [jira] [Commented] (MAHOUT-884) Matrix Concatenate utility

2011-11-13 Thread Lance Norskog
It's from this thread: http://www.lucidimagination.com/search/document/117d2f370e925cf9#ca4271b66a19bf9a And somehow I managed to promise to write it. On Sun, Nov 13, 2011 at 10:59 PM, Jake Mannix (Commented) (JIRA) < j...@apache.org> wrote: > >[ > https://issues.apache.org/jira/browse/MAHO

[jira] [Commented] (MAHOUT-884) Matrix Concatenate utility

2011-11-13 Thread Jake Mannix (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149470#comment-13149470 ] Jake Mannix commented on MAHOUT-884: why do we want the part files squished into one?

[jira] [Commented] (MAHOUT-884) Matrix Concatenate utility

2011-11-13 Thread Lance Norskog (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149469#comment-13149469 ] Lance Norskog commented on MAHOUT-884: -- I forgot about NamedVectors :(

[jira] [Updated] (MAHOUT-884) Matrix Concatenate utility

2011-11-13 Thread Lance Norskog (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lance Norskog updated MAHOUT-884: - Attachment: MAHOUT-884.patch > Matrix Concatenate utility > -- > >

[jira] [Created] (MAHOUT-884) Matrix Concatenate utility

2011-11-13 Thread Lance Norskog (Created) (JIRA)
Matrix Concatenate utility -- Key: MAHOUT-884 URL: https://issues.apache.org/jira/browse/MAHOUT-884 Project: Mahout Issue Type: New Feature Components: Integration Reporter: Lance Norskog

[jira] [Commented] (MAHOUT-833) Make conversion to sequence files map-reduce

2011-11-13 Thread Joe Prasanna Kumar (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149441#comment-13149441 ] Joe Prasanna Kumar commented on MAHOUT-833: --- Josh, For the SequenceFilesFromDire

[jira] [Commented] (MAHOUT-833) Make conversion to sequence files map-reduce

2011-11-13 Thread Josh Patterson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149411#comment-13149411 ] Josh Patterson commented on MAHOUT-833: --- What are the most common expectations aroun

[jira] [Commented] (MAHOUT-881) Refactor TopItems to use Lucene's PriorityQueue and remove excessive sorting

2011-11-13 Thread Ted Dunning (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149391#comment-13149391 ] Ted Dunning commented on MAHOUT-881: {quote} avoid what allocating the arrays {quote}

[jira] [Commented] (MAHOUT-881) Refactor TopItems to use Lucene's PriorityQueue and remove excessive sorting

2011-11-13 Thread Grant Ingersoll (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149387#comment-13149387 ] Grant Ingersoll commented on MAHOUT-881: I see some other things we can do, too.

[jira] [Commented] (MAHOUT-881) Refactor TopItems to use Lucene's PriorityQueue and remove excessive sorting

2011-11-13 Thread Ted Dunning (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149385#comment-13149385 ] Ted Dunning commented on MAHOUT-881: {quote} from 90 microseconds to 47 microseconds {

[jira] [Commented] (MAHOUT-881) Refactor TopItems to use Lucene's PriorityQueue and remove excessive sorting

2011-11-13 Thread Grant Ingersoll (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149382#comment-13149382 ] Grant Ingersoll commented on MAHOUT-881: Yeah, tests are good and the evaluator.

[jira] [Commented] (MAHOUT-881) Refactor TopItems to use Lucene's PriorityQueue and remove excessive sorting

2011-11-13 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149381#comment-13149381 ] Sean Owen commented on MAHOUT-881: -- I think the tests should at least be committed. I'd a

[jira] [Commented] (MAHOUT-881) Refactor TopItems to use Lucene's PriorityQueue and remove excessive sorting

2011-11-13 Thread Grant Ingersoll (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149380#comment-13149380 ] Grant Ingersoll commented on MAHOUT-881: Yeah, I did some profiling too and came t

[jira] [Issue Comment Edited] (MAHOUT-845) Make cluster top terms code more reusable

2011-11-13 Thread Lance Norskog (Issue Comment Edited) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149373#comment-13149373 ] Lance Norskog edited comment on MAHOUT-845 at 11/13/11 9:45 PM:

[jira] [Commented] (MAHOUT-845) Make cluster top terms code more reusable

2011-11-13 Thread Lance Norskog (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149373#comment-13149373 ] Lance Norskog commented on MAHOUT-845: -- 1) Is this feature useful in any other code o

[jira] [Commented] (MAHOUT-879) Remove all graph algorithms with the exception of PageRank

2011-11-13 Thread Hudson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149358#comment-13149358 ] Hudson commented on MAHOUT-879: --- Integrated in Mahout-Quality #1175 (See [https://builds.ap

Fwd: Cluster labeling

2011-11-13 Thread Frank Scholten
Forwarding this to dev. -- Forwarded message -- From: Frank Scholten Date: Tue, Nov 8, 2011 at 11:56 PM Subject: Cluster labeling To: u...@mahout.apache.org Hi all, Sometimes my cluster labels are terms that hardly occur in the combined text of the documents of a cluster. I wou

[jira] [Commented] (MAHOUT-845) Make cluster top terms code more reusable

2011-11-13 Thread Frank Scholten (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149330#comment-13149330 ] Frank Scholten commented on MAHOUT-845: --- Any feedback on the latest patch?

[jira] [Created] (MAHOUT-883) Add an example that computes PageRank on the wikipedia page link graph

2011-11-13 Thread Sebastian Schelter (Created) (JIRA)
Add an example that computes PageRank on the wikipedia page link graph -- Key: MAHOUT-883 URL: https://issues.apache.org/jira/browse/MAHOUT-883 Project: Mahout Issue Type: T

[jira] [Resolved] (MAHOUT-879) Remove all graph algorithms with the exception of PageRank

2011-11-13 Thread Sebastian Schelter (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter resolved MAHOUT-879. --- Resolution: Fixed Fix Version/s: 0.6 > Remove all graph algorithms with th

[jira] [Updated] (MAHOUT-879) Remove all graph algorithms with the exception of PageRank

2011-11-13 Thread Sebastian Schelter (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-879: -- Attachment: MAHOUT-879.patch > Remove all graph algorithms with the exception of Pa

[jira] [Updated] (MAHOUT-879) Remove all graph algorithms with the exception of PageRank

2011-11-13 Thread Sebastian Schelter (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-879: -- Attachment: graph-processing.tar.gz attached tar.gz file containing the removed graph a

Jenkins build is back to normal : Mahout-Quality #1174

2011-11-13 Thread Apache Jenkins Server
See

[jira] [Updated] (MAHOUT-881) Refactor TopItems to use Lucene's PriorityQueue and remove excessive sorting

2011-11-13 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-881: - Attachment: Call_Tree_2.html Call_Tree.html > Refactor TopItems to use Lucene's Prior

[jira] [Commented] (MAHOUT-881) Refactor TopItems to use Lucene's PriorityQueue and remove excessive sorting

2011-11-13 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149270#comment-13149270 ] Sean Owen commented on MAHOUT-881: -- Since it's easy, I just used jprofiler to observe the