[jira] Commented: (MAHOUT-388) Upgrade Lucene

2010-05-05 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12864630#action_12864630 ] Drew Farris commented on MAHOUT-388: Any objections to this? If not I'll plan on commit

[jira] Updated: (MAHOUT-388) Upgrade Lucene

2010-05-01 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-388: --- Status: Patch Available (was: Open) > Upgrade Lucene > -- > > Key: MAHOU

[jira] Updated: (MAHOUT-388) Upgrade Lucene

2010-05-01 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-388: --- Attachment: MAHOUT-388.patch Updates to Lucene 3.0.1, created DefaultAnalyzer in mahout-util which ex

[jira] Created: (MAHOUT-373) VectorDumper/VectorHelper doesn't dump values when dictionary is present

2010-04-09 Thread Drew Farris (JIRA)
VectorDumper/VectorHelper doesn't dump values when dictionary is present Key: MAHOUT-373 URL: https://issues.apache.org/jira/browse/MAHOUT-373 Project: Mahout Issue Typ

[jira] Commented: (MAHOUT-361) SLF4J dependency structure leads to unpleasant surproses

2010-04-02 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852842#action_12852842 ] Drew Farris commented on MAHOUT-361: Sorry, I probably wasn't being clear; I'm not prop

[jira] Commented: (MAHOUT-361) SLF4J dependency structure leads to unpleasant surproses

2010-04-02 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852792#action_12852792 ] Drew Farris commented on MAHOUT-361: I've run into this too as a result of having the s

[jira] Commented: (MAHOUT-350) add one "JobName" and reduceNumber parameter to org.apache.mahout.cf.taste.hadoop.item.RecommenderJob

2010-04-01 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852360#action_12852360 ] Drew Farris commented on MAHOUT-350: bq. (Incidentally, now might be a good time to plu

[jira] Commented: (MAHOUT-350) add one "JobName" and reduceNumber parameter to org.apache.mahout.cf.taste.hadoop.item.RecommenderJob

2010-03-31 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852224#action_12852224 ] Drew Farris commented on MAHOUT-350: {quote} Hmm, I don't understand that. AbstractJob.

[jira] Commented: (MAHOUT-350) add one "JobName" and reduceNumber parameter to org.apache.mahout.cf.taste.hadoop.item.RecommenderJob

2010-03-31 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852012#action_12852012 ] Drew Farris commented on MAHOUT-350: Not sure if this is helpful Sean, GenericOptionsPa

[jira] Commented: (MAHOUT-274) Use avro for serialization of structured documents.

2010-03-30 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12851708#action_12851708 ] Drew Farris commented on MAHOUT-274: Tracking some interesting things happening over at

[jira] Commented: (MAHOUT-344) Minhash based clustering

2010-03-30 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12851686#action_12851686 ] Drew Farris commented on MAHOUT-344: Hi Cristi, Sounds like a great start. Answers for

[jira] Commented: (MAHOUT-274) Use avro for serialization of structured documents.

2010-03-23 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12849025#action_12849025 ] Drew Farris commented on MAHOUT-274: pushed to github: http://github.com/drewfarris/mah

[jira] Closed: (MAHOUT-242) LLR Collocation Identifier

2010-03-10 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris closed MAHOUT-242. -- > LLR Collocation Identifier > -- > > Key: MAHOUT-242 >

[jira] Closed: (MAHOUT-325) Switch from hadoop-0.20.2-SNAPSHOT to hadoop-0.20.2 (release)

2010-03-10 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris closed MAHOUT-325. -- > Switch from hadoop-0.20.2-SNAPSHOT to hadoop-0.20.2 (release) > -

[jira] Updated: (MAHOUT-325) Switch from hadoop-0.20.2-SNAPSHOT to hadoop-0.20.2 (release)

2010-03-10 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-325: --- Resolution: Fixed Assignee: Drew Farris Status: Resolved (was: Patch Available) commit

[jira] Closed: (MAHOUT-317) Collocations: Eliminate in-memory frequency calculation

2010-03-10 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris closed MAHOUT-317. -- > Collocations: Eliminate in-memory frequency calculation > ---

[jira] Updated: (MAHOUT-325) Switch from hadoop-0.20.2-SNAPSHOT to hadoop-0.20.2 (release)

2010-03-10 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-325: --- Attachment: hadoop-0.20.2.patch Final patch for the hadoop artifact's final resting place in the rele

[jira] Updated: (MAHOUT-325) Switch from hadoop-0.20.2-SNAPSHOT to hadoop-0.20.2 (release)

2010-03-10 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-325: --- Attachment: hadoop-0.20.2.patch Looks like they've moved the artifact into staging, this patch update

[jira] Commented: (MAHOUT-325) Switch from hadoop-0.20.2-SNAPSHOT to hadoop-0.20.2 (release)

2010-03-10 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12843572#action_12843572 ] Drew Farris commented on MAHOUT-325: Did it happen to indicate which urls it attempted

[jira] Commented: (MAHOUT-325) Switch from hadoop-0.20.2-SNAPSHOT to hadoop-0.20.2 (release)

2010-03-10 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12843569#action_12843569 ] Drew Farris commented on MAHOUT-325: What issue did you run into? Error messages, etc?

[jira] Updated: (MAHOUT-325) Switch from hadoop-0.20.2-SNAPSHOT to hadoop-0.20.2 (release)

2010-03-09 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-325: --- Attachment: hadoop-0.20.2.patch Updates maven/pom.xml to point at a working repository for hadoop 0.2

[jira] Created: (MAHOUT-325) Switch from hadoop-0.20.2-SNAPSHOT to hadoop-0.20.2 (release)

2010-03-09 Thread Drew Farris (JIRA)
Switch from hadoop-0.20.2-SNAPSHOT to hadoop-0.20.2 (release) - Key: MAHOUT-325 URL: https://issues.apache.org/jira/browse/MAHOUT-325 Project: Mahout Issue Type: Improvement Aff

[jira] Updated: (MAHOUT-325) Switch from hadoop-0.20.2-SNAPSHOT to hadoop-0.20.2 (release)

2010-03-09 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-325: --- Status: Patch Available (was: Open) > Switch from hadoop-0.20.2-SNAPSHOT to hadoop-0.20.2 (release)

[jira] Resolved: (MAHOUT-317) Collocations: Eliminate in-memory frequency calculation

2010-03-06 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris resolved MAHOUT-317. Resolution: Fixed Committed in r919798 > Collocations: Eliminate in-memory frequency calculation >

[jira] Assigned: (MAHOUT-317) Collocations: Eliminate in-memory frequency calculation

2010-03-06 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris reassigned MAHOUT-317: -- Assignee: Drew Farris > Collocations: Eliminate in-memory frequency calculation > -

[jira] Commented: (MAHOUT-320) Modify IntPairWritable in LDA implementation to be binary comparable to improve performance.

2010-03-04 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12841220#action_12841220 ] Drew Farris commented on MAHOUT-320: I haven't reviewed the round of patches, when I wr

[jira] Updated: (MAHOUT-317) Collocations: Eliminate in-memory frequency calculation

2010-03-03 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-317: --- Attachment: MAHOUT-317.patch re-added missing minSupport, thanks for pointing this out Robin. Fixed

[jira] Commented: (MAHOUT-320) Modify IntPairWritable in LDA implementation to be binary comparable to improve performance.

2010-03-03 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12840617#action_12840617 ] Drew Farris commented on MAHOUT-320: I certainlly can't argure about the space savings.

[jira] Commented: (MAHOUT-317) Collocations: Eliminate in-memory frequency calculation

2010-03-03 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12840597#action_12840597 ] Drew Farris commented on MAHOUT-317: Thanks for trying it out Robin. I'll take a closer

[jira] Commented: (MAHOUT-320) Modify IntPairWritable in LDA implementation to be binary comparable to improve performance.

2010-03-03 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12840592#action_12840592 ] Drew Farris commented on MAHOUT-320: The big win here is sortability of the binary form

[jira] Updated: (MAHOUT-320) Modify IntPairWritable in LDA implementation to be binary comparable to improve performance.

2010-03-02 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-320: --- Component/s: Clustering > Modify IntPairWritable in LDA implementation to be binary comparable to >

[jira] Updated: (MAHOUT-320) Modify IntPairWritable in LDA implementation to be binary comparable to improve performance.

2010-03-02 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-320: --- Attachment: MAHOUT-320.patch binary comparable implementation plus unit test for get/set, writable me

[jira] Updated: (MAHOUT-320) Modify IntPairWritable in LDA implementation to be binary comparable to improve performance.

2010-03-02 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-320: --- Assignee: Robin Anil Status: Patch Available (was: Open) > Modify IntPairWritable in LDA imple

[jira] Created: (MAHOUT-320) Modify IntPairWritable in LDA implementation to be binary comparable to improve performance.

2010-03-02 Thread Drew Farris (JIRA)
Modify IntPairWritable in LDA implementation to be binary comparable to improve performance. Key: MAHOUT-320 URL: https://issues.apache.org/jira/browse/MAHOUT-320

[jira] Updated: (MAHOUT-317) Collocations: Eliminate in-memory frequency calculation

2010-03-01 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-317: --- Attachment: MAHOUT-317.patch Replaced GramTuple with GramKey which achieves the same end in a more ef

[jira] Updated: (MAHOUT-317) Collocations: Eliminate in-memory frequency calculation

2010-03-01 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-317: --- Attachment: MAHOUT-317.patch oops. Attached is the fixed patch. In the light of day it becomes c

[jira] Updated: (MAHOUT-317) Collocations: Eliminate in-memory frequency calculation

2010-02-28 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-317: --- Attachment: MAHOUT-317.patch This patch addresses the original problem by using the Partitioner/Outp

[jira] Created: (MAHOUT-317) Collocations: Eliminate in-memory frequency calculation

2010-02-28 Thread Drew Farris (JIRA)
Collocations: Eliminate in-memory frequency calculation --- Key: MAHOUT-317 URL: https://issues.apache.org/jira/browse/MAHOUT-317 Project: Mahout Issue Type: Improvement Affects Version

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-26 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12838868#action_12838868 ] Drew Farris commented on MAHOUT-301: bq. Can you upload the patch for the maven configs

[jira] Updated: (MAHOUT-311) Update assemblies to include components of launcher script from MAHOUT-301

2010-02-26 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-311: --- Status: Patch Available (was: Open) > Update assemblies to include components of launcher script fro

[jira] Updated: (MAHOUT-311) Update assemblies to include components of launcher script from MAHOUT-301

2010-02-26 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-311: --- Attachment: MAHOUT-311.patch In addition to the goals of this issue, this patch adjusts the way that

[jira] Created: (MAHOUT-311) Update assemblies to include components of launcher script from MAHOUT-301

2010-02-26 Thread Drew Farris (JIRA)
Update assemblies to include components of launcher script from MAHOUT-301 -- Key: MAHOUT-311 URL: https://issues.apache.org/jira/browse/MAHOUT-311 Project: Mahout Issue

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-25 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12838694#action_12838694 ] Drew Farris commented on MAHOUT-301: Had a chance to take this out for a spin tonight.

[jira] Updated: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-24 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-301: --- Attachment: MAHOUT-301-drew.patch Jake, this is looking really great. Here's a partial patch that in

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-24 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12837763#action_12837763 ] Drew Farris commented on MAHOUT-301: This sounds great. I will take it for a spin when

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-23 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12837607#action_12837607 ] Drew Farris commented on MAHOUT-301: It doesn't appear that the following command works

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-23 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12837477#action_12837477 ] Drew Farris commented on MAHOUT-301: bq. Cool, so why not just check to see if $HADOOP_

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-23 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12837448#action_12837448 ] Drew Farris commented on MAHOUT-301: {quote} Hmm... ok. I'm a little reticent about run

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-23 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12837434#action_12837434 ] Drew Farris commented on MAHOUT-301: Jake, the basic idea is that you would always use

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-23 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12837376#action_12837376 ] Drew Farris commented on MAHOUT-301: {quote} This wasn't a problem with my patch, right

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-23 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12837243#action_12837243 ] Drew Farris commented on MAHOUT-301: bq. BTW. How is hadoop execution done using shell

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-23 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12837234#action_12837234 ] Drew Farris commented on MAHOUT-301: bq. including the job jar is much cleaner than add

[jira] Updated: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-22 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-301: --- Attachment: MAHOUT-301-drew.patch Did some testing, here's a patch to clean some of these things up +

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-20 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12836268#action_12836268 ] Drew Farris commented on MAHOUT-301: {blockquote} What does GenericOptionsParser do if

[jira] Updated: (MAHOUT-299) Collocations: improve performance by making Gram BinaryComparable

2010-02-20 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-299: --- Resolution: Fixed Status: Resolved (was: Patch Available) resolved in r912189 > Collocation

[jira] Assigned: (MAHOUT-299) Collocations: improve performance by making Gram BinaryComparable

2010-02-20 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris reassigned MAHOUT-299: -- Assignee: Drew Farris > Collocations: improve performance by making Gram BinaryComparable > ---

[jira] Commented: (MAHOUT-299) Collocations: improve performance by making Gram BinaryComparable

2010-02-20 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12836224#action_12836224 ] Drew Farris commented on MAHOUT-299: bq. I'd not throw RuntimeException - IllegalStateE

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-20 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12836209#action_12836209 ] Drew Farris commented on MAHOUT-301: This is pretty nice, it gets to the point where re

[jira] Commented: (MAHOUT-299) Collocations: improve performance by making Gram BinaryComparable

2010-02-20 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12836207#action_12836207 ] Drew Farris commented on MAHOUT-299: Thanks for the review Sean, I'll get it committed

[jira] Updated: (MAHOUT-299) Collocations: improve performance by making Gram BinaryComparable

2010-02-18 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-299: --- Status: Patch Available (was: Open) > Collocations: improve performance by making Gram BinaryCompara

[jira] Updated: (MAHOUT-299) Collocations: improve performance by making Gram BinaryComparable

2010-02-18 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-299: --- Attachment: MAHOUT-299.patch Patch as described above: Included other cleanups: * Gram is no longer

[jira] Created: (MAHOUT-299) Collocations: improve performance by making Gram BinaryComparable

2010-02-18 Thread Drew Farris (JIRA)
Collocations: improve performance by making Gram BinaryComparable - Key: MAHOUT-299 URL: https://issues.apache.org/jira/browse/MAHOUT-299 Project: Mahout Issue Type: Improvement

[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-15 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-274: --- Attachment: (was: mahout-avro-examples.tar.bz) > Use avro for serialization of structured documen

[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-15 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-274: --- Comment: was deleted (was: re-added latest tarball with proper extension.) > Use avro for serializat

[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-15 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-274: --- Attachment: (was: mahout-colloc.tar.gz) > Use avro for serialization of structured documents. > -

[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-15 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-274: --- Attachment: mahout-avro-examples.tar.gz (this is really the right tarball this time, honest) > Use a

[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-15 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-274: --- Attachment: mahout-colloc.tar.gz re-added latest tarball with proper extension. > Use avro for seria

[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-15 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-274: --- Attachment: mahout-avro-examples.tar.bz Status update w/ new tarball which contains a maven project (

[jira] Commented: (MAHOUT-291) Mahout Code Cleanup

2010-02-15 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12833806#action_12833806 ] Drew Farris commented on MAHOUT-291: Thanks very much Robin for posting a patch to revi

[jira] Updated: (MAHOUT-285) Wrap up collocation and dictionary vectorizer integration

2010-02-10 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-285: --- Attachment: MAHOUT-285.patch Robin got the bulk of this done yesterday night, reviewed his changes an

[jira] Updated: (MAHOUT-285) Wrap up collocation and dictionary vectorizer integration

2010-02-10 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-285: --- Attachment: MAHOUT-285.patch Robin, check out the DocumentProcessor integration here, is this what yo

[jira] Commented: (MAHOUT-285) Wrap up collocation and dictionary vectorizer integration

2010-02-10 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-285?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12832047#action_12832047 ] Drew Farris commented on MAHOUT-285: Yes, I'm very close on this and should be able to

[jira] Updated: (MAHOUT-285) Wrap up collocation and dictionary vectorizer integration

2010-02-09 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-285: --- Attachment: MAHOUT-285.patch First pass at integration patch, this patch includes the following: * I

[jira] Created: (MAHOUT-285) Wrap up collocation and dictionary vectorizer integration

2010-02-09 Thread Drew Farris (JIRA)
Wrap up collocation and dictionary vectorizer integration - Key: MAHOUT-285 URL: https://issues.apache.org/jira/browse/MAHOUT-285 Project: Mahout Issue Type: Improvement Affects Ver

[jira] Updated: (MAHOUT-242) LLR Collocation Identifier

2010-02-08 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-242: --- Attachment: MAHOUT-242.patch Moved to utils based on discussion on the dev list. This can be committe

[jira] Updated: (MAHOUT-283) Update assemblies to include mahout-collections for release build

2010-02-08 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-283: --- Status: Patch Available (was: Open) > Update assemblies to include mahout-collections for release bu

[jira] Updated: (MAHOUT-283) Update assemblies to include mahout-collections for release build

2010-02-08 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-283: --- Attachment: MAHOUT-283.patch simpler than it sounded. verified by performing a build of a src release

[jira] Commented: (MAHOUT-242) LLR Collocation Identifier

2010-02-08 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12830971#action_12830971 ] Drew Farris commented on MAHOUT-242: No problem moving this to util, I wasn't sure wher

[jira] Commented: (MAHOUT-282) Remove assembly from core, re-add commons-cli 1.x (no longer exluced from hadoop dependency)

2010-02-08 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12830970#action_12830970 ] Drew Farris commented on MAHOUT-282: Mahout doesn't pull commons-cli in directly, rathe

[jira] Created: (MAHOUT-283) Update assemblies to include mahout-collections for release build

2010-02-08 Thread Drew Farris (JIRA)
Update assemblies to include mahout-collections for release build - Key: MAHOUT-283 URL: https://issues.apache.org/jira/browse/MAHOUT-283 Project: Mahout Issue Type: Sub-task

[jira] Updated: (MAHOUT-242) LLR Collocation Identifier

2010-02-08 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-242: --- Attachment: MAHOUT-242.patch Updated patch now includes a combiner for pass1 > LLR Collocation Ident

[jira] Updated: (MAHOUT-282) Remove assembly from core, re-add commons-cli 1.x (no longer exluced from hadoop dependency)

2010-02-08 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-282: --- Status: Patch Available (was: Open) > Remove assembly from core, re-add commons-cli 1.x (no longer e

[jira] Updated: (MAHOUT-282) Remove assembly from core, re-add commons-cli 1.x (no longer exluced from hadoop dependency)

2010-02-08 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-282: --- Attachment: MAHOUT-282.patch > Remove assembly from core, re-add commons-cli 1.x (no longer exluced f

[jira] Created: (MAHOUT-282) Remove assembly from core, re-add commons-cli 1.x (no longer exluced from hadoop dependency)

2010-02-08 Thread Drew Farris (JIRA)
Remove assembly from core, re-add commons-cli 1.x (no longer exluced from hadoop dependency) Key: MAHOUT-282 URL: https://issues.apache.org/jira/browse/MAHOUT-282

[jira] Commented: (MAHOUT-280) Clean some redundant POM declarations

2010-02-07 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12830826#action_12830826 ] Drew Farris commented on MAHOUT-280: Do we really want to go to a source/target level o

[jira] Commented: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-06 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12830630#action_12830630 ] Drew Farris commented on MAHOUT-274: I suspect providing a writable wrapper that implem

[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-05 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-274: --- Attachment: mahout-avro-examples.tar.gz Very rudimentary exploration of using avro to produce writabl

[jira] Created: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-05 Thread Drew Farris (JIRA)
Use avro for serialization of structured documents. --- Key: MAHOUT-274 URL: https://issues.apache.org/jira/browse/MAHOUT-274 Project: Mahout Issue Type: Improvement Reporter: Drew

[jira] Updated: (MAHOUT-242) LLR Collocation Identifier

2010-02-02 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-242: --- Attachment: MAHOUT-242.patch Updated patch, removed pom modifications checked in as a part of MAHOUT-

[jira] Updated: (MAHOUT-272) Add licenses for 3rd party jars to mahout binary release and remove additional unused dependencies.

2010-02-02 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-272: --- Summary: Add licenses for 3rd party jars to mahout binary release and remove additional unused depend

[jira] Updated: (MAHOUT-272) Add licences for 3rd party jars to mahout binary release and remove additional unused dependencies.

2010-02-02 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-272: --- Status: Patch Available (was: Open) > Add licences for 3rd party jars to mahout binary release and r

[jira] Updated: (MAHOUT-272) Add licences for 3rd party jars to mahout binary release and remove additional unused dependencies.

2010-02-02 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-272: --- Attachment: MAHOUT-272.patch * Added exclusion for eclipse core to hadoop dependency in maven/pom.xml

[jira] Created: (MAHOUT-272) Add licences for 3rd party jars to mahout binary release and remove additional unused dependencies.

2010-02-02 Thread Drew Farris (JIRA)
Add licences for 3rd party jars to mahout binary release and remove additional unused dependencies. --- Key: MAHOUT-272 URL: https://issues.apache.org/jira/browse/MAHO

[jira] Commented: (MAHOUT-215) Provide jars with mahout release.

2010-02-01 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12828430#action_12828430 ] Drew Farris commented on MAHOUT-215: It looks like there might have been a problem with

[jira] Assigned: (MAHOUT-215) Provide jars with mahout release.

2010-02-01 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris reassigned MAHOUT-215: -- Assignee: Jake Mannix (was: Drew Farris) > Provide jars with mahout release. > ---

[jira] Commented: (MAHOUT-242) LLR Collocation Identifier

2010-01-29 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12806357#action_12806357 ] Drew Farris commented on MAHOUT-242: bq. Hey Drew, I'm not much of a maven guy - what's

[jira] Commented: (MAHOUT-215) Provide jars with mahout release.

2010-01-28 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12805942#action_12805942 ] Drew Farris commented on MAHOUT-215: bq. Just an FYI, we need to make sure we can legal

[jira] Assigned: (MAHOUT-215) Provide jars with mahout release.

2010-01-28 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris reassigned MAHOUT-215: -- Assignee: Drew Farris (was: Jake Mannix) > Provide jars with mahout release. > ---

[jira] Commented: (MAHOUT-215) Provide jars with mahout release.

2010-01-28 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12805910#action_12805910 ] Drew Farris commented on MAHOUT-215: Thanks for the review and commit Jake > Provide j

[jira] Closed: (MAHOUT-215) Provide jars with mahout release.

2010-01-28 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris closed MAHOUT-215. -- > Provide jars with mahout release. > - > > Key: MAHOUT-215

  1   2   >