Re: [GSOC] Congrats to all students

2010-04-27 Thread zhao zhendong
Thanks everyone! I am so exciting to be accepted and I will do my best to finish my proposal in time. A shared blog sounds great to me. The GSoC looks like a training, we suppose to share the experience with all who interested in Mahout project. Cheers, Zhendong On Tue, Apr 27, 2010 at 3:22 PM,

[jira] Commented: (MAHOUT-334) Proposal for GSoC2010 (Linear SVM for Mahout)

2010-04-08 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12855104#action_12855104 ] zhao zhendong commented on MAHOUT-334: -- Is there any suggestion or comment on my

Re: [GSOC] Wiki Page Added

2010-03-31 Thread zhao zhendong
Hi Grant, Could you please give us the link of this page? Cheers, Zhendong On Wed, Mar 31, 2010 at 8:53 PM, Grant Ingersoll gsing...@apache.orgwrote: I created a Wiki page on GSOC. I hope everyone considering GSOC reads it. Mentors, please add as you see fit. Would be good to get a Mahout

Re: [GSOC] Wiki Page Added

2010-03-31 Thread zhao zhendong
Ha, thanks. On Wed, Mar 31, 2010 at 9:29 PM, Grant Ingersoll gsing...@apache.orgwrote: D'oh! My bad: http://cwiki.apache.org/MAHOUT/gsoc.html. It's linked from the front wiki page under community. -Grant On Mar 31, 2010, at 9:11 AM, zhao zhendong wrote: Hi Grant, Could you please

[jira] Commented: (MAHOUT-334) Proposal for GSoC2010 (Linear SVM for Mahout)

2010-03-30 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12851727#action_12851727 ] zhao zhendong commented on MAHOUT-334: -- Proposal Title: Linear SVM Package (LIBLINEAR

Re: A mahout logo Revamp

2010-03-14 Thread zhao zhendong
That's great. Thanks Robin. On Mon, Mar 15, 2010 at 10:27 AM, Robin Anil robin.a...@gmail.com wrote: Here is the new mahout logo. Slightly higher contrast and sharper than the RC1 and some subtle shading https://issues.apache.org/jira/secure/attachment/12438777/mahout-logo-200.png see if

Re: A mahout logo Revamp

2010-03-13 Thread zhao zhendong
That's cool. On Sun, Mar 14, 2010 at 12:27 AM, Robin Anil robin.a...@gmail.com wrote: A tweaked logo https://issues.apache.org/jira/secure/attachment/12438686/mahout-100.png https://issues.apache.org/jira/secure/attachment/12438687/mahout-200.png --

Re: A mahout logo Revamp

2010-03-13 Thread zhao zhendong
1. 5 (blue round-figure, yellow elephant with blue text and NO shade) 2. 3 (blue figure, yellow elephant with blue text and NO shade) 3. 8 (blue figure, yellow elephant with blue-yellow text and NO shade) - Zhen-Dong Zhao (Maxim)

[jira] Created: (MAHOUT-334) Proposal for GSoC2010 (Linear SVM for Mahout)

2010-03-12 Thread zhao zhendong (JIRA)
Proposal for GSoC2010 (Linear SVM for Mahout) - Key: MAHOUT-334 URL: https://issues.apache.org/jira/browse/MAHOUT-334 Project: Mahout Issue Type: Task Reporter: zhao zhendong Title

[jira] Commented: (MAHOUT-327) Implement a cool classifier over map/reduce

2010-03-12 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12844622#action_12844622 ] zhao zhendong commented on MAHOUT-327: -- Proposal for GSoC2010 (Linear SVM for Mahout

Re: Have Mahout applied GSOC 2010?

2010-03-09 Thread zhao zhendong
Dunning wrote: Apache is definitely going to participate. If Mahout gets strong candidates, we would probably will get one or more slots. On Mon, Mar 8, 2010 at 10:06 AM, zhao zhendong zhaozhend...@gmail.com wrote: Robin told me Mahout gonna apply GSOC 2010 as a mentor. Can anybody

Have Mahout applied GSOC 2010?

2010-03-08 Thread zhao zhendong
Hi Robin told me Mahout gonna apply GSOC 2010 as a mentor. Can anybody tell me the answer? I really appreciate this chance. Thanks, -- - Zhen-Dong Zhao (Maxim) Department of Computer Science School of Computing National

Re: Need comments on Proposal for linear SVM framework (Google Summer of Code 2010)

2010-02-21 Thread zhao zhendong
://www.bwaldvogel.de/liblinear-java/, I want to port this code to Mahout using Mahout Collections, etc. On Sat, Feb 20, 2010 at 10:00 AM, zhao zhendong zhaozhend...@gmail.com wrote: Hi all, Robin told me such great chance for continuous contributing code here (many thanks to Robin). Because I still

Re: command line interfaces

2010-02-21 Thread zhao zhendong
+1, On Sun, Feb 21, 2010 at 4:22 AM, Ted Dunning ted.dunn...@gmail.com wrote: Related to Zhao's recent summer of code proposal, the MLOSS paper from his group on LIBLINEAR shows just how nice and simple a command line training and testing interface can be: See

Re: Need comments on Proposal for linear SVM framework (Google Summer of Code 2010)

2010-02-21 Thread zhao zhendong
: On Sun, Feb 21, 2010 at 1:49 AM, zhao zhendong zhaozhend...@gmail.com wrote: ... That's true. Do you think whether porting a LIBLINEAR to Mahout is good enough for this proposal, I really don't know How big is big enough:) If Yes, I can move the rest part for the future work

Need comments on Proposal for linear SVM framework (Google Summer of Code 2010)

2010-02-20 Thread zhao zhendong
Hi all, Robin told me such great chance for continuous contributing code here (many thanks to Robin). Because I still work on Sequential SVM (Mahout-232) and I prefer to extend it to a unified framework that incorporates some other state-of-the-art linear SVM classifiers, I propose Linear Support

[jira] Updated: (MAHOUT-232) Implementation of sequential SVM solver based on Pegasos

2010-02-18 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhao zhendong updated MAHOUT-232: - Attachment: SVMonMahout0.5.1.patch MapReduce/MapReduceUtil.java should have been mapreduce

[jira] Updated: (MAHOUT-232) Implementation of sequential SVM solver based on Pegasos

2010-02-17 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhao zhendong updated MAHOUT-232: - Attachment: SVMDataset.patch SVMonMahout0.5.patch Try using Mahout collections

[jira] Commented: (MAHOUT-232) Implementation of sequential SVM solver based on Pegasos

2010-02-10 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12832022#action_12832022 ] zhao zhendong commented on MAHOUT-232: -- Hi Sean, For Mahout-232, I suppose

[jira] Commented: (MAHOUT-227) Parallel SVM

2010-02-09 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12831837#action_12831837 ] zhao zhendong commented on MAHOUT-227: -- So far, I didn't work on this parallel Binary

How to get in Mahout-0.3

2010-02-07 Thread zhao zhendong
Hi all, I want to get the SVM package into Mahout-0.3. I have checked the w...@mahout, a little bit confused, do I need to become a committer first? ( I will be very glad to become a committer.) Does here anyone can tell me what shall I do? Cheers, Zhendong --

Re: How to get in Mahout-0.3

2010-02-07 Thread zhao zhendong
, Feb 7, 2010 at 9:33 PM, zhao zhendong zhaozhend...@gmail.com wrote: Hi all, I want to get the SVM package into Mahout-0.3. I have checked the w...@mahout, a little bit confused, do I need to become a committer first? ( I will be very glad to become a committer.) Does here anyone

Re: How to get in Mahout-0.3

2010-02-07 Thread zhao zhendong
improvements, and help you refactor and check it in. About committership, stick around :) put in more code, we may not want to let you go ;) Sure, I very like here and wish to do more. Robin On Mon, Feb 8, 2010 at 11:03 AM, zhao zhendong zhaozhend...@gmail.com wrote: Hi all, I want

Re: Release thinking

2010-01-26 Thread zhao zhendong
Hi all, I will do my best to get this in 0.3 release. {quote} MAHOUT-232 Implementation of sequential SVM solver based on Pegasos This patch looks to be progressing - it would be really nice to get it in. {quote} Cheers, Zhendong --

Re: [jira] Commented: (MAHOUT-238) Further Dependency Cleanup

2010-01-21 Thread zhao zhendong
Hi Drew, I propose to 1) update hbase-0.20.0.jar to hbase-0.20.2.jar due to the later is stable and hbased-platform is based on this version, 2) and add zookeeper-3.2.1.jar. Cheers, Zhendong On Tue, Jan 19, 2010 at 12:36 PM, zhao zhendong zhaozhend...@gmail.comwrote: Hi Drew, Including

Re: [jira] Commented: (MAHOUT-238) Further Dependency Cleanup

2010-01-18 Thread zhao zhendong
, but seems worth it to me. On Fri, Jan 8, 2010 at 4:02 PM, zhao zhendong zhaozhend...@gmail.com wrote: Thanks Drew, +1 for me to maintain a stable hadoop release, such as 0.20.1. The reason is obvious :) Cheers, Zhendong

[jira] Updated: (MAHOUT-232) Implementation of sequential SVM solver based on Pegasos

2010-01-18 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhao zhendong updated MAHOUT-232: - Attachment: SequentialSVM_0.4.patch 1) Supporting sequential multi-classification (both one-vs

[jira] Updated: (MAHOUT-232) Implementation of sequential SVM solver based on Pegasos

2010-01-18 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhao zhendong updated MAHOUT-232: - Description: After discussed with guys in this community, I decided to re-implement

slf4j library lost?

2010-01-09 Thread zhao zhendong
Hi all, When I new a SparseVector, the mahout-core always failed with SLF4J: Failed to load class org.slf4j.impl.StaticLoggerBinder. SLF4J: See http://www.slf4j.org/codes.html#StaticLoggerBinder for further details. Exception in thread main java.lang.NoClassDefFoundError:

Re: slf4j library lost?

2010-01-09 Thread zhao zhendong
locally. However you run this simple program -- the SLF4J binding has to be on the classpath. I would imagine you have always had to do this? On Sat, Jan 9, 2010 at 12:22 PM, zhao zhendong zhaozhend...@gmail.com wrote: Hi all, When I new a SparseVector, the mahout-core always failed

Re: slf4j library lost?

2010-01-09 Thread zhao zhendong
. But Zhendong this is something, I think, you would need to address locally. However you run this simple program -- the SLF4J binding has to be on the classpath. I would imagine you have always had to do this? On Sat, Jan 9, 2010 at 12:22 PM, zhao zhendong zhaozhend...@gmail.com wrote: Hi all

MapReduce Unit Testing

2010-01-08 Thread zhao zhendong
Hi, Does anybody know a out-off-shift Unit testing package for Mapreduce framework? MRUnit is good, but this package only can be found in Cloudera own Hadoop. Cheers, Zhendong -- - Zhen-Dong Zhao (Maxim) Department of Computer

Re: [jira] Commented: (MAHOUT-238) Further Dependency Cleanup

2010-01-08 Thread zhao zhendong
help: http://svn.apache.org/viewvc/hadoop/mapreduce/trunk/ If you look at the release notes, you should be able to discern what made up 20.2 if it is a real release (looking at common made is look like it isn't). On Thu, Jan 7, 2010 at 9:46 PM, zhao zhendong zhaozhend...@gmail.com wrote

[jira] Updated: (MAHOUT-232) Implementation of sequential SVM solver based on Pegasos

2010-01-07 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhao zhendong updated MAHOUT-232: - Description: After discussed with guys in this community, I decided to re-implement

[jira] Updated: (MAHOUT-232) Implementation of sequential SVM solver based on Pegasos

2010-01-07 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhao zhendong updated MAHOUT-232: - Attachment: SequentialSVM_0.3.patch Implementation of sequential SVM solver based on Pegasos

[jira] Commented: (MAHOUT-232) Implementation of sequential SVM solver based on Pegasos

2010-01-02 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12795852#action_12795852 ] zhao zhendong commented on MAHOUT-232: -- Thanks for Ted's Comments, I will revise

[jira] Updated: (MAHOUT-232) Implementation of sequential SVM solver based on Pegasos

2010-01-01 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhao zhendong updated MAHOUT-232: - Component/s: Classification Description: After discussed with guys in this community, I

[jira] Updated: (MAHOUT-232) Implementation of sequential SVM solver based on Pegasos

2010-01-01 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhao zhendong updated MAHOUT-232: - Attachment: SequentialSVM_0.2.patch Implementation of sequential SVM solver based on Pegasos

[jira] Updated: (MAHOUT-232) Implementation of sequential SVM solver based on Pegasos

2010-01-01 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhao zhendong updated MAHOUT-232: - Description: After discussed with guys in this community, I decided to re-implement

[jira] Updated: (MAHOUT-232) Implementation of sequential SVM solver based on Pegasos

2010-01-01 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhao zhendong updated MAHOUT-232: - Description: After discussed with guys in this community, I decided to re-implement

[jira] Updated: (MAHOUT-232) Implementation of sequential SVM solver based on Pegasos

2010-01-01 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhao zhendong updated MAHOUT-232: - Attachment: (was: SequentialSVM_0.2.patch) Implementation of sequential SVM solver based

[jira] Commented: (MAHOUT-232) Implementation of sequential SVM solver based on Pegasos

2010-01-01 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12795832#action_12795832 ] zhao zhendong commented on MAHOUT-232: -- Oops, I forgot to add these files to SVN

Re: [jira] Commented: (MAHOUT-232) Implementation of sequential SVM solver based on Pegasos

2009-12-30 Thread zhao zhendong
Reporter: zhao zhendong Attachments: SequentialSVM_0.1.patch After discussed with guys in this community, I decided to re-implement a Sequential SVM solver based on Pegasos for Mahout platform (mahout command line style, SparseMatrix and SparseVector etc

[jira] Issue Comment Edited: (MAHOUT-232) Implementation of sequential SVM solver based on Pegasos

2009-12-30 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12795539#action_12795539 ] zhao zhendong edited comment on MAHOUT-232 at 12/31/09 6:03 AM

[jira] Updated: (MAHOUT-232) Implementation of sequential SVM solver based on Pegasos

2009-12-29 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhao zhendong updated MAHOUT-232: - Affects Version/s: 0.1 Status: Patch Available (was: Open) Sequential SVM based

[jira] Updated: (MAHOUT-232) Implementation of sequential SVM solver based on Pegasos

2009-12-29 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhao zhendong updated MAHOUT-232: - Attachment: SequentialSVM_0.1.patch Implementation of sequential SVM solver based on Pegasos

[jira] Issue Comment Edited: (MAHOUT-232) Implementation of sequential SVM solver based on Pegasos

2009-12-29 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12794809#action_12794809 ] zhao zhendong edited comment on MAHOUT-232 at 12/30/09 7:07 AM

[jira] Commented: (MAHOUT-232) Implementation of sequential SVM solver based on Pegasos

2009-12-28 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12794809#action_12794809 ] zhao zhendong commented on MAHOUT-232: -- I still work on it :(. I can attach them

[jira] Commented: (MAHOUT-227) Parallel SVM

2009-12-21 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12793111#action_12793111 ] zhao zhendong commented on MAHOUT-227: -- Thanks for your comments. Sure, actually, I

Re: [jira] Commented: (MAHOUT-227) Parallel SVM

2009-12-21 Thread zhao zhendong
Issue Type: Task Components: Classification Reporter: zhao zhendong Attachments: ParallelPegasos.doc, ParallelPegasos.pdf I wrote a proposal of parallel algorithm for SVM training. Any comment is welcome. -- This message is automatically generated by JIRA

Re: [jira] Commented: (MAHOUT-227) Parallel SVM

2009-12-21 Thread zhao zhendong
: MAHOUT-227 URL: https://issues.apache.org/jira/browse/MAHOUT-227 Project: Mahout Issue Type: Task Components: Classification Reporter: zhao zhendong Attachments: ParallelPegasos.doc, ParallelPegasos.pdf I wrote a proposal

Re: [jira] Commented: (MAHOUT-227) Parallel SVM

2009-12-21 Thread zhao zhendong
libraries have nothing parallel about them yet, but they are all aimed to be able to scale to large data sets. Does this make sense? -jake On Mon, Dec 21, 2009 at 9:21 PM, zhao zhendong zhaozhend...@gmail.com wrote: {quote} k = 1 Otherwise as in the Pegasos article. No parallelism

[jira] Updated: (MAHOUT-227) Parallel SVM

2009-12-20 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhao zhendong updated MAHOUT-227: - Attachment: ParallelPegasos.pdf ParallelPegasos.doc These are two distinct files

[jira] Updated: (MAHOUT-227) Parallel SVM

2009-12-20 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhao zhendong updated MAHOUT-227: - Attachment: (was: svmProposal.patch) Parallel SVM Key

[jira] Created: (MAHOUT-227) Parallel SVM

2009-12-19 Thread zhao zhendong (JIRA)
Parallel SVM Key: MAHOUT-227 URL: https://issues.apache.org/jira/browse/MAHOUT-227 Project: Mahout Issue Type: Task Components: Classification Reporter: zhao zhendong I wrote a proposal of parallel

[jira] Updated: (MAHOUT-227) Parallel SVM

2009-12-19 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhao zhendong updated MAHOUT-227: - Status: Patch Available (was: Open) The patch is a document, say the proposal of parallel

[jira] Updated: (MAHOUT-227) Parallel SVM

2009-12-19 Thread zhao zhendong (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhao zhendong updated MAHOUT-227: - Attachment: svmProposal.patch The patch includes two files with same content (.doc and .pdf

Re: SVM algo, code, etc.

2009-12-18 Thread zhao zhendong
Hi all, I have finished a draft of proposal on Parallel Pegasos SVM solover. I need some comments. If any one interested in this proposal, contact me please. By the way, is it a good idea to attach this proposal in this mail session. Cheers, Zhendong On Thu, Dec 17, 2009 at 1:16 AM, Ted

Re: SVM algo, code, etc.

2009-12-11 Thread zhao zhendong
True, I am still wondering about whether it is valuable to implement a parallel SVM on hadoop? I really wanna join in mike's group. Just like Olivier concerned, some linear version of SVM solvers can handle large-scale data sets ( several seconds for 100K-level samples). It's true that the linear

Re: LDA for multi label classification was: Mahout Book

2009-10-16 Thread zhao zhendong
I have seen the implementation of L-LDA using Java, Stanford Topic Modeling Toolbox http://nlp.stanford.edu/software/tmt/ Does any one know whether they provide the source code or not? Thanks, Maxim On Fri, Oct 16, 2009 at 12:39 PM, David Hall d...@cs.berkeley.edu wrote: Sorry, this slipped out

Re: [jira] Updated: (MAHOUT-124) Online Classification using HBase

2009-07-06 Thread zhao zhendong
Hi, How can I download the Hbase 0.20.jar? Cheers, Zhendong On Mon, Jul 6, 2009 at 5:21 AM, Robin Anil (JIRA) j...@apache.org wrote: [ https://issues.apache.org/jira/browse/MAHOUT-124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel] Robin Anil updated MAHOUT-124: