Re: [VOTE] Simplifying website

2015-02-28 Thread Yexi Jiang
+1

2015-02-28 11:19 GMT-05:00 Martin Illecker :

> +1
>
> > Am 28.02.2015 um 15:55 schrieb Suraj Menon :
> >
> > +1
> >
> > On Mon, Feb 23, 2015 at 7:30 PM, Edward J. Yoon  >
> > wrote:
> >
> >>>> As the site is so simple, I think it will be better to see adding a
> >> menu
> >>>> bar in top or left side of the site.
> >>
> >> I'm thinking about one-page website. I'll try to add more information.
> >>
> >>>> Hippo character seems to be afraid. Don't you think that a previous
> >>> version
> >>>> looks better than the new one?
> >>
> >> And,
> >>
> >>> This looks great. Is the logo changed again?
> >>
> >> Nope, logos are still all available.
> >>
> >> --
> >> Best Regards, Edward J. Yoon
> >>
> >> -Original Message-
> >> From: Yexi Jiang [mailto:yexiji...@gmail.com]
> >> Sent: Monday, February 23, 2015 9:50 PM
> >> To: dev@hama.apache.org
> >> Subject: Re: [VOTE] Simplifying website
> >>
> >> This looks great. Is the logo changed again?
> >>
> >> Regards,
> >> Yexi
> >>
> >> 2015-02-23 5:05 GMT-05:00 Andronidis Anastasios <
> andronat_...@hotmail.com
> >>> :
> >>
> >>> I like it!
> >>>
> >>> Anastasis Andronidis
> >>>
> >>> On 23 Φεβ 2015, at 6:49 π.μ., 김민호  wrote:
> >>>
> >>>> Hi,
> >>>>
> >>>> That sounds great!
> >>>>
> >>>> As the site is so simple, I think it will be better to see adding a
> >> menu
> >>>> bar in top or left side of the site.
> >>>>
> >>>> Hippo character seems to be afraid. Don't you think that a previous
> >>> version
> >>>> looks better than the new one?
> >>>>
> >>>> Best Regards,
> >>>> Minho Kim
> >>>>
> >>>> -Original Message-
> >>>> From: Edward J. Yoon [mailto:edward.y...@samsung.com]
> >>>> Sent: Monday, February 23, 2015 2:15 PM
> >>>> To: dev@hama.apache.org
> >>>> Subject: [VOTE] Simplifying website
> >>>>
> >>>> Hey guys,
> >>>>
> >>>> I've made simplified version[1] of Hama website. Please review,
> >> comment,
> >>>> and vote. I'll change to this version by lazy consensus if there are
> no
> >>>> objections within the next 3 days.
> >>>>
> >>>> Thanks!
> >>>>
> >>>> 1. http://hama.apache.org/index2.html
> >>>>
> >>>> --
> >>>> Best Regards, Edward J. Yoon
> >>>>
> >>>>
> >>>
> >>>
> >>
> >>
> >> --
> >> --
> >> Yexi Jiang,
> >> ECS 251,  yjian...@cs.fiu.edu
> >> School of Computing and Information Sciences,
> >> Florida International University
> >> Homepage: http://users.cis.fiu.edu/~yjian004/
> >>
> >>
>
>


-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computing and Information Sciences,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Re: [VOTE] Simplifying website

2015-02-23 Thread Yexi Jiang
This looks great. Is the logo changed again?

Regards,
Yexi

2015-02-23 5:05 GMT-05:00 Andronidis Anastasios :

> I like it!
>
> Anastasis Andronidis
>
> On 23 Φεβ 2015, at 6:49 π.μ., 김민호  wrote:
>
> > Hi,
> >
> > That sounds great!
> >
> > As the site is so simple, I think it will be better to see adding a menu
> > bar in top or left side of the site.
> >
> > Hippo character seems to be afraid. Don't you think that a previous
> version
> > looks better than the new one?
> >
> > Best Regards,
> > Minho Kim
> >
> > -Original Message-
> > From: Edward J. Yoon [mailto:edward.y...@samsung.com]
> > Sent: Monday, February 23, 2015 2:15 PM
> > To: dev@hama.apache.org
> > Subject: [VOTE] Simplifying website
> >
> > Hey guys,
> >
> > I've made simplified version[1] of Hama website. Please review, comment,
> > and vote. I'll change to this version by lazy consensus if there are no
> > objections within the next 3 days.
> >
> > Thanks!
> >
> > 1. http://hama.apache.org/index2.html
> >
> > --
> > Best Regards, Edward J. Yoon
> >
> >
>
>


-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computing and Information Sciences,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Re: Renil Joseph as a new Hama committer

2014-05-23 Thread Yexi Jiang
Welcome!


2014-05-23 1:13 GMT-07:00 Andronidis Anastasios :

> Welcome Renil Joseph!
>
> On 23 Μαϊ 2014, at 9:50 π.μ., Chia-Hung Lin  wrote:
>
> > Congratulation! Welcome joining Hama community.
> >
> > On 23 May 2014 14:59, Edward J. Yoon  wrote:
> >> The Hama PMC is pleased to announce Renil Joseph as a new committer of
> >> Hama. We look forward to his continuing involvement with Hama.
> >>
> >> Congrats, Renil Joseph!
> >>
> >> --
> >> Best Regards, Edward J. Yoon
> >> CEO at DataSayer Co., Ltd.
>
>


-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computing and Information Sciences,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Re: [DISCUSS] Rename of ML and Graph modules

2014-04-14 Thread Yexi Jiang
Yes, Mach reminds me microkernel.


2014-04-14 4:07 GMT-04:00 Chia-Hung Lin :

> Graph package is clearer for me.
>
> Mach may be confusing with CMU's os microkernel.
>
> Or we want a code name for each release like some GNU/ Linux dist release?
>
>
>
>
> On 14 April 2014 14:57, Tommaso Teofili  wrote:
> > I think graph is a pretty fine name, it'easy to understand it's Hama
> > applied to graphs, for 'ml' maybe it's a bit too short so ml may mean
> > anything even if I don't think 'mach' improves that.
> >
> > Regards,
> > Tommaso
> >
> >
> > 2014-04-13 17:01 GMT+02:00 Yexi Jiang :
> >
> >> It seems that the old names are better than the new names.
> >>
> >>
> >> 2014-04-13 8:10 GMT-04:00 Andronidis Anastasios <
> andronat_...@hotmail.com
> >> >:
> >>
> >> > hi,
> >> >
> >> > sorry but i don't understand what the new names mean.
> >> >
> >> > what is a b-graph? mach?
> >> >
> >> > kindly,
> >> > anastasis
> >> >
> >> > On 13 Απρ 2014, at 1:54 μ.μ., Edward J. Yoon 
> >> > wrote:
> >> >
> >> > > Because they are too ambiguous and unmemorable.
> >> > >
> >> > > On Sun, Apr 13, 2014 at 8:25 PM, Tommaso Teofili
> >> > >  wrote:
> >> > >> why?
> >> > >>
> >> > >> Tommaso
> >> > >>
> >> > >>
> >> > >> 2014-04-13 12:57 GMT+02:00 Edward J. Yoon :
> >> > >>
> >> > >>> I propose that we rename the graph and ml modules:
> >> > >>>
> >> > >>> 1. Hama Graph -> B-Graph
> >> > >>> 2. Hama ML -> Mach
> >> > >>>
> >> > >>> WDYT?
> >> > >>>
> >> > >>> --
> >> > >>> Edward J. Yoon (@eddieyoon)
> >> > >>> Chief Executive Officer
> >> > >>> DataSayer Co., Ltd.
> >> > >>>
> >> > >
> >> > >
> >> > >
> >> > > --
> >> > > Edward J. Yoon (@eddieyoon)
> >> > > Chief Executive Officer
> >> > > DataSayer Co., Ltd.
> >> >
> >> >
> >>
> >>
> >> --
> >> --
> >> Yexi Jiang,
> >> ECS 251,  yjian...@cs.fiu.edu
> >> School of Computer and Information Science,
> >> Florida International University
> >> Homepage: http://users.cis.fiu.edu/~yjian004/
> >>
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Re: [DISCUSS] Rename of ML and Graph modules

2014-04-13 Thread Yexi Jiang
It seems that the old names are better than the new names.


2014-04-13 8:10 GMT-04:00 Andronidis Anastasios :

> hi,
>
> sorry but i don't understand what the new names mean.
>
> what is a b-graph? mach?
>
> kindly,
> anastasis
>
> On 13 Απρ 2014, at 1:54 μ.μ., Edward J. Yoon 
> wrote:
>
> > Because they are too ambiguous and unmemorable.
> >
> > On Sun, Apr 13, 2014 at 8:25 PM, Tommaso Teofili
> >  wrote:
> >> why?
> >>
> >> Tommaso
> >>
> >>
> >> 2014-04-13 12:57 GMT+02:00 Edward J. Yoon :
> >>
> >>> I propose that we rename the graph and ml modules:
> >>>
> >>> 1. Hama Graph -> B-Graph
> >>> 2. Hama ML -> Mach
> >>>
> >>> WDYT?
> >>>
> >>> --
> >>> Edward J. Yoon (@eddieyoon)
> >>> Chief Executive Officer
> >>> DataSayer Co., Ltd.
> >>>
> >
> >
> >
> > --
> > Edward J. Yoon (@eddieyoon)
> > Chief Executive Officer
> > DataSayer Co., Ltd.
>
>


-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


[jira] [Updated] (HAMA-863) Implement SparseVector

2014-04-11 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-863:


Resolution: Fixed
Status: Resolved  (was: Patch Available)

> Implement SparseVector
> --
>
> Key: HAMA-863
> URL: https://issues.apache.org/jira/browse/HAMA-863
> Project: Hama
>  Issue Type: New Feature
>  Components: math
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.7.0
>
> Attachments: HAMA-863-2.patch, HAMA-863.patch
>
>
> Implement the SparseVector that implements the DoubleVector interface.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HAMA-863) Implement SparseVector

2014-04-10 Thread Yexi Jiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HAMA-863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13966165#comment-13966165
 ] 

Yexi Jiang commented on HAMA-863:
-

wired, I only added new classes without touching any existing code.

> Implement SparseVector
> --
>
> Key: HAMA-863
> URL: https://issues.apache.org/jira/browse/HAMA-863
> Project: Hama
>  Issue Type: New Feature
>  Components: math
>Affects Versions: 0.7.0
>Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.7.0
>
> Attachments: HAMA-863-2.patch, HAMA-863.patch
>
>
> Implement the SparseVector that implements the DoubleVector interface.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HAMA-863) Implement SparseVector

2014-04-08 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-863:


Attachment: HAMA-863-2.patch

I deleted the commented code.

> Implement SparseVector
> --
>
> Key: HAMA-863
> URL: https://issues.apache.org/jira/browse/HAMA-863
> Project: Hama
>  Issue Type: New Feature
>  Components: math
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.7.0
>
> Attachments: HAMA-863-2.patch, HAMA-863.patch
>
>
> Implement the SparseVector that implements the DoubleVector interface.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HAMA-863) Implement SparseVector

2014-04-08 Thread Yexi Jiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HAMA-863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13963077#comment-13963077
 ] 

Yexi Jiang commented on HAMA-863:
-

Thanks for your comments.

I checked the code again, and deleted all these commented code. I'm not sure 
whether it is appropriate to add debug statements in SparseDoubleVector, since 
the I/O may heavily degrades its performance. I think the developers of the 
other classes in commons.math may have the same concern, so none of the classes 
includes debug statements. 

> Implement SparseVector
> --
>
> Key: HAMA-863
> URL: https://issues.apache.org/jira/browse/HAMA-863
> Project: Hama
>  Issue Type: New Feature
>  Components: math
>Affects Versions: 0.7.0
>Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.7.0
>
> Attachments: HAMA-863.patch
>
>
> Implement the SparseVector that implements the DoubleVector interface.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HAMA-863) Implement SparseVector

2014-04-08 Thread Yexi Jiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HAMA-863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13963013#comment-13963013
 ] 

Yexi Jiang commented on HAMA-863:
-

[~chl501] I didn't find any out.println statement in SparseDoubleVector or 
TestSparseDoubleVector. Maybe the outputs come from other places. Can you tell 
what are the outputs?

> Implement SparseVector
> --
>
> Key: HAMA-863
> URL: https://issues.apache.org/jira/browse/HAMA-863
> Project: Hama
>  Issue Type: New Feature
>  Components: math
>Affects Versions: 0.7.0
>Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.7.0
>
> Attachments: HAMA-863.patch
>
>
> Implement the SparseVector that implements the DoubleVector interface.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HAMA-863) Implement SparseVector

2014-04-06 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-863:


Attachment: HAMA-863.patch

> Implement SparseVector
> --
>
> Key: HAMA-863
> URL: https://issues.apache.org/jira/browse/HAMA-863
> Project: Hama
>  Issue Type: New Feature
>  Components: math
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.7.0
>
> Attachments: HAMA-863.patch
>
>
> Implement the SparseVector that implements the DoubleVector interface.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HAMA-863) Implement SparseVector

2014-04-06 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-863:


Status: Patch Available  (was: Open)

If no objection, I will commit it within this week.

> Implement SparseVector
> --
>
> Key: HAMA-863
> URL: https://issues.apache.org/jira/browse/HAMA-863
> Project: Hama
>  Issue Type: New Feature
>  Components: math
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.7.0
>
> Attachments: HAMA-863.patch
>
>
> Implement the SparseVector that implements the DoubleVector interface.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


Re: [jira] [Commented] (HAMA-883) [Research Task] Massive log event aggregation in real time using Apache Hama

2014-04-02 Thread Yexi Jiang
great~


2014-04-02 21:43 GMT-04:00 Edward J. Yoon (JIRA) :

>
> [
> https://issues.apache.org/jira/browse/HAMA-883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13958430#comment-13958430]
>
> Edward J. Yoon commented on HAMA-883:
> -
>
> NOTE: my fellow worker is currently working on this issue -
> https://github.com/garudakang/meerkat
>
> > [Research Task] Massive log event aggregation in real time using Apache
> Hama
> >
> 
> >
> > Key: HAMA-883
> > URL: https://issues.apache.org/jira/browse/HAMA-883
> > Project: Hama
> >  Issue Type: Task
> >Reporter: Edward J. Yoon
> >
> > BSP tasks can be used for aggregating log data streamed in real time.
> With this research task, we might able to platformization these kind of
> processing.
>
>
>
> --
> This message was sent by Atlassian JIRA
> (v6.2#6252)
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Re: Apache projects based Business

2014-03-26 Thread Yexi Jiang
Looks cool :)


2014-03-26 18:00 GMT-04:00 Edward J. Yoon :

> Hi community,
>
> If there are things that I need to be aware of, please give me a heads up.
>
> -- Forwarded message --
> From: Edward J. Yoon 
> Date: Wed, Mar 26, 2014 at 11:13 AM
> Subject: Apache projects based Business
> To: legal-disc...@apache.org
> Cc: Commons Developers List 
>
>
> Hi,
>
> I recently established my own company called DataSayer, Co, Ltd.[1],
> we'll develop the Big Data product using Apache projects (Apache
> Hadoop, Hama, Spark, MRQL), it's currently located in Seoul and also
> plan on move to USA.
>
> Before announcing the official website and publishing some press on
> the web, please review the web text content and let me know if there
> are things that I need to be aware of.
>
> Thanks!
>
> 1. http://datasayer.com/
>
> --
> Edward J. Yoon (@eddieyoon)
> Chief Executive Officer
> DataSayer, Inc.
>
>
> --
> Edward J. Yoon (@eddieyoon)
> Chief Executive Officer
> DataSayer, Inc.
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Re: [ANNOUNCE] New Hama PMC Chair - ChiaHung Lin

2014-03-20 Thread Yexi Jiang
Congrats!


2014-03-20 7:02 GMT-04:00 Andronidis Anastasios :

> congrats!!
>
> Anastasis
>
> On 20 Μαρ 2014, at 2:57 π.μ., Edward J. Yoon 
> wrote:
>
> > On behalf of the Apache Hama PMC, I'm pleased to announce that Apache
> > Board has approved the nomination of ChiaHung Lin as new Hama PMC
> > Chair. :D
> >
> > Congrats ChiaHung!
> >
> > --
> > Edward J. Yoon (@eddieyoon)
> > Chief Executive Officer
> > DataSayer, Inc.
>
>


-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


[jira] [Updated] (HAMA-888) Add more test cases for DenseDoubleVector

2014-03-15 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-888:


Resolution: Fixed
Status: Resolved  (was: Patch Available)

> Add more test cases for DenseDoubleVector
> -
>
> Key: HAMA-888
> URL: https://issues.apache.org/jira/browse/HAMA-888
> Project: Hama
>  Issue Type: Improvement
>  Components: math
>Affects Versions: 0.6.3
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.7.0
>
> Attachments: HAMA-888.patch
>
>
> Current number of test-cases is too little to cover this class. Add more test 
> cases to DenseDoubleVector.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HAMA-888) Add more test cases for DenseDoubleVector

2014-03-14 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-888:


Attachment: HAMA-888.patch

> Add more test cases for DenseDoubleVector
> -
>
> Key: HAMA-888
> URL: https://issues.apache.org/jira/browse/HAMA-888
> Project: Hama
>  Issue Type: Improvement
>  Components: math
>Affects Versions: 0.6.3
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.7.0
>
> Attachments: HAMA-888.patch
>
>
> Current number of test-cases is too little to cover this class. Add more test 
> cases to DenseDoubleVector.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HAMA-888) Add more test cases for DenseDoubleVector

2014-03-14 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-888:


Status: Patch Available  (was: Open)

Almost 100% coverage of all the methods. If no objection, I will commit it 
within 3 days.

> Add more test cases for DenseDoubleVector
> -
>
> Key: HAMA-888
> URL: https://issues.apache.org/jira/browse/HAMA-888
> Project: Hama
>  Issue Type: Improvement
>  Components: math
>Affects Versions: 0.6.3
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.7.0
>
> Attachments: HAMA-888.patch
>
>
> Current number of test-cases is too little to cover this class. Add more test 
> cases to DenseDoubleVector.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


Re: [ANNOUNCE] New Hama PMC members

2014-03-13 Thread Yexi Jiang
done.


2014-03-13 2:41 GMT-04:00 Edward J. Yoon :

> New PMC members are required to read the branding guidelines,
> http://apache.org/foundation/marks/responsibility.html
>
>
> Plus, please subscribe the priv...@hama.apache.org list!
>
> On Thu, Mar 13, 2014 at 5:42 AM, Andronidis Anastasios
>  wrote:
> > Thank you a lot!
> >
> > On 12 Μαρ 2014, at 6:39 μ.μ., Yexi Jiang  wrote:
> >
> >> Oh, looks great!
> >>
> >>
> >> 2014-03-11 20:15 GMT-04:00 Edward J. Yoon :
> >>
> >>> On behalf of the Apache Hama PMC, I'm pleased to announce the addition
> >>> of the following new Hama PMC members:
> >>>
> >>> * Yexi Jiang
> >>> * Anastasis Andronidis
> >>>
> >>> Thank you for all of your work on the project and Congratz!
> >>>
> >>> --
> >>> Edward J. Yoon (@eddieyoon)
> >>> Chief Executive Officer
> >>> DataSayer, Inc.
> >>>
> >>
> >>
> >>
> >> --
> >> ------
> >> Yexi Jiang,
> >> ECS 251,  yjian...@cs.fiu.edu
> >> School of Computer and Information Science,
> >> Florida International University
> >> Homepage: http://users.cis.fiu.edu/~yjian004/
> >
>
>
>
> --
> Edward J. Yoon (@eddieyoon)
> Chief Executive Officer
> DataSayer, Inc.
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Re: [ANNOUNCE] New Hama PMC members

2014-03-12 Thread Yexi Jiang
Oh, looks great!


2014-03-11 20:15 GMT-04:00 Edward J. Yoon :

> On behalf of the Apache Hama PMC, I'm pleased to announce the addition
> of the following new Hama PMC members:
>
> * Yexi Jiang
> * Anastasis Andronidis
>
> Thank you for all of your work on the project and Congratz!
>
> --
> Edward J. Yoon (@eddieyoon)
> Chief Executive Officer
> DataSayer, Inc.
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


[jira] [Updated] (HAMA-889) NonDefaultIterator of DenseDoubleVector never reaches the end

2014-03-09 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-889:


Summary: NonDefaultIterator of DenseDoubleVector never reaches the end  
(was: NonZeroIterator of DenseDoubleVector never reaches the end)

> NonDefaultIterator of DenseDoubleVector never reaches the end
> -
>
> Key: HAMA-889
> URL: https://issues.apache.org/jira/browse/HAMA-889
> Project: Hama
>  Issue Type: Bug
>  Components: math
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.7.0
>
> Attachments: HAMA-889.patch
>
>
> NonZeroIterator of DenseDoubleVector never reaches the end.
> This bug can be check by the following snippet of code.
> {code:title=TestDenseDoubleVector.java|borderStyle=solid}
>   @Test(timeout=100)
>   public void testIterator() {
> double[] expectedRes = new double[] {38, 44, 50, 56, 0, 0, 3, 0, 0, 0};
> DoubleVector vec = new DenseDoubleVector(expectedRes);
> 
> Iterator itrNonZero = vec.iterateNonZero();
> while (itrNonZero.hasNext()) {
>   itrNonZero.next();
> }
>   }
> {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HAMA-889) NonZeroIterator of DenseDoubleVector never reaches the end

2014-03-08 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-889:


Attachment: (was: HAMA-889.patch)

> NonZeroIterator of DenseDoubleVector never reaches the end
> --
>
> Key: HAMA-889
> URL: https://issues.apache.org/jira/browse/HAMA-889
> Project: Hama
>  Issue Type: Bug
>  Components: math
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.7.0
>
> Attachments: HAMA-889.patch
>
>
> NonZeroIterator of DenseDoubleVector never reaches the end.
> This bug can be check by the following snippet of code.
> {code:title=TestDenseDoubleVector.java|borderStyle=solid}
>   @Test(timeout=100)
>   public void testIterator() {
> double[] expectedRes = new double[] {38, 44, 50, 56, 0, 0, 3, 0, 0, 0};
> DoubleVector vec = new DenseDoubleVector(expectedRes);
> 
> Iterator itrNonZero = vec.iterateNonZero();
> while (itrNonZero.hasNext()) {
>   itrNonZero.next();
> }
>   }
> {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HAMA-889) NonZeroIterator of DenseDoubleVector never reaches the end

2014-03-08 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-889:


Attachment: HAMA-889.patch

> NonZeroIterator of DenseDoubleVector never reaches the end
> --
>
> Key: HAMA-889
> URL: https://issues.apache.org/jira/browse/HAMA-889
> Project: Hama
>  Issue Type: Bug
>  Components: math
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.7.0
>
> Attachments: HAMA-889.patch
>
>
> NonZeroIterator of DenseDoubleVector never reaches the end.
> This bug can be check by the following snippet of code.
> {code:title=TestDenseDoubleVector.java|borderStyle=solid}
>   @Test(timeout=100)
>   public void testIterator() {
> double[] expectedRes = new double[] {38, 44, 50, 56, 0, 0, 3, 0, 0, 0};
> DoubleVector vec = new DenseDoubleVector(expectedRes);
> 
> Iterator itrNonZero = vec.iterateNonZero();
> while (itrNonZero.hasNext()) {
>   itrNonZero.next();
> }
>   }
> {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HAMA-889) NonZeroIterator of DenseDoubleVector never reaches the end

2014-03-08 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-889:


Attachment: (was: HAMA-889.patch)

> NonZeroIterator of DenseDoubleVector never reaches the end
> --
>
> Key: HAMA-889
> URL: https://issues.apache.org/jira/browse/HAMA-889
> Project: Hama
>  Issue Type: Bug
>  Components: math
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.7.0
>
> Attachments: HAMA-889.patch
>
>
> NonZeroIterator of DenseDoubleVector never reaches the end.
> This bug can be check by the following snippet of code.
> {code:title=TestDenseDoubleVector.java|borderStyle=solid}
>   @Test(timeout=100)
>   public void testIterator() {
> double[] expectedRes = new double[] {38, 44, 50, 56, 0, 0, 3, 0, 0, 0};
> DoubleVector vec = new DenseDoubleVector(expectedRes);
> 
> Iterator itrNonZero = vec.iterateNonZero();
> while (itrNonZero.hasNext()) {
>   itrNonZero.next();
> }
>   }
> {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HAMA-889) NonZeroIterator of DenseDoubleVector never reaches the end

2014-03-08 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-889:


Attachment: HAMA-889.patch

> NonZeroIterator of DenseDoubleVector never reaches the end
> --
>
> Key: HAMA-889
> URL: https://issues.apache.org/jira/browse/HAMA-889
> Project: Hama
>  Issue Type: Bug
>  Components: math
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.7.0
>
> Attachments: HAMA-889.patch, HAMA-889.patch
>
>
> NonZeroIterator of DenseDoubleVector never reaches the end.
> This bug can be check by the following snippet of code.
> {code:title=TestDenseDoubleVector.java|borderStyle=solid}
>   @Test(timeout=100)
>   public void testIterator() {
> double[] expectedRes = new double[] {38, 44, 50, 56, 0, 0, 3, 0, 0, 0};
> DoubleVector vec = new DenseDoubleVector(expectedRes);
> 
> Iterator itrNonZero = vec.iterateNonZero();
> while (itrNonZero.hasNext()) {
>   itrNonZero.next();
> }
>   }
> {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HAMA-889) NonZeroIterator of DenseDoubleVector never reaches the end

2014-03-08 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-889:


Status: Patch Available  (was: Open)

I will submit it tomorrow if no objection.

> NonZeroIterator of DenseDoubleVector never reaches the end
> --
>
> Key: HAMA-889
> URL: https://issues.apache.org/jira/browse/HAMA-889
> Project: Hama
>  Issue Type: Bug
>  Components: math
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.7.0
>
> Attachments: HAMA-889.patch
>
>
> NonZeroIterator of DenseDoubleVector never reaches the end.
> This bug can be check by the following snippet of code.
> {code:title=TestDenseDoubleVector.java|borderStyle=solid}
>   @Test(timeout=100)
>   public void testIterator() {
> double[] expectedRes = new double[] {38, 44, 50, 56, 0, 0, 3, 0, 0, 0};
> DoubleVector vec = new DenseDoubleVector(expectedRes);
> 
> Iterator itrNonZero = vec.iterateNonZero();
> while (itrNonZero.hasNext()) {
>   itrNonZero.next();
> }
>   }
> {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HAMA-889) NonZeroIterator of DenseDoubleVector never reaches the end

2014-03-08 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-889:


Attachment: HAMA-889.patch

> NonZeroIterator of DenseDoubleVector never reaches the end
> --
>
> Key: HAMA-889
> URL: https://issues.apache.org/jira/browse/HAMA-889
> Project: Hama
>  Issue Type: Bug
>  Components: math
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.7.0
>
> Attachments: HAMA-889.patch
>
>
> NonZeroIterator of DenseDoubleVector never reaches the end.
> This bug can be check by the following snippet of code.
> {code:title=TestDenseDoubleVector.java|borderStyle=solid}
>   @Test(timeout=100)
>   public void testIterator() {
> double[] expectedRes = new double[] {38, 44, 50, 56, 0, 0, 3, 0, 0, 0};
> DoubleVector vec = new DenseDoubleVector(expectedRes);
> 
> Iterator itrNonZero = vec.iterateNonZero();
> while (itrNonZero.hasNext()) {
>   itrNonZero.next();
> }
>   }
> {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Created] (HAMA-889) NonZeroIterator of DenseDoubleVector never reaches the end

2014-03-08 Thread Yexi Jiang (JIRA)
Yexi Jiang created HAMA-889:
---

 Summary: NonZeroIterator of DenseDoubleVector never reaches the end
 Key: HAMA-889
 URL: https://issues.apache.org/jira/browse/HAMA-889
 Project: Hama
  Issue Type: Bug
  Components: math
Affects Versions: 0.7.0
Reporter: Yexi Jiang
Assignee: Yexi Jiang
 Fix For: 0.7.0


NonZeroIterator of DenseDoubleVector never reaches the end.

This bug can be check by the following snippet of code.

{code:title=TestDenseDoubleVector.java|borderStyle=solid}
  @Test(timeout=100)
  public void testIterator() {
double[] expectedRes = new double[] {38, 44, 50, 56, 0, 0, 3, 0, 0, 0};
DoubleVector vec = new DenseDoubleVector(expectedRes);

Iterator itrNonZero = vec.iterateNonZero();
while (itrNonZero.hasNext()) {
  itrNonZero.next();
}
  }
{code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Created] (HAMA-888) Add more test cases for DenseDoubleVector

2014-03-08 Thread Yexi Jiang (JIRA)
Yexi Jiang created HAMA-888:
---

 Summary: Add more test cases for DenseDoubleVector
 Key: HAMA-888
 URL: https://issues.apache.org/jira/browse/HAMA-888
 Project: Hama
  Issue Type: Improvement
  Components: math
Affects Versions: 0.6.3
Reporter: Yexi Jiang
Assignee: Yexi Jiang
 Fix For: 0.7.0


Current number of test-cases is too little to cover this class. Add more test 
cases to DenseDoubleVector.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


Re: [jira] [Updated] (HAMA-883) [Research Task] Massive log event aggregation in real time using Apache Hama

2014-03-04 Thread Yexi Jiang
Yes, currently Hama does not support streaming input and streaming output.
 That's why currently it is not a natural choice for people with real time
computing needs.

Do we really need to make Hama to support the real time computing? In that
case, we need to compete with Storm...


2014-03-04 22:58 GMT-05:00 Chia-Hung Lin :

> I used Twitter Storm previously. Storm is an excellent framework in
> real time processing.
>
> Considering Hama in real time tasks, the framework in my opinion need
> to decouple io from hdfs so that the source/ input is not restricted
> to just hdfs.
>
> On 5 March 2014 09:30, Yexi Jiang  wrote:
> > Please correct me if I'm wrong. My understanding of aggregating the log
> is
> > the collect the generated from each monitored machine in real time. The
> > collecting procedure is continuous like a data stream and never end.
> >
> > I know how to use Hama to aggregate the logs batch by batch (e.g.
> aggregate
> > the logs incrementally each day), but I cannot immediately make up an
> idea
> > of using Hama to solve this problem in real time approach.
> >
> >
> > 2014-03-04 19:32 GMT-05:00 Edward J. Yoon :
> >
> >> Aggregators of Graph package are doing similar wok. Monitoring and
> >> Global communication, ..., etc.
> >>
> >>
> >>
> >> On Tue, Mar 4, 2014 at 10:20 PM, Yexi Jiang 
> wrote:
> >> > I am very interested in this topic since my research area includes
> event
> >> > mining, but can BSP conducts the real time computing?
> >> >
> >> > I once used the message queue based solution to collect the event
> logs.
> >> >
> >> >
> >> > 2014-03-04 1:54 GMT-05:00 Edward J. Yoon (JIRA) :
> >> >
> >> >>
> >> >>  [
> >> >>
> >>
> https://issues.apache.org/jira/browse/HAMA-883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
> >> ]
> >> >>
> >> >> Edward J. Yoon updated HAMA-883:
> >> >> 
> >> >>
> >> >> Summary: [Research Task] Massive log event aggregation in real
> time
> >> >> using Apache Hama  (was: [Research Task] Massive log data
> aggregation in
> >> >> real time using Apache Hama)
> >> >>
> >> >> > [Research Task] Massive log event aggregation in real time using
> >> Apache
> >> >> Hama
> >> >> >
> >> >>
> >>
> 
> >> >> >
> >> >> >     Key: HAMA-883
> >> >> > URL:
> https://issues.apache.org/jira/browse/HAMA-883
> >> >> > Project: Hama
> >> >> >  Issue Type: Task
> >> >> >Reporter: Edward J. Yoon
> >> >> >
> >> >> > BSP tasks can be used for aggregating log data streamed in real
> time.
> >> >> With this research task, we might able to platformization these kind
> of
> >> >> processing.
> >> >>
> >> >>
> >> >>
> >> >> --
> >> >> This message was sent by Atlassian JIRA
> >> >> (v6.2#6252)
> >> >>
> >> >
> >> >
> >> >
> >> > --
> >> > --
> >> > Yexi Jiang,
> >> > ECS 251,  yjian...@cs.fiu.edu
> >> > School of Computer and Information Science,
> >> > Florida International University
> >> > Homepage: http://users.cis.fiu.edu/~yjian004/
> >>
> >>
> >>
> >> --
> >> Edward J. Yoon (@eddieyoon)
> >> Chief Executive Officer
> >> DataSayer, Inc.
> >>
> >
> >
> >
> > --
> > --
> > Yexi Jiang,
> > ECS 251,  yjian...@cs.fiu.edu
> > School of Computer and Information Science,
> > Florida International University
> > Homepage: http://users.cis.fiu.edu/~yjian004/
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Re: [jira] [Updated] (HAMA-883) [Research Task] Massive log event aggregation in real time using Apache Hama

2014-03-04 Thread Yexi Jiang
I have ever implemented a system monitor/log collector using ActiveMQ and a
real time anomaly detection algorithm on top of Twitter's Storm. I think
people like me may naturally choose such streaming computing framework to
handle this scenario.

For real time computation, what is the unique characteristics of Hama that
make people choose it instead of Storm? In my humble opinion, one unique
characteristic of Hama is that it provides a general BSP computing
framework (compared with Giraph, who provide a specific BSP only for graph
computing). No one else has such ability.


2014-03-04 21:02 GMT-05:00 Edward J. Yoon :

> The final goal can be a real-time event processing framework for
> distributed event detection, filtering, and aggregation. I guess that
> can be done with only 3 components:
>
>  * Event processing job configuration interface.
>  * User-defined function that handles the stream input.
>  * Master Aggregator(s) and its client library.
>
> I expect this can be applied such as web clickstream log analysis
> (large scale web servers), finding hot search keywords, detecting
> system errors in real time, and user will be able to program them in
> few minutes.
>
>
> On Wed, Mar 5, 2014 at 10:30 AM, Yexi Jiang  wrote:
> > Please correct me if I'm wrong. My understanding of aggregating the log
> is
> > the collect the generated from each monitored machine in real time. The
> > collecting procedure is continuous like a data stream and never end.
> >
> > I know how to use Hama to aggregate the logs batch by batch (e.g.
> aggregate
> > the logs incrementally each day), but I cannot immediately make up an
> idea
> > of using Hama to solve this problem in real time approach.
> >
> >
> > 2014-03-04 19:32 GMT-05:00 Edward J. Yoon :
> >
> >> Aggregators of Graph package are doing similar wok. Monitoring and
> >> Global communication, ..., etc.
> >>
> >>
> >>
> >> On Tue, Mar 4, 2014 at 10:20 PM, Yexi Jiang 
> wrote:
> >> > I am very interested in this topic since my research area includes
> event
> >> > mining, but can BSP conducts the real time computing?
> >> >
> >> > I once used the message queue based solution to collect the event
> logs.
> >> >
> >> >
> >> > 2014-03-04 1:54 GMT-05:00 Edward J. Yoon (JIRA) :
> >> >
> >> >>
> >> >>  [
> >> >>
> >>
> https://issues.apache.org/jira/browse/HAMA-883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
> >> ]
> >> >>
> >> >> Edward J. Yoon updated HAMA-883:
> >> >> 
> >> >>
> >> >> Summary: [Research Task] Massive log event aggregation in real
> time
> >> >> using Apache Hama  (was: [Research Task] Massive log data
> aggregation in
> >> >> real time using Apache Hama)
> >> >>
> >> >> > [Research Task] Massive log event aggregation in real time using
> >> Apache
> >> >> Hama
> >> >> >
> >> >>
> >>
> 
> >> >> >
> >> >> > Key: HAMA-883
> >> >> > URL:
> https://issues.apache.org/jira/browse/HAMA-883
> >> >> > Project: Hama
> >> >> >  Issue Type: Task
> >> >> >Reporter: Edward J. Yoon
> >> >> >
> >> >> > BSP tasks can be used for aggregating log data streamed in real
> time.
> >> >> With this research task, we might able to platformization these kind
> of
> >> >> processing.
> >> >>
> >> >>
> >> >>
> >> >> --
> >> >> This message was sent by Atlassian JIRA
> >> >> (v6.2#6252)
> >> >>
> >> >
> >> >
> >> >
> >> > --
> >> > --
> >> > Yexi Jiang,
> >> > ECS 251,  yjian...@cs.fiu.edu
> >> > School of Computer and Information Science,
> >> > Florida International University
> >> > Homepage: http://users.cis.fiu.edu/~yjian004/
> >>
> >>
> >>
> >> --
> >> Edward J. Yoon (@eddieyoon)
> >> Chief Executive Officer
> >> DataSayer, Inc.
> >>
> >
> >
> >
> > --
> > --
> > Yexi Jiang,
> > ECS 251,  yjian...@cs.fiu.edu
> > School of Computer and Information Science,
> > Florida International University
> > Homepage: http://users.cis.fiu.edu/~yjian004/
>
>
>
> --
> Edward J. Yoon (@eddieyoon)
> Chief Executive Officer
> DataSayer, Inc.
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Re: [jira] [Updated] (HAMA-883) [Research Task] Massive log event aggregation in real time using Apache Hama

2014-03-04 Thread Yexi Jiang
Please correct me if I'm wrong. My understanding of aggregating the log is
the collect the generated from each monitored machine in real time. The
collecting procedure is continuous like a data stream and never end.

I know how to use Hama to aggregate the logs batch by batch (e.g. aggregate
the logs incrementally each day), but I cannot immediately make up an idea
of using Hama to solve this problem in real time approach.


2014-03-04 19:32 GMT-05:00 Edward J. Yoon :

> Aggregators of Graph package are doing similar wok. Monitoring and
> Global communication, ..., etc.
>
>
>
> On Tue, Mar 4, 2014 at 10:20 PM, Yexi Jiang  wrote:
> > I am very interested in this topic since my research area includes event
> > mining, but can BSP conducts the real time computing?
> >
> > I once used the message queue based solution to collect the event logs.
> >
> >
> > 2014-03-04 1:54 GMT-05:00 Edward J. Yoon (JIRA) :
> >
> >>
> >>  [
> >>
> https://issues.apache.org/jira/browse/HAMA-883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
> ]
> >>
> >> Edward J. Yoon updated HAMA-883:
> >> 
> >>
> >> Summary: [Research Task] Massive log event aggregation in real time
> >> using Apache Hama  (was: [Research Task] Massive log data aggregation in
> >> real time using Apache Hama)
> >>
> >> > [Research Task] Massive log event aggregation in real time using
> Apache
> >> Hama
> >> >
> >>
> 
> >> >
> >> > Key: HAMA-883
> >> > URL: https://issues.apache.org/jira/browse/HAMA-883
> >> > Project: Hama
> >> >  Issue Type: Task
> >> >Reporter: Edward J. Yoon
> >> >
> >> > BSP tasks can be used for aggregating log data streamed in real time.
> >> With this research task, we might able to platformization these kind of
> >> processing.
> >>
> >>
> >>
> >> --
> >> This message was sent by Atlassian JIRA
> >> (v6.2#6252)
> >>
> >
> >
> >
> > --
> > --
> > Yexi Jiang,
> > ECS 251,  yjian...@cs.fiu.edu
> > School of Computer and Information Science,
> > Florida International University
> > Homepage: http://users.cis.fiu.edu/~yjian004/
>
>
>
> --
> Edward J. Yoon (@eddieyoon)
> Chief Executive Officer
> DataSayer, Inc.
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Re: [jira] [Updated] (HAMA-883) [Research Task] Massive log event aggregation in real time using Apache Hama

2014-03-04 Thread Yexi Jiang
I am very interested in this topic since my research area includes event
mining, but can BSP conducts the real time computing?

I once used the message queue based solution to collect the event logs.


2014-03-04 1:54 GMT-05:00 Edward J. Yoon (JIRA) :

>
>  [
> https://issues.apache.org/jira/browse/HAMA-883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel]
>
> Edward J. Yoon updated HAMA-883:
> 
>
> Summary: [Research Task] Massive log event aggregation in real time
> using Apache Hama  (was: [Research Task] Massive log data aggregation in
> real time using Apache Hama)
>
> > [Research Task] Massive log event aggregation in real time using Apache
> Hama
> >
> 
> >
> > Key: HAMA-883
> > URL: https://issues.apache.org/jira/browse/HAMA-883
> > Project: Hama
> >  Issue Type: Task
> >Reporter: Edward J. Yoon
> >
> > BSP tasks can be used for aggregating log data streamed in real time.
> With this research task, we might able to platformization these kind of
> processing.
>
>
>
> --
> This message was sent by Atlassian JIRA
> (v6.2#6252)
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Re: Implementation of DoubleVector/DenseDoubleVector/SparseDoubleVector

2014-03-03 Thread Yexi Jiang
> I think SparseDoubleVector can be represented by index/value pairs.
> And, toArray can be implemented like:

> public double[] toArray() {
>  double[] arr = new double[size];
> for(Element e : vector) {
>arr[e.getIndex()] = e.getValue();
>  }
>}


The above piece of code would still waste a lot of space, supposing the
dimension (size) is 10^6 and only a couple of entries are set. This piece
of code would still return an array with one million entires.

If we do not care about the space cost, this method is fine.





2014-03-03 0:44 GMT-05:00 Edward J. Yoon :

> > 4. Is the toArray method available for SparseDoubleVector? In my opinion,
> > it is better not to do that.
>
> I think SparseDoubleVector can be represented by index/value pairs.
> And, toArray can be implemented like:
>
> public double[] toArray() {
>   double[] arr = new double[size];
>   for(Element e : vector) {
> arr[e.getIndex()] = e.getValue();
>   }
> }
>
> For bit vector, java.util.BitSet can be used.
>
> On Tue, Feb 25, 2014 at 6:14 AM, Yexi Jiang  wrote:
> > Hi, All,
> >
> > I am currently working on the SparseDoubleVector (HAMA-863) and found
> some
> > unclear places about the vector implementation.
> >
> > 1. What is the definition for a vector? According to the implementation,
> it
> > is implemented as elementwise sqrt. In such a case, problem will occur if
> > the one of the entry is negative.
> >
> > 2. Most of the operators are conducted on a copy of the current object.
> Do
> > we also need to provide a set of operators that directly modify the
> current
> > object itself? e.g. addOriginal, subtractOriginal, etc.
> >
> > 3. When a DenseDoubleVector operates with a SparseDoubleVector, what will
> > be the concrete type of the result object? A simple implementation is to
> > always return a SparseDoubleVector, even if it is dense. A complex
> > implementation is we maintain a sparsity ratio (the ratio of non-default
> > entries), if the ratio exceed a threshold, a DenseDoubleVector will be
> > returned.
> >
> > 4. Is the toArray method available for SparseDoubleVector? In my opinion,
> > it is better not to do that.
> >
> >
> > Regards,
> > Yexi
>
>
>
> --
> Edward J. Yoon (@eddieyoon)
> Chief Executive Officer
> DataSayer, Inc.
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


[jira] [Commented] (HAMA-877) Support MapReduce job in Hama

2014-02-26 Thread Yexi Jiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HAMA-877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13913842#comment-13913842
 ] 

Yexi Jiang commented on HAMA-877:
-

What's the relationship of this issue to 
[HAMA-535|https://issues.apache.org/jira/browse/HAMA-535]?

> Support MapReduce job in Hama
> -
>
> Key: HAMA-877
> URL: https://issues.apache.org/jira/browse/HAMA-877
> Project: Hama
>  Issue Type: New Feature
>Affects Versions: 0.6.3
>Reporter: Edward J. Yoon
>Assignee: Edward J. Yoon
>Priority: Blocker
> Fix For: 0.7.0
>
>
> Support MapReduce job in Hama



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


Implementation of DoubleVector/DenseDoubleVector/SparseDoubleVector

2014-02-24 Thread Yexi Jiang
Hi, All,

I am currently working on the SparseDoubleVector (HAMA-863) and found some
unclear places about the vector implementation.

1. What is the definition for a vector? According to the implementation, it
is implemented as elementwise sqrt. In such a case, problem will occur if
the one of the entry is negative.

2. Most of the operators are conducted on a copy of the current object. Do
we also need to provide a set of operators that directly modify the current
object itself? e.g. addOriginal, subtractOriginal, etc.

3. When a DenseDoubleVector operates with a SparseDoubleVector, what will
be the concrete type of the result object? A simple implementation is to
always return a SparseDoubleVector, even if it is dense. A complex
implementation is we maintain a sparsity ratio (the ratio of non-default
entries), if the ratio exceed a threshold, a DenseDoubleVector will be
returned.

4. Is the toArray method available for SparseDoubleVector? In my opinion,
it is better not to do that.


Regards,
Yexi


Re: GSoC 2014

2014-02-12 Thread Yexi Jiang
Have you already done that? It seems the deadline is tomorrow.


2014-02-12 20:30 GMT-05:00 Edward J. Yoon :

> ASF Members and committers can volunteer to mentor.
>
> If you have an idea and want to mentor it, Please feel free to open
> the JIRA ticket. ;-)
>
> On Thu, Feb 13, 2014 at 1:31 AM, Anastasis Andronidis
>  wrote:
> > Are we going to participate in GSoC 2014?
> >
> > Cheers,
> > Anastasis
>
>
>
> --
> Best Regards, Edward J. Yoon
> @eddieyoon
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Re: Comparison between 0.6.3 and TRUNK

2014-02-12 Thread Yexi Jiang
Looks pretty good. Can we try even larger scale?


2014-02-11 3:18 GMT-05:00 Edward J. Yoon :

> See
> https://wiki.apache.org/hama/Benchmarks#PageRank_Performance_0.7.0-SNAPSHOT_vs_0.6.3
>
>
>
> On Mon, Jan 27, 2014 at 5:09 PM, Edward J. Yoon 
> wrote:
> > FYI, this result shows the improvement obtained by removing message
> > bundling overheads.
> >
> > 
> > PageRank with 3 tasks on single machine
> > bsp.child.java.opts: -Xmx1524m
> > hama.graph.vertices.info: ListVerticesInfo
> >
> > * Input vertices: 2 max edges per vertex: 100
> > Hama-0.6.3: 87.895 seconds
> > Hama-TRUNK: 99.689 seconds
> >
> >  * Input vertices: 4 max edges per vertex: 200
> > Hama-0.6.3: 340.094 seconds
> > Hama-TRUNK: 420.992 seconds
> >
> >  * Input vertices: 4 max edges per vertex: 300
> > Hama-0.6.3: 556.07 seconds
> > Hama-TRUNK: 583.098 seconds
> >
> >  * Input vertices: 4 max edges per vertex: 400
> > Hama-0.6.3: 733.408 seconds
> > Hama-TRUNK: 739.116 seconds
> >
> >  * Input vertices: 4 max edges per vertex: 500
> > Hama-0.6.3: java.lang.OutOfMemoryError: Java heap space
> > Hama-TRUNK: 1207.854 seconds
> >
> >  * Input vertices: 4 max edges per vertex: 600
> > Hama-0.6.3: java.lang.OutOfMemoryError: Java heap space
> > Hama-TRUNK: java.lang.OutOfMemoryError: Java heap space
> > Hama-TRUNK (DiskVerticesInfo): java.lang.OutOfMemoryError: Java heap
> space
> >
> > --
> > Best Regards, Edward J. Yoon
> > @eddieyoon
>
>
>
> --
> Best Regards, Edward J. Yoon
> @eddieyoon
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Re: [jira] [Commented] (HAMA-859) Leverage commons cli2 to parse the input argument for NeuralNetwork Example

2014-02-03 Thread Yexi Jiang
What is this?

Test 
Result<https://builds.apache.org/job/Hama-Nightly-for-Hadoop-1.x/1169/testReport/>
(1
failure / +1)

   - 
org.apache.hama.bsp.TestBSPTaskFaults.testPingOnTaskSetupFailure<https://builds.apache.org/job/Hama-Nightly-for-Hadoop-1.x/1169/testReport/junit/org.apache.hama.bsp/TestBSPTaskFaults/testPingOnTaskSetupFailure/>



2014-02-03 Hudson (JIRA) :

>
> [
> https://issues.apache.org/jira/browse/HAMA-859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13890240#comment-13890240]
>
> Hudson commented on HAMA-859:
> -
>
> FAILURE: Integrated in Hama-Nightly-for-Hadoop-1.x #1169 (See [
> https://builds.apache.org/job/Hama-Nightly-for-Hadoop-1.x/1169/])
> HAMA-859: change the version of commons-cli (edwardyoon: rev 1563867)
> * /hama/trunk/core/pom.xml
> * /hama/trunk/pom.xml
>
>
> > Leverage commons cli2 to parse the input argument for NeuralNetwork
> Example
> >
> ---
> >
> > Key: HAMA-859
> > URL: https://issues.apache.org/jira/browse/HAMA-859
> > Project: Hama
> >  Issue Type: Improvement
> >  Components: examples
> >Affects Versions: 0.7.0
> >Reporter: Yexi Jiang
> >Assignee: Yexi Jiang
> >  Labels: examples
> > Fix For: 0.7.0
> >
> > Attachments: HAMA-859.patch
> >
> >
> > Current parsing is handled manually, which is not flexible and
> error-prone. By replacing the parsing with commons cli2, the user can have
> more flexibility to set the input parameters.
>
>
>
> --
> This message was sent by Atlassian JIRA
> (v6.1.5#6160)
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Re: [jira] [Comment Edited] (HAMA-859) Leverage commons cli2 to parse the input argument for NeuralNetwork Example

2014-02-03 Thread Yexi Jiang
According to the log, it seems you already did it. And it seems the version
of cli is still not correct.


2014-02-03 Edward J. Yoon (JIRA) :

>
> [
> https://issues.apache.org/jira/browse/HAMA-859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13889481#comment-13889481]
>
> Edward J. Yoon edited comment on HAMA-859 at 2/3/14 1:50 PM:
> -
>
> Moreover, maven release doesn't allow snapshot dependency.
>
>
> was (Author: udanax):
> Moreover, maven doesn't allow snapshot dependency.
>
> > Leverage commons cli2 to parse the input argument for NeuralNetwork
> Example
> >
> ---
> >
> > Key: HAMA-859
> > URL: https://issues.apache.org/jira/browse/HAMA-859
> > Project: Hama
> >  Issue Type: Improvement
> >  Components: examples
> >Affects Versions: 0.7.0
> >Reporter: Yexi Jiang
> >Assignee: Yexi Jiang
> >  Labels: examples
> > Fix For: 0.7.0
> >
> > Attachments: HAMA-859.patch
> >
> >
> > Current parsing is handled manually, which is not flexible and
> error-prone. By replacing the parsing with commons cli2, the user can have
> more flexibility to set the input parameters.
>
>
>
> --
> This message was sent by Atlassian JIRA
> (v6.1.5#6160)
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


[jira] [Commented] (HAMA-859) Leverage commons cli2 to parse the input argument for NeuralNetwork Example

2014-02-03 Thread Yexi Jiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HAMA-859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13889461#comment-13889461
 ] 

Yexi Jiang commented on HAMA-859:
-

It seems that the the ParseException class in original cli package cannot be 
found.

> Leverage commons cli2 to parse the input argument for NeuralNetwork Example
> ---
>
> Key: HAMA-859
> URL: https://issues.apache.org/jira/browse/HAMA-859
> Project: Hama
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 0.7.0
>Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: examples
> Fix For: 0.7.0
>
> Attachments: HAMA-859.patch
>
>
> Current parsing is handled manually, which is not flexible and error-prone. 
> By replacing the parsing with commons cli2, the user can have more 
> flexibility to set the input parameters.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HAMA-864) Fix/improve DoubleVector and DenseDoubleVector

2014-01-29 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-864:


Status: Patch Available  (was: Open)

Without obligation, I will commit this patch within this week.

> Fix/improve DoubleVector and DenseDoubleVector
> --
>
> Key: HAMA-864
> URL: https://issues.apache.org/jira/browse/HAMA-864
> Project: Hama
>  Issue Type: Improvement
>  Components: math
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.6.3
>
> Attachments: HAMA-864.patch
>
>
> Including remove deprecated methods, improve the descriptions, and fix bugs.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HAMA-864) Fix/improve DoubleVector and DenseDoubleVector

2014-01-29 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-864:


Attachment: HAMA-864.patch

> Fix/improve DoubleVector and DenseDoubleVector
> --
>
> Key: HAMA-864
> URL: https://issues.apache.org/jira/browse/HAMA-864
> Project: Hama
>  Issue Type: Improvement
>  Components: math
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.6.3
>
> Attachments: HAMA-864.patch
>
>
> Including remove deprecated methods, improve the descriptions, and fix bugs.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HAMA-864) Fix/improve DoubleVector and DenseDoubleVector

2014-01-29 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-864:


Description: Including remove deprecated methods, improve the descriptions, 
and fix bugs.  (was: Current implementation subtractFrom(int scalar) is 
implemented exactly the same as subtract(int scalar). 

The subtract(int scalar) is implemented as (vector - scalar). However, the 
subtractFrom should be implemented as (scalar - vector).)

> Fix/improve DoubleVector and DenseDoubleVector
> --
>
> Key: HAMA-864
> URL: https://issues.apache.org/jira/browse/HAMA-864
> Project: Hama
>  Issue Type: Bug
>  Components: math
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.6.3
>
>
> Including remove deprecated methods, improve the descriptions, and fix bugs.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HAMA-864) Fix/improve DoubleVector and DenseDoubleVector

2014-01-29 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-864:


Issue Type: Improvement  (was: Bug)

> Fix/improve DoubleVector and DenseDoubleVector
> --
>
> Key: HAMA-864
> URL: https://issues.apache.org/jira/browse/HAMA-864
> Project: Hama
>  Issue Type: Improvement
>  Components: math
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.6.3
>
>
> Including remove deprecated methods, improve the descriptions, and fix bugs.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HAMA-864) Fix/improve DoubleVector and DenseDoubleVector

2014-01-29 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-864:


Summary: Fix/improve DoubleVector and DenseDoubleVector  (was: Fix the 
subtractFrom() method of DoubleVector and DenseDoubleVector)

> Fix/improve DoubleVector and DenseDoubleVector
> --
>
> Key: HAMA-864
> URL: https://issues.apache.org/jira/browse/HAMA-864
> Project: Hama
>  Issue Type: Bug
>  Components: math
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: math
> Fix For: 0.6.3
>
>
> Current implementation subtractFrom(int scalar) is implemented exactly the 
> same as subtract(int scalar). 
> The subtract(int scalar) is implemented as (vector - scalar). However, the 
> subtractFrom should be implemented as (scalar - vector).



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Created] (HAMA-864) Fix the subtractFrom() method of DoubleVector and DenseDoubleVector

2014-01-29 Thread Yexi Jiang (JIRA)
Yexi Jiang created HAMA-864:
---

 Summary: Fix the subtractFrom() method of DoubleVector and 
DenseDoubleVector
 Key: HAMA-864
 URL: https://issues.apache.org/jira/browse/HAMA-864
 Project: Hama
  Issue Type: Bug
  Components: math
Affects Versions: 0.7.0
Reporter: Yexi Jiang
Assignee: Yexi Jiang
 Fix For: 0.6.3


Current implementation subtractFrom(int scalar) is implemented exactly the same 
as subtract(int scalar). 

The subtract(int scalar) is implemented as (vector - scalar). However, the 
subtractFrom should be implemented as (scalar - vector).



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


About the DoubleVector/DoubleMath

2014-01-29 Thread Yexi Jiang
Hi, all,

It seems that Apache Mahout already has good implementations of
Vector/Matrix. Do we need to also define ourselves? Or we just reuse theirs?

If we define ourselves, we need to maintain them in the future. If we reuse
theirs, we need to do a significant amount of refactoring on the whole ml
package.

Regards,
Yexi

-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


[jira] [Created] (HAMA-863) Implement SparseVector

2014-01-29 Thread Yexi Jiang (JIRA)
Yexi Jiang created HAMA-863:
---

 Summary: Implement SparseVector
 Key: HAMA-863
 URL: https://issues.apache.org/jira/browse/HAMA-863
 Project: Hama
  Issue Type: New Feature
  Components: math
Affects Versions: 0.7.0
Reporter: Yexi Jiang
Assignee: Yexi Jiang
 Fix For: 0.7.0


Implement the SparseVector that implements the DoubleVector interface.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HAMA-859) Leverage commons cli2 to parse the input argument for NeuralNetwork Example

2014-01-28 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-859:


Resolution: Implemented
Status: Resolved  (was: Patch Available)

> Leverage commons cli2 to parse the input argument for NeuralNetwork Example
> ---
>
> Key: HAMA-859
> URL: https://issues.apache.org/jira/browse/HAMA-859
> Project: Hama
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: examples
> Fix For: 0.7.0
>
> Attachments: HAMA-859.patch
>
>
> Current parsing is handled manually, which is not flexible and error-prone. 
> By replacing the parsing with commons cli2, the user can have more 
> flexibility to set the input parameters.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HAMA-861) Add statistical report for random data generator tools

2014-01-26 Thread Yexi Jiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HAMA-861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13882529#comment-13882529
 ] 

Yexi Jiang commented on HAMA-861:
-

Besides mean and variance, what else is needed? 

> Add statistical report for random data generator tools
> --
>
> Key: HAMA-861
> URL: https://issues.apache.org/jira/browse/HAMA-861
> Project: Hama
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 0.6.3
>Reporter: Edward J. Yoon
>Assignee: Edward J. Yoon
> Fix For: 0.7.0
>
>
> Currently, random data generator doesn't print any report. 
> {code}
> edward@edward-VirtualBox:~/workspace/hama-trunk$ bin/hama jar 
> examples/target/hama-examples-0.7.0-SNAPSHOT.jar gen fastgen 8 400 
> trunk/8-400 3
> 14/01/27 10:57:45 INFO bsp.BSPJobClient: Running job: job_201401271056_0002
> 14/01/27 10:57:48 INFO bsp.BSPJobClient: Current supersteps number: 0
> 14/01/27 10:58:30 INFO bsp.BSPJobClient: The total number of supersteps: 0
> 14/01/27 10:58:30 INFO bsp.BSPJobClient: Counters: 3
> 14/01/27 10:58:30 INFO bsp.BSPJobClient:   
> org.apache.hama.bsp.JobInProgress$JobCounter
> 14/01/27 10:58:30 INFO bsp.BSPJobClient: SUPERSTEPS=0
> 14/01/27 10:58:30 INFO bsp.BSPJobClient: LAUNCHED_TASKS=3
> 14/01/27 10:58:30 INFO bsp.BSPJobClient:   
> org.apache.hama.bsp.BSPPeerImpl$PeerCounter
> 14/01/27 10:58:30 INFO bsp.BSPJobClient: SUPERSTEP_SUM=0
> Job Finished in 45.683 seconds
> {code}



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HAMA-859) Leverage commons cli2 to parse the input argument for NeuralNetwork Example

2014-01-25 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-859:


Labels: examples  (was: )
Status: Patch Available  (was: Open)

Without objection, I will submit the changes within next week.

> Leverage commons cli2 to parse the input argument for NeuralNetwork Example
> ---
>
> Key: HAMA-859
> URL: https://issues.apache.org/jira/browse/HAMA-859
> Project: Hama
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: examples
> Fix For: 0.7.0
>
> Attachments: HAMA-859.patch
>
>
> Current parsing is handled manually, which is not flexible and error-prone. 
> By replacing the parsing with commons cli2, the user can have more 
> flexibility to set the input parameters.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HAMA-859) Leverage commons cli2 to parse the input argument for NeuralNetwork Example

2014-01-25 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-859:


Attachment: HAMA-859.patch

Changes:
1. Import commons-cli2 library.
2. Leverage commons cli2 to parse the input argument for NeuralNetwork Example.



> Leverage commons cli2 to parse the input argument for NeuralNetwork Example
> ---
>
> Key: HAMA-859
> URL: https://issues.apache.org/jira/browse/HAMA-859
> Project: Hama
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
> Fix For: 0.7.0
>
> Attachments: HAMA-859.patch
>
>
> Current parsing is handled manually, which is not flexible and error-prone. 
> By replacing the parsing with commons cli2, the user can have more 
> flexibility to set the input parameters.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Comment Edited] (HAMA-859) Leverage commons cli2 to parse the input argument for NeuralNetwork Example

2014-01-25 Thread Yexi Jiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HAMA-859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13882032#comment-13882032
 ] 

Yexi Jiang edited comment on HAMA-859 at 1/25/14 8:27 PM:
--

Changes:
1. Import commons-cli2 library. Update the hama parent pom.xml and hama-core 
pom.xml.
2. Leverage commons cli2 to parse the input argument for NeuralNetwork Example.




was (Author: yxjiang):
Changes:
1. Import commons-cli2 library.
2. Leverage commons cli2 to parse the input argument for NeuralNetwork Example.



> Leverage commons cli2 to parse the input argument for NeuralNetwork Example
> ---
>
> Key: HAMA-859
> URL: https://issues.apache.org/jira/browse/HAMA-859
> Project: Hama
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 0.7.0
>Reporter: Yexi Jiang
>Assignee: Yexi Jiang
> Fix For: 0.7.0
>
> Attachments: HAMA-859.patch
>
>
> Current parsing is handled manually, which is not flexible and error-prone. 
> By replacing the parsing with commons cli2, the user can have more 
> flexibility to set the input parameters.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HAMA-859) Leverage commons cli2 to parse the input argument for NeuralNetwork Example

2014-01-23 Thread Yexi Jiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HAMA-859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13880492#comment-13880492
 ] 

Yexi Jiang commented on HAMA-859:
-

[~udanax] I found the common cli1 has already been set as a dependent library. 
Can I also import the cli2 library?

> Leverage commons cli2 to parse the input argument for NeuralNetwork Example
> ---
>
> Key: HAMA-859
> URL: https://issues.apache.org/jira/browse/HAMA-859
> Project: Hama
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 0.7.0
>Reporter: Yexi Jiang
>Assignee: Yexi Jiang
> Fix For: 0.7.0
>
>
> Current parsing is handled manually, which is not flexible and error-prone. 
> By replacing the parsing with commons cli2, the user can have more 
> flexibility to set the input parameters.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HAMA-859) Leverage commons cli2 to parse the input argument for NeuralNetwork Example

2014-01-23 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-859:


  Component/s: examples
Affects Version/s: 0.7.0
Fix Version/s: 0.7.0

> Leverage commons cli2 to parse the input argument for NeuralNetwork Example
> ---
>
> Key: HAMA-859
> URL: https://issues.apache.org/jira/browse/HAMA-859
> Project: Hama
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
> Fix For: 0.7.0
>
>
> Current parsing is handled manually, which is not flexible and error-prone. 
> By replacing the parsing with commons cli2, the user can have more 
> flexibility to set the input parameters.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HAMA-859) Leverage commons cli2 to parse the input argument for NeuralNetwork Example

2014-01-23 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-859:


Description: Current parsing is handled manually, which is not flexible and 
error-prone. By replacing the parsing with commons cli2, the user can have more 
flexibility to set the input parameters.

> Leverage commons cli2 to parse the input argument for NeuralNetwork Example
> ---
>
> Key: HAMA-859
> URL: https://issues.apache.org/jira/browse/HAMA-859
> Project: Hama
>  Issue Type: Improvement
>  Components: examples
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
> Fix For: 0.7.0
>
>
> Current parsing is handled manually, which is not flexible and error-prone. 
> By replacing the parsing with commons cli2, the user can have more 
> flexibility to set the input parameters.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Created] (HAMA-859) Leverage commons cli2 to parse the input argument for NeuralNetwork Example

2014-01-23 Thread Yexi Jiang (JIRA)
Yexi Jiang created HAMA-859:
---

 Summary: Leverage commons cli2 to parse the input argument for 
NeuralNetwork Example
 Key: HAMA-859
 URL: https://issues.apache.org/jira/browse/HAMA-859
 Project: Hama
  Issue Type: Improvement
Reporter: Yexi Jiang
Assignee: Yexi Jiang






--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


Hama Jira does not work

2014-01-22 Thread Yexi Jiang
It seems that hama jira does not work properly. I cannot view the issues.
Moreover, when I click Change Log, it shows the following error.



Hama - Version 0.6.3

   - Release date: 10/Oct/13


   - An error occurred whilst rendering this message. Please contact the
   administrators, and inform them of this bug. Details: ---
   org.apache.velocity.exception.MethodInvocationException: Invocation of
   method 'getIssueCountByFixForVersion' in class
   
com.atlassian.jira.plugin.projectpanel.impl.VersionDrillDownRenderer$RoadMapGraphingBean
   threw exception com.atlassian.jira.issue.index.SearchUnavailableException:
   com.atlassian.util.concurrent.LazyReference$InitializationException:
   com.atlassian.util.concurrent.LazyReference$InitializationException:
   com.atlassian.jira.util.RuntimeIOException: java.io.FileNotFoundException:
   /x1/jira/jira-main/jira-data/caches/indexes/issues/_5y3x.cfs (No such file
   or directory) at templates/plugins/jira/projectpanels/roadmap-panel.vm[line
   94, column 37] at
   
org.apache.velocity.runtime.parser.node.ASTMethod.handleInvocationException(ASTMethod.java:337)
   at
   org.apache.velocity.runtime.parser.node.ASTMethod.execute(ASTMethod.java:284)
   at
   
org.apache.velocity.runtime.parser.node.ASTReference.execute(ASTReference.java:262)
   at
   
org.apache.velocity.runtime.parser.node.ASTReference.value(ASTReference.java:507)
   at
   
org.apache.velocity.runtime.parser.node.ASTExpression.value(ASTExpression.java:71)
   at
   
org.apache.velocity.runtime.parser.node.ASTSetDirective.render(ASTSetDirective.java:142)
   at
   org.apache.velocity.runtime.parser.node.ASTBlock.render(ASTBlock.java:72)
   at
   
org.apache.velocity.runtime.directive.VelocimacroProxy.render(VelocimacroProxy.java:212)
   at
   
org.apache.velocity.runtime.directive.RuntimeMacro.render(RuntimeMacro.java:247)
   at
   
org.apache.velocity.runtime.parser.node.ASTDirective.render(ASTDirective.java:175)
   at
   org.apache.velocity.runtime.parser.node.ASTBlock.render(ASTBlock.java:72)
   at
   
org.apache.velocity.runtime.directive.Foreach.performIteration(Foreach.java:393)
   at org.apache.velocity.runtime.directive.Foreach.render(Foreach.java:316)
   at
   
org.apache.velocity.runtime.parser.node.ASTDirective.render(ASTDirective.java:175)
   at
   org.apache.velocity.runtime.parser.node.ASTBlock.render(ASTBlock.java:72)
   at
   
org.apache.velocity.runtime.parser.node.ASTIfStatement.render(ASTIfStatement.java:87)
   at
   org.apache.velocity.runtime.parser.node.ASTBlock.render(ASTBlock.java:72)
   at
   
org.apache.velocity.runtime.parser.node.SimpleNode.render(SimpleNode.java:336)
   at
   
org.apache.velocity.runtime.parser.node.ASTIfStatement.render(ASTIfStatement.java:106)
   at
   
org.apache.velocity.runtime.parser.node.SimpleNode.render(SimpleNode.java:336)
   at org.apache.velocity.Template.merge(Template.java:328) at
   org.apache.velocity.Template.merge(Template.java:235) at
   org.apache.velocity.app.VelocityEngine.mergeTemplate(VelocityEngine.java:381)
   at
   
com.atlassian.velocity.DefaultVelocityManager.writeEncodedBodyImpl(DefaultVelocityManager.java:117)
   at
   
com.atlassian.velocity.DefaultVelocityManager.writeEncodedBody(DefaultVelocityManager.java:97)
   at
   
com.atlassian.jira.template.velocity.DefaultVelocityTemplatingEngine$DefaultRenderRequest.toWriterImpl(DefaultVelocityTemplatingEngine.java:146)
   at
   
com.atlassian.jira.template.velocity.DefaultVelocityTemplatingEngine$DefaultRenderRequest.asHtml(DefaultVelocityTemplatingEngine.java:134)
   at
   
com.atlassian.jira.template.velocity.DefaultVelocityTemplatingEngine$DefaultRenderRequest$2.with(DefaultVelocityTemplatingEngine.java:120)
   at
   
com.atlassian.jira.template.velocity.DefaultVelocityTemplatingEngine$DefaultRenderRequest$StringRepresentation.toString(DefaultVelocityTemplatingEngine.java:88)
   at
   
com.atlassian.jira.template.velocity.DefaultVelocityTemplatingEngine$DefaultRenderRequest.asHtml(DefaultVelocityTemplatingEngine.java:116)
   at
   
com.atlassian.jira.plugin.projectpanel.impl.VersionDrillDownRenderer.getHtml(VersionDrillDownRenderer.java:125)
   at
   
com.atlassian.jira.plugin.projectpanel.impl.VersionDrillDownRenderer.getHtml(VersionDrillDownRenderer.java:108)
   at
   
com.atlassian.jira.plugin.projectpanel.impl.ChangeLogProjectTabPanel.getHtml(ChangeLogProjectTabPanel.java:41)
   at
   
com.atlassian.jira.web.action.browser.BrowseProject.getTabHtml(BrowseProject.java:224)
   at


Re: New logo on website

2014-01-20 Thread Yexi Jiang
The new logo seems really nice :)

It seems that the main color most of the projects in hadoop ecosystem is
yellow, e.g. hadoop, mahout, hive, giraph, oozie, and the original hama
logo. Only pig and hbase have different style.

In my humble opinion, the original logo has been well recognized by the
community. I'm not sure whether it is good to replace the original one.


2014/1/20 Tommaso Teofili 

> +1
>
> just remember we have to include also the ASF logo on the website, maybe on
> the right.
>
> Regards,
> Tommaso
>
>
> 2014/1/20 Chia-Hung Lin 
>
> > + one hama
> >
> > On 20 January 2014 20:18, Martin Illecker  wrote:
> > > Nice ;-)
> > >
> > >
> > > 2014/1/20 Edward J. Yoon 
> > >
> > >> Hi,
> > >>
> > >> http://people.apache.org/~edwardyoon/site/index.html
> > >>
> > >> Do you like this new logo?
> > >>
> > >> If no objections arise, I'd like to commit this!
> > >>
> > >> --
> > >> Best Regards, Edward J. Yoon
> > >> @eddieyoon
> > >>
> >
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Re: Website Update

2014-01-03 Thread Yexi Jiang
I intended to fix the bugs you mentioned but you have already fixed it.


2014/1/3 Edward J. Yoon 

> > Is there any details about the test results?
>
> As described in there, 6M poinsts input and Oracle BDA was used. Hama
> finishes KMeans in a few seconds. Mahout takes almost 500 ~ 1000 secs.
> See also http://lambda.uta.edu/mrql-bsp.pdf.
>
> Do you need more?
>
> On Fri, Jan 3, 2014 at 11:03 AM, Yexi Jiang  wrote:
> > Is there any details about the test results?
> >
> >
> > 2014/1/2 Edward J. Yoon 
> >
> >> Input was too small :/ So, I'll update website using confident
> benchmarks
> >> soon.
> >>
> >> https://twitter.com/tjungblut/status/414717432293363712
> >>
> >> After I saw this tweet, I thought we need to update the website. Many
> >> people seems think that hama is a graph processing framework. If you
> >> have some good idea, Please let me know.
> >>
> >> P.S., I recently tested Hama examples and 80% of them didn't work.
> >> Let's fix them all in 0.7 release and update the website clearly.
> >>
> >>
> >> On Thu, Jan 2, 2014 at 11:55 PM, Tommaso Teofili
> >>  wrote:
> >> > Hi all,
> >> >
> >> > I just noticed that the graph on our homepage [0] looks very similar
> to
> >> the
> >> > one on Spark homepage [1] so I wonder if we could at least make it a
> bit
> >> > clearer either by writing the benchmarks results in a table near it
> (it
> >> > seems Hama always takes ~0) or something else I cannot think to right
> >> now.
> >> >
> >> > The reference to the benchmarks wiki page is ok but I cannot find the
> >> entry
> >> > for the comparison with Mahout, maybe I'm missing something ...
> >> > Regards and happy new year everyone.
> >> > Tommaso
> >> >
> >> > [0] : http://hama.apache.org/images/mahout_vs_hama.png
> >> > [1] : http://spark.incubator.apache.org/images/spark-lr.png
> >> >
> >> >
> >> > 2013/12/20 Edward J. Yoon 
> >> >
> >> >> Hi all,
> >> >>
> >> >> I published new website for our community. If you have other ideas,
> >> please
> >> >> feel free to share your comments or file a JIRA ticket.
> >> >>
> >> >>
> >> >> On Fri, Dec 20, 2013 at 11:29 AM, Edward J. Yoon <
> edwardy...@apache.org
> >> >> >wrote:
> >> >>
> >> >> > Thanks, I'll.
> >> >> >
> >> >> > Sent from my iPhone
> >> >> >
> >> >> > > On 2013. 12. 19., at 오후 10:41, Yexi Jiang 
> >> wrote:
> >> >> > >
> >> >> > > Hi,
> >> >> > >
> >> >> > > It looks nice. Is it possible to add more description to the
> figure?
> >> >> When
> >> >> > > people first saw this, they may not know what the x axis is (the
> >> number
> >> >> > of
> >> >> > > cores or the number of the number of groom servers?). Moreover,
> it
> >> is
> >> >> > > better to tell the reader some specs of the dataset used.
> >> >> > >
> >> >> > > Regards,
> >> >> > > Yexi
> >> >> > >
> >> >> > >
> >> >> > > 2013/12/19 Edward J. Yoon 
> >> >> > >
> >> >> > >> Thank you so much!
> >> >> > >>
> >> >> > >> Sent from my iPhone
> >> >> > >>
> >> >> > >>>> On 2013. 12. 19., at 오후 4:45, Tommaso Teofili <
> >> >> > tommaso.teof...@gmail.com>
> >> >> > >>> wrote:
> >> >> > >>>
> >> >> > >>> Hi Edward,
> >> >> > >>>
> >> >> > >>> I think it generally looks better than the current one, I would
> >> just
> >> >> > >> change
> >> >> > >>> this:
> >> >> > >>>
> >> >> > >>> Many data analysis techniques such as machine learning and
> graph
> >> >> > >> algorithms
> >> >> > >>> require iterative computations but MapReduce model doesn't fit
> for
> >> >

Re: Website Update

2014-01-02 Thread Yexi Jiang
Is there any details about the test results?


2014/1/2 Edward J. Yoon 

> Input was too small :/ So, I'll update website using confident benchmarks
> soon.
>
> https://twitter.com/tjungblut/status/414717432293363712
>
> After I saw this tweet, I thought we need to update the website. Many
> people seems think that hama is a graph processing framework. If you
> have some good idea, Please let me know.
>
> P.S., I recently tested Hama examples and 80% of them didn't work.
> Let's fix them all in 0.7 release and update the website clearly.
>
>
> On Thu, Jan 2, 2014 at 11:55 PM, Tommaso Teofili
>  wrote:
> > Hi all,
> >
> > I just noticed that the graph on our homepage [0] looks very similar to
> the
> > one on Spark homepage [1] so I wonder if we could at least make it a bit
> > clearer either by writing the benchmarks results in a table near it (it
> > seems Hama always takes ~0) or something else I cannot think to right
> now.
> >
> > The reference to the benchmarks wiki page is ok but I cannot find the
> entry
> > for the comparison with Mahout, maybe I'm missing something ...
> > Regards and happy new year everyone.
> > Tommaso
> >
> > [0] : http://hama.apache.org/images/mahout_vs_hama.png
> > [1] : http://spark.incubator.apache.org/images/spark-lr.png
> >
> >
> > 2013/12/20 Edward J. Yoon 
> >
> >> Hi all,
> >>
> >> I published new website for our community. If you have other ideas,
> please
> >> feel free to share your comments or file a JIRA ticket.
> >>
> >>
> >> On Fri, Dec 20, 2013 at 11:29 AM, Edward J. Yoon  >> >wrote:
> >>
> >> > Thanks, I'll.
> >> >
> >> > Sent from my iPhone
> >> >
> >> > > On 2013. 12. 19., at 오후 10:41, Yexi Jiang 
> wrote:
> >> > >
> >> > > Hi,
> >> > >
> >> > > It looks nice. Is it possible to add more description to the figure?
> >> When
> >> > > people first saw this, they may not know what the x axis is (the
> number
> >> > of
> >> > > cores or the number of the number of groom servers?). Moreover, it
> is
> >> > > better to tell the reader some specs of the dataset used.
> >> > >
> >> > > Regards,
> >> > > Yexi
> >> > >
> >> > >
> >> > > 2013/12/19 Edward J. Yoon 
> >> > >
> >> > >> Thank you so much!
> >> > >>
> >> > >> Sent from my iPhone
> >> > >>
> >> > >>>> On 2013. 12. 19., at 오후 4:45, Tommaso Teofili <
> >> > tommaso.teof...@gmail.com>
> >> > >>> wrote:
> >> > >>>
> >> > >>> Hi Edward,
> >> > >>>
> >> > >>> I think it generally looks better than the current one, I would
> just
> >> > >> change
> >> > >>> this:
> >> > >>>
> >> > >>> Many data analysis techniques such as machine learning and graph
> >> > >> algorithms
> >> > >>> require iterative computations but MapReduce model doesn't fit for
> >> > these
> >> > >>> iterative data analysis applications. To run these iterative data
> >> > >> analysis
> >> > >>> applications more efficiently, Hama offers pure Bulk Synchronous
> >> > Parallel
> >> > >>> computing engine.
> >> > >>>
> >> > >>>
> >> > >>> to something like this:
> >> > >>>
> >> > >>> Many data analysis techniques such as machine learning and graph
> >> > >> algorithms
> >> > >>> require iterative computations, this is where Bulk Synchronous
> >> Parallel
> >> > >>> model can be more effective than "plain" MapReduce. Therefore to
> run
> >> > such
> >> > >>> iterative data analysis applications more efficiently, Hama offers
> >> pure
> >> > >>> Bulk Synchronous Parallel computing engine.
> >> > >>>
> >> > >>>
> >> > >>> As I wouldn't say MR is inherently not good for iterative
> >> computations,
> >> > >>> just BSP can be a better / more perfomant alternative.
> >> > >>> My 2 cents,
> >> > >>> Tommaso
> >> > >>>
> >> > >>>
> >> > >>> 2013/12/19 Edward J. Yoon 
> >> > >>>
> >> > >>>> Hi,
> >> > >>>>
> >> > >>>> I've made some changes to our website -
> >> > >>>> http://people.apache.org/~edwardyoon/site/ - Please review and
> >> > feedback
> >> > >>>> here.
> >> > >>>>
> >> > >>>> --
> >> > >>>> Best Regards, Edward J. Yoon
> >> > >>>> @eddieyoon
> >> > >
> >> > >
> >> > >
> >> > > --
> >> > > --
> >> > > Yexi Jiang,
> >> > > ECS 251,  yjian...@cs.fiu.edu
> >> > > School of Computer and Information Science,
> >> > > Florida International University
> >> > > Homepage: http://users.cis.fiu.edu/~yjian004/
> >> >
> >>
> >>
> >>
> >> --
> >> Best Regards, Edward J. Yoon
> >> @eddieyoon
> >>
>
>
>
> --
> Best Regards, Edward J. Yoon
> @eddieyoon
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Typo in hama web page

2014-01-02 Thread Yexi Jiang
Hi,

I found there is a typo in this page
http://hama.apache.org/hama_graph_tutorial.html.

In Section "Vertex API", the argument of compute method, the parameter
should be Iterable instead of Iterator. Similar for the Section
"Example: PageRankVertex".

Regards,
Yexi


Re: Website Update

2013-12-19 Thread Yexi Jiang
Hi,

It looks nice. Is it possible to add more description to the figure? When
people first saw this, they may not know what the x axis is (the number of
cores or the number of the number of groom servers?). Moreover, it is
better to tell the reader some specs of the dataset used.

Regards,
Yexi


2013/12/19 Edward J. Yoon 

> Thank you so much!
>
> Sent from my iPhone
>
> > On 2013. 12. 19., at 오후 4:45, Tommaso Teofili 
> wrote:
> >
> > Hi Edward,
> >
> > I think it generally looks better than the current one, I would just
> change
> > this:
> >
> > Many data analysis techniques such as machine learning and graph
> algorithms
> > require iterative computations but MapReduce model doesn't fit for these
> > iterative data analysis applications. To run these iterative data
> analysis
> > applications more efficiently, Hama offers pure Bulk Synchronous Parallel
> > computing engine.
> >
> >
> > to something like this:
> >
> > Many data analysis techniques such as machine learning and graph
> algorithms
> > require iterative computations, this is where Bulk Synchronous Parallel
> > model can be more effective than "plain" MapReduce. Therefore to run such
> > iterative data analysis applications more efficiently, Hama offers pure
> > Bulk Synchronous Parallel computing engine.
> >
> >
> > As I wouldn't say MR is inherently not good for iterative computations,
> > just BSP can be a better / more perfomant alternative.
> > My 2 cents,
> > Tommaso
> >
> >
> > 2013/12/19 Edward J. Yoon 
> >
> >> Hi,
> >>
> >> I've made some changes to our website -
> >> http://people.apache.org/~edwardyoon/site/ - Please review and feedback
> >> here.
> >>
> >> --
> >> Best Regards, Edward J. Yoon
> >> @eddieyoon
> >>
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


[jira] [Updated] (HAMA-828) Improve code, fix typo and modify unclear comment in org.apache.hama.ml.ann package

2013-12-11 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-828:


Resolution: Fixed
Status: Resolved  (was: Patch Available)

> Improve code, fix typo and modify unclear comment in org.apache.hama.ml.ann 
> package
> ---
>
> Key: HAMA-828
> URL: https://issues.apache.org/jira/browse/HAMA-828
> Project: Hama
>  Issue Type: Improvement
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: machine_learning
> Fix For: 0.7.0
>
> Attachments: HAMA-828.patch
>
>
> Improve code, fix the typo and unclear comments.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)


[jira] [Updated] (HAMA-828) Improve code, fix typo and modify unclear comment in org.apache.hama.ml.ann package

2013-12-09 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-828:


Status: Patch Available  (was: Open)

Please check the patch, if there is no problem, I will submit it within two 
days.

> Improve code, fix typo and modify unclear comment in org.apache.hama.ml.ann 
> package
> ---
>
> Key: HAMA-828
> URL: https://issues.apache.org/jira/browse/HAMA-828
> Project: Hama
>  Issue Type: Improvement
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: machine_learning
> Fix For: 0.7.0
>
> Attachments: HAMA-828.patch
>
>
> Improve code, fix the typo and unclear comments.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)


[jira] [Updated] (HAMA-828) Improve code, fix typo and modify unclear comment in org.apache.hama.ml.ann package

2013-12-09 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-828:


Attachment: HAMA-828.patch

> Improve code, fix typo and modify unclear comment in org.apache.hama.ml.ann 
> package
> ---
>
> Key: HAMA-828
> URL: https://issues.apache.org/jira/browse/HAMA-828
> Project: Hama
>  Issue Type: Improvement
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: machine_learning
> Fix For: 0.7.0
>
> Attachments: HAMA-828.patch
>
>
> Improve code, fix the typo and unclear comments.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)


[jira] [Updated] (HAMA-828) Improve code, fix typo and modify unclear comment in org.apache.hama.ml.ann package

2013-12-08 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-828:


Description: Improve code, fix the typo and unclear comments.  (was: Fix 
the typo and unclear comments.)

> Improve code, fix typo and modify unclear comment in org.apache.hama.ml.ann 
> package
> ---
>
> Key: HAMA-828
> URL: https://issues.apache.org/jira/browse/HAMA-828
> Project: Hama
>  Issue Type: Improvement
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: machine_learning
> Fix For: 0.7.0
>
>
> Improve code, fix the typo and unclear comments.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)


[jira] [Updated] (HAMA-828) Improve code, fix typo and modify unclear comment in org.apache.hama.ml.ann package

2013-12-08 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-828:


Summary: Improve code, fix typo and modify unclear comment in 
org.apache.hama.ml.ann package  (was: Fix typo and unclear comment in 
org.apache.hama.ml.ann package)

> Improve code, fix typo and modify unclear comment in org.apache.hama.ml.ann 
> package
> ---
>
> Key: HAMA-828
> URL: https://issues.apache.org/jira/browse/HAMA-828
> Project: Hama
>  Issue Type: Improvement
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: machine_learning
> Fix For: 0.7.0
>
>
> Fix the typo and unclear comments.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)


[jira] [Updated] (HAMA-828) Fix typo and unclear comment in org.apache.hama.ml.ann package

2013-12-08 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-828:


Issue Type: Improvement  (was: Bug)

> Fix typo and unclear comment in org.apache.hama.ml.ann package
> --
>
> Key: HAMA-828
> URL: https://issues.apache.org/jira/browse/HAMA-828
> Project: Hama
>  Issue Type: Improvement
>Affects Versions: 0.7.0
>    Reporter: Yexi Jiang
>    Assignee: Yexi Jiang
>  Labels: machine_learning
> Fix For: 0.7.0
>
>
> Fix the typo and unclear comments.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)


[jira] [Created] (HAMA-828) Fix typo and unclear comment in org.apache.hama.ml.ann package

2013-12-08 Thread Yexi Jiang (JIRA)
Yexi Jiang created HAMA-828:
---

 Summary: Fix typo and unclear comment in org.apache.hama.ml.ann 
package
 Key: HAMA-828
 URL: https://issues.apache.org/jira/browse/HAMA-828
 Project: Hama
  Issue Type: Bug
Affects Versions: 0.7.0
Reporter: Yexi Jiang
Assignee: Yexi Jiang
 Fix For: 0.7.0


Fix the typo and unclear comments.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)


[jira] [Updated] (HAMA-822) Add feature transformer interface to improve the power and flexibility of existing machine learning model

2013-12-08 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-822:


Resolution: Fixed
Status: Resolved  (was: Patch Available)

> Add feature transformer interface to improve the power and flexibility of 
> existing machine learning model
> -
>
> Key: HAMA-822
> URL: https://issues.apache.org/jira/browse/HAMA-822
> Project: Hama
>  Issue Type: New Feature
>  Components: machine learning
>Affects Versions: 0.7.0
>Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: machine_learning
> Fix For: 0.7.0
>
> Attachments: HAMA-822.patch
>
>
> Add the feature transformer interface to improve the power of the existing 
> learning model.
> If the user implicitly know how to transform and extract the features, the 
> user can implement the concrete feature transformer to extract the useful 
> information from the original features. 
> The implemented feature transformer can reduce the dimension of the original 
> features to improve the generalization of the model. Or it can transform the 
> original nonlinearly separable features by mapping them to a linear separable 
> space.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)


Stack-overflow is offering free open source adverts

2013-12-07 Thread Yexi Jiang
Hi, All,

It seems that stack overflow is offering free open source adverts, the
details can be found here:
http://meta.stackoverflow.com/questions/210389/open-source-advertising-sidebar-1h-2014
.

-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Re: [jira] [Updated] (HAMA-827) Add NamedVector

2013-12-07 Thread Yexi Jiang
Yes, +1.


2013/12/7 Edward J. Yoon (JIRA) 

>
>  [
> https://issues.apache.org/jira/browse/HAMA-827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel]
>
> Edward J. Yoon updated HAMA-827:
> 
>
> Attachment: HAMA-827_v04.patch
>
> Oh, ok. Apply methods are now deprecated.
>
> > Add NamedVector
> > ---
> >
> > Key: HAMA-827
> > URL: https://issues.apache.org/jira/browse/HAMA-827
> > Project: Hama
> >  Issue Type: Improvement
> >  Components: machine learning, math
> >Reporter: Edward J. Yoon
> >Assignee: Edward J. Yoon
> > Fix For: 0.7.0
> >
> > Attachments: HAMA-827_final.patch, HAMA-827_v03.patch,
> HAMA-827_v04.patch, patch.txt
> >
> >
> > Add named vector contains a string name field like Mahout's NamedVector.
>
>
>
> --
> This message was sent by Atlassian JIRA
> (v6.1#6144)
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


[jira] [Commented] (HAMA-827) Add NamedVector

2013-12-06 Thread Yexi Jiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HAMA-827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13841419#comment-13841419
 ] 

Yexi Jiang commented on HAMA-827:
-

Oh, want I mean is that the 'apply' method would be deprecated in the future.

> Add NamedVector
> ---
>
> Key: HAMA-827
> URL: https://issues.apache.org/jira/browse/HAMA-827
> Project: Hama
>  Issue Type: Improvement
>  Components: machine learning, math
>Reporter: Edward J. Yoon
>Assignee: Edward J. Yoon
> Fix For: 0.7.0
>
> Attachments: HAMA-827_final.patch, HAMA-827_v03.patch, patch.txt
>
>
> Add named vector contains a string name field like Mahout's NamedVector.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HAMA-827) Add NamedVector

2013-12-05 Thread Yexi Jiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HAMA-827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13840887#comment-13840887
 ] 

Yexi Jiang commented on HAMA-827:
-

Can't the NamedVector have the 'apply' methods?

> Add NamedVector
> ---
>
> Key: HAMA-827
> URL: https://issues.apache.org/jira/browse/HAMA-827
> Project: Hama
>  Issue Type: Improvement
>  Components: machine learning, math
>Reporter: Edward J. Yoon
>Assignee: Edward J. Yoon
> Attachments: patch.txt
>
>
> Add named vector contains a string name field like Mahout's NamedVector.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HAMA-822) Add feature transformer interface to improve the power and flexibility of existing machine learning model

2013-12-03 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-822:


Status: Patch Available  (was: Open)

If there is no obligation, I will commit it within this week. 


> Add feature transformer interface to improve the power and flexibility of 
> existing machine learning model
> -
>
> Key: HAMA-822
> URL: https://issues.apache.org/jira/browse/HAMA-822
> Project: Hama
>  Issue Type: New Feature
>  Components: machine learning
>Affects Versions: 0.7.0
>Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: machine_learning
> Attachments: HAMA-822.patch
>
>
> Add the feature transformer interface to improve the power of the existing 
> learning model.
> If the user implicitly know how to transform and extract the features, the 
> user can implement the concrete feature transformer to extract the useful 
> information from the original features. 
> The implemented feature transformer can reduce the dimension of the original 
> features to improve the generalization of the model. Or it can transform the 
> original nonlinearly separable features by mapping them to a linear separable 
> space.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HAMA-822) Add feature transformer interface to improve the power and flexibility of existing machine learning model

2013-12-03 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-822:


Attachment: HAMA-822.patch

> Add feature transformer interface to improve the power and flexibility of 
> existing machine learning model
> -
>
> Key: HAMA-822
> URL: https://issues.apache.org/jira/browse/HAMA-822
> Project: Hama
>  Issue Type: New Feature
>  Components: machine learning
>Affects Versions: 0.7.0
>Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: machine_learning
> Attachments: HAMA-822.patch
>
>
> Add the feature transformer interface to improve the power of the existing 
> learning model.
> If the user implicitly know how to transform and extract the features, the 
> user can implement the concrete feature transformer to extract the useful 
> information from the original features. 
> The implemented feature transformer can reduce the dimension of the original 
> features to improve the generalization of the model. Or it can transform the 
> original nonlinearly separable features by mapping them to a linear separable 
> space.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


Re: Run multiple hama jobs with LocalBSPRunner simultaneously

2013-12-02 Thread Yexi Jiang
Even concurrently reads are not allowed?

I have two test cases read the same file.


2013/12/2 Edward J. Yoon 

> Jobs can run but accessing one file concurrently is impossible.
>
> On Mon, Dec 2, 2013 at 12:15 PM, Yexi Jiang  wrote:
> > Hi, All,
> >
> > I am currently writing the test cases for HAMA-822. I need to launch hama
> > jobs in two of my test cases (method). I found that when the test cases
> are
> > executed in parallel, it will cause error. When I comments any one of the
> > test case, they other one work properly.
> >
> > Is it true that the LocalBSPRunner cannot simultaneously execute multiple
> > jobs?
> >
> > The error message looks like the following:
> >
> -
> > 13/12/01 22:01:53 WARN fs.FSInputChecker: Problem opening checksum file:
> > file:/tmp/hama-parts/job_localrunner_0001/part-0.  Ignoring
> exception:
> > java.io.EOFException
> >
> > 13/12/01 22:01:53 ERROR bsp.LocalBSPRunner: Exception during BSP
> execution!
> >
> > java.io.EOFException
> >
> >
> > 13/12/01 22:01:56 ERROR bsp.BSPJobClient: Error partitioning the input
> path.
> >
> > java.io.IOException: Runtime partition failed for the job.
> >
> 
>
>
>
> --
> Best Regards, Edward J. Yoon
> @eddieyoon
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Run multiple hama jobs with LocalBSPRunner simultaneously

2013-12-01 Thread Yexi Jiang
Hi, All,

I am currently writing the test cases for HAMA-822. I need to launch hama
jobs in two of my test cases (method). I found that when the test cases are
executed in parallel, it will cause error. When I comments any one of the
test case, they other one work properly.

Is it true that the LocalBSPRunner cannot simultaneously execute multiple
jobs?

The error message looks like the following:
-
13/12/01 22:01:53 WARN fs.FSInputChecker: Problem opening checksum file:
file:/tmp/hama-parts/job_localrunner_0001/part-0.  Ignoring exception:
java.io.EOFException

13/12/01 22:01:53 ERROR bsp.LocalBSPRunner: Exception during BSP execution!

java.io.EOFException


13/12/01 22:01:56 ERROR bsp.BSPJobClient: Error partitioning the input path.

java.io.IOException: Runtime partition failed for the job.



[jira] [Commented] (HAMA-809) Underflow in LogisticRegressionModel

2013-11-27 Thread Yexi Jiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HAMA-809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13834497#comment-13834497
 ] 

Yexi Jiang commented on HAMA-809:
-

Is this still open?

> Underflow in LogisticRegressionModel
> 
>
> Key: HAMA-809
> URL: https://issues.apache.org/jira/browse/HAMA-809
> Project: Hama
>  Issue Type: Bug
>  Components: machine learning
>Affects Versions: 0.6.1, 0.6.2
>Reporter: Tommaso Teofili
>Assignee: Tommaso Teofili
> Fix For: 0.7.0
>
>
> There's an underflow bug in _LogisticRegressionModel_ when calculating the 
> hypothesis output due to _double_ limitations.
> Therefore we should leverage _BigDecimals_ for such operations as they allow 
> for more precise calculation.
> Also the log algorithm in LRM is wrong and needs to be fixed.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


Re: Hama book

2013-11-26 Thread Yexi Jiang
Me too, that's pretty interesting.


2013/11/26 Suraj Menon 

> I can help too. What is the timeline?
>
>
> On Tue, Nov 26, 2013 at 5:32 AM, Anastasis Andronidis <
> andronat_...@hotmail.com> wrote:
>
> > I am interested for the Graph API if you want.
> >
> > Anastasis
> >
> > On 26 Νοε 2013, at 8:02 π.μ., Edward J. Yoon 
> > wrote:
> >
> > > Anyone?
> > >
> > > On Thu, Nov 21, 2013 at 8:34 PM, Edward J. Yoon  >
> > wrote:
> > >> Hi forks,
> > >>
> > >> I talked little with Manning’s Publisher, and started to writing a
> book
> > proposal. Comment below if you're interested in being co-author.
> > >
> > >
> > >
> > > --
> > > Best Regards, Edward J. Yoon
> > > @eddieyoon
> > >
> >
> >
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


[jira] [Commented] (HAMA-821) K-Means writes only k records as a output

2013-11-25 Thread Yexi Jiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HAMA-821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13832319#comment-13832319
 ] 

Yexi Jiang commented on HAMA-821:
-

+1

> K-Means writes only k records as a output
> -
>
> Key: HAMA-821
> URL: https://issues.apache.org/jira/browse/HAMA-821
> Project: Hama
>  Issue Type: Bug
>  Components: machine learning
>Reporter: Edward J. Yoon
>Assignee: Edward J. Yoon
> Fix For: 0.7.0
>
> Attachments: HAMA-821.patch
>
>
> KMeans writes only k records, because 276 line at KMeansBSP overwrites the 
> value for the key. I'm sure it was not intended..
> And, many people ask me about meaning of input and output of KMeans. We need 
> to make K-Means example output lines more readable like,
> {code}
> 13/11/25 17:34:04 INFO kmeans.KMeansBSP: Finished! Writing the results...
> [5.1, 3.5, 1.4, 0.2] belongs to cluster 2
> [4.9, 3.0, 1.4, 0.2] belongs to cluster 2
> [4.7, 3.2, 1.3, 0.2] belongs to cluster 2
> [4.6, 3.1, 1.5, 0.2] belongs to cluster 2
> [5.0, 3.6, 1.4, 0.2] belongs to cluster 2
> 
> {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Created] (HAMA-822) Add feature transformer interface to improve the power and flexibility of existing machine learning model

2013-11-25 Thread Yexi Jiang (JIRA)
Yexi Jiang created HAMA-822:
---

 Summary: Add feature transformer interface to improve the power 
and flexibility of existing machine learning model
 Key: HAMA-822
 URL: https://issues.apache.org/jira/browse/HAMA-822
 Project: Hama
  Issue Type: New Feature
  Components: machine learning
Affects Versions: 0.7.0
Reporter: Yexi Jiang
Assignee: Yexi Jiang


Add the feature transformer interface to improve the power of the existing 
learning model.

If the user implicitly know how to transform and extract the features, the user 
can implement the concrete feature transformer to extract the useful 
information from the original features. 

The implemented feature transformer can reduce the dimension of the original 
features to improve the generalization of the model. Or it can transform the 
original nonlinearly separable features by mapping them to a linear separable 
space.





--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HAMA-818) Remove useless comments in GroomServer

2013-11-25 Thread Yexi Jiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HAMA-818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13831471#comment-13831471
 ] 

Yexi Jiang commented on HAMA-818:
-

+1, it looks more clear.

> Remove useless comments in GroomServer
> --
>
> Key: HAMA-818
> URL: https://issues.apache.org/jira/browse/HAMA-818
> Project: Hama
>  Issue Type: Improvement
>Reporter: Edward J. Yoon
>Assignee: Edward J. Yoon
>Priority: Minor
> Fix For: 0.7.0
>
> Attachments: patch.txt
>
>
> http://markmail.org/message/lr4xlyegr3y7qssr



--
This message was sent by Atlassian JIRA
(v6.1#6144)


Question about the workflow of GroomServer

2013-11-20 Thread Yexi Jiang
Hi,

I am recently reading the code of the GroomServer and trying to figure out
the workflow. The following is my understanding, please correct me if I am
wrong.

I already know the following steps:
1. When initializing, the GroomServer will create an instructor, who will
buffer the incoming directive instance (a wrapper of action that can
launch, recover, or kill a task?).
2. The instance will be handled by the DispatchTasksHandler.
3. If the action is launched, then a couple of things will be done in the
order: DispatchTasksHandler.startNewTask -> GroomServer.localizeJob ->
GroomServer.launchTaskForJob -> TaskInProgress.launchTask .
I haven't figure out why a couple of distinct BSPJob instances are created
in these steps.
4. A TaskRunner will be created to run the task. Further, a BSPChildRunner
instance will be created. When the BSPChildRunner instance (a thread) been
scheduled and executed, it will create a process to execute
the BSPPeerChild.
5. BSPChildRunner will create a BSPPeerImpl and it will be executed by a
BSPTask (BSPTask.run -> BSPTask.runBSP).
6. BSPPeerImpl will execute its setup, bsp, and cleanup method, which will
be implemented by the end user.

Regards,
Yexi


Re: Hama is in the Bossie awards 2013

2013-10-30 Thread Yexi Jiang
Cool! It said that hama mainly focuses on matrix computation, but it seems
that the current implementation does not has too much support.


2013/10/30 Edward J. Yoon 

>
> http://www.infoworld.com/d/open-source-software/bossies-2013-the-best-of-open-source-software-awards-226971
>
> Congratz :-)
>
> --
> Best Regards, Edward J. Yoon
> @eddieyoon
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Re: Combine all Writables into a new package

2013-10-21 Thread Yexi Jiang
datePeerAction.java
> >
> > ClusterStatus.java
> >
> >
> > -- bsp.groom --
> >
> > GroomServer.java
> >
> > GroomServerManager.java
> >
> > GroomServerStatus.java
> >
> >
> > -- bsp.job --
> >
> > JobInProgress.java
> >
> > JobInProgressListener.java
> >
> > JobProfile.java
> >
> > JobStatus.java
> >
> > RunningJob.java
> >
> >
> > -- bsp.scheduler --
> >
> > Schedulable.java
> >
> > SimpleTaskScheduler.java
> >
> > TaskScheduler.java
> >
> > FCFSQueue.java
> >
> > Queue.java
> >
> > QueueManager.java
> >
> >
> > -- bsp.superstep --
> >
> > Superstep.java
> >
> > SuperstepBSP.java
> >
> >
> > Thanks!
> >
> >
> > 2013/10/21 Chia-Hung Lin 
> >
> >> +1 for hama-io or hama-commons
> >>
> >>
> >> On 21 October 2013 21:35, Tommaso Teofili 
> >> wrote:
> >> > what about creating a module for that (Writables and InputFormats for
> >> now)
> >> > hama-io / hama-commons that can be used by both (containing math
> stuff as
> >> > well) ?
> >> >
> >> > Tommaso
> >> >
> >> >
> >> > 2013/10/21 Martin Illecker 
> >> >
> >> >> VectorWritable and MatrixWritable have both some dependencies
> >> >> to org.apache.hama.ml.math.
> >> >> (DenseDoubleVector, DoubleVector and DenseDoubleMatrix, DoubleMatrix)
> >> >>
> >> >> If we move VectorWritable and MatrixWritable to core (e.g.,
> >> >> org.apache.hama.io.writable), we have to move
> org.apache.hama.ml.math as
> >> >> well.
> >> >> I think that's not possible because of other classes in hama-ml
> >> depending
> >> >> on ml.math.
> >> >>
> >> >> Temporary I will have to copy VectorWritable to the core to use it
> in a
> >> >> test case.
> >> >>
> >> >>
> >> >> 2013/10/21 Tommaso Teofili 
> >> >>
> >> >> > 2013/10/21 Martin Illecker 
> >> >> >
> >> >> > > Hello,
> >> >> > >
> >> >> > > regarding to my Hama Pipes test case [1], I want to use
> >> VectorWritable
> >> >> > > inside the hama-core module.
> >> >> > > Therefore I would need a dependency to hama-ml but this will
> cause a
> >> >> > cyclic
> >> >> > > dependency.
> >> >> > >
> >> >> > > So is it possible to move both writables, VectorWritable and
> >> >> > > MatrixWritable, from org.apache.hama.ml.writable into a new
> package?
> >> >> > > e.g., org.apache.hama.io.writable based on [2]
> >> >> > >
> >> >> >
> >> >> > I think this really makes sense.
> >> >> >
> >> >> >
> >> >> > >
> >> >> > > Regarding to [3] we can also move TextArrayWritable
> >> >> > > from org.apache.hama.bsp into this new package.
> >> >> > >
> >> >> > > Do you think we can move the writables of
> >> org.apache.hama.ml.writable
> >> >> to
> >> >> > > the core module?
> >> >> > >
> >> >> >
> >> >> > +1
> >> >> >
> >> >> >
> >> >> > > And can we do the package refactoring [2] of org.apache.hama.bsp
> >> >> > submitted
> >> >> > > by Suraj?
> >> >> > >
> >> >> >
> >> >> > +1 here too.
> >> >> >
> >> >> > Tommaso
> >> >> >
> >> >> >
> >> >> > >
> >> >> > > Thanks!
> >> >> > >
> >> >> > > Martin
> >> >> > >
> >> >> > > [1] https://issues.apache.org/jira/browse/HAMA-808
> >> >> > > [2]
> >> >> >
> https://issues.apache.org/jira/secure/attachment/12609417/bsplist.txt
> >> >> > > [3] https://issues.apache.org/jira/browse/HAMA-727
> >> >> > >
> >> >> >
> >> >>
> >>
>
>
>
> --
> Best Regards, Edward J. Yoon
> @eddieyoon
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Does Vertex assumes that the edges of a vertex can be fit in memory?

2013-10-18 Thread Yexi Jiang
Hi,

I recently read the source code about the graph processing. I found that
the edge information of a vertex is stored in ArrayList, which implicitly
assume that the edges of a vertex can be fit in the memory. Will such
implementation cause problem for some extreme large graphs?

I also read the source code of apache giraph, it seems that they have
similar implementation (store the edges in an in-memory data structure).

Regards,
Yexi


[jira] [Commented] (HAMA-809) Underflow in LogisticRegressionModel

2013-10-09 Thread Yexi Jiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HAMA-809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13791089#comment-13791089
 ] 

Yexi Jiang commented on HAMA-809:
-

Sorry I'm super busy these days so that I can't help to fix this bug.

> Underflow in LogisticRegressionModel
> 
>
> Key: HAMA-809
> URL: https://issues.apache.org/jira/browse/HAMA-809
> Project: Hama
>  Issue Type: Bug
>  Components: machine learning
>Reporter: Tommaso Teofili
>Assignee: Tommaso Teofili
>
> There's an underflow bug in _LogisticRegressionModel_ when calculating the 
> hypothesis output due to _double_ limitations.
> Therefore we should leverage _BigDecimal_s for such operations as they allow 
> for more precise calculation.
> Also the log algorithm in LRM is wrong and needs to be fixed.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HAMA-809) Underflow in LogisticRegressionModel

2013-10-09 Thread Yexi Jiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HAMA-809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13790446#comment-13790446
 ] 

Yexi Jiang commented on HAMA-809:
-

[~teofili] What is the testing data?

> Underflow in LogisticRegressionModel
> 
>
> Key: HAMA-809
> URL: https://issues.apache.org/jira/browse/HAMA-809
> Project: Hama
>  Issue Type: Bug
>  Components: machine learning
>Reporter: Tommaso Teofili
>Assignee: Tommaso Teofili
>
> There's an underflow bug in _LogisticRegressionModel_ when calculating the 
> hypothesis output due to _double_ limitations.
> Therefore we should leverage _BigDecimal_s for such operations as they allow 
> for more precise calculation.
> Also the log algorithm in LRM is wrong and needs to be fixed.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


Re: [VOTE] Release Hama 0.6.3 (RC3)

2013-10-09 Thread Yexi Jiang
+1



2013/10/9 Anastasis Andronidis 

> +1
>
> Anastasis
>
> On 9 Οκτ 2013, at 12:43 μ.μ., Tommaso Teofili 
> wrote:
>
> > +1
> >
> > Tommaso
> >
> >
> > 2013/10/6 Edward J. Yoon 
> >
> >> Hi,
> >>
> >> I've created RC3 for Hama 0.6.3 release.
> >>
> >> Artifacts and Signatures:
> >> http://people.apache.org/~edwardyoon/dist/0.6.3-RC3/
> >>
> >> SVN Tags: http://svn.apache.org/repos/asf/hama/tags/0.6.3-RC3/
> >>
> >> KEYS file: https://dist.apache.org/repos/dist/release/hama/KEYS
> >>
> >> Please try it on both hadoop1 and hadoop2, run the tests, check the doc,
> >> etc.
> >>
> >> [ ] +1 Release the packages as Apache Hama 0.6.3
> >> [ ] -1 Do not release the packages because...
> >>
> >> Thank you!
> >>
> >> --
> >> Best Regards, Edward J. Yoon
> >> @eddieyoon
> >>
>
>


-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


[jira] [Updated] (HAMA-806) Make the description of NeuralNetwork example more clear

2013-09-27 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-806:


Attachment: HAMA-806.patch

> Make the description of NeuralNetwork example more clear
> 
>
> Key: HAMA-806
> URL: https://issues.apache.org/jira/browse/HAMA-806
> Project: Hama
>  Issue Type: Improvement
>  Components: examples, machine learning
>Affects Versions: 0.6.3
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>Priority: Trivial
>  Labels: machine_learning
> Fix For: 0.6.3
>
> Attachments: HAMA-806.patch
>
>
> Due to the complexity of using the NeuralNetwork, the current description of 
> how to use the NeuralNetwork is not clear. There is needs to make it more 
> clear.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HAMA-806) Make the description of NeuralNetwork example more clear

2013-09-27 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-806:


Status: Patch Available  (was: Open)

> Make the description of NeuralNetwork example more clear
> 
>
> Key: HAMA-806
> URL: https://issues.apache.org/jira/browse/HAMA-806
> Project: Hama
>  Issue Type: Improvement
>  Components: examples, machine learning
>Affects Versions: 0.6.3
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>Priority: Trivial
>  Labels: machine_learning
> Fix For: 0.6.3
>
> Attachments: HAMA-806.patch
>
>
> Due to the complexity of using the NeuralNetwork, the current description of 
> how to use the NeuralNetwork is not clear. There is needs to make it more 
> clear.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HAMA-806) Make the description of NeuralNetwork example more clear

2013-09-27 Thread Yexi Jiang (JIRA)
Yexi Jiang created HAMA-806:
---

 Summary: Make the description of NeuralNetwork example more clear
 Key: HAMA-806
 URL: https://issues.apache.org/jira/browse/HAMA-806
 Project: Hama
  Issue Type: Improvement
  Components: examples, machine learning
Affects Versions: 0.6.3
Reporter: Yexi Jiang
Assignee: Yexi Jiang
Priority: Trivial
 Fix For: 0.6.3


Due to the complexity of using the NeuralNetwork, the current description of 
how to use the NeuralNetwork is not clear. There is needs to make it more clear.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


Re: [jira] [Commented] (HAMA-804) Create NeuralNetwork Example

2013-09-27 Thread Yexi Jiang
It seems that the @Test tag has been removed in the test case.


2013/9/26 Edward J. Yoon (JIRA) 

>
> [
> https://issues.apache.org/jira/browse/HAMA-804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13779629#comment-13779629]
>
> Edward J. Yoon commented on HAMA-804:
> -
>
> Sorry, I made a bug.
>
> I've changed from this.testNeuralNetworkTraining(); to
> this.neuralNetworkTraining(); and, moved delete() calls to finally
> block.
>
> } finally {
>   fs.delete(new Path(RESULT_PATH), true);
>   fs.delete(new Path(MODEL_PATH), true);
>   fs.delete(new Path(SEQTRAIN_DATA), true);
> }
>
>
>
>
> --
> Best Regards, Edward J. Yoon
> @eddieyoon
>
>
> > Create NeuralNetwork Example
> > 
> >
> > Key: HAMA-804
> > URL: https://issues.apache.org/jira/browse/HAMA-804
> >     Project: Hama
> >  Issue Type: New Feature
> >  Components: machine learning
> >Reporter: Yexi Jiang
> >Assignee: Yexi Jiang
> >  Labels: machine_learning
> > Fix For: 0.6.3
> >
> > Attachments: HAMA-804.patch
> >
> >
> > Add an NeuralNetwork example to the example package.
>
> --
> This message is automatically generated by JIRA.
> If you think it was sent incorrectly, please contact your JIRA
> administrators
> For more information on JIRA, see: http://www.atlassian.com/software/jira
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Re: [jira] [Commented] (HAMA-804) Create NeuralNetwork Example

2013-09-25 Thread Yexi Jiang
It looks weird. The error indicates that the DoubleVector is not defined,
but it is indeed defined in the ml package. Moreover, it passes all the
tests.


2013/9/25 Edward J. Yoon (JIRA) 

>
> [
> https://issues.apache.org/jira/browse/HAMA-804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13778378#comment-13778378]
>
> Edward J. Yoon commented on HAMA-804:
> -
>
> I don't know why yet but there seems some bug.
>
> > Create NeuralNetwork Example
> > 
> >
> > Key: HAMA-804
> > URL: https://issues.apache.org/jira/browse/HAMA-804
> > Project: Hama
> >  Issue Type: New Feature
> >  Components: machine learning
> >Reporter: Yexi Jiang
> >Assignee: Yexi Jiang
> >  Labels: machine_learning
> > Fix For: 0.6.3
> >
> > Attachments: HAMA-804.patch
> >
> >
> > Add an NeuralNetwork example to the example package.
>
> --
> This message is automatically generated by JIRA.
> If you think it was sent incorrectly, please contact your JIRA
> administrators
> For more information on JIRA, see: http://www.atlassian.com/software/jira
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


[jira] [Commented] (HAMA-804) Create NeuralNetwork Example

2013-09-25 Thread Yexi Jiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HAMA-804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13778274#comment-13778274
 ] 

Yexi Jiang commented on HAMA-804:
-

[~udanax]It is true that the classes defined in ml cannot be used?

> Create NeuralNetwork Example
> 
>
> Key: HAMA-804
> URL: https://issues.apache.org/jira/browse/HAMA-804
> Project: Hama
>  Issue Type: New Feature
>  Components: machine learning
>    Reporter: Yexi Jiang
>Assignee: Yexi Jiang
>  Labels: machine_learning
> Fix For: 0.6.3
>
> Attachments: HAMA-804.patch
>
>
> Add an NeuralNetwork example to the example package.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HAMA-804) Create NeuralNetwork Example

2013-09-25 Thread Yexi Jiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HAMA-804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yexi Jiang updated HAMA-804:


Resolution: Fixed
Status: Resolved  (was: Patch Available)

> Create NeuralNetwork Example
> 
>
> Key: HAMA-804
> URL: https://issues.apache.org/jira/browse/HAMA-804
> Project: Hama
>  Issue Type: New Feature
>  Components: machine learning
>    Reporter: Yexi Jiang
>    Assignee: Yexi Jiang
>  Labels: machine_learning
> Fix For: 0.6.3
>
> Attachments: HAMA-804.patch
>
>
> Add an NeuralNetwork example to the example package.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


Re: [jira] [Commented] (HAMA-804) Create NeuralNetwork Example

2013-09-25 Thread Yexi Jiang
Hi, Edward,

Thank you for solving this problem!

Regards,
Yexi


2013/9/25 Hudson (JIRA) 

>
> [
> https://issues.apache.org/jira/browse/HAMA-804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13777441#comment-13777441]
>
> Hudson commented on HAMA-804:
> -
>
> SUCCESS: Integrated in Hama-trunk #179 (See [
> https://builds.apache.org/job/Hama-trunk/179/])
> HAMA-804: Fix rat check fail (edwardyoon: rev 1526167)
> *
> /hama/trunk/examples/src/test/java/org/apache/hama/examples/NeuralNetworkTest.java
>
>
> > Create NeuralNetwork Example
> > 
> >
> > Key: HAMA-804
> > URL: https://issues.apache.org/jira/browse/HAMA-804
> > Project: Hama
> >  Issue Type: New Feature
> >  Components: machine learning
> >Reporter: Yexi Jiang
> >Assignee: Yexi Jiang
> >  Labels: machine_learning
> > Fix For: 0.6.3
> >
> > Attachments: HAMA-804.patch
> >
> >
> > Add an NeuralNetwork example to the example package.
>
> --
> This message is automatically generated by JIRA.
> If you think it was sent incorrectly, please contact your JIRA
> administrators
> For more information on JIRA, see: http://www.atlassian.com/software/jira
>



-- 
--
Yexi Jiang,
ECS 251,  yjian...@cs.fiu.edu
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


  1   2   3   >