Hi,

I added support for the-state-of-the-art classifiers (those are not yet supported in Mahout) and Hivemall's cute(!?) logo as well in Hivemall 0.1-rc3.

Newly supported classifiers include
- Confidence Weighted (CW)
- Adaptive Regularization of Weight Vectors (AROW)
- Soft Confidence Weighted (SCW1, SCW2)

Those classifiers are much smart comparing to the standard SGD-based or passive aggressive classifiers. Please check it out by yourself.

Thanks,
Makoto

(2013/10/11 4:28), Clark Yang (杨卓荦) wrote:
I looks really cool, I think I will try it on.

Cheers,
Zhuoluo (Clark) Yang


2013/10/5 Makoto YUI <yuin...@gmail.com <mailto:yuin...@gmail.com>>

    Hi Edward,

    Thank you for your interst.

    Hivemall project does not have a plan to have a specific mailing
    list, I will answer following questions/comments on twitter or
    through Github issues (with a question label).

    BTW, I just added a CTR (Click-Through-Rate) prediction example that is
    provided by a commercial search engine provider for the KDDCup 2012
    track 2.
    
https://github.com/myui/__hivemall/wiki/KDDCup-2012-__track-2-CTR-prediction-dataset
    
<https://github.com/myui/hivemall/wiki/KDDCup-2012-track-2-CTR-prediction-dataset>

    I guess many of you working on ad CTR/CVR predictions. This example
    might be some help understanding how to do it only within Hive.

    Thanks,
    Makoto @myui


    (2013/10/04 23:02), Edward Capriolo wrote:

        Looks cool im already starting to play with it.

        On Friday, October 4, 2013, Makoto Yui <yuin...@gmail.com
        <mailto:yuin...@gmail.com>
        <mailto:yuin...@gmail.com <mailto:yuin...@gmail.com>>> wrote:
          > Hi Dean,
          >
          > Thank you for your interest in Hivemall.
          >
          > Twitter's paper actually influenced me in developing
        Hivemall and I
          > initially implemented such functionality as Pig UDFs.
          >
          > Though my Pig ML library is not released, you can find a similar
          > attempt for Pig in
          > https://github.com/y-tag/java-__pig-MyUDFs
        <https://github.com/y-tag/java-pig-MyUDFs>
          >
          > Thanks,
          > Makoto
          >
          > 2013/10/3 Dean Wampler <deanwamp...@gmail.com
        <mailto:deanwamp...@gmail.com>
        <mailto:deanwamp...@gmail.com <mailto:deanwamp...@gmail.com>>__>:

          >> This is great news! I know that Twitter has done something
        similar
        with UDFs
          >> for Pig, as described in this paper:
          >>
        
http://www.umiacs.umd.edu/~__jimmylin/publications/Lin___Kolcz_SIGMOD2012.pdf
        
<http://www.umiacs.umd.edu/%7Ejimmylin/publications/Lin_Kolcz_SIGMOD2012.pdf>
        
<http://www.umiacs.umd.edu/%__7Ejimmylin/publications/Lin___Kolcz_SIGMOD2012.pdf
        
<http://www.umiacs.umd.edu/%7Ejimmylin/publications/Lin_Kolcz_SIGMOD2012.pdf>>

          >>
          >> I'm glad to see the same thing start with Hive.
          >>
          >> Dean
          >>
          >>
          >> On Wed, Oct 2, 2013 at 10:21 AM, Makoto YUI
        <yuin...@gmail.com <mailto:yuin...@gmail.com>
        <mailto:yuin...@gmail.com <mailto:yuin...@gmail.com>>> wrote:
          >>>
          >>> Hello all,
          >>>
          >>> My employer, AIST, has given the thumbs up to open source
        our machine
          >>> learning library, named Hivemall.
          >>>
          >>> Hivemall is a scalable machine learning library running on
        Hive/Hadoop,
          >>> licensed under the LGPL 2.1.
          >>>
          >>> https://github.com/myui/__hivemall
        <https://github.com/myui/hivemall>
          >>>
          >>> Hivemall provides machine learning functionality as well
        as feature
          >>> engineering functions through UDFs/UDAFs/UDTFs of Hive. It
        is designed
          >>> to be scalable to the number of training instances as well
        as the
        number
          >>> of training features.
          >>>
          >>> Hivemall is very easy to use as every machine learning
        step is done
          >>> within HiveQL.
          >>>
          >>> -- Installation is just as follows:
          >>> add jar /tmp/hivemall.jar;
          >>> source /tmp/define-all.hive;
          >>>
          >>> -- Logistic regression is performed by a query.
          >>> SELECT
          >>>   feature,
          >>>   avg(weight) as weight
          >>> FROM
          >>>  (SELECT logress(features,label) as (feature,weight) FROM
          >>> training_features) t
          >>> GROUP BY feature;
          >>>
          >>> You can find detailed examples on our wiki pages.
          >>> https://github.com/myui/__hivemall/wiki/_pages
        <https://github.com/myui/hivemall/wiki/_pages>
          >>>
          >>> Though we consider that Hivemall is much easier to use and
        more
        scalable
          >>> than Mahout for classification/regression tasks, please
        check it by
          >>> yourself. If you have a Hive environment, you can evaluate
        Hivemall
          >>> within 5 minutes or so.
          >>>
          >>> Hope you enjoy the release! Feedback (and pull request) is
        always
        welcome.
          >>>
          >>> Thank you,
          >>> Makoto
          >>
          >>
          >>
          >>
          >> --
          >> Dean Wampler, Ph.D.
          >> @deanwampler
          >> http://polyglotprogramming.com
          >




Reply via email to