On Sun, Jul 7, 2013 at 11:50 PM, <
[email protected]> wrote:
> Send Scikit-learn-general mailing list submissions to
> [email protected]
>
> To subscribe or unsubscribe via the World Wide Web, visit
> https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
> or, via email, send a message with subject or body 'help' to
> [email protected]
>
> You can reach the person managing the list at
> [email protected]
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Scikit-learn-general digest..."
>
>
> Today's Topics:
>
> 1. Data Compression (Anubhab Baksi)
> 2. Re: Data Compression (Olivier Grisel)
> 3. Questions for plot_forest_iris.py and AdaBoost (Ian Ozsvald)
> 4. AdaBoost and strong vs weak classifiers (Ian Ozsvald)
> 5. RandomForests - where do we select a subset of features
> during fitting? (Ian Ozsvald)
> 6. Defining a Density Estimation Interface (Jacob Vanderplas)
> 7. Re: AdaBoost and strong vs weak classifiers (Olivier Grisel)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Sun, 7 Jul 2013 21:16:47 +0530
> From: Anubhab Baksi <[email protected]>
> Subject: [Scikit-learn-general] Data Compression
> To: [email protected]
> Message-ID:
> <
> canhk5kyfmnqq-yhp5kyxqda5eyvuaedxpkswm1glyy4zigl...@mail.gmail.com>
> Content-Type: text/plain; charset="iso-8859-1"
>
> Hi,
> I searched for several Data Compression codes in Scikit learn, but I could
> not find it.
>
> Now, can anybody please tell me, are these really implemented?
>
>
> Regards.
> -------------- next part --------------
> An HTML attachment was scrubbed...
>
> ------------------------------
>
> Message: 2
> Date: Sun, 7 Jul 2013 17:56:32 +0200
> From: Olivier Grisel <[email protected]>
> Subject: Re: [Scikit-learn-general] Data Compression
> To: scikit-learn-general <[email protected]>
> Message-ID:
> <
> cafve7k4epy7azru-6se+n8tfy7sakl16ggrs+hle8zqskyu...@mail.gmail.com>
> Content-Type: text/plain; charset=UTF-8
>
> 2013/7/7 Anubhab Baksi <[email protected]>:
> > Hi,
> > I searched for several Data Compression codes in Scikit learn, but I
> could
> > not find it.
> >
> > Now, can anybody please tell me, are these really implemented?
>
> I am not sure what you really mean by such a generic term as "Data
> Compression" but if this is a name for a specific machine learning
> algorithm then it is not implemented in scikit-learn. BTW most machine
> learning models can be interpreted as doing some sort of lossy
> training set compression but the purpose is generally not to be able
> to "uncompress" the model later to recover the training set but rather
> to use the statistical summary of the training set to be able to make
> useful predictions on any future test set assuming they share the same
> statistical distribution.
>
> If you are interested in lossless compression algorithms in Python
> then you should rather use the gzip or bz2 modules of the standard
> lib:
>
> http://docs.python.org/2/library/archiving.html
>
> --
> Olivier
> http://twitter.com/ogrisel - http://github.com/ogrisel
>
> Actually, I meant algorithms like Huffman Coding, LZW Coding etc. These
are not directly related to machine learning though.
------------------------------------------------------------------------------
See everything from the browser to the database with AppDynamics
Get end-to-end visibility with application monitoring from AppDynamics
Isolate bottlenecks and diagnose root cause in seconds.
Start your free trial of AppDynamics Pro today!
http://pubads.g.doubleclick.net/gampad/clk?id=48808831&iu=/4140/ostg.clktrk
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general