If we are talking about the same thing, you are returning clusters of samples and features together (ie rows and columns). So if in K-means we return a 1D array with cluster labels, here the output would be two arrays, one of (n_samples,) and one of (n_features,). Another alternative would be a list of length `n_clusters` where each element is a pair of lists of row, respectively column indices. But I believe the first one can be uniform enough wrt our current API.
On Thu, Apr 25, 2013 at 9:56 AM, Mathieu Blondel <[email protected]> wrote: > Could you elaborate why it would require a new API? > > Mathieu > > On Apr 25, 2013 9:08 AM, "Vlad Niculae" <[email protected]> wrote: >> >> The Baader-Meinhof phenomenon in action -- only 2 days ago I saw a >> talk about information-theoretic biclustering (aka co-clustering) >> applied to opinion mining of video game reviews and the method raised >> my attention. An efficient implementation would be very nice, but it >> will definitely require a new API. >> >> >> >> On Thu, Apr 25, 2013 at 8:56 AM, Mathieu Blondel <[email protected]> >> wrote: >> > Hi Kemal, >> > >> > >> > On Thu, Apr 25, 2013 at 6:56 AM, Kemal Eren <[email protected]> wrote: >> >> >> >> >> >> If you are looking for biclustering algorithms I could certainly do >> >> that. >> >> I did my Master's thesis on it and wrote this software: >> >> http://bmi.osu.edu/hpc/software/bibench/. Its biclustering algorithms >> >> are >> >> wrappers to existing tools. It would be really nice to have >> >> Python/Cython >> >> implementations in scikit-learn. >> > >> > >> > The biclustering project would be my personal favorite. It's nice that >> > you >> > have a code base to start from. I will try to see it it's not too late >> > to >> > register as a mentor. Also, I see that you already have a few >> > pull-requests >> > under review. This is nice, since this is a requirement of the PSF for >> > eligibility to the GSOC. >> > >> > What algorithms do you have in mind? If you decide to go for >> > biclustering, >> > you can send us a proposal draft on the mailing-list (in another >> > thread). >> > >> > Thanks, >> > Mathieu >> > >> > >> > ------------------------------------------------------------------------------ >> > Try New Relic Now & We'll Send You this Cool Shirt >> > New Relic is the only SaaS-based application performance monitoring >> > service >> > that delivers powerful full stack analytics. Optimize and monitor your >> > browser, app, & servers with just a few lines of code. Try New Relic >> > and get this awesome Nerd Life shirt! >> > http://p.sf.net/sfu/newrelic_d2d_apr >> > _______________________________________________ >> > Scikit-learn-general mailing list >> > [email protected] >> > https://lists.sourceforge.net/lists/listinfo/scikit-learn-general >> > ------------------------------------------------------------------------------ Try New Relic Now & We'll Send You this Cool Shirt New Relic is the only SaaS-based application performance monitoring service that delivers powerful full stack analytics. Optimize and monitor your browser, app, & servers with just a few lines of code. Try New Relic and get this awesome Nerd Life shirt! http://p.sf.net/sfu/newrelic_d2d_apr _______________________________________________ Scikit-learn-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
