Dimension reduction is addressed with PCA which is an option of SSVD
method.
However, if you can research/offer parallelization strategy for ICA, i'd be
all ears.

there's also ongoing push to create a DSL environment for mahout
distributed matrices to Spark which i personally think is one of the most
promising recent developments. It is also an treasure chest (or a can of
worms depending on how you view it) for new people to chime in. DSL
environment issue is MAHOUT-1346, with everything else pretty much
dependent on it

-d




On Tue, Nov 26, 2013 at 9:19 AM, Олександр Ольгашко <
alexandrolg...@gmail.com> wrote:

> Hello,
>
> I am a student, interested in data analysis, also i have chosen this theme
> for my diploma work. As mentioned here
> https://cwiki.apache.org/confluence/display/MAHOUT/Algorithms, there are
> some open algorithms, for example, in Dimension reduction section.
>
> So, how can i start develop them? I have some theoretical background, but i
> think, there may be some unknown problems. Mb somebody is working on these
> algorithms. Can you give some tips for start?
>
> I searched in JIRA for Independent Component Analysis, found nothing.
>
> Thanks in advance.
>

Reply via email to