Dimension reduction is addressed with PCA which is an option of SSVD method. However, if you can research/offer parallelization strategy for ICA, i'd be all ears.
there's also ongoing push to create a DSL environment for mahout distributed matrices to Spark which i personally think is one of the most promising recent developments. It is also an treasure chest (or a can of worms depending on how you view it) for new people to chime in. DSL environment issue is MAHOUT-1346, with everything else pretty much dependent on it -d On Tue, Nov 26, 2013 at 9:19 AM, Олександр Ольгашко < alexandrolg...@gmail.com> wrote: > Hello, > > I am a student, interested in data analysis, also i have chosen this theme > for my diploma work. As mentioned here > https://cwiki.apache.org/confluence/display/MAHOUT/Algorithms, there are > some open algorithms, for example, in Dimension reduction section. > > So, how can i start develop them? I have some theoretical background, but i > think, there may be some unknown problems. Mb somebody is working on these > algorithms. Can you give some tips for start? > > I searched in JIRA for Independent Component Analysis, found nothing. > > Thanks in advance. >