On Thu, Aug 1, 2013 at 7:08 AM, Sebastian Schelter <s...@apache.org> wrote:

> IIRC the main reasons for deprecating Lanczos was that in contrast to
> SSVD, it does not use a constant number of MapReduce jobs and that our
> implementation has the constraint that all the resulting vectors have to
> fit into the memory of the driver machine.
>

While it's true that Lanczos does not use a constant number of MR
iterations,
the phrase "our implementation" is key in saying we have to hold all the
output
vectors in memory.  This wasn't even a very integral part of our impl.
 It's fairly
simple to implement the linear combinations of the Ritz vectors after
iterations
are complete as an operation keeping only 3 vectors in memory at a time, we
just never made that optimization.


>
> Best,
> Sebastian
>
> On 01.08.2013 12:15, Fernando Fernández wrote:
> > Hi everyone,
> >
> > Sorry if I duplicate the question but I've been looking for an answer
> and I
> > haven't found an explanation other than it's not being used (together
> with
> > some other algorithms). If it's been discussed in depth before maybe you
> > can point me to some link with the discussion.
> >
> > I have successfully used Lanczos in several projects and it's been a
> > surprise to me finding that the main reason (according to what I've read
> > that might not be the full story) is that it's not being used. At the
> > begining I supposed it was because SSVD is supposed to be much faster
> with
> > similar results, but after making some tests I have found that running
> > times are similar or even worse than lanczos for some configurations (I
> > have tried several combinations of parameters, given child processes
> enough
> > memory, etc. and had no success in running SSVD at least in 3/4 of time
> > Lanczos runs, thouh they might be some combinations of parameters I have
> > still not tried). It seems to be quite tricky to find a good combination
> of
> > parameters for SSVD and I have seen also a precision loss in some
> examples
> > that makes me not confident in migrating Lanczos to SSVD from now on (How
> > far can I trust results from a combination of parameters that runs in
> > significant less time, or at least a good time?).
> >
> > Can someone convince me that SSVD is actually a better option than
> Lanczos?
> > (I'm totally willing to be convinced... :) )
> >
> > Thank you very much in advance.
> >
> > Fernando.
> >
>
>


-- 

  -jake

Reply via email to