[
https://issues.apache.org/jira/browse/MAHOUT-524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13132810#comment-13132810
]
Jeff Eastman commented on MAHOUT-524:
-------------------------------------
All of this is buried inside of DistributedLanczosSolver. Either the problem
resides in there and should impact all users of DLS or it is in the
SpectralKMeansDriver setup which invokes the DLS. Turns out the DLS.runJob(...)
method employed (line 65) is only called by spectral clustering (KMeans and
Eigencuts). The one other caller, DLS.runJob(...) (line 80) is itself never
called.
Just looking at the invocation site (SpectralKMeansDriver.run() line 155, I see
two file paths being passed into DLS.runJob(...): the lanczosSeqFiles path is
output/calculations/eigenvectors-17, the desired output path, and the
LanczosState is constructed with L, a DRM with inputPath
examples/output/calculations/laplacian-89. This is the input path which is
failing in getFileStatus and causing the exception. Both of these look
reasonable to me.
There are; however, several different Configuration objects being manipulated
by SKMD. I'm suspicious there is something horked in one of them which is
causing the DLS file not found.
> DisplaySpectralKMeans example fails
> -----------------------------------
>
> Key: MAHOUT-524
> URL: https://issues.apache.org/jira/browse/MAHOUT-524
> Project: Mahout
> Issue Type: Bug
> Components: Clustering
> Affects Versions: 0.4, 0.5
> Reporter: Jeff Eastman
> Assignee: Shannon Quinn
> Labels: clustering, k-means, visualization
> Fix For: 0.6
>
> Attachments: EclipseLog_20110918.txt,
> SpectralKMeans_fail_20110919.txt, aff.txt, raw.txt, spectralkmeans.png
>
>
> I've committed a new display example that attempts to push the standard
> mixture of models data set through spectral k-means. After some tweaking of
> configuration arguments and a bug fix in EigenCleanupJob it runs spectral
> k-means to completion. The display example is expecting 2-d clustered points
> and the example is producing 5-d points. Additional I/O work is needed before
> this will play with the rest of the clustering algorithms.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira