[ 
https://issues.apache.org/jira/browse/MAHOUT-524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13132810#comment-13132810
 ] 

Jeff Eastman commented on MAHOUT-524:
-------------------------------------

All of this is buried inside of DistributedLanczosSolver. Either the problem 
resides in there and should impact all users of DLS or it is in the 
SpectralKMeansDriver setup which invokes the DLS. Turns out the DLS.runJob(...) 
method employed (line 65) is only called by spectral clustering (KMeans and 
Eigencuts). The one other caller, DLS.runJob(...) (line 80) is itself never 
called.

Just looking at the invocation site (SpectralKMeansDriver.run() line 155, I see 
two file paths being passed into DLS.runJob(...): the lanczosSeqFiles path is 
output/calculations/eigenvectors-17, the desired output path, and the 
LanczosState is constructed with L, a DRM with inputPath 
examples/output/calculations/laplacian-89. This is the input path which is 
failing in getFileStatus and causing the exception. Both of these look 
reasonable to me.

There are; however, several different Configuration objects being manipulated 
by SKMD. I'm suspicious there is something horked in one of them which is 
causing the DLS file not found.
                
> DisplaySpectralKMeans example fails
> -----------------------------------
>
>                 Key: MAHOUT-524
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-524
>             Project: Mahout
>          Issue Type: Bug
>          Components: Clustering
>    Affects Versions: 0.4, 0.5
>            Reporter: Jeff Eastman
>            Assignee: Shannon Quinn
>              Labels: clustering, k-means, visualization
>             Fix For: 0.6
>
>         Attachments: EclipseLog_20110918.txt, 
> SpectralKMeans_fail_20110919.txt, aff.txt, raw.txt, spectralkmeans.png
>
>
> I've committed a new display example that attempts to push the standard 
> mixture of models data set through spectral k-means. After some tweaking of 
> configuration arguments and a bug fix in EigenCleanupJob it runs spectral 
> k-means to completion. The display example is expecting 2-d clustered points 
> and the example is producing 5-d points. Additional I/O work is needed before 
> this will play with the rest of the clustering algorithms. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to