[ https://issues.apache.org/jira/browse/MAHOUT-1047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13640512#comment-13640512 ]
Saikat Kanjilal commented on MAHOUT-1047: ----------------------------------------- My apologies, I've been out on vacation, just getting back today, I tried but cant seem to repro it locally with small dictionary files, will try to use your patch and larger disctionary files. Also I noticed that in trunk I am running into errors when trying a clean install with maven, the only way to get past this is skipping tests, have you gotten all tests to run successfully? > CVB hangs after completion > -------------------------- > > Key: MAHOUT-1047 > URL: https://issues.apache.org/jira/browse/MAHOUT-1047 > Project: Mahout > Issue Type: Bug > Components: Clustering > Affects Versions: 0.7 > Environment: Ubuntu > Reporter: seth boyles > Priority: Minor > Labels: cvb, lda > Fix For: 0.7, 0.8 > > Attachments: MAHOUT-1047-Show-Leak.patch > > > After running the new LDA CVB implementation, it hangs and does not terminate > the process like every other time I run Mahout > Terminal output: > 12/07/19 11:38:49 INFO mapred.LocalJobRunner: > 12/07/19 11:38:49 INFO mapred.Task: Task 'attempt_local_0022_m_000000_0' done. > 12/07/19 11:38:49 INFO mapred.JobClient: map 100% reduce 0% > 12/07/19 11:38:49 INFO mapred.JobClient: Job complete: job_local_0022 > 12/07/19 11:38:49 INFO mapred.JobClient: Counters: 8 > 12/07/19 11:38:49 INFO mapred.JobClient: File Output Format Counters > 12/07/19 11:38:49 INFO mapred.JobClient: Bytes Written=2247793 > 12/07/19 11:38:49 INFO mapred.JobClient: File Input Format Counters > 12/07/19 11:38:49 INFO mapred.JobClient: Bytes Read=1920337 > 12/07/19 11:38:49 INFO mapred.JobClient: FileSystemCounters > 12/07/19 11:38:49 INFO mapred.JobClient: FILE_BYTES_READ=1342812616 > 12/07/19 11:38:49 INFO mapred.JobClient: FILE_BYTES_WRITTEN=1326092302 > 12/07/19 11:38:49 INFO mapred.JobClient: Map-Reduce Framework > 12/07/19 11:38:49 INFO mapred.JobClient: Map input records=2772 > 12/07/19 11:38:49 INFO mapred.JobClient: Spilled Records=0 > 12/07/19 11:38:49 INFO mapred.JobClient: SPLIT_RAW_BYTES=140 > 12/07/19 11:38:49 INFO mapred.JobClient: Map output records=2772 > 12/07/19 11:38:49 INFO driver.MahoutDriver: Program took 4089950 ms (Minutes: > 68.16583333333334) > $MAHOUT_HOME/mahout cvb -i > /home/seth/Scripted/mahout_data/vectors/vectors/vectors-for-cvb/ -o > /home/seth/Scripted/mahout_data/clusters/ -ow -k 90 -dt > /home/seth/Scripted/mahout_data/distributions -dict > /home/seth/Scripted/mahout_data/vectors/vectors/dictionary.file-0 -mt > /home/seth/Scripted/mahout_data/temp/ -x 20 -cd 0.05 -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira