Seq2sparse outputs final vectors to different directories depending upon the
TF/TFIDF weight switch. This is confusing to users.
--------------------------------------------------------------------------------------------------------------------------------
Key: MAHOUT-398
URL: https://issues.apache.org/jira/browse/MAHOUT-398
Project: Mahout
Issue Type: Improvement
Components: Utils
Affects Versions: 0.3
Reporter: Jeff Eastman
Fix For: 0.4
In TF mode, seq2sparse puts the output vectors into <output>vectors. In TFIDF
mode; however, it puts the output vectors into <output>/tfidf/vectors. Even
worse, in TFIDF mode the TFIDF converter reuses the <output>/vector/ directory
for its intermediate calculations. Seems like both modes ought to output to the
same directory so changing the switch does not cause downstream user changes
that are error-prone and confusing.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.