Re: [jira] [Created] (MAHOUT-1549) Extracting tfidf-vectors by key
I'm not sure I understand your question correctly. If you know the keys, you could put them into a file, write a Map-only Job that loads the keys from the file and filters the data to only retain the key-values pairs where the key is contained in your list. Does that make sense? --sebastian
[jira] [Created] (MAHOUT-1549) Extracting tfidf-vectors by key
Richard Scharrer created MAHOUT-1549: Summary: Extracting tfidf-vectors by key Key: MAHOUT-1549 URL: https://issues.apache.org/jira/browse/MAHOUT-1549 Project: Mahout Issue Type: Question