Re: [jira] [Created] (MAHOUT-1549) Extracting tfidf-vectors by key

2014-05-15 Thread Sebastian Schelter
I'm not sure I understand your question correctly. If you know the keys, you could put them into a file, write a Map-only Job that loads the keys from the file and filters the data to only retain the key-values pairs where the key is contained in your list. Does that make sense? --sebastian

[jira] [Created] (MAHOUT-1549) Extracting tfidf-vectors by key

2014-05-07 Thread Richard Scharrer (JIRA)
Richard Scharrer created MAHOUT-1549: Summary: Extracting tfidf-vectors by key Key: MAHOUT-1549 URL: https://issues.apache.org/jira/browse/MAHOUT-1549 Project: Mahout Issue Type: Question