Hi,
can you post the exact parameters you used to call the job? And please
have a look at your error logs again, I have the suspicion that
something else already went wrong before the exception that you posted
occured, could you check that too?
--sebastian
On 20.12.2010 08:50, Gayatri Rao wrote:
Hi,
I have been trying to run the Hadoop Item Based Collaborative Filtering Job
as described in
https://cwiki.apache.org/confluence/display/MAHOUT/TasteCommandLine
Few MR jobs run sucessfully
((RecommenderJob-ItemIDIndexMapper-ItemIDIndexReduce,RecommenderJob-ToItemPrefsMapper-ToUserVectorReduc,RecommenderJob-CountUsersMapper-CountUsersReducer,RecommenderJob-MaybePruneRowsMapper-ToItemVectorsR)
After which the job dies with an exception
Exception in thread "main"
org.apache.hadoop.mapreduce.lib.input.InvalidInputException: Input path does
not exist: /data/temp/similarityMatrix
at
org.apache.hadoop.mapreduce.lib.input.FileInputFormat.listStatus(FileInputFormat.java:224)
at
org.apache.hadoop.mapreduce.lib.input.SequenceFileInputFormat.listStatus(SequenceFileInputFormat.java:55)
at
org.apache.hadoop.mapreduce.lib.input.FileInputFormat.getSplits(FileInputFormat.java:241)
at org.apache.hadoop.mapred.JobClient.writeNewSplits(JobClient.java:885)
at
org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:779)
at org.apache.hadoop.mapreduce.Job.submit(Job.java:432)
at org.apache.hadoop.mapreduce.Job.waitForCompletion(Job.java:447)
at
org.apache.mahout.cf.taste.hadoop.item.RecommenderJob.run(RecommenderJob.java:234)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at
org.apache.mahout.cf.taste.hadoop.item.RecommenderJob.main(RecommenderJob.java:328)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
I find the following files in /data/temp
[gaya...@e1aeu046110d mahout-distribution-0.4]$ hadoop dfs -ls /data/temp
Found 4 items
drwxr-xr-x - gayatri supergroup 0 2010-12-17 16:53
/data/temp/countUsers
drwxr-xr-x - gayatri supergroup 0 2010-12-17 16:51
/data/temp/itemIDIndex
drwxr-xr-x - gayatri supergroup 0 2010-12-17 16:54
/data/temp/itemUserMatrix
drwxr-xr-x - gayatri supergroup 0 2010-12-17 16:52
/data/temp/userVectors
Is this a configuration issue? I am not able to understand what might be the
error.
Thanks
Gayatri