Hello Everyone, I have products and there are certain associated tags to each product. So to find similar products I am using mahout spark-rowsimilarity algorithm in following manner.
$MAHOUT_HOME/mahout spark-rowsimilarity -i hdfs://0.0.0.0:9000/wtrousers -o hdfs://0.0.0.0:9000/s_trousers_out1/ -D:spark.io.compression.=lzf -ma spark://0.0.0.0:7077 To run this command I need to pull data from database to flat file. Is there anyway I can use this command / write java code directly to work on database? -- Thanks & Regards, *Rohit Jain* Web developer | Consultant Mob +91 8097283931