Re: spark sql query optimization , and decision tree building

2014-10-27 Thread Yanbo Liang
kumar [mailto:sanath1...@gmail.com] *Sent:* Wednesday, October 22, 2014 12:58 PM *To:* user@spark.apache.org *Subject:* spark sql query optimization , and decision tree building Hi all , I have a large data in text files (1,000,000 lines) .Each line has 128 columns . Here each line

RE: spark sql query optimization , and decision tree building

2014-10-22 Thread Cheng, Hao
not sure about how kd tree used in mllib. but keep in mind SchemaRDD is just a normal RDD. Cheng Hao From: sanath kumar [mailto:sanath1...@gmail.com] Sent: Wednesday, October 22, 2014 12:58 PM To: user@spark.apache.org Subject: spark sql query optimization , and decision tree building Hi all

Re: spark sql query optimization , and decision tree building

2014-10-22 Thread sanath kumar
kumar [mailto:sanath1...@gmail.com] *Sent:* Wednesday, October 22, 2014 12:58 PM *To:* user@spark.apache.org *Subject:* spark sql query optimization , and decision tree building Hi all , I have a large data in text files (1,000,000 lines) .Each line has 128 columns . Here each line

spark sql query optimization , and decision tree building

2014-10-21 Thread sanath kumar
Hi all , I have a large data in text files (1,000,000 lines) .Each line has 128 columns . Here each line is a feature and each column is a dimension. I have converted the txt files in json format and able to run sql queries on json files using spark. Now i am trying to build a k dimenstion