There are various techniques but the actual answer will depend on what you are trying to do, kind of input data, nature of algorithm. You can browse through https://www.analyticsvidhya.com/blog/2015/11/easy-methods-deal-categorical-variables-predictive-modeling/ this should give you a starting hint. On Dec 20, 2016, at 3:05 PM, big data <bigdatab...@outlook.com<mailto:bigdatab...@outlook.com>> wrote:
our source data are string-based data, like this: col1 col2 col3 ... aaa bbb ccc aa2 bb2 cc2 aa3 bb3 cc3 ... ... ... How to convert all of these data to double to apply for mlib's algorithm? thanks. --------------------------------------------------------------------- To unsubscribe e-mail: user-unsubscr...@spark.apache.org<mailto:user-unsubscr...@spark.apache.org>