Re: Binning for numerical dataset

2014-02-04 Thread unmesha sreeveni
To do binning in MapReduce we need to find min and max in mapper let mapper() pass the min,max values to reducer.then after reducer calculate the buckets. Is that the best way -- *Thanks & Regards* Unmesha Sreeveni U.B

Binning for numerical dataset

2014-02-04 Thread unmesha sreeveni
I am able to normalize a given data say 100,1:2:3 101,2:3:4 into 100 1 100 2 100 3 101 2 101 3 101 4 How to do binning for a numerical data say iris.csv. I worked out the maths behind it Iris DataSet: http://archive.ics.uci.edu/ml/datasets/Iris 1. find out the minimum and maximum values of each