On Sun, 16 Dec 2018 at 09:49, Vincent Davis <[email protected]> wrote: > > Why not start with a histogram. > > Vincent > > On Sat, Dec 15, 2018 at 6:46 PM Marc Lucke <[email protected]> wrote: > > > hey guys, > > > > I have a hobby project that sorts my email automatically for me & I want > > to improve it. There's data science and statistical info that I'm > > missing, & I always enjoy reading about the pythonic way to do things too. > > > > I have a list of percentage scores:
[clipped for brevity] > > That algorithm is entirely untested & I think it could work, it's just I > > don't want to reinvent the wheel. Any ideas kindly appreciated. > > > > > > -- > > https://mail.python.org/mailman/listinfo/python-list > > > -- > https://mail.python.org/mailman/listinfo/python-list +1 for k means certainly. Also k means in 1D will be like a simple distance comparison and assignment. A quick Google will give you the exact codes for doing so. It will be you yourself who will decide how many clusters you want, as Avi has rightly pointed out. -- /Shakti -- https://mail.python.org/mailman/listinfo/python-list
