Re: [Scikit-learn-general] SVM classifier.

2013-10-18 Thread Nigel Legg
Thanks Olivier and Lars, I'll take another look. Cheers, Nigel 07914 740972 On 18 October 2013 09:16, Olivier Grisel wrote: > 2013/10/18 Lars Buitinck : > > 2013/10/18 Nigel Legg : > >> What am I doing wrong here? > > > > Could be lots of things. In any case, using an untuned SVC for this > >

Re: [Scikit-learn-general] SVM classifier.

2013-10-18 Thread Olivier Grisel
2013/10/18 Lars Buitinck : > 2013/10/18 Nigel Legg : >> What am I doing wrong here? > > Could be lots of things. In any case, using an untuned SVC for this > task is a bad idea because (a) you need to tune it and (b) it's an > SVC. Better try LinearSVC or SGDClassifier. Indeed, SVC is using a RBF

Re: [Scikit-learn-general] SVM classifier.

2013-10-18 Thread Lars Buitinck
2013/10/18 Nigel Legg : > What am I doing wrong here? Could be lots of things. In any case, using an untuned SVC for this task is a bad idea because (a) you need to tune it and (b) it's an SVC. Better try LinearSVC or SGDClassifier.

Re: [Scikit-learn-general] SVM classifier.

2013-10-18 Thread Gilles Louppe
Hi Nigel, What is the proportion of English versus non-English tweets in your data? It may be the case that your dataset is unbalanced. Gilles On 18 October 2013 09:32, Nigel Legg wrote: > I have a set of tweets, and I am trying to use an SVM classifier to class > them as being English or ano

[Scikit-learn-general] SVM classifier.

2013-10-18 Thread Nigel Legg
I have a set of tweets, and I am trying to use an SVM classifier to class them as being English or another language. I have a training set which has been classified by hand. My code: train_set = open(train_set) corpus = [] i = 0 for line in train_set: x = line.find(',') text = line[:x]