Thanks Olivier and Lars, I'll take another look.
Cheers, Nigel
07914 740972
On 18 October 2013 09:16, Olivier Grisel wrote:
> 2013/10/18 Lars Buitinck :
> > 2013/10/18 Nigel Legg :
> >> What am I doing wrong here?
> >
> > Could be lots of things. In any case, using an untuned SVC for this
> >
2013/10/18 Lars Buitinck :
> 2013/10/18 Nigel Legg :
>> What am I doing wrong here?
>
> Could be lots of things. In any case, using an untuned SVC for this
> task is a bad idea because (a) you need to tune it and (b) it's an
> SVC. Better try LinearSVC or SGDClassifier.
Indeed, SVC is using a RBF
2013/10/18 Nigel Legg :
> What am I doing wrong here?
Could be lots of things. In any case, using an untuned SVC for this
task is a bad idea because (a) you need to tune it and (b) it's an
SVC. Better try LinearSVC or SGDClassifier.
Hi Nigel,
What is the proportion of English versus non-English tweets in your data?
It may be the case that your dataset is unbalanced.
Gilles
On 18 October 2013 09:32, Nigel Legg wrote:
> I have a set of tweets, and I am trying to use an SVM classifier to class
> them as being English or ano
I have a set of tweets, and I am trying to use an SVM classifier to class
them as being English or another language. I have a training set which has
been classified by hand.
My code:
train_set = open(train_set)
corpus = []
i = 0
for line in train_set:
x = line.find(',')
text = line[:x]