Re: [R] Factor levels in training set

2016-06-15 Thread PIKAL Petr
Hi Elahe

I get slightly different error when using scale to nonnumeric data so I am not 
sure if you use the scale function from base package.

> scale(raman[1:20,])
Error in colMeans(x, na.rm = TRUE) : 'x' must be numeric

Anyway, how do you expect scaling shall be done when you have nonumeric 
variable. What shall be the output of

scale(iris$Species)

The only workaround is either to scale only numeric variables from your data 
and add nonnumeric in folowing step or to change all factor variable to numeric 
before scaling (which I would not recommend).

If your data are supposed to be numeric you can check if they really are by

str(df)

Cheers
Petr

> -Original Message-
> From: R-help [mailto:r-help-boun...@r-project.org] On Behalf Of ch.elahe
> via R-help
> Sent: Tuesday, June 14, 2016 5:29 PM
> To: R-help Mailing List <r-help@r-project.org>
> Subject: [R] Factor levels in training set
>
>
>  Hi all,
> I want to use Supervised Self organizing Maps from Kohonen package for my
> data. I need to divide my df into training set and test set, but a part of my 
> df
> contains column with factor levels and I don't know how to bring them into
> my training set. Currently I use the following command for my training set:
>
> dt=sort(sample(nrow(df),nrow(df)*.7))
> training=m[dt,]
> till here I get no error but in the next step which I need to bring my 
> training
> set in a matrix I face this error:
>
> scale(df[training,])
> error: 'x' should be numeric
> Does anyone know how should I include column with factor levels in my df so
> that I don't get this error?
> Thanks for any help,
> Elahe
>
> __
> R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-
> guide.html
> and provide commented, minimal, self-contained, reproducible code.


Tento e-mail a jakékoliv k němu připojené dokumenty jsou důvěrné a jsou určeny 
pouze jeho adresátům.
Jestliže jste obdržel(a) tento e-mail omylem, informujte laskavě neprodleně 
jeho odesílatele. Obsah tohoto emailu i s přílohami a jeho kopie vymažte ze 
svého systému.
Nejste-li zamýšleným adresátem tohoto emailu, nejste oprávněni tento email 
jakkoliv užívat, rozšiřovat, kopírovat či zveřejňovat.
Odesílatel e-mailu neodpovídá za eventuální škodu způsobenou modifikacemi či 
zpožděním přenosu e-mailu.

V případě, že je tento e-mail součástí obchodního jednání:
- vyhrazuje si odesílatel právo ukončit kdykoliv jednání o uzavření smlouvy, a 
to z jakéhokoliv důvodu i bez uvedení důvodu.
- a obsahuje-li nabídku, je adresát oprávněn nabídku bezodkladně přijmout; 
Odesílatel tohoto e-mailu (nabídky) vylučuje přijetí nabídky ze strany příjemce 
s dodatkem či odchylkou.
- trvá odesílatel na tom, že příslušná smlouva je uzavřena teprve výslovným 
dosažením shody na všech jejích náležitostech.
- odesílatel tohoto emailu informuje, že není oprávněn uzavírat za společnost 
žádné smlouvy s výjimkou případů, kdy k tomu byl písemně zmocněn nebo písemně 
pověřen a takové pověření nebo plná moc byly adresátovi tohoto emailu případně 
osobě, kterou adresát zastupuje, předloženy nebo jejich existence je adresátovi 
či osobě jím zastoupené známá.

This e-mail and any documents attached to it may be confidential and are 
intended only for its intended recipients.
If you received this e-mail by mistake, please immediately inform its sender. 
Delete the contents of this e-mail with all attachments and its copies from 
your system.
If you are not the intended recipient of this e-mail, you are not authorized to 
use, disseminate, copy or disclose this e-mail in any manner.
The sender of this e-mail shall not be liable for any possible damage caused by 
modifications of the e-mail or by delay with transfer of the email.

In case that this e-mail forms part of business dealings:
- the sender reserves the right to end negotiations about entering into a 
contract in any time, for any reason, and without stating any reasoning.
- if the e-mail contains an offer, the recipient is entitled to immediately 
accept such offer; The sender of this e-mail (offer) excludes any acceptance of 
the offer on the part of the recipient containing any amendment or variation.
- the sender insists on that the respective contract is concluded only upon an 
express mutual agreement on all its aspects.
- the sender of this e-mail informs that he/she is not authorized to enter into 
any contracts on behalf of the company except for cases in which he/she is 
expressly authorized to do so in writing, and such authorization or power of 
attorney is submitted to the recipient or the person represented by the 
recipient, or the existence of such authorization is known to the recipient of 
the person

[R] Factor levels in training set

2016-06-14 Thread ch.elahe via R-help
Hi all, 
I want to use Supervised Self organizing Maps from Kohonen package for my data. 
I need to divide my df into training set and test set, but a part of my df 
contains column with factor levels and I don't know how to bring them into my 
training set. Currently I use the following command for my training set:
 
dt=sort(sample(nrow(df),nrow(df)*.7))
training=m[dt,]
till here I get no error but in the next step which I need to bring my training 
set in a matrix I face this error:

scale(df[training,])
error: 'x' should be numeric
Does anyone know how should I include column with factor levels in my df so 
that I don't get this error?
Thanks for any help,
Elahe

__
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Factor levels in training set

2016-06-14 Thread ch.elahe via R-help

 Hi all, 
I want to use Supervised Self organizing Maps from Kohonen package for my data. 
I need to divide my df into training set and test set, but a part of my df 
contains column with factor levels and I don't know how to bring them into my 
training set. Currently I use the following command for my training set:

dt=sort(sample(nrow(df),nrow(df)*.7))
training=m[dt,]
till here I get no error but in the next step which I need to bring my training 
set in a matrix I face this error:

scale(df[training,])
error: 'x' should be numeric
Does anyone know how should I include column with factor levels in my df so 
that I don't get this error?
Thanks for any help,
Elahe

__
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.