Re: [R] about data problem

2016-09-21 Thread Martin Maechler
> Joe Ceradini > on Tue, 20 Sep 2016 17:06:17 -0600 writes: > read.csv("your_data.csv", stringsAsFactors=FALSE) > (I'm just reiterating Jianling said...) If you do not have very many columns, and want to become more efficient and knowledgeable, I

Re: [R] about data problem

2016-09-20 Thread Jeff Newmiller
You can use the latter IF you know there are no problems with the input data. If you need to troubleshoot then you need separate columns so you can compare them. -- Sent from my phone. Please excuse my brevity. On September 20, 2016 4:22:41 PM PDT, lily li wrote:

Re: [R] about data problem

2016-09-20 Thread lily li
Thanks. The former method works. I confused character with factor. Besides, I should use: dta$DischargeNum <- as.numeric( dta$Discharge ) instead of: dta$Discharge <- as.numeric( dta$Discharge ) On Tue, Sep 20, 2016 at 5:18 PM, Jeff Newmiller wrote: > Which means it

Re: [R] about data problem

2016-09-20 Thread Jeff Newmiller
Which means it avoided converting to factor... Success! Note that the column apparently has garbage characters in one or more of the rows, which should be evident when you LOOK AT THE CHARACTERS in the column. They should all be numeric symbols, plus or minus, and perhaps decimal points. If

Re: [R] about data problem

2016-09-20 Thread lily li
Yes, I tried to add this statement when reading the dataset. But when I use summary(df), it shows: Discharge Length: Class :character Mode :character On Tue, Sep 20, 2016 at 5:06 PM, Joe Ceradini wrote: > read.csv("your_data.csv", stringsAsFactors=FALSE) > (I'm just

Re: [R] about data problem

2016-09-20 Thread Jeff Newmiller
Find the offending data. One approach is to look at the input data with your image sensors and neural pattern processor (eyes and brain). One way to reduce the load on those told is to read in the data with the stringsAsFactors=TRUE argument and try manually converting the resulting character

Re: [R] about data problem

2016-09-20 Thread Joe Ceradini
read.csv("your_data.csv", stringsAsFactors=FALSE) (I'm just reiterating Jianling said...) Joe On Tue, Sep 20, 2016 at 4:56 PM, lily li wrote: > Is there a function in read.csv that I can use to avoid converting numeric > to factor? Thanks a lot. > > > > On Tue, Sep 20,

Re: [R] about data problem

2016-09-20 Thread lily li
Is there a function in read.csv that I can use to avoid converting numeric to factor? Thanks a lot. On Tue, Sep 20, 2016 at 4:42 PM, lily li wrote: > Thanks. Then what should I do to solve the problem? > > On Tue, Sep 20, 2016 at 4:30 PM, Jeff Newmiller

Re: [R] about data problem

2016-09-20 Thread lily li
Thanks. Then what should I do to solve the problem? On Tue, Sep 20, 2016 at 4:30 PM, Jeff Newmiller wrote: > I suppose you can do what works for your data, but I wouldn't recommend > na.rm=TRUE because it hides problems rather than clarifying them. > > If in fact your

Re: [R] about data problem

2016-09-20 Thread Jeff Newmiller
I suppose you can do what works for your data, but I wouldn't recommend na.rm=TRUE because it hides problems rather than clarifying them. If in fact your data includes true NA values (the letters NA or simply nothing between the commas are typical ways this information may be indicated), then

Re: [R] about data problem

2016-09-20 Thread lily li
I reread the data, and use 'na.rm = T' when reading the data. This time it has no such problem. It seems that the existence of NAs convert the integer to factor. Thanks for your help. On Tue, Sep 20, 2016 at 4:09 PM, Jianling Fan wrote: > Add the "stringsAsFactors = F"

Re: [R] about data problem

2016-09-20 Thread Jianling Fan
Add the "stringsAsFactors = F" when you read the data, and then convert them to numeric. On 20 September 2016 at 16:00, lily li wrote: > Yes, it is stored as factor. I can't check out any problem in the original > data. Reread data doesn't help either. I use read.csv to

Re: [R] about data problem

2016-09-20 Thread lily li
Yes, it is stored as factor. I can't check out any problem in the original data. Reread data doesn't help either. I use read.csv to read in the data, do you think it is better to use read.table? Thanks again. On Tue, Sep 20, 2016 at 3:55 PM, Greg Snow <538...@gmail.com> wrote: > This indicates

Re: [R] about data problem

2016-09-20 Thread Greg Snow
This indicates that your Discharge column has been stored/converted as a factor (run str(df) to verify and check other columns). This usually happens when functions like read.table are left to try to figure out what each column is and it finds something in that column that cannot be converted to

[R] about data problem

2016-09-20 Thread lily li
Hi R users, I have a problem in reading data. For example, part of my dataframe is like this: df month day year Discharge 31 20106.4 32 2010 7.58 33 2010 6.82 34 2010 8.63 3