Assuming that the problem is that your input file has 
additional embedded characters added by the data base
program you could try extracting just the text using
the UNIX strings program:

   strings myfile.csv > myfile.txt

and see if myfile.txt works with R and if not check out
what the differences are between it and the .csv file.

Date:   Thu, 14 Oct 2004 11:31:33 -0700 
From:   Scott Waichler <[EMAIL PROTECTED]>
To:   <[EMAIL PROTECTED]> 
Subject:   [R] Problem with number characters 

 
I am trying to process text fields scanned in from a csv file that is
output from the Windows database program FileMakerPro. The characters
onscreen look like regular text, but R does not like their underlying binary form.
For example, one of text fields contains a name and a number, but
R recognizes the number as something other than what it appears
to be in plain text. The character string "Draszt 03" after being
read into R using scan and ="" becomes "Draszt 03" where the 3 is 
displayed in my R session as a superscript. Here is the result pasted
into this email I'm composing in emacs: "Draszt 0%/1€Œiso8859-15³"
Another clue for the knowledgable: when I try to display the vector element
causing trouble, I get
<CHARSXP: "Draszt 0%/1€Œiso8859-15³">
where again the superscipt part is just "3" in my R session. I'm working in
Linux, R version 1.9.1, 2004-06-21. Your help will be much appreciated.

Scott Waichler
Pacific Northwest National Laboratory
[EMAIL PROTECTED]

______________________________________________
[EMAIL PROTECTED] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html

Reply via email to