The source documentation of the files states ASCII (ISO/IEC 8859-1) The file I am referring to can be found here: http://www.ars.usda.gov/SP2UserFiles/Place/12354500/Data/SR27/asc/NUTR_DEF.txt
There are fields marked as micro grams using that character. ~µg~ On Fri, Sep 19, 2014 at 10:32 PM, Ted Dunning <[email protected]> wrote: > What is the encoding of your file? UTF-8? Or ISO-latin-1? > > > > On Fri, Sep 19, 2014 at 9:47 AM, Jim Scott <[email protected]> wrote: > > > I have a delimited file and it is blowing up when I try to trim > characters > > from a field containing the lowercase greek Mu character. > > > > Failure while running fragment. Unexpected byte 0xb5 at position 1014 > > encountered while decoding UTF8 string > > > > I'm kind of confused by this, as it is properly identifying the > character, > > so I'm not sure how it could be unexpected. > > > > > -- *Jim Scott* Director, Enterprise Strategy & Architecture +1 (347) 746-9281 <http://www.mapr.com/> [image: MapR Technologies] <http://www.mapr.com>
