The source documentation of the files states ASCII (ISO/IEC 8859-1)
The file I am referring to can be found here:
http://www.ars.usda.gov/SP2UserFiles/Place/12354500/Data/SR27/asc/NUTR_DEF.txt

There are fields marked as micro grams using that character.
~µg~


On Fri, Sep 19, 2014 at 10:32 PM, Ted Dunning <[email protected]> wrote:

> What is the encoding of your file?  UTF-8?  Or ISO-latin-1?
>
>
>
> On Fri, Sep 19, 2014 at 9:47 AM, Jim Scott <[email protected]> wrote:
>
> > ​I have a delimited file and it is blowing up when I try to trim
> characters
> > from a field containing the lowercase greek Mu character.
> >
> > Failure while running fragment. Unexpected byte 0xb5 at position 1014
> > encountered while decoding UTF8 string
> >
> > I'm kind of confused by this, as it is properly identifying the
> character,
> > so I'm not sure how it could be unexpected.
> > ​
> >
>



-- 
*Jim Scott*
Director, Enterprise Strategy & Architecture
+1 (347) 746-9281

 <http://www.mapr.com/>
[image: MapR Technologies] <http://www.mapr.com>

Reply via email to