Hello,

I am comparing the mysql data (by dumping into a file) to the textfile
imported by sqoop onto HDFS.
Am using the diff tool to do the same.

I observed the following differences:
mysql      -->       sqoop_text_output
\\n                       \n
\\t                        \n
\$                        $
0                         0.0

So, it seems like mysql auto escapes the output with a \. I got around that
by telling mysql not to do that so that i can compare properly. I had to
pass the --raw flag to mysql. Then the only difference i currently see is
that 0 being converted to 0.0 by sqoop (as mentioned in the docs).

How can i make mysql also convert the 0 to a 0.0 when it dumps to a csv?
Maybe the answer lines in the guts of sqoop that i can use myself. Or is it
possible to tell sqoop not to convert 0 to 0.0 ?

All in all, i am trying to verify the work done by sqoop for my
satisfaction. Once i verify the text data is being exported fine, i will
verify the same for the parquet format.

Thanks,
~Pratik

Reply via email to