Hello, I am comparing the mysql data (by dumping into a file) to the textfile imported by sqoop onto HDFS. Am using the diff tool to do the same.
I observed the following differences: mysql --> sqoop_text_output \\n \n \\t \n \$ $ 0 0.0 So, it seems like mysql auto escapes the output with a \. I got around that by telling mysql not to do that so that i can compare properly. I had to pass the --raw flag to mysql. Then the only difference i currently see is that 0 being converted to 0.0 by sqoop (as mentioned in the docs). How can i make mysql also convert the 0 to a 0.0 when it dumps to a csv? Maybe the answer lines in the guts of sqoop that i can use myself. Or is it possible to tell sqoop not to convert 0 to 0.0 ? All in all, i am trying to verify the work done by sqoop for my satisfaction. Once i verify the text data is being exported fine, i will verify the same for the parquet format. Thanks, ~Pratik
