embedded pig in the custer
Hi, I am using pig 0.14. How to run embedded pig (with PigServer) on the cluster which fully packs all classpath and default configuration for pig, hadoop, yarn, hdfs? With this simple solution java -cp myjar.jar mymainclass, it will definitely throw classnotfound exception, other exception. BR, Patcharee
Loading data from a CSV file which has '\n' character in a field
I am trying to load a csv file which has '\n' character in the field and Pig is considering that as a new record. I am missing the data in that particular column and getting additional records in the output table. I am using d = LOAD '/location/of/the/file/name_of_the_fiel.csv' USING org.apache.pig.piggybank.storage.CSVExcelStorage(',', 'YES_MULTILINE', 'UNIX', 'SKIP_INPUT_HEADER'); to allow the multi-line possibility in a field. Still I am facing the same issue. Where the data is shifting into next row. Appreciate any help. Thanks Sunil Kancharlapalli [http://www.cisco.com/web/europe/images/email/signature/logo05.jpg] Sunilmanohar Kancharlapalli Engineer - IT sunka...@cisco.com Phone: Cisco Systems Limited US Cisco.comhttp://www.cisco.com [Think before you print.]Think before you print. This email may contain confidential and privileged material for the sole use of the intended recipient. Any review, use, distribution or disclosure by others is strictly prohibited. If you are not the intended recipient (or authorized to receive for the recipient), please contact the sender by reply email and delete all copies of this message. For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/index.html