what's the best practice to create an external hive table based on a csv file on HDFS with 618 columns in header?

Raymond Xie Mon, 23 Jul 2018 12:47:59 -0700

We are using Cloudera CDH 5.11

I have seen solution for small xlsx files with only handful columns in
header, in my case the csv file to be loaded into a new hive table has 618
columns.


   1.

   Would it be saved as parquet by default if I upload it (save it to csv
   first) through HUE-> File Browser? if not, where can I specify the file
   format?
   2.

   What would be the best way to create an external Impala table based on
   that location? It would definitely be unbelievable if I need to create the
   DDL/schema manually as there are so many columns.

Thank you very much.


*------------------------------------------------*
*Sincerely yours,*


*Raymond*

what's the best practice to create an external hive table based on a csv file on HDFS with 618 columns in header?

Reply via email to