RE: HiveContext ignores ("skip.header.line.count"="1")
I am not sure if we really want to support that with HiveContext, but a workround is to use the Spark package at https://github.com/databricks/spark-csv From: Felix Cheung [mailto:felixcheun...@hotmail.com] Sent: Tuesday, October 27, 2015 10:54 AM To: Daniel Haviv; user Subject: RE: HiveContext ignores ("skip.header.line.count"="1") Please open a JIRA? Date: Mon, 26 Oct 2015 15:32:42 +0200 Subject: HiveContext ignores ("skip.header.line.count"="1") From: daniel.ha...@veracity-group.com<mailto:daniel.ha...@veracity-group.com> To: user@spark.apache.org<mailto:user@spark.apache.org> Hi, I have a csv table in Hive which is configured to skip the header row using TBLPROPERTIES("skip.header.line.count"="1"). When querying from Hive the header row is not included in the data, but when running the same query via HiveContext I get the header row. I made sure that HiveContext sees the skip.header.line.count setting by running "show create table" Any ideas? Thank you. Daniel
Re: HiveContext ignores ("skip.header.line.count"="1")
I will Thank you. > On 27 באוק׳ 2015, at 4:54, Felix Cheungwrote: > > Please open a JIRA? > > > Date: Mon, 26 Oct 2015 15:32:42 +0200 > Subject: HiveContext ignores ("skip.header.line.count"="1") > From: daniel.ha...@veracity-group.com > To: user@spark.apache.org > > Hi, > I have a csv table in Hive which is configured to skip the header row using > TBLPROPERTIES("skip.header.line.count"="1"). > When querying from Hive the header row is not included in the data, but when > running the same query via HiveContext I get the header row. > > I made sure that HiveContext sees the skip.header.line.count setting by > running "show create table" > > Any ideas? > > Thank you. > Daniel
RE: HiveContext ignores ("skip.header.line.count"="1")
Please open a JIRA? Date: Mon, 26 Oct 2015 15:32:42 +0200 Subject: HiveContext ignores ("skip.header.line.count"="1") From: daniel.ha...@veracity-group.com To: user@spark.apache.org Hi,I have a csv table in Hive which is configured to skip the header row using TBLPROPERTIES("skip.header.line.count"="1").When querying from Hive the header row is not included in the data, but when running the same query via HiveContext I get the header row. I made sure that HiveContext sees the skip.header.line.count setting by running "show create table" Any ideas? Thank you. Daniel