Re: Getting all files of a table

2015-12-01 Thread Krzysztof Zarzycki
Great that worked! The only problem was that it returned all the files including _SUCCESS and _metadata, but I filtered only the *.parquet Thanks Michael, Krzysztof 2015-12-01 20:20 GMT+01:00 Michael Armbrust : > sqlContext.table("...").inputFiles > > (this is best

Getting all files of a table

2015-12-01 Thread Krzysztof Zarzycki
Hi there, Do you know how easily I can get a list of all files of a Hive table? What I want to achieve is to get all files that are underneath parquet table and using sparksql-protobuf[1] library(really handy library!) and its helper class ProtoParquetRDD: val protobufsRdd = new

Re: Getting all files of a table

2015-12-01 Thread Michael Armbrust
sqlContext.table("...").inputFiles (this is best effort, but should work for hive tables). Michael On Tue, Dec 1, 2015 at 10:55 AM, Krzysztof Zarzycki wrote: > Hi there, > Do you know how easily I can get a list of all files of a Hive table? > > What I want to achieve is