Also, I'm using phoenixTableAsDataFrame API to read from a pre-split phoenix table. How can we ensure read is parallelized across all executors? Would salting/pre-splitting tables help in providing parallelism? Appreciate any inputs.
Thanks Kanagha On Wed, Aug 16, 2017 at 10:16 PM, kanagha <er.kana...@gmail.com> wrote: > Hi Josh, > > Per your previous post, it is mentioned "The phoenix-spark parallelism is > based on the splits provided by the Phoenix query planner, and has no > requirements on specifying partition columns or upper/lower bounds." > > Does it depend upon the region splits on the input table for parallelism? > Could you please provide more details? > > > Thanks > > > > -- > View this message in context: http://apache-phoenix-user- > list.1124778.n5.nabble.com/phoenix-spark-options-not- > supporint-query-in-dbtable-tp1915p3810.html > Sent from the Apache Phoenix User List mailing list archive at Nabble.com. >