Re: Spark Streaming : Multiple sources found for csv : Error

2018-08-30 Thread Srabasti Banerjee
Hi Jorn, Do you have suggestions as to how to do that? The conflicting packages are being picked up by default from pom.xml. I am not invoking any additional packages while running spark submit on the thin jar. ThanksSrabasti Banerjee On Thursday, 30 August, 2018, 9:45:36 PM GMT-7, Jörn

Re: Spark Streaming : Multiple sources found for csv : Error

2018-08-30 Thread Srabasti Banerjee
Great we are already discussing/working to fix the issue.Happy to help if I can :-) Any workarounds that we can use for now? Please note I am not invoking any additional packages while running spark submit on the thin jar. Thanks,Srabasti Banerjee On Thursday, 30 August, 2018, 9:02:11

Re: Spark Streaming : Multiple sources found for csv : Error

2018-08-30 Thread Jörn Franke
Can’t you remove the dependency to the databricks CSV data source? Spark has them now integrated since some versions so it is not needed. > On 31. Aug 2018, at 05:52, Srabasti Banerjee > wrote: > > Hi, > > I am trying to run below code to read file as a dataframe onto a Stream (for > Spark

Re: Spark Streaming : Multiple sources found for csv : Error

2018-08-30 Thread Hyukjin Kwon
Yea, this is exactly what I have been worried of the recent changes (discussed in https://issues.apache.org/jira/browse/SPARK-24924) See https://github.com/apache/spark/pull/17916. This should be fine in upper Spark versions. FYI, +Wechen and Dongjoon I want to add Thomas Graves and Gengliang

Spark Streaming : Multiple sources found for csv : Error

2018-08-30 Thread Srabasti Banerjee
Hi, I am trying to run below code to read file as a dataframe onto a Stream (for Spark Streaming) developed via Eclipse IDE, defining schemas appropriately, by running thin jar on server and am getting error below. Tried out suggestions from researching on internet based on