Hi Jorn,
Do you have suggestions as to how to do that?
The conflicting packages are being picked up by default from pom.xml. I am not
invoking any additional packages while running spark submit on the thin jar.
ThanksSrabasti Banerjee
On Thursday, 30 August, 2018, 9:45:36 PM GMT-7, Jörn
Great we are already discussing/working to fix the issue.Happy to help if I can
:-)
Any workarounds that we can use for now?
Please note I am not invoking any additional packages while running spark
submit on the thin jar.
Thanks,Srabasti Banerjee
On Thursday, 30 August, 2018, 9:02:11
Can’t you remove the dependency to the databricks CSV data source? Spark has
them now integrated since some versions so it is not needed.
> On 31. Aug 2018, at 05:52, Srabasti Banerjee
> wrote:
>
> Hi,
>
> I am trying to run below code to read file as a dataframe onto a Stream (for
> Spark
Yea, this is exactly what I have been worried of the recent changes
(discussed in https://issues.apache.org/jira/browse/SPARK-24924)
See https://github.com/apache/spark/pull/17916. This should be fine in
upper Spark versions.
FYI, +Wechen and Dongjoon
I want to add Thomas Graves and Gengliang
Hi,
I am trying to run below code to read file as a dataframe onto a Stream (for
Spark Streaming) developed via Eclipse IDE, defining schemas appropriately, by
running thin jar on server and am getting error below. Tried out suggestions
from researching on internet based on
+1 on beta support for scala 2.12
On Thu, Aug 30, 2018 at 2:33 PM, Stavros Kontopoulos <
stavros.kontopou...@lightbend.com> wrote:
> +1 that would be great Sean, also you put a lot of effort in there, would
> make sense to wait a bit.
>
> Stavros
>
> On Fri, Aug 31, 2018 at 12:00 AM, Sean Owen
+1 that would be great Sean, also you put a lot of effort in there, would
make sense to wait a bit.
Stavros
On Fri, Aug 31, 2018 at 12:00 AM, Sean Owen wrote:
> I know it's famous last words, but we really might be down to the last
> fix: https://github.com/apache/spark/pull/22264 More a
I know it's famous last words, but we really might be down to the last fix:
https://github.com/apache/spark/pull/22264 More a question of making tests
happy at this point I think than fundamental problems. My goal is to make
sure we can release a usable, but beta-quality, 2.12 release of Spark in
Let's see how they go. At some point we do need to cut the release. That
argument can be made on every feature, and different people place different
value / importance on different features, so we could just end up never
making a release.
On Thu, Aug 30, 2018 at 1:56 PM antonkulaga wrote:
>
I wanted to call any interested eyes to this discussion:
https://github.com/apache/spark/pull/22179
What are you interested in accomplishing?
The spark.ml package has provided a machine learning API based on
DataFrames for quite some time. If you are interested in mixing query
processing and machine learning, this is certainly the best place to start.
See here:
Is there a plan to support SQL extensions for mllib? Or is there an effort
already underway?
Any information is appreciated.
Thanks in advance.
Hemant
12 matches
Mail list logo