temed,
greets_stemed, greeted_stemed] |
+--+
---+
From: Michael Armbrust <mich...@databricks.com>
Date: Tuesday, January 5, 2016 at 12:58 PM
To: Andrew Davidson <a...@santacruzintegration.com>
Cc: "user @spark" <user@spark.apache.org>
Subject: Re: problem with DataFrame
>
> I really appreciate your help. I The following code works.
>
Glad you got it to work!
Is there a way this example can be added to the distribution to make it
> easier for future java programmers? It look me a long time get to this
> simple solution.
>
I'd welcome a pull request that added
..@databricks.com>
>> Date: Wednesday, January 6, 2016 at 11:09 AM
>> To: Andrew Davidson <a...@santacruzintegration.com>
>> Cc: "user @spark" <user@spark.apache.org>
>> Subject: Re: problem with DataFrame df.withColumn()
>> org.apache.sp
t; From: Michael Armbrust <mich...@databricks.com>
> Date: Wednesday, January 6, 2016 at 11:09 AM
> To: Andrew Davidson <a...@santacruzintegration.com>
> Cc: "user @spark" <user@spark.apache.org>
> Subject: Re: problem with DataFrame df.withColumn()
> org.ap
rg>
Subject: Re: problem with DataFrame df.withColumn()
org.apache.spark.sql.AnalysisException: resolved attribute(s) missing
>> I really appreciate your help. I The following code works.
>
> Glad you got it to work!
>
>> Is there a way this example can be added to the distribution
spark.apache.org>
Subject: Re: problem with DataFrame df.withColumn()
org.apache.spark.sql.AnalysisException: resolved attribute(s) missing
> Its not really possible to convert an RDD to a Column. You can think of a
> Column as an expression that produces a single output given some set of
>
> I am trying to implement org.apache.spark.ml.Transformer interface in
> Java 8.
>
My understanding is the sudo code for transformers is something like
>
> @Override
>
> public DataFrame transform(DataFrame df) {
>
> 1. Select the input column
>
> 2. Create a new column
>
> 3. Append the
I am having a heck of a time writing a simple transformer in Java. I assume
that my Transformer is supposed to append a new column to the dataFrame
argument. Any idea why I get the following exception in Java 8 when I try to
call DataFrame withColumn()? The JavaDoc says withColumn() "Returns a new
Its not really possible to convert an RDD to a Column. You can think of a
Column as an expression that produces a single output given some set of
input columns. If I understand your code correctly, I think this might be
easier to express as a UDF:
sqlContext.udf().register("stem", new