Re: Appending a static dataframe to a stream create Parquet file fails

2021-09-05 Thread eugen . wintersberger
"parquet") > > .option("path", path.toString) > > .outputMode("append") > > .start() > > .processAllAvailable() > > spark.read.format("parquet").load(path.toString).count mustBe 1159 > > > > logLinesDF.write.format("parquet").mode("append").save(path.toStrin > > g) > > spark.read.format("parquet").load(path.toString).count mustBe > > 2*1159 > > } > > > > Does anyone have an idea what I am doing wrong here? > > > > thanks in advance > >  Eugen Wintersberger

Appending a static dataframe to a stream create Parquet file fails

2021-09-02 Thread eugen . wintersberger
ing).count mustBe 2*1159 } Does anyone have an idea what I am doing wrong here? thanks in advance  Eugen Wintersberger

Re: Append to an existing Delta Lake using structured streaming

2021-07-21 Thread eugen . wintersberger
21, 2021 at 2:49 AM, > wrote: > > Hi all, > >   I stumbled upon an interessting problem. I have an existing > > Deltalake with data recovered from a backup and would like to > > append to this Deltalake using Spark structured streaming. This > > does not wor

Append to an existing Delta Lake using structured streaming

2021-07-21 Thread eugen . wintersberger
file with structured streaming than appending to this file with a streaming job (at least with the same job) works flawlessly.  Did I missunderstand something here? best regards    Eugen Wintersberger

Kafka structured straming - how to read headers

2020-12-03 Thread eugen . wintersberger
Hi folks,   I am trying to read the message headers from a Kafka structured stream which should be stored in a column named ``headers``.  I try something like this: val stream = sparkSession.readStream.format("kafka")..load() stream.map(row => {   ...  val headers =

Fitting only the intercept for LinearRegression

2020-03-21 Thread eugen . wintersberger
Hi, I was wondering if it would be possible to fit only the intercept on a LinearRegression instance by providing a known coefficient? Here is some background information: we have a problem where linear regression is well suited as a predictor. However, the model requires continuous adoption.