Re: Print on screen DataStream content

2020-11-24 Thread Simone Cavallarin
ok, thanks you all for the help! s


From: David Anderson 
Sent: 24 November 2020 15:16
To: Simone Cavallarin 
Cc: user@flink.apache.org 
Subject: Re: Print on screen DataStream content

Simone,

What you want to do is to override the toString() method on Event so that it 
produces a more helpful String as its result, and then use

stream.print()

in your IDE (where stream is a DataStream).

By the way, printOrTest(stream) isn't part of Flink -- that's just something 
used by the training exercises.

Best,
David

On Tue, Nov 24, 2020 at 4:08 PM Simone Cavallarin 
mailto:cavalla...@hotmail.com>> wrote:
Hi,

yes, I would like to debug locally on my IDE.

This is what I tried so far, but no luck.

a)String ff = result.toString();
System.out.print(ff);

 b)   printOrTest(stream);

 c)stream.print();

  d)   System.out.println(stream.print());


This is the output and to me it looks like an "IDcode" that indicate each Event 
on my application but i don't know how to access the data inside. My Event is 
wrapped in a POJO. So each Event is something like this ['tom', 1.70, 55, 
1929219939, M]

p1.Event@1ca90014
p1.Event@5e550c0c
p1.Event@402baa28
p1.Event@5cf8ae00
p1.Event@805bc2b
p1.Event@7e46f44b
p1.Event@63a17b41

Apologies, but I really don't understand if I write `DataStream#print()` I just 
receive an error... i'm missing something here..
On the example there are System.out.println().
[cid:175facf17c7cb971f161]
Thanks

From: Timo Walther mailto:twal...@apache.org>>
Sent: 24 November 2020 11:50
To: user@flink.apache.org<mailto:user@flink.apache.org> 
mailto:user@flink.apache.org>>
Subject: Re: Print on screen DataStream content

Hi Simone,

if you are just executing DataStream pipelines locally in your IDE while
prototyping. You should be able to use `DataStream#print()` which just
prints to standard out [1] (It might be hidden between the log messages).

For debugging locally, you can also just set breakpoints in your
functions like in `StatefulSessionCalculator` and use the debugging mode
of your IDE.

Regards,
Timo

[1]
https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fapache%2Fflink%2Fblob%2Fmaster%2Fflink-examples%2Fflink-examples-streaming%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fflink%2Fstreaming%2Fexamples%2Fwordcount%2FWordCount.javadata=04%7C01%7C%7Ce156804411e14d71ebec08d8906f357a%7C84df9e7fe9f640afb435%7C1%7C0%7C637418154590426156%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000sdata=OLLSzUOK5GhE9ES5Y7XsOXS2KKhUdZfcOkuDFFDfML8%3Dreserved=0<https://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fapache%2Fflink%2Fblob%2Fmaster%2Fflink-examples%2Fflink-examples-streaming%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fflink%2Fstreaming%2Fexamples%2Fwordcount%2FWordCount.java=04%7C01%7C%7C609e34a624744519c61108d8908bf2bd%7C84df9e7fe9f640afb435%7C1%7C0%7C637418278029448824%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000=4WKJVCOaSusacMdu7vykjToOKC%2FlF3BmK9Zzii1ZysM%3D=0>


On 24.11.20 11:09, Simone Cavallarin wrote:
> I tried to `DataStream#print()` but I don't quite understand how to
> implement it. Could you please give me an example? I'm using Intellij so
> what I would need is just to see the data on my screen.
>
> Thanks
>
> 
> *From:* David Anderson mailto:da...@alpinegizmo.com>>
> *Sent:* 24 November 2020 10:01
> *To:* Pankaj Chand 
> mailto:pankajchanda...@gmail.com>>
> *Cc:* Austin Cawley-Edwards 
> mailto:austin.caw...@gmail.com>>; Simone Cavallarin
> mailto:cavalla...@hotmail.com>>; 
> user@flink.apache.org<mailto:user@flink.apache.org> 
> mailto:user@flink.apache.org>>
> *Subject:* Re: Print on screen DataStream content
> When Flink is running on a cluster, `DataStream#print()` prints to files
> in the log directory.
>
> Regards,
> David
>
> On Tue, Nov 24, 2020 at 6:03 AM Pankaj Chand 
> mailto:pankajchanda...@gmail.com>
> <mailto:pankajchanda...@gmail.com>> wrote:
>
> Please correct me if I am wrong. `DataStream#print()` only prints to
> the screen when running from the IDE, but does not work (print to
> the screen) when running on a cluster (even a local cluster).
>
> Thanks,
>
> Pankaj
>
> On Mon, Nov 23, 2020 at 5:31 PM Austin Cawley-Edwards
> mailto:austin.caw...@gmail.com> 
> <mailto:austin.caw...@gmail.com>> wrote:
>
> Hey Simone,
>
> I'd suggest trying out the `DataStream#print()` function to
> start, but there are a few other easy-to-integrate sinks for
> testing that y

Re: Print on screen DataStream content

2020-11-24 Thread David Anderson
Simone,

What you want to do is to override the toString() method on Event so that
it produces a more helpful String as its result, and then use

stream.print()

in your IDE (where stream is a DataStream).

By the way, printOrTest(stream) isn't part of Flink -- that's just
something used by the training exercises.

Best,
David

On Tue, Nov 24, 2020 at 4:08 PM Simone Cavallarin 
wrote:

> Hi,
>
> yes, I would like to debug locally on my IDE.
>
> This is what I tried so far, but no luck.
>
> a)String ff = result.toString();
> System.out.print(ff);
>
>  b)   printOrTest(stream);
>
>  c)stream.print();
>
>   d)   System.out.println(stream.print());
>
>
> This is the output and to me it looks like an "IDcode" that indicate each
> Event on my application but i don't know how to access the data inside. My
> Event is wrapped in a POJO. So each Event is something like this ['tom',
> 1.70, 55, 1929219939, M]
>
> p1.Event@1ca90014
> p1.Event@5e550c0c
> p1.Event@402baa28
> p1.Event@5cf8ae00
> p1.Event@805bc2b
> p1.Event@7e46f44b
> p1.Event@63a17b41
>
> Apologies, but I really don't understand if I write `DataStream#print()` I
> just receive an error... i'm missing something here..
> On the example there are System.out.println().
>
> Thanks
> --
> *From:* Timo Walther 
> *Sent:* 24 November 2020 11:50
> *To:* user@flink.apache.org 
> *Subject:* Re: Print on screen DataStream content
>
> Hi Simone,
>
> if you are just executing DataStream pipelines locally in your IDE while
> prototyping. You should be able to use `DataStream#print()` which just
> prints to standard out [1] (It might be hidden between the log messages).
>
> For debugging locally, you can also just set breakpoints in your
> functions like in `StatefulSessionCalculator` and use the debugging mode
> of your IDE.
>
> Regards,
> Timo
>
> [1]
>
> https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fapache%2Fflink%2Fblob%2Fmaster%2Fflink-examples%2Fflink-examples-streaming%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fflink%2Fstreaming%2Fexamples%2Fwordcount%2FWordCount.javadata=04%7C01%7C%7Ce156804411e14d71ebec08d8906f357a%7C84df9e7fe9f640afb435%7C1%7C0%7C637418154590426156%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000sdata=OLLSzUOK5GhE9ES5Y7XsOXS2KKhUdZfcOkuDFFDfML8%3Dreserved=0
>
>
> On 24.11.20 11:09, Simone Cavallarin wrote:
> > I tried to `DataStream#print()` but I don't quite understand how to
> > implement it. Could you please give me an example? I'm using Intellij so
> > what I would need is just to see the data on my screen.
> >
> > Thanks
> >
> > ------------
> > *From:* David Anderson 
> > *Sent:* 24 November 2020 10:01
> > *To:* Pankaj Chand 
> > *Cc:* Austin Cawley-Edwards ; Simone
> Cavallarin
> > ; user@flink.apache.org 
> > *Subject:* Re: Print on screen DataStream content
> > When Flink is running on a cluster, `DataStream#print()` prints to files
> > in the log directory.
> >
> > Regards,
> > David
> >
> > On Tue, Nov 24, 2020 at 6:03 AM Pankaj Chand  > <mailto:pankajchanda...@gmail.com >> wrote:
> >
> > Please correct me if I am wrong. `DataStream#print()` only prints to
> > the screen when running from the IDE, but does not work (print to
> > the screen) when running on a cluster (even a local cluster).
> >
> > Thanks,
> >
> > Pankaj
> >
> > On Mon, Nov 23, 2020 at 5:31 PM Austin Cawley-Edwards
> > mailto:austin.caw...@gmail.com
> >> wrote:
> >
> > Hey Simone,
> >
> > I'd suggest trying out the `DataStream#print()` function to
> > start, but there are a few other easy-to-integrate sinks for
> > testing that you can check out in the docs here[1]
> >
> > Best,
> > Austin
> >
> > [1]:
> >
> https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fci.apache.org%2Fprojects%2Fflink%2Fflink-docs-release-1.11%2Fdev%2Fdatastream_api.html%23data-sinksdata=04%7C01%7C%7Ce156804411e14d71ebec08d8906f357a%7C84df9e7fe9f640afb435%7C1%7C0%7C637418154590426156%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000sdata=eJzrCz1ZOIY%2FHp3Vy7vnhOSukKO9%2F58lxHOSdJSj9bY%3Dreserved=0
> > <
> https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fci.apache.org%2Fprojects%2Fflink%2Fflink-docs-release-1.11%2Fdev%2Fdatastream_api.h

Re: Print on screen DataStream content

2020-11-24 Thread Simone Cavallarin
Hi,

yes, I would like to debug locally on my IDE.

This is what I tried so far, but no luck.

a)String ff = result.toString();
System.out.print(ff);

 b)   printOrTest(stream);

 c)stream.print();

  d)   System.out.println(stream.print());


This is the output and to me it looks like an "IDcode" that indicate each Event 
on my application but i don't know how to access the data inside. My Event is 
wrapped in a POJO. So each Event is something like this ['tom', 1.70, 55, 
1929219939, M]

p1.Event@1ca90014
p1.Event@5e550c0c
p1.Event@402baa28
p1.Event@5cf8ae00
p1.Event@805bc2b
p1.Event@7e46f44b
p1.Event@63a17b41

Apologies, but I really don't understand if I write `DataStream#print()` I just 
receive an error... i'm missing something here..
On the example there are System.out.println().
[cid:cd68857a-2550-4e16-b381-554f86d3812f]
Thanks

From: Timo Walther 
Sent: 24 November 2020 11:50
To: user@flink.apache.org 
Subject: Re: Print on screen DataStream content

Hi Simone,

if you are just executing DataStream pipelines locally in your IDE while
prototyping. You should be able to use `DataStream#print()` which just
prints to standard out [1] (It might be hidden between the log messages).

For debugging locally, you can also just set breakpoints in your
functions like in `StatefulSessionCalculator` and use the debugging mode
of your IDE.

Regards,
Timo

[1]
https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fapache%2Fflink%2Fblob%2Fmaster%2Fflink-examples%2Fflink-examples-streaming%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fflink%2Fstreaming%2Fexamples%2Fwordcount%2FWordCount.javadata=04%7C01%7C%7Ce156804411e14d71ebec08d8906f357a%7C84df9e7fe9f640afb435%7C1%7C0%7C637418154590426156%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000sdata=OLLSzUOK5GhE9ES5Y7XsOXS2KKhUdZfcOkuDFFDfML8%3Dreserved=0


On 24.11.20 11:09, Simone Cavallarin wrote:
> I tried to `DataStream#print()` but I don't quite understand how to
> implement it. Could you please give me an example? I'm using Intellij so
> what I would need is just to see the data on my screen.
>
> Thanks
>
> 
> *From:* David Anderson 
> *Sent:* 24 November 2020 10:01
> *To:* Pankaj Chand 
> *Cc:* Austin Cawley-Edwards ; Simone Cavallarin
> ; user@flink.apache.org 
> *Subject:* Re: Print on screen DataStream content
> When Flink is running on a cluster, `DataStream#print()` prints to files
> in the log directory.
>
> Regards,
> David
>
> On Tue, Nov 24, 2020 at 6:03 AM Pankaj Chand  <mailto:pankajchanda...@gmail.com>> wrote:
>
> Please correct me if I am wrong. `DataStream#print()` only prints to
> the screen when running from the IDE, but does not work (print to
> the screen) when running on a cluster (even a local cluster).
>
> Thanks,
>
> Pankaj
>
> On Mon, Nov 23, 2020 at 5:31 PM Austin Cawley-Edwards
> mailto:austin.caw...@gmail.com>> wrote:
>
> Hey Simone,
>
> I'd suggest trying out the `DataStream#print()` function to
> start, but there are a few other easy-to-integrate sinks for
> testing that you can check out in the docs here[1]
>
> Best,
> Austin
>
> [1]:
> 
> https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fci.apache.org%2Fprojects%2Fflink%2Fflink-docs-release-1.11%2Fdev%2Fdatastream_api.html%23data-sinksdata=04%7C01%7C%7Ce156804411e14d71ebec08d8906f357a%7C84df9e7fe9f640afb435%7C1%7C0%7C637418154590426156%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000sdata=eJzrCz1ZOIY%2FHp3Vy7vnhOSukKO9%2F58lxHOSdJSj9bY%3Dreserved=0
> 
> <https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fci.apache.org%2Fprojects%2Fflink%2Fflink-docs-release-1.11%2Fdev%2Fdatastream_api.html%23data-sinksdata=04%7C01%7C%7Ce156804411e14d71ebec08d8906f357a%7C84df9e7fe9f640afb435%7C1%7C0%7C637418154590426156%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000sdata=eJzrCz1ZOIY%2FHp3Vy7vnhOSukKO9%2F58lxHOSdJSj9bY%3Dreserved=0>
>
> On Mon, Nov 23, 2020 at 3:56 PM Simone Cavallarin
> mailto:cavalla...@hotmail.com>> wrote:
>
> Hi All,
>
> On my code I have a DataStream that I would like to access.
> I need to understand what I'm getting for each
> transformation to check if the data that I'm working on make
> sense. How can I print into the console or get a file (csv,
> txt) for the variables: "stream", "enriched&qu

Re: Print on screen DataStream content

2020-11-24 Thread Timo Walther

Hi Simone,

if you are just executing DataStream pipelines locally in your IDE while 
prototyping. You should be able to use `DataStream#print()` which just 
prints to standard out [1] (It might be hidden between the log messages).


For debugging locally, you can also just set breakpoints in your 
functions like in `StatefulSessionCalculator` and use the debugging mode 
of your IDE.


Regards,
Timo

[1] 
https://github.com/apache/flink/blob/master/flink-examples/flink-examples-streaming/src/main/java/org/apache/flink/streaming/examples/wordcount/WordCount.java



On 24.11.20 11:09, Simone Cavallarin wrote:
I tried to `DataStream#print()` but I don't quite understand how to 
implement it. Could you please give me an example? I'm using Intellij so 
what I would need is just to see the data on my screen.


Thanks


*From:* David Anderson 
*Sent:* 24 November 2020 10:01
*To:* Pankaj Chand 
*Cc:* Austin Cawley-Edwards ; Simone Cavallarin 
; user@flink.apache.org 

*Subject:* Re: Print on screen DataStream content
When Flink is running on a cluster, `DataStream#print()` prints to files 
in the log directory.


Regards,
David

On Tue, Nov 24, 2020 at 6:03 AM Pankaj Chand <mailto:pankajchanda...@gmail.com>> wrote:


Please correct me if I am wrong. `DataStream#print()` only prints to
the screen when running from the IDE, but does not work (print to
the screen) when running on a cluster (even a local cluster).

Thanks,

Pankaj

On Mon, Nov 23, 2020 at 5:31 PM Austin Cawley-Edwards
mailto:austin.caw...@gmail.com>> wrote:

Hey Simone,

I'd suggest trying out the `DataStream#print()` function to
start, but there are a few other easy-to-integrate sinks for
testing that you can check out in the docs here[1]

Best,
Austin

[1]:

https://ci.apache.org/projects/flink/flink-docs-release-1.11/dev/datastream_api.html#data-sinks

<https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fci.apache.org%2Fprojects%2Fflink%2Fflink-docs-release-1.11%2Fdev%2Fdatastream_api.html%23data-sinks=04%7C01%7C%7C27267479d99245bad55408d8905fea55%7C84df9e7fe9f640afb435%7C1%7C0%7C637418088905500774%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000=9YFLY5slrok3wXJ5n1F3s0BNYuzNs%2F70RLWLsWJCkzE%3D=0>

On Mon, Nov 23, 2020 at 3:56 PM Simone Cavallarin
mailto:cavalla...@hotmail.com>> wrote:

Hi All,

On my code I have a DataStream that I would like to access.
I need to understand what I'm getting for each
transformation to check if the data that I'm working on make
sense. How can I print into the console or get a file (csv,
txt) for the variables: "stream", "enriched" and "result"?

I have tried different way but no way to get the data.

Thanks!
/
/
/
/
/        FlinkKafkaConsumer kafkaData =/
/                new FlinkKafkaConsumer("CorID_1", new
EventDeserializationSchema(), p);/
/        WatermarkStrategy wmStrategy =/
/                WatermarkStrategy/
/                        .forMonotonousTimestamps()/
/                        .withIdleness(Duration.ofMinutes(1))/
/                        .withTimestampAssigner((event,
timestamp) -> {/
/                            return event.get_Time();/
/                        });/
/        DataStream *stream *= env.addSource(/
/   
kafkaData.assignTimestampsAndWatermarks(wmStrategy));/

/
/
/        DataStream> *enriched* = stream/
/                .keyBy((Event KafkaMSG) -> KafkaMSG.CorrID)/
/                .map(new StatefulSessionCalculator());/
/
/
/        WindowedStream, String,
TimeWindow> *result* = enriched/
/                .keyBy(new MyKeySelector())/
/   
.window(EventTimeSessionWindows.withDynamicGap(new

DynamicSessionWindows()));/





Re: Print on screen DataStream content

2020-11-24 Thread Simone Cavallarin
I tried to `DataStream#print()` but I don't quite understand how to implement 
it. Could you please give me an example? I'm using Intellij so what I would 
need is just to see the data on my screen.

Thanks


From: David Anderson 
Sent: 24 November 2020 10:01
To: Pankaj Chand 
Cc: Austin Cawley-Edwards ; Simone Cavallarin 
; user@flink.apache.org 
Subject: Re: Print on screen DataStream content

When Flink is running on a cluster, `DataStream#print()` prints to files in the 
log directory.

Regards,
David

On Tue, Nov 24, 2020 at 6:03 AM Pankaj Chand 
mailto:pankajchanda...@gmail.com>> wrote:
Please correct me if I am wrong. `DataStream#print()` only prints to the screen 
when running from the IDE, but does not work (print to the screen) when running 
on a cluster (even a local cluster).

Thanks,

Pankaj

On Mon, Nov 23, 2020 at 5:31 PM Austin Cawley-Edwards 
mailto:austin.caw...@gmail.com>> wrote:
Hey Simone,

I'd suggest trying out the `DataStream#print()` function to start, but there 
are a few other easy-to-integrate sinks for testing that you can check out in 
the docs here[1]

Best,
Austin

[1]: 
https://ci.apache.org/projects/flink/flink-docs-release-1.11/dev/datastream_api.html#data-sinks<https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fci.apache.org%2Fprojects%2Fflink%2Fflink-docs-release-1.11%2Fdev%2Fdatastream_api.html%23data-sinks=04%7C01%7C%7C27267479d99245bad55408d8905fea55%7C84df9e7fe9f640afb435%7C1%7C0%7C637418088905500774%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000=9YFLY5slrok3wXJ5n1F3s0BNYuzNs%2F70RLWLsWJCkzE%3D=0>

On Mon, Nov 23, 2020 at 3:56 PM Simone Cavallarin 
mailto:cavalla...@hotmail.com>> wrote:
Hi All,

On my code I have a DataStream that I would like to access. I need to 
understand what I'm getting for each transformation to check if the data that 
I'm working on make sense. How can I print into the console or get a file (csv, 
txt) for the variables: "stream", "enriched" and "result"?

I have tried different way but no way to get the data.

Thanks!


FlinkKafkaConsumer kafkaData =
new FlinkKafkaConsumer("CorID_1", new 
EventDeserializationSchema(), p);
WatermarkStrategy wmStrategy =
WatermarkStrategy
.forMonotonousTimestamps()
.withIdleness(Duration.ofMinutes(1))
.withTimestampAssigner((event, timestamp) -> {
return event.get_Time();
});
DataStream stream = env.addSource(
kafkaData.assignTimestampsAndWatermarks(wmStrategy));

DataStream> enriched = stream
.keyBy((Event KafkaMSG) -> KafkaMSG.CorrID)
.map(new StatefulSessionCalculator());

WindowedStream, String, TimeWindow> result = 
enriched
.keyBy(new MyKeySelector())
.window(EventTimeSessionWindows.withDynamicGap(new 
DynamicSessionWindows()));


Re: Print on screen DataStream content

2020-11-24 Thread David Anderson
When Flink is running on a cluster, `DataStream#print()` prints to files in
the log directory.

Regards,
David

On Tue, Nov 24, 2020 at 6:03 AM Pankaj Chand 
wrote:

> Please correct me if I am wrong. `DataStream#print()` only prints to the
> screen when running from the IDE, but does not work (print to the screen)
> when running on a cluster (even a local cluster).
>
> Thanks,
>
> Pankaj
>
> On Mon, Nov 23, 2020 at 5:31 PM Austin Cawley-Edwards <
> austin.caw...@gmail.com> wrote:
>
>> Hey Simone,
>>
>> I'd suggest trying out the `DataStream#print()` function to start, but
>> there are a few other easy-to-integrate sinks for testing that you can
>> check out in the docs here[1]
>>
>> Best,
>> Austin
>>
>> [1]:
>> https://ci.apache.org/projects/flink/flink-docs-release-1.11/dev/datastream_api.html#data-sinks
>>
>> On Mon, Nov 23, 2020 at 3:56 PM Simone Cavallarin 
>> wrote:
>>
>>> Hi All,
>>>
>>> On my code I have a DataStream that I would like to access. I need to
>>> understand what I'm getting for each transformation to check if the data
>>> that I'm working on make sense. How can I print into the console or get a
>>> file (csv, txt) for the variables: "stream", "enriched" and "result"?
>>>
>>> I have tried different way but no way to get the data.
>>>
>>> Thanks!
>>>
>>>
>>> *FlinkKafkaConsumer kafkaData =*
>>> *new FlinkKafkaConsumer("CorID_1", new
>>> EventDeserializationSchema(), p);*
>>> *WatermarkStrategy wmStrategy =*
>>> *WatermarkStrategy*
>>> *.forMonotonousTimestamps()*
>>> *.withIdleness(Duration.ofMinutes(1))*
>>> *.withTimestampAssigner((event, timestamp) -> {*
>>> *return event.get_Time();*
>>> *});*
>>> *DataStream stream = env.addSource(*
>>> *kafkaData.assignTimestampsAndWatermarks(wmStrategy));*
>>>
>>> *DataStream> enriched = stream*
>>> *.keyBy((Event KafkaMSG) -> KafkaMSG.CorrID)*
>>> *.map(new StatefulSessionCalculator());*
>>>
>>> *WindowedStream, String, TimeWindow> result
>>> = enriched*
>>> *.keyBy(new MyKeySelector())*
>>> *.window(EventTimeSessionWindows.withDynamicGap(new
>>> DynamicSessionWindows()));*
>>>
>>


Re: Print on screen DataStream content

2020-11-23 Thread Pankaj Chand
Please correct me if I am wrong. `DataStream#print()` only prints to the
screen when running from the IDE, but does not work (print to the screen)
when running on a cluster (even a local cluster).

Thanks,

Pankaj

On Mon, Nov 23, 2020 at 5:31 PM Austin Cawley-Edwards <
austin.caw...@gmail.com> wrote:

> Hey Simone,
>
> I'd suggest trying out the `DataStream#print()` function to start, but
> there are a few other easy-to-integrate sinks for testing that you can
> check out in the docs here[1]
>
> Best,
> Austin
>
> [1]:
> https://ci.apache.org/projects/flink/flink-docs-release-1.11/dev/datastream_api.html#data-sinks
>
> On Mon, Nov 23, 2020 at 3:56 PM Simone Cavallarin 
> wrote:
>
>> Hi All,
>>
>> On my code I have a DataStream that I would like to access. I need to
>> understand what I'm getting for each transformation to check if the data
>> that I'm working on make sense. How can I print into the console or get a
>> file (csv, txt) for the variables: "stream", "enriched" and "result"?
>>
>> I have tried different way but no way to get the data.
>>
>> Thanks!
>>
>>
>> *FlinkKafkaConsumer kafkaData =*
>> *new FlinkKafkaConsumer("CorID_1", new
>> EventDeserializationSchema(), p);*
>> *WatermarkStrategy wmStrategy =*
>> *WatermarkStrategy*
>> *.forMonotonousTimestamps()*
>> *.withIdleness(Duration.ofMinutes(1))*
>> *.withTimestampAssigner((event, timestamp) -> {*
>> *return event.get_Time();*
>> *});*
>> *DataStream stream = env.addSource(*
>> *kafkaData.assignTimestampsAndWatermarks(wmStrategy));*
>>
>> *DataStream> enriched = stream*
>> *.keyBy((Event KafkaMSG) -> KafkaMSG.CorrID)*
>> *.map(new StatefulSessionCalculator());*
>>
>> *WindowedStream, String, TimeWindow> result =
>> enriched*
>> *.keyBy(new MyKeySelector())*
>> *.window(EventTimeSessionWindows.withDynamicGap(new
>> DynamicSessionWindows()));*
>>
>


Re: Print on screen DataStream content

2020-11-23 Thread Austin Cawley-Edwards
Hey Simone,

I'd suggest trying out the `DataStream#print()` function to start, but
there are a few other easy-to-integrate sinks for testing that you can
check out in the docs here[1]

Best,
Austin

[1]:
https://ci.apache.org/projects/flink/flink-docs-release-1.11/dev/datastream_api.html#data-sinks

On Mon, Nov 23, 2020 at 3:56 PM Simone Cavallarin 
wrote:

> Hi All,
>
> On my code I have a DataStream that I would like to access. I need to
> understand what I'm getting for each transformation to check if the data
> that I'm working on make sense. How can I print into the console or get a
> file (csv, txt) for the variables: "stream", "enriched" and "result"?
>
> I have tried different way but no way to get the data.
>
> Thanks!
>
>
> *FlinkKafkaConsumer kafkaData =*
> *new FlinkKafkaConsumer("CorID_1", new
> EventDeserializationSchema(), p);*
> *WatermarkStrategy wmStrategy =*
> *WatermarkStrategy*
> *.forMonotonousTimestamps()*
> *.withIdleness(Duration.ofMinutes(1))*
> *.withTimestampAssigner((event, timestamp) -> {*
> *return event.get_Time();*
> *});*
> *DataStream stream = env.addSource(*
> *kafkaData.assignTimestampsAndWatermarks(wmStrategy));*
>
> *DataStream> enriched = stream*
> *.keyBy((Event KafkaMSG) -> KafkaMSG.CorrID)*
> *.map(new StatefulSessionCalculator());*
>
> *WindowedStream, String, TimeWindow> result =
> enriched*
> *.keyBy(new MyKeySelector())*
> *.window(EventTimeSessionWindows.withDynamicGap(new
> DynamicSessionWindows()));*
>


Print on screen DataStream content

2020-11-23 Thread Simone Cavallarin
Hi All,

On my code I have a DataStream that I would like to access. I need to 
understand what I'm getting for each transformation to check if the data that 
I'm working on make sense. How can I print into the console or get a file (csv, 
txt) for the variables: "stream", "enriched" and "result"?

I have tried different way but no way to get the data.

Thanks!


FlinkKafkaConsumer kafkaData =
new FlinkKafkaConsumer("CorID_1", new 
EventDeserializationSchema(), p);
WatermarkStrategy wmStrategy =
WatermarkStrategy
.forMonotonousTimestamps()
.withIdleness(Duration.ofMinutes(1))
.withTimestampAssigner((event, timestamp) -> {
return event.get_Time();
});
DataStream stream = env.addSource(
kafkaData.assignTimestampsAndWatermarks(wmStrategy));

DataStream> enriched = stream
.keyBy((Event KafkaMSG) -> KafkaMSG.CorrID)
.map(new StatefulSessionCalculator());

WindowedStream, String, TimeWindow> result = 
enriched
.keyBy(new MyKeySelector())
.window(EventTimeSessionWindows.withDynamicGap(new 
DynamicSessionWindows()));