To aitozi. Cheers Minglei
> 在 2018年6月27日,下午5:46,shimin yang <ysmcl...@gmail.com> 写道: > > Aitozi > > We are using hyperloglog to count daily uv, but it only provided an > approximate value. I also tried the count distinct in flink table without > window, but need to set the retention time. > > However, the time resolution of this operator is 1 millisecond, so it ends up > with too many timers in the java heap which might leads to OOM. > > Cheers > Shimin > > 2018-06-27 17:34 GMT+08:00 zhangminglei <18717838...@163.com > <mailto:18717838...@163.com>>: > Aitozi > > From my side, I do not think distinct is very easy to deal with. Even though > together work with kafka support exactly-once. > > For uv, we can use a bloomfilter to filter pv for geting uv in the end. > > Window is usually used in an aggregate operation, so I think all should be > realized by windows. > > I am not familiar with this fields, so I still want to know what others > response this question. > > Cheers > Minglei > > > > > 在 2018年6月27日,下午5:12,aitozi <gjying1...@gmail.com > > <mailto:gjying1...@gmail.com>> 写道: > > > > Hi, community > > > > I am using flink to deal with some situation. > > > > 1. "distinct count" to calculate the uv/pv. > > 2. calculate the topN of the past 1 hour or 1 day time. > > > > Are these all realized by window? Or is there a best practice on doing this? > > > > 3. And when deal with the distinct, if there is no need to do the keyBy > > previous, how does the window deal with this. > > > > Thanks > > Aitozi. > > > > > > > > -- > > Sent from: > > http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/ > > <http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/> > > >