Hi Martijn,

big +1 for this effort. Thanks a lot for pushing this initiative forward!

Cheers,
Till

On Fri, Jan 14, 2022 at 11:49 AM Konstantin Knauf <kna...@apache.org> wrote:

> Hi Martijn,
>
> I think this is a great initiative. Thank you for pursuing this. It allows
> us to
>
> a) generate better insights into the usage of Apache Flink and its
> documentation as shown in the video
> a) do this in a privacy preserving way and
> c) act as a role model for other Apache projects on this matter
>
> Big +1. I am happy to help, if I can.
>
> Cheers,
>
> Konstantin
>
>
>
> On Fri, Jan 14, 2022 at 11:21 AM Martijn Visser <mart...@ververica.com>
> wrote:
>
> > Hi everyone,
> >
> > The Flink website currently uses Google Analytics to track how visitors
> of
> > the website are interacting with it. It provides insights into which
> > documentation pages are visited, how users are using the website (what's
> > the cycle of pages they visit before exiting the page), if they are
> > downloading Flink etc. However, the Apache Software Foundation
> discourages
> > using Google Analytics [1] unless meeting certain requirements. The Flink
> > website currently does not meet those requirements.
> >
> > I do believe that it's useful to understand what parts of a website are
> > important to users, what features are most frequently read up on, where
> > they get lost in the docs, etc. so we can better understand how users use
> > the system, the website, and the docs and where to focus improvements
> next.
> >
> > I would like to move the Flink website from Google Analytics to an
> > alternative as soon as possible for Flink. I would be in favour of
> opening
> > up insights to this data for everyone too, it's public data anyway.
> >
> > For the past couple of months, I've been engaging in a conversation with
> > ASF Legal and ASF Infra about setting up a privacy-friendly alternative
> for
> > Google Analytics for all ASF projects via the priv...@apache.org mailing
> > list (I can't find a public web archive link for this unfortunately). As
> > part of that discussion, I've done a test with the open source and
> > self-hosted version of Matomo [2], taking a look at the privacy
> > implications and the functionality that this tool offers. You can watch a
> > recording of that experiment [3] and view the test setup I've used [4].
> >
> > The current status is that ASF Legal, ASF Infra and I have agreed to take
> > the next step on this project. This step means that:
> >
> > * I set up Matomo on a VM provided by ASF Infra
> > * A new DNS name is created (either https://analytics.apache.org/ or
> > https://matomo.analytics.apache.org/) by ASF Infra
> > * The Flink website is adjusted to remove the tracking from Google
> > Analytics and include the necessary Javascript to allow tracking of the
> > Flink website and documentation in Matomo
> >
> > If this test would be successful, ASF Infra would take over the hosting
> of
> > this solution and provide it to all ASF projects.
> >
> > I would like to understand from the Flink community:
> >
> > 1. Do you think this is a good idea?
> >
> > 2. If yes, I need a couple of PMCs for requesting a VM from Apache Infra
> > [5]
> >
> > Best regards,
> >
> > Martijn
> > https://twitter.com/MartijnVisser82
> >
> > [1] https://privacy.apache.org/faq/committers.html
> > [2] https://matomo.org/
> > [3]
> >
> >
> https://drive.google.com/file/d/1yomYhLoyrzBW620bpn_dROiwyvSCzuvt/view?usp=sharing
> > [4] https://github.com/MartijnVisser/matomo-analytics
> > [5] https://infra.apache.org/vm-for-project.html
> >
>
>
> --
>
> Konstantin Knauf
>
> https://twitter.com/snntrable
>
> https://github.com/knaufk
>

Reply via email to