[ 
https://issues.apache.org/jira/browse/SOLR-15189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17305989#comment-17305989
 ] 

Jan Høydahl commented on SOLR-15189:
------------------------------------

Thanks for doing this Alexandre.

Guess this gives us some insight into traffic patterns. Would be interesting to 
see the source of the refguide 6_6 traffic, but I'm quite convinced it is 
Google, since 6.6 guide still shows up on top for some queries at my end.

We should prepare for sunsetting GA on both Lucene and Solr sites, note board 
member Justin's comment on lucene dev list 
https://lists.apache.org/thread.html/re44bf57334ca786b3b5c7c66f27c45604e15f9850920dd565ad64889%40%3Cdev.lucene.apache.org%3E

Also, I just noticed that INFRA must have started tracking downloads again, 
since numbers have started appearing in 
https://uls.apache.org/exports/solr.apache.org.yaml. Those numbers are so low 
that I wonder if they are counting the right thing. My guess is that Solr 
numbers are hidden in Lucene's stats since the current artifacts are actually 
in lucene/solr folder of download site.

Of course any download numbers from this stat will only be the number of clicks 
from homepage to the mirrors - and we don't know anything about how many of 
those that actually ends up as a download, and we cannot know how many visit 
the mirrors directly the next time. Also, with Docker rising the number of 
Docker image pulls are euqally and increasingly interesting. We can get a count 
of total pulls with this command 

{{curl -s https://hub.docker.com/v2/repositories/library/solr/ | jq -r 
".pull_count"}}

I have created SOLR-15275 for getting rid of GA again. Can we perhaps set April 
1st as a date for that? Then we have about 1 month stats in GA to continue 
analyzing.

> Tracking downloads on new Solr site
> -----------------------------------
>
>                 Key: SOLR-15189
>                 URL: https://issues.apache.org/jira/browse/SOLR-15189
>             Project: Solr
>          Issue Type: Sub-task
>            Reporter: Jan Høydahl
>            Assignee: Alexandre Rafalovitch
>            Priority: Major
>         Attachments: Analytics All Web Site Data All Traffic 
> 20210219-20210320.pdf, Analytics All Web Site Data Pages 20210219-20210320.pdf
>
>
> On lucene.apache.org we use Google Analytics tracking
> {quote}GOOGLE_ANALYTICS_TRACKING_ID = 'UA-94576-12'
> {quote}
> I think the reason was so that we could estimate downloads from mirrors, by 
> counting number of clicks on the links from download pages. But are anyone 
> ever looking at or publishing those numbers?
> The ASF wants projects to stop using 3rd party tracking of users and instead 
> ask INFRA for aggregated stats for the page. WDYT? Should we
>  # Remove trackers from both sites and rely on stats from infra
>  # Continue using Google analytics, but have someone actually publish numbers 
> from it every month?
>  # Use some other way of counting downloads?
> h2. What do we get without a tracker?
> INFRA provides anonymous page view stats here 
> [https://uls.apache.org/exports/lucene.apache.org.yaml] which gives some 
> insight. But not downloads specifically. We see 12k visits to Solr downloads 
> page last months, but we don't know how many of those clicked...
> {code:java}
> Sheet3:
>   Name: Most visited pages, past month
>   Values:
>     /solr/index.html: 33604
>     /index.html: 27588
>     /solr/downloads.html: 12118
>     /core/2_9_4/queryparsersyntax.html: 11135
>     /core/index.html: 10353
>     /solr/guide/solr-tutorial.html: 9734
>     /solr/resources.html: 8014
>     /solr/features.html: 7046
>     /solr/guide/8_8/solr-tutorial.html: 6099
>     /solr/news.html: 5843
>     /solr/guide/6_6/the-standard-query-parser.html: 5216
>     /solr/guide/index.html: 4430
>     /solr/guide/6_6/common-query-parameters.html: 4379
>     /core/downloads.html: 3644
> {code}
> There's an interesting section at the bottom of that YAML page, wonder if it 
> could be enabled in some way
> {code}
> Sheet6:
>   Name: Downloads, past month
>   Values: {}
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to