I'm not Adrian but we work together on this project, and that's indeed
what we're doing, and the guess was correct as well.
Thanks so far!

On Mon 15.05 08:45, Addshore wrote:
> I believe in this case data is being crunched, in hadoop, which is where
> the WDQS access logs are.
> And I think the page in question that Adrian wanted to load was
> https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/queries/examples,
> at a guess he is looking at how often these example queries are requested
> via the service.
> 
> On Mon, 15 May 2017 at 00:22 Nuria Ruiz <nu...@wikimedia.org> wrote:
> 
> > >(i.e. implying that we need to collect the data somewhere else, and move
> > to production for number crunching only)?
> > I think we should probably set up a sync up so you get an overview of how
> > this works cause this is a brief response. Data is harvested in some
> > production machines, it is processed (in different production machines) and
> > moved to stats machines (also production but a sheltered environment). We
> > do not use stats machines to harvest data. They just provide access to it
> > and are sized so you can process and crunch data, this talk explains a bit
> > how does this all works: https://www.youtube.com/watch?v=tx1pagZOsiM
> >
> > We might be talking pass each other here, if so, a meeting might help.
> >
> >
> > >Nuria, what exactly do you have in mind when you say "a development
> > instance of Wikidata"?
> > If you need to look at a wikidata query and see what it shows on the logs
> > when you  query x or y, that step should be done on a (wikidata) *test
> > environment* that logs the http requests for your queries as received by
> > the server. So you can "test" your queries agains a server and see how
> > those are received.
> >
> >
> > Thanks,
> >
> > Nuria
> >
> >
> >
> >
> >
> > On Sun, May 14, 2017 at 1:10 PM, Adrian Bielefeldt <
> > adrian.bielefe...@mailbox.tu-dresden.de> wrote:
> >
> >> Hi Addshore,
> >> thanks for the advice, I can now connect.
> >>
> >> Greetings,
> >>
> >> Adrian
> >>
> >>
> >> On 05/13/2017 05:47 PM, Addshore wrote:
> >>
> >> You should be able to connect to query.wikidata.org via the webproxy.
> >>
> >> https://wikitech.wikimedia.org/wiki/HTTP_proxy
> >>
> >> On Sat, 13 May 2017 at 15:23 Adrian Bielefeldt <
> >> adrian.bielefe...@mailbox.tu-dresden.de> wrote:
> >>
> >>> Hello Nuri,
> >>>
> >>> I'm working on a project
> >>> <https://meta.wikimedia.org/wiki/Research:Understanding_Wikidata_Queries>
> >>> analyzing the wikidata SPARQL-queries. We extract specific fields (e.g.
> >>> uri_query, hour) from wmf.wdqs_extract, parse the queries with a java
> >>> program using open_rdf as the parser and then analyze it for different
> >>> metrics like variable count, which entities are being used and so on.
> >>>
> >>> At the moment I'm working on checking which entries equal one of the
> >>> example queries at
> >>> https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/queries/examples
> >>> using this
> >>> <https://github.com/Wikidata/QueryAnalysis/blob/master/src/main/java/general/Main.java#L339-L376>
> >>> code. Unfortunately the program cannot connect to the website, so I'm
> >>> assuming I have to create an exception for this request or ask for it to 
> >>> be
> >>> created.
> >>>
> >>> Greetings,
> >>>
> >>> Adrian
> >>> _______________________________________________
> >>> Analytics mailing list
> >>> Analytics@lists.wikimedia.org
> >>> https://lists.wikimedia.org/mailman/listinfo/analytics
> >>>
> >>
> >>
> >> _______________________________________________
> >> Analytics mailing 
> >> listAnalytics@lists.wikimedia.orghttps://lists.wikimedia.org/mailman/listinfo/analytics
> >>
> >>
> >>
> >> _______________________________________________
> >> Analytics mailing list
> >> Analytics@lists.wikimedia.org
> >> https://lists.wikimedia.org/mailman/listinfo/analytics
> >>
> >>
> > _______________________________________________
> > Analytics mailing list
> > Analytics@lists.wikimedia.org
> > https://lists.wikimedia.org/mailman/listinfo/analytics
> >

> _______________________________________________
> Analytics mailing list
> Analytics@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/analytics

Attachment: signature.asc
Description: PGP signature

_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to