[Analytics] Fwd: [Wiki-research-l] [events] Wiki Workshop 2023 Call for Papers

2023-03-02 Thread Leila Zia
and join us in conversations about research on the Wikimedia projects. Best, Leila -- Leila Zia Head of Research Wikimedia Foundation -- Forwarded message - From: Martin Gerlach Date: Mon, Feb 20, 2023 at 1:29 AM Subject: [Wiki-research-l] [events] Wiki Workshop 2023 Call

[Analytics] Re: [event] Wiki Workshop 2022 - Registration open

2022-06-06 Thread Leila Zia
://wikiworkshop.org/2022/#papers . Best, Leila, on behalf of Wiki Workshop 2022 organizers On Fri, Apr 8, 2022 at 8:03 AM Leila Zia wrote: > Hi all, > > The registration for Wiki Workshop 2022 [1] is now open. The event is > virtually held on April 25, 12:00-18:30 UTC and as part of The Web >

[Analytics] [event] Wiki Workshop 2022 - Registration open

2022-04-08 Thread Leila Zia
://hls.harvard.edu/faculty/directory/10519/Lessig [5] (privacy statement for the Google form survey [6]) https://docs.google.com/forms/d/e/1FAIpQLSctlkUv8FasB2Nc4RvThnxAbjPzUwmnxB2FwnNkZlKG1NPOTg/viewform [6] https://foundation.wikimedia.org/wiki/Legal:Wiki_Workshop_Registration_Privacy_Statement -- Leila

[Analytics] Re: [events] Wiki Workshop 2022 Announcement and Call for Papers

2022-03-04 Thread Leila Zia
Hi all, A reminder that if you're considering submitting your ongoing or completed Wikimedia related research to Wiki Workshop, the non-archival deadline is on March 10. Submission instructions at https://wikiworkshop.org/2022/#submission . Best, Leila On Fri, Jan 28, 2022 at 2:34 PM Leila Zia

[Analytics] Re: [events] Wiki Workshop 2022 Announcement and Call for Papers

2022-01-28 Thread Leila Zia
/#call . See my original email below for more details. Best, Leila On Mon, Dec 20, 2021 at 9:53 PM Leila Zia wrote: > > Hi everyone, > > Summary: Wiki Workshop 2022 [0] will take place virtually as part of > The Web Conference 2022 [1]. Call for papers is now open: > https://wik

[Analytics] The Wikimedia Foundation Research Award of the Year - Call for Nominations

2022-01-07 Thread Leila Zia
at wmf-ray-2...@easychair.org or here. Best, Benjamin Mako Hill (University of Washington) Leila Zia (Wikimedia Foundation) ___ Analytics mailing list -- analytics@lists.wikimedia.org To unsubscribe send an email to analytics-le...@lists.wikimedia.org

[Analytics] [events] Wiki Workshop 2022 Announcement and Call for Papers

2021-12-20 Thread Leila Zia
or at wikiworks...@googlegroups.com. Looking forward to seeing many of you in this year's edition. Best, Srijan Kumar, Georgia Tech Emily Lesack, Wikimedia Foundation Miriam Redi, Wikimedia Foundation Bob West, EPFL Leila Zia, Wikimedia Foundation [0] https://wikiworkshop.org/2022/ [1] https

Re: [Analytics] [events] Wiki Workshop 2021 Announcement and Call for Papers

2021-04-09 Thread Leila Zia
/1FAIpQLSeiq7MUp9ln8Z9KijslxRh18eT0bqCpQqAGjunC4n99WMumSw/viewform . If you're interested in research on or about the Wikimedia projects, don't miss it. :) Best, Leila, Miriam and Bob On Fri, Mar 26, 2021 at 5:27 PM Leila Zia wrote: > > Hi everyone, > > We are looking forward to hosting you in W

Re: [Analytics] [events] Wiki Workshop 2021 Announcement and Call for Papers

2021-03-26 Thread Leila Zia
check out the accepted papers and the invited speakers' list on the website: https://wikiworkshop.org/2021/ Best, Leila, Miriam and Bob On Wed, Jan 6, 2021 at 8:52 AM Leila Zia wrote: > > Hi everyone, > > We are delighted to announce that Wiki Workshop 2021 will be held > virtuall

[Analytics] [events] Wiki Workshop 2021 Announcement and Call for Papers

2021-01-06 Thread Leila Zia
edition. Best, Miriam Redi, Wikimedia Foundation Bob West, EPFL Leila Zia, Wikimedia Foundation [1] https://www2021.thewebconf.org/ ___ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics

Re: [Analytics] Upcoming WMF/Research-Team Office hours on September 1st, 2020

2020-09-01 Thread Leila Zia
A friendly reminder that we will kick off this meeting in a couple of minutes. Join us if you want to talk about research. :) On Fri, Aug 28, 2020 at 10:13 AM Martin Gerlach wrote: > > Hi all, > > Join the Research Team at the Wikimedia Foundation [1] for their monthly > Office hours on

[Analytics] [job] We're hiring

2020-08-05 Thread Leila Zia
! :) Research Scientist (Disinformation): https://boards.greenhouse.io/wikimedia/jobs/2267633 Research Engineer: https://boards.greenhouse.io/wikimedia/jobs/2267741 If you have questions, please don't hesitate to reach out. Best, Leila [1] https://research.wikimedia.org/team.html -- Leila Zia

Re: [Analytics] Wiki Workshop 2020 Announcement and Call for Papers

2020-04-18 Thread Leila Zia
forward to seeing those of you who will attend. :) Leila On Mon, Mar 16, 2020 at 12:42 PM Leila Zia wrote: > Hi Pine, > > We're considering the options and going through the pros and cons of > it. One consideration is that folks whose native language is not > English or

Re: [Analytics] Wiki Workshop 2020 Announcement and Call for Papers

2020-03-27 Thread Leila Zia
n Fri, Mar 13, 2020 at 2:42 PM Leila Zia wrote: > > Hi all, > > We have an update for you regarding Wiki Workshop 2020 [0] in light of > the global health situation related to COVID-19. > > ==Summary== > We have turned Wiki Workshop 2020 from an in-perso

Re: [Analytics] Analytics/Research Office hours on 2020-03-25 at 17.00-18.00 (UTC)

2020-03-25 Thread Leila Zia
This is happening now and for the next 53 min. ;) Show up if you have questions for us. Best, Leila -- Leila Zia Head of Research Wikimedia Foundation On Fri, Mar 20, 2020 at 3:03 AM Martin Gerlach wrote: > > Hi all, > > join us for our monthly Analytics/Research Office hours o

Re: [Analytics] Wiki Workshop 2020 Announcement and Call for Papers

2020-03-16 Thread Leila Zia
, I watch or share presentations after they have > occurred. > > Pine > ( https://meta.wikimedia.org/wiki/User:Pine ) > > On Fri, Mar 13, 2020 at 9:43 PM Leila Zia wrote: > > > > Hi all, > > > > We have an update for you regarding Wiki Workshop 2020 [0] in

Re: [Analytics] Wiki Workshop 2020 Announcement and Call for Papers

2020-03-13 Thread Leila Zia
/Zoom_Video_Communications [2] https://wikiworkshop.org/2020/#organization On Wed, Nov 27, 2019 at 7:13 PM Leila Zia wrote: > > Hi everyone, > > We are delighted to announce that Wiki Workshop 2020 will be held in > Taipei on April 20 or 21, 2020 (the date to be finalized soon) and as &g

Re: [Analytics] Announcement - Mediawiki History Dumps

2020-02-11 Thread Leila Zia
ne of the first two approaches. * I'm not sure if you intend to make the dataset more discoverable through places such as https://datasetsearch.research.google.com/ . You may want to consider that. Thanks, Leila -- Leila Zia Head of Research Wikimedia Foundation On Mon, Feb 10, 2020 at 9:28 P

Re: [Analytics] SparkContext stopped and cannot be restarted

2020-02-07 Thread Leila Zia
On Fri, Feb 7, 2020 at 12:45 PM Nuria Ruiz wrote: > > and the verdict (supported by you) was that we should use this list or > the public IRC channel. > Indeed, eh? I suggest we revisit that to send questions to > analytics-internal but if others disagree, I am fine with either. > my 2 cents: I

Re: [Analytics] Subject: New Office hours for WMF/Research starting in January 2020

2020-01-17 Thread Leila Zia
-- Leila Zia Head of Research Wikimedia Foundation On Mon, Dec 16, 2019 at 3:39 AM Martin Gerlach wrote: > > Hi all, > > > We, the Research team at Wikimedia Foundation, have received some requests > over the past months for making ourselves more available to answer some of > t

Re: [Analytics] Wiki Workshop 2020 Announcement and Call for Papers

2020-01-14 Thread Leila Zia
questions, let us know. Best, Leila [1] 2020's program committee: http://wikiworkshop.org/2020/#program-committee On Wed, Nov 27, 2019 at 7:13 PM Leila Zia wrote: > > Hi everyone, > > We are delighted to announce that Wiki Workshop 2020 will be held in > Taipei on April 20 or 21,

[Analytics] Wiki Workshop 2020 Announcement and Call for Papers

2019-11-27 Thread Leila Zia
, please let us know on this list or at wikiworks...@googlegroups.com. Looking forward to seeing you in Taipei. Best, Miriam Redi, Wikimedia Foundation Bob West, EPFL Leila Zia, Wikimedia Foundation [1] https://www2020.thewebconf.org/ ___ Analytics

[Analytics] Fwd: [Wikitech-l] BREAKING CHANGE: schema update, xml dumps

2019-11-27 Thread Leila Zia
FYI -- Forwarded message - From: Ariel Glenn WMF Date: Wed, Nov 27, 2019 at 5:38 AM Subject: [Wikitech-l] BREAKING CHANGE: schema update, xml dumps To: Wikipedia Xmldatadumps-l , Wikimedia developers We plan to move to the new schema for xml dumps for the February 1, 2020 run.

Re: [Analytics] Wikimedia Research Showcase - Feedback time

2019-11-15 Thread Leila Zia
/1FAIpQLSecgn8cMu5IfTYRgn93bfOiJVEIL09RRf_WV0dVr6ZnJ8UU_w/viewform Thanks to those of you who have participated so far. Thanks, Leila On Fri, Nov 8, 2019 at 12:46 PM Leila Zia wrote: > > Hi all, > > Wikimedia Research Showcase [1] is almost six years old and we're > using the birthday opportunity to step back and reflect on the pa

[Analytics] Wikimedia Research Showcase - Feedback time

2019-11-08 Thread Leila Zia
forward. Please submit your responses by 2019-11-22. Sincerely, Jonathan Morgan and Leila Zia Research, Wikimedia Foundation [1] https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase [2] If you want to participate but not through Google Forms, ping me off-list and I'll send you a pdf file you can

Re: [Analytics] [Wiki-research-l] Analytics clients (stat/notebook hosts) and backups of home directories

2019-07-15 Thread Leila Zia
scussing team practices for data/project backups tomorrow and plan to > >> come out with some proposals, at least for the short term. > >> > >> Are there any existing processes or guidelines I should be aware of? > >> > >> Thanks! > >>

Re: [Analytics] [Wiki-research-l] Analytics clients (stat/notebook hosts) and backups of home directories

2019-07-10 Thread Leila Zia
Hi Luca, Thanks for the heads up. Isaac is coordinating a response from the Research side. I have one question for you: As you allow/encourage for more copies of the files to exist, what is the mechanism you'd like to put in place for reducing the chances of PII to be copied in new folders that

Re: [Analytics] [Wikimedia Research Showcase] June 26, 2019 at 11:30 AM PST, 19:30 UTC

2019-06-27 Thread Leila Zia
summaries. Best, Leila p.s. inspired by your question, I'm thinking maybe we should ask Jonathan Chang to write a blog post about it. That's for later though. :) -- Leila Zia Principal Research Scientist, Head of Research Wikimedia Foundation On Thu, Jun 27, 2019 at 2:50 PM James Salsman wrote

[Analytics] [Wikimedia Research Showcase] March 20 at 11:30 AM PST, 18:30 UTC

2019-03-18 Thread Leila Zia
Hi all, The next Research Showcase, “Learning How to Correct a Knowledge Base from the Edit History” and “TableNet: An Approach for Determining Fine-grained Relations for Wikipedia Tables” will be live-streamed this Wednesday, March 20, 2019, at 11:30 AM PST/18:30 UTC (Please note the change in

Re: [Analytics] DEPRECATION WARNING: dbstore1002 is going to be decommissioned on March 4th

2019-02-22 Thread Leila Zia
On Fri, Feb 22, 2019 at 2:45 AM Luca Toscano wrote: > > the Analytics team has been working with the SRE Data Persistence team during > the last months to replace dbstore1002 with three brand new nodes, > dbstore100[3-5]. We are moving from a single mysql instance (multi-source) to > a

Re: [Analytics] Farewell, Erik!

2019-02-06 Thread Leila Zia
Erik, It's been an incredible honor to work with you as a colleague and a volunteer. Thank you for the stats and all the conversations about categories, topics, languages, ..., but even more so for showing me the path and the purpose, time after time. I will dearly miss you in Wikimedia

Re: [Analytics] Wikistats2 - Metrics available for project families

2018-12-12 Thread Leila Zia
On Wed, Dec 12, 2018 at 11:40 AM Nuria Ruiz wrote: > > Hello! > > The Analytics team would like to announce that we have now in Wikistats2 > metrics available for what we are calling (for the lack of a better name) > "project families". That is, "all wikipedias", "all wikibooks"..etc > > See,

Re: [Analytics] Pageviews by agent for May 18-21 2015

2018-11-13 Thread Leila Zia
will address this question or if I'm missing something obvious (apologies in advance if that's the case). Best, Leila -- Leila Zia Senior Research Scientist, Lead Wikimedia Foundation On Tue, Nov 13, 2018 at 6:41 AM Jennifer Pan wrote: > > Hi there, > > > I'm an assistant professor in

Re: [Analytics] Statistics about republication of Wikimedia content

2018-10-17 Thread Leila Zia
Hi Pine, On Tue, Sep 18, 2018 at 12:11 PM Pine W wrote: > > Hi Analytics, > > Are views of republished Wikimedia content, such as on Google and Youtube, > something that we could include in addition to Wikimedia pageview statistics? > I imagine that this would require cooperation from

Re: [Analytics] [Wiki-research-l] New files for geo coded Wikimedia stats

2018-07-27 Thread Leila Zia
net_users > [8] Talk page: https://bit.ly/2L5Z2P4 section 'Wikipedia vs Worldbank > population counts' > [9] Talk page: https://bit.ly/2NJUoIu section 'Wikipedia vs Worldbank > internet percentages' > ___________ > Wiki-research-l mailing list &

Re: [Analytics] Statistical point of view to the visitors and promotional user names

2018-06-12 Thread Leila Zia
[coming back from a private response to the public list, with Ladislav's permission.] On Fri, Jun 1, 2018 at 5:03 AM Ladislav Nešněra wrote: > > Re: to https://lists.wikimedia.org/pipermail/analytics/2018-May/006349.html > > Hi Leila, > I'm sorry for the delay but I'm not subscriber of this

[Analytics] Open position - Research Scientist

2018-06-11 Thread Leila Zia
and friends. Best, Leila -- Leila Zia Senior Research Scientist, Lead Wikimedia Foundation ___ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics

Re: [Analytics] Statistical point of view to the visitors and promotional user names

2018-05-29 Thread Leila Zia
Hi Ladislav, We need some more explanation from you to be able to help. Are you interested to see how the reader behavior and usage of Wikipedia has changed as a result of this policy change? If not, can you elaborate? Thanks, Leila -- Leila Zia Senior Research Scientist, Lead Wikimedia

Re: [Analytics] Jeff Levesque: List of Articles By Categories (College Project)

2018-05-23 Thread Leila Zia
r taking the time to respond to me! No worries. :) Good luck! This class of yours sounds really exciting. Leila > > Thank you, > Jeff Levesque > > -Original Message- > From: Leila Zia <le...@wikimedia.org> > Sent: Wednesday, May 23, 2018 7:34 PM >

Re: [Analytics] Jeff Levesque: List of Articles By Categories (College Project)

2018-05-23 Thread Leila Zia
/listinfo/analytics [2] https://wikitech.wikimedia.org/wiki/Analytics/Data_Lake/Traffic/Pageviews -- Leila Zia Senior Research Scientist, Lead Wikimedia Foundation On Wed, May 23, 2018 at 3:22 PM, Wikimedia Answers <answ...@wikimedia.org> wrote: > Forwarding for your evaluation :)

Re: [Analytics] Content of wmf.wdqs_extract

2018-05-08 Thread Leila Zia
A couple of pointers as Stas was not involved in the details of the extraction. Adrian: you can dig the history behind the extraction at https://phabricator.wikimedia.org/T146064 Please also check the codes at https://gerrit.wikimedia.org/r/#/c/311964/ for details, specifically wdqs_extract.hql

Re: [Analytics] How best to accurately record page interactions in Page Previews

2018-04-12 Thread Leila Zia
Thank you, Tilman. This is very helpful. Leila On Thu, Feb 8, 2018 at 1:50 AM, Tilman Bayer <tba...@wikimedia.org> wrote: > Hi Leila, > > On Wed, Jan 17, 2018 at 10:46 AM, Leila Zia <le...@wikimedia.org> wrote: > >> Hi Sam, >> >> On Wed,

Re: [Analytics] [Services] Getting more than just 1000 top articles from REST API

2018-04-02 Thread Leila Zia
ne suggestion here is that if you want to find articles that are consistently high-page-view (and not part of spike/trend-views), you increase the time-window to 6 months or longer. Best, Leila​ ​ ​ -- Leila Zia Senior Research Scientist, Lead Wikimedia Foundation ​ __

Re: [Analytics] How best to accurately record page interactions in Page Previews

2018-01-17 Thread Leila Zia
Hi Sam, On Wed, Jan 17, 2018 at 1:51 AM, Sam Smith wrote: > IMO #1 is preferable from the operations and performance perspectives as the > response is always served from the edge and includes very few headers, > whereas the request in #2 may be served by the application

Re: [Analytics] Wikipedia aggregate clickstream data released

2018-01-16 Thread Leila Zia
contributed to it at https://blog.wikimedia.org/2018/01/16/wikipedia-rabbit-hole-clickstream/ Best, Leila -- Leila Zia Senior Research Scientist Wikimedia Foundation On Tue, Feb 17, 2015 at 11:00 AM, Dario Taraborelli < dtarabore...@wikimedia.org> wrote: > We’re glad to announce th

Re: [Analytics] Research Showcase Wednesday, November 15, 2017 at 11:30 AM (PST) 18:30 UTC

2017-11-15 Thread Leila Zia
On Wed, Nov 15, 2017 at 11:02 AM, Jan Ainali wrote: > Wasn't 18:30 UTC was 30 minutes ago? That seems to be a typo. It's at 19:30 UTC. Sorry about that. Best, Leila > > Med vänliga hälsningar > Jan Ainali > http://ainali.com > > 2017-11-15 19:53 GMT+01:00 Sarah R

Re: [Analytics] research process (was Re: Google Code-in: Get your tasks for young contributors prepared!)

2017-11-03 Thread Leila Zia
t on our end. We will do our best. The ticket for tracking this task is https://phabricator.wikimedia.org/T179693 . Best, Leila ​ -- Leila Zia Senior Research Scientist Wikimedia Foundation ​ > > > /Lars > > ___ > Analytics m

Re: [Analytics] Analytics project request

2017-07-24 Thread Leila Zia
/Community_Health#Segment_3:_Research_on_harassment CD - Structured Data https://meta.wikimedia.org/wiki/Wikimedia_Foundation_Annual_Plan/2017-2018/Final/Structured_Data#Segment_4:_Programs [2] https://meta.wikimedia.org/wiki/Research:Wikipedia_clickstream -- Leila Zia Senior Research Scientist Wikimedia

Re: [Analytics] Analytics project request

2017-07-24 Thread Leila Zia
I'll review Daniel's email and will get back to him/you on this list in the next day or so. Leila -- Leila Zia Senior Research Scientist Wikimedia Foundation On Mon, Jul 24, 2017 at 7:59 AM, Nuria Ruiz <nu...@wikimedia.org> wrote: > Daniel, > > Singining an NDA is not enoug

Re: [Analytics] new mediawiki_history snapshot available

2017-07-12 Thread Leila Zia
end 2017/ begginning 2018. Thank you! > > On Wed, Jul 12, 2017 at 12:22 PM, Leila Zia <le...@wikimedia.org> wrote: >> >> On Wed, Jul 12, 2017 at 12:16 PM, Nuria Ruiz <nu...@wikimedia.org> wrote: >> > Further clarification that this snapshot of data is not yet public

Re: [Analytics] [Wiki-research-l] Wikipedia Detox: Scaling up our understanding of harassment on Wikipedia

2017-06-22 Thread Leila Zia
Attacks Seen at Scale <https://arxiv.org/abs/1610.08914>, Section 3 should give you a relatively detailed description of how this question was approached. Best, Leila Pine > > On Wed, Jun 21, 2017 at 2:08 PM, Leila Zia <le...@wikimedia.org> wrote: > >> Hi Dan, >>

Re: [Analytics] web log data

2017-03-06 Thread Leila Zia
that we cannot be of more help for your research at this point. Best, Leila [1] https://www.mediawiki.org/wiki/Wikimedia_Research/Formal_collaborations#How_are_formal_research_collaborations_created.3F are met and Leila Zia Senior Research Scientist Wikimedia Foundation On Mon, Mar 6, 2017 at 6:28 AM

Re: [Analytics] stats.grok.se used in study about Snowden and internet traffic

2017-01-18 Thread Leila Zia
+ Juliet, as this is something Communications may want to follow up given that stats.groke.se is not maintained by a Wikimedia Foundation member. Thanks for sharing this. Leila Leila Zia Senior Research Scientist Wikimedia Foundation On Wed, Jan 18, 2017 at 9:00 AM, Andrew Otto <

Re: [Analytics] Fwd: [Query Logs] Research:Understanding Wikidata Queries

2017-01-16 Thread Leila Zia
On Tue, Jan 3, 2017 at 9:30 AM, Stas Malyshev wrote: > Hi! > > > 1. Is there a unique key for the query log? The log I am refering to > > is the *wdqs_extract* table**from > > the hive database wmf.**We would like to be able to > > permanently link our

Re: [Analytics] Upcoming Research Showcase, November 16, 2016

2016-11-16 Thread Leila Zia
Hi all, A reminder that this is happening in 2 hours from now. Best, Leila On Wed, Nov 9, 2016 at 2:29 PM, Leila Zia <le...@wikimedia.org> wrote: > [Apologies for cross-posting] > > Hi everyone, > > Almost a year ago, we [1] embarked on a research project to understand wh

Re: [Analytics] ensuring reader anonymity

2016-11-11 Thread Leila Zia
hashing IP addresses in webrequest logs. Best, Leila On Fri, Nov 11, 2016 at 11:16 AM, Leila Zia <le...@wikimedia.org> wrote: > ​Hi Pine, > > On Fri, Nov 11, 2016 at 10:39 AM, Pine W <wiki.p...@gmail.com> wrote: > >> On Fri, Nov 11, 2016 at 9:25 AM, Leila Zia <le...@w

Re: [Analytics] ensuring reader anonymity

2016-11-11 Thread Leila Zia
​Hi Pine, On Fri, Nov 11, 2016 at 10:39 AM, Pine W <wiki.p...@gmail.com> wrote: > On Fri, Nov 11, 2016 at 9:25 AM, Leila Zia <le...@wikimedia.org> wrote: > >> Nuria, regarding the IP addresses specifically (not the proxy, for which, >> I'll need more time to go t

[Analytics] Upcoming Research Showcase, November 16, 2016

2016-11-09 Thread Leila Zia
make it, please feel free to watch the video later and get in touch with us with questions/comments. :) Best, Leila -- Leila Zia Senior Research Scientist Wikimedia Foundation ​[1] WMF Research and researchers from three academic institutions: EPFL, GESIS, and Stanford University, in collaborat

[Analytics] Fwd: [Research-Internal] Fwd: Dumps Rewrite getting underway (help needed!)

2016-09-13 Thread Leila Zia
​FYI -- Forwarded message -- From: Ariel Glenn WMF Date: Mon, Sep 12, 2016 at 9:07 AM Subject: [Research-Internal] Fwd: Dumps Rewrite getting underway (help needed!) To: research-inter...@lists.wikimedia.org -- Forwarded message -- From:

Re: [Analytics] Split testing example implementations

2016-09-07 Thread Leila Zia
Hi Jan, I don't know of documented examples (the A/B testing design depends on the question you want to answer). If you want to chat about this more, I'd be happy to brainstorm with you about your options. Message me off-list and we can set up a time if that's helpful. Best, Leila Leila Zia

Re: [Analytics] Getting search engine terms for specific wikibook?

2016-09-06 Thread Leila Zia
re <https://www.mediawiki.org/wiki/Wikimedia_Research/Formal_collaborations>. Leila Zia Senior Research Scientist Wikimedia Foundation On Mon, Sep 5, 2016 at 11:19 AM, Nuria Ruiz <nu...@wikimedia.org> wrote: > >By the way, what about alternate, external methods such as subscribing > &g

Re: [Analytics] Analysing link

2016-08-26 Thread Leila Zia
On Fri, Aug 26, 2016 at 1:38 AM, Federico Leva (Nemo) wrote: > Jan Dittrich, 26/08/2016 10:03: > >> or even click paths >> > > Do you know about https://meta.wikimedia.org/wik > i/Research:Improving_link_coverage/Release_page_traces ? > ​and

Re: [Analytics] [Pageview API] Data Retention Question

2016-07-29 Thread Leila Zia
Dan, Thanks for reaching out. 18 months is enough for my use cases as long as the dumps capture the exact data structure. Best, Leila -- Leila Zia Senior Research Scientist Wikimedia Foundation On Fri, Jul 29, 2016 at 11:51 AM, Amir E. Aharoni < amir.ahar...@mail.huji.ac.il> wrote: >

Re: [Analytics] [Wiki-research-l] question about Pageviews dumps

2016-07-01 Thread Leila Zia
Hi Marc, On Tue, Jun 28, 2016 at 6:36 AM, Marc Miquel wrote: > Since this would be for a research project I might ask funding for it, I > would like to know if I could count on that, what is the nature of the > available data, and what would be the procedure to obtain

Re: [Analytics] Zika

2016-02-14 Thread Leila Zia
​Hey Dan,​ On Sun, Feb 14, 2016 at 3:02 AM, Dan Andreescu wrote: > So, I felt personally compelled in the case of Zika, and the confusing > coverage it has seen, to offer to personally help. ​Which aspect of the coverage are you referring to as confusing?​ > I can

Re: [Analytics] Canonical location for metrics documentation

2015-10-14 Thread Leila Zia
On Wed, Oct 14, 2015 at 8:05 AM, Dan Andreescu wrote: > > > I'm not saying it's easy, but I think having documentation in more than > one place is an awful experience for newcomers. > I second this as a problem. I make a joke of it each time I want to explain to a

Re: [Analytics] Canonical location for metrics documentation

2015-10-14 Thread Leila Zia
Makes sense to me. On Wed, Oct 14, 2015 at 11:27 AM, Neil P. Quinn wrote: > Keep in mind that, when I say "metrics documentation", I'm not referring > to documentation about Hive > , the webrequest > logs

Re: [Analytics] Echo databases on analytics-store?

2015-10-09 Thread Leila Zia
On Fri, Oct 9, 2015 at 1:26 PM, Neil P. Quinn wrote: > I'm trying to gather some stats on the use of Echo notifications across > wikis, and I'd like to join the `echo_events` table with the `user` table > for a given wiki. > I'm not sure what kind of information you need

Re: [Analytics] Users changing language version through interwiki links

2015-09-12 Thread Leila Zia
Hi Strainu, On Sat, Sep 12, 2015 at 5:43 AM, Strainu wrote: > > I think for smaller wikis this would be an interesting way to know which > domains/articles to work on. > What I'm saying is not directly related to your data request but to your comment above: We've been

Re: [Analytics] [Survey] Pageview API

2015-09-11 Thread Leila Zia
It's getting exciting. :-) I'd go with choice 2 since it gives more control to the user while offering what the user can get through choice 1 as well. Question: will we get page_ids or page_titles or both? It's good to have both. Leila On Fri, Sep 11, 2015 at 3:00 PM, Dan Andreescu

Re: [Analytics] Breakdown of unique visitors by country (and by project)

2015-09-08 Thread Leila Zia
Hi Cristian, On Tue, Sep 8, 2015 at 10:42 AM, Cristian Consonni wrote: > > we (Wikimedia Italia) are starting writing a proposal for a EU project > disclaimer: I'm not in Analytics. We don't have unique counts (per country/project, or otherwise) as you already

Re: [Analytics] user table information

2015-06-29 Thread Leila Zia
Thanks a lot everyone. :-) On Mon, Jun 29, 2015 at 6:21 PM, Gergo Tisza gti...@wikimedia.org wrote: On Sat, Jun 27, 2015 at 2:30 PM, Leila Zia le...@wikimedia.org wrote: For the article recommendation test, we queried user table to get editors' email addresses. We then excluded the emails

[Analytics] user table information

2015-06-27 Thread Leila Zia
Hi, For the article recommendation test, we queried user table to get editors' email addresses. We then excluded the emails that were not verified. We've received a comment here https://meta.wikimedia.org/wiki/Research_talk:Increasing_article_coverage#Usage_of_user_database that suggests the user

Re: [Analytics] [Research-Internal] Revision history of deleted pages

2015-06-25 Thread Leila Zia
Aaron, any chance you know the answer to this question? I have a vague memory that we talked about deleted pages and their text some time back. This data should live somewhere, right? given that deleted pages can be restored. Thanks, Leila On Wed, Jun 24, 2015 at 2:03 PM, Leila Zia le

Re: [Analytics] [Research-Internal] Revision history of deleted pages

2015-06-24 Thread Leila Zia
switching to the public list with Bob's permission. On Wed, Jun 24, 2015 at 1:58 PM, Robert West robert.bob.w...@gmail.com wrote: Hi everyone, I'd like to find all enwiki articles that were ever marked with the {{hoax}} template. Pages with that template mostly end up being deleted, so

Re: [Analytics] Search dashboards are now running on live data

2015-05-22 Thread Leila Zia
On Fri, May 22, 2015 at 3:14 PM, Luis Villa lvi...@wikimedia.org wrote: 68,000 searches/day seems *really* low, right, but I'm not sure search sessions per day is the same as the number of searches per day. Oliver, what definition of a search session do you use? How do you compute it? Leila

Re: [Analytics] clicks on red links

2015-05-22 Thread Leila Zia
Hi Amir, As far as I know and as mentioned by others, the exact statistics you're looking for don't exist. More comments in-line. On Wed, May 20, 2015 at 10:37 PM, Amir E. Aharoni amir.ahar...@mail.huji .ac.il wrote: Hi, Are there statistics about the number of people who click on red

[Analytics] May 2015 research showcase

2015-05-11 Thread Leila Zia
Hi everyone, The next research showcase will be live-streamed this Wednesday, May 13 at 11.30 PT. The streaming link will be posted on the lists a few minutes before the showcase starts and as usual, you can join the conversation on IRC at #wikimedia-research. We look forward to seeing you!

Re: [Analytics] [Wiki-research-l] April 2015 research showcase: remix and reuse in collaborative communities; the oral citations debate

2015-04-30 Thread Leila Zia
A reminder that this event will start in 10 minutes. You can watch the event on YouTube here http://youtu.be/upQXecRNcdw. As usual, we will be in #wikimedia-research for questions and chat. :-) On Thu, Apr 16, 2015 at 12:43 PM, Dario Taraborelli dtarabore...@wikimedia.org wrote: I am thrilled

Re: [Analytics] Research Showcase Starting in 8 minutes!

2015-03-25 Thread Leila Zia
The youtube link has changed to: http://youtu.be/PHQqicVoVx4 On Wed, Mar 25, 2015 at 11:22 AM, Ellery Wulczyn ewulc...@wikimedia.org wrote: Today we will have two presentation: 1. User Session Identification by Aaron Halfaker 2. Mining Missing Hyperlinks in Wikipedia by Bob West. You can

[Analytics] [Announcement] March 2015 Research Showcase

2015-03-20 Thread Leila Zia
Hi, This month's research showcase https://www.mediawiki.org/w/index.php?title=Analytics/Research_and_Data/Showcase#March_2015 is scheduled for Wednesday, March 25, 11:30 (PST). We will have two presentations on user session identification by Aaron Halfaker, and mining missing hyperlinks in

[Analytics] [Technical] which pageview definition

2015-03-15 Thread Leila Zia
Hi, I'm trying to figure out which of the two pageview definitions we currently have I can use for a question Bob and I are trying to address. It would be great if you share your thoughts. If you choose to do so, please do it by Tuesday, eod, PST. More details: *What are we doing?* We are

Re: [Analytics] [Cluster] Monitoring the impact Hive jobs have on the Analytics cluster

2015-03-08 Thread Leila Zia
This is really useful, Christian. Thanks for explaining and documenting it. Leila On Sat, Mar 7, 2015 at 6:14 AM, Christian Aistleitner christ...@quelltextlich.at wrote: Hi, around running jobs on the Analytics cluster, I've sometime seen people say in IRC: “Let's run this heavy job. I'll

Re: [Analytics] analytics-store replag

2015-03-05 Thread Leila Zia
Hi Sean, Thanks for the email. The two create queries are mine. Should I kill one? Leila On Mar 5, 2015 7:09 AM, Sean Pringle sprin...@wikimedia.org wrote: Just a heads-up: Analytics-store is seeing several hours of replag on s1, s4, and s6. s4 is me doing a commonswiki schema change,

Re: [Analytics] analytics-store replag

2015-03-05 Thread Leila Zia
Hi Sean, On Thu, Mar 5, 2015 at 9:59 PM, Sean Pringle sprin...@wikimedia.org wrote: Hi Leila On Fri, Mar 6, 2015 at 1:38 AM, Leila Zia le...@wikimedia.org wrote: Hi Sean, Thanks for the email. The two create queries are mine. Should I kill one? Lag has now reached 24h for s1

Re: [Analytics] February 2015 Research Showcase: Global South survey results; data imports in OpenStreetMap

2015-02-18 Thread Leila Zia
This is happening in 15 minutes. Here is the link for watching it: http://youtu.be/yaj9dfHjkOA We will be in IRC channel #wikimedia-research for taking your questions. :-) On Wed, Feb 11, 2015 at 5:21 PM, Dario Taraborelli dtarabore...@wikimedia.org wrote: I am thrilled to announce our

Re: [Analytics] Welcome Joseph

2015-02-18 Thread Leila Zia
Welcome to the team, Joseph! b.t.w., I didn't know you have a background in NLP. That skill may become handy soon. ;-) On Wed, Feb 18, 2015 at 6:37 PM, Toby Negrin tneg...@wikimedia.org wrote: Hi Everyone, I'd like to welcome Joseph Allemendou to the Analytics team! We are really excited to

Re: [Analytics] DNT, standards, and expectations

2015-01-16 Thread Leila Zia
On Fri, Jan 16, 2015 at 4:56 PM, Ori Livneh o...@wikimedia.org wrote: On Fri, Jan 16, 2015 at 4:25 PM, Dario Taraborelli dtarabore...@wikimedia.org wrote: I second Aaron’s concerns, which I previously expressed during the consultation about the new privacy policy. My main objection to the

Re: [Analytics] most clicked links in articles

2015-01-12 Thread Leila Zia
Hi Amir, We're working on a link improvement project [1] that will answer your first two questions. The first round of tests will be on ptwiki, then enwiki, and depending on the results we may add more languages. The algorithm used is robust to the choice of language, its accuracy, however,

Re: [Analytics] Per-namespace daily edit numbers

2015-01-08 Thread Leila Zia
Gergo, this table has edits per name space aggregated by month. In your original email, you ask for edit count and time of edit. If that's the case, this table can't help (but how Aaron has generated this table can). mmonth: last day of the month (month is MM form) reverted: total number of

Re: [Analytics] WikiGrok and EventLogging

2015-01-07 Thread Leila Zia
before deploying! On Tue, Jan 6, 2015 at 4:55 PM, Ryan Kaldari rkald...@wikimedia.org wrote: I can elaborate on this after I finished the SWAT deployment Gimme 30 minutes or so. On Tue, Jan 6, 2015 at 4:51 PM, Leila Zia le...@wikimedia.org wrote: Hi, The mobile team is planning

[Analytics] WikiGrok and EventLogging

2015-01-06 Thread Leila Zia
Hi, The mobile team is planning to switch WikiGrok on for non-logged in users next week (2014-01-12). The widget will be on on 166,029 article pages in enwiki. There are two EventLogging schema that may collect data heavily and we want to make sure EL can handle the influx of data. The two

Re: [Analytics] Getting Access to Wikipedia Database

2014-12-24 Thread Leila Zia
Hi Neta, On Wed, Dec 24, 2014 at 7:19 AM, Neta Livneh neta.liv...@gmail.com wrote: Actually, this is a great opportunity to say that I would love to get you guys involved or at least hear insights from the analytics team regarding the project's direction. Feel free to keep me in the loop

Re: [Analytics] Switching the RD team to Phabricator

2014-12-15 Thread Leila Zia
without reviewing our prioritization process. Shall we make this a Q3 goal since people seem really into it? On Dec 15, 2014, at 10:44 AM, Leila Zia le...@wikimedia.org wrote: Hi Oliver, I'd like to give Phabricator a try. I suggest the following steps if we decide to do it: 1. We

Re: [Analytics] Switching the RD team to Phabricator

2014-12-15 Thread Leila Zia
On Mon, Dec 15, 2014 at 10:48 AM, Toby Negrin tneg...@wikimedia.org wrote: Shall we make this a Q3 goal since people seem really into it? I'm not sure. If it involves figuring out prioritization, it can be a good idea. On Dec 15, 2014, at 10:44 AM, Leila Zia le...@wikimedia.org wrote

Re: [Analytics] EventLogging data QA

2014-12-15 Thread Leila Zia
On Mon, Dec 15, 2014 at 10:06 AM, Toby Negrin tneg...@wikimedia.org wrote: I share Christian's concerns - Dario/Leila - can you comment based on your recent experiences with WikiGrok? I agree with Christian. QA in beta labs is good but not enough. We still need to do QA when a feature goes

Re: [Analytics] EventLogging data QA

2014-12-15 Thread Leila Zia
are talking add-block that can be tested even earlier, vagrant will be a fine venue. All the issues related to the client (browser) not emitting events can be tested on the development environment with ease. On Mon, Dec 15, 2014 at 4:18 PM, Leila Zia le...@wikimedia.org javascript:_e(%7B%7D

[Analytics] EventLogging and Adblock on Linux/Firefox

2014-12-11 Thread Leila Zia
Hi everyone, From some initial tests it appears to me that EventLogging is not logging events from Linux/Firefox when Adblock is enabled. I'm on Ubuntu 14.04, Firefox 34.0, and Adblock Plus 2.6.6. When I disable Adblock, I see event.gif?{...} in Console, when I enable it, I don't. Just to make

Re: [Analytics] EventLogging and Adblock on Linux/Firefox

2014-12-11 Thread Leila Zia
good catch, didn't know there is a venue of them. My test was with Adblock Plus on Linux/Firefox. I installed Adblock Plus on Chrome just now and tested. Linux/Chrome logs events without a problem. On Thu, Dec 11, 2014 at 3:00 PM, Federico Leva (Nemo) nemow...@gmail.com wrote: Is everyone

Re: [Analytics] EventLogging and Adblock on Linux/Firefox

2014-12-11 Thread Leila Zia
[1]. Thanks, Dan [1]: Yes, that's what it is. Rain. You Californians might call it stormageddon or whatever, but the rest of the world calls it rain. ;-) On 11 December 2014 at 14:15, Leila Zia le...@wikimedia.org wrote: Hi everyone, From some initial tests it appears to me

  1   2   >