Re: [Analytics] Wikimedia monthly research & data showcase: live streamed tomorrow

2014-04-16 Thread Leila Zia
A reminder that this will start in 10 minutes. You can find the link to the live streaming here . On Tue, Apr 15, 2014 at 10:51 AM, Dario Taraborelli < dtarabore...@wikimedia.org> wrote: > The next Research & Data > showcase

Re: [Analytics] Survey tool for features

2014-04-29 Thread Leila Zia
On 04/29/2014 11:30 AM, Steven Walling wrote: On Tue, Apr 29, 2014 at 10:34 AM, Mark Holmquist mailto:mtrac...@member.fsf.org>> wrote: But a dark threat loomed over the land. With one product using SurveyMonkey, other products seemed poised to use it, too [1]. The compromise buil

Re: [Analytics] db1047 & one box to rule them all

2014-04-30 Thread Leila Zia
Hi Sean, I am very excited about this. Thank you. :-) Re unified views: On Wed, Apr 30, 2014 at 6:59 AM, Dan Andreescu wrote: > This is awesome, thank you Sean > >> *This is probably my bad, but I understood the goal to be having a >>> single db containing unified, core tablets. So, we'd ha

[Analytics] Monthly Research & Data Showcase this Wednesday

2014-07-15 Thread Leila Zia
The next Research & Data Showcase will be live-streamed this Wednesday, 7/16 at 11.30 PT. The streaming link will be posted on the lists a few minutes before the showcase starts and as usual, you can join the conversation on IRC at # wikimedia-research. We look forward to seeing you! Leila This

Re: [Analytics] Monthly Research & Data Showcase this Wednesday

2014-07-16 Thread Leila Zia
The Research and Data Showcase will start in few minutes. The updated list of speakers is here: https://www.mediawiki.org/wiki/Analytics/Research_and_Data/Showcase#July_2014 You can watch the YouTube steaming here: http://youtu.be/1E4JcxTgmco On Mon, Jul 14, 2014 at 12:45 PM, Leila Zia wrote

[Analytics] Research and Data Showcase Survey

2014-07-16 Thread Leila Zia
Hi all, Starting December 2013, Research and Data has had eight showcases. We would like to hear your feedback about them through this survey*: https://www.surveymonkey.com/s/ResearchandData The deadline for filling out the survey is Wednesday, July 30. We will present the results in the Au

Re: [Analytics] Media Viewer User Preference Data

2014-07-16 Thread Leila Zia
On Wed, Jul 16, 2014 at 5:01 AM, Erik Zachte wrote: > It seems to me that if we could present half of a target population with > old and half with new settings from the outset (e.g. by focussing on new > users only), then the outcome would be more convincing. > Disclaimer: I have not been follow

Re: [Analytics] data visualization about Wikipedia (I am asking for a dataset that I cannot produce by myself)

2014-08-07 Thread Leila Zia
Hi Thomas, Disclaimer: I'm not in Wikimania, so it's hard for me to know how much more you've discussed this with others in the team. I know you've had some discussions with Oliver, too. For the record, in the Metrics Meeting on July 31, the Analytics team presented an overview of Mobile.[1

[Analytics] Monthly Research & Data Showcase this Wednesday

2014-08-18 Thread Leila Zia
The next Research & Data showcase will be live-streamed this Wednesday, 8/20 at 11.30 PT. The streaming link will be posted on the lists a few minutes before the showcase starts and as usual, you can join the conversation on IRC

Re: [Analytics] Monthly Research & Data Showcase this Wednesday

2014-08-20 Thread Leila Zia
This is a reminder that this event will happen in less than 35 min. Here's the streaming link: https://www.youtube.com/watch?v=wgnnVG7sLQ0 On Mon, Aug 18, 2014 at 9:37 AM, Leila Zia wrote: > The next Research & Data showcase > <https://www.mediawiki.org/wiki/Analytics

Re: [Analytics] Anonymizing and releasing 'edits per country' data for Wiki Projects

2014-08-25 Thread Leila Zia
FWIW: depending on the threshold chosen in step 2 of Anonymization suggested by Yuvi, some of the countries/languages will have no data. This data will solve the problem for some of the partners, but not all of them. On Monday, August 25, 2014, Jessie Wild wrote: > THIS IS SO USEFUL! > > For gra

Re: [Analytics] pitching the Gender Edit Dashboard

2014-08-26 Thread Leila Zia
Thanks for initiating this thread, Kaldari. On Mon, Aug 25, 2014 at 9:30 PM, Ryan Kaldari wrote: > Part of scientific investigation is forming a hypothesis, but that's > difficult to do when you don't even have anecdotal evidence. There's > nothing wrong with beginning an investigation with impe

Re: [Analytics] pitching the Gender Edit Dashboard

2014-08-29 Thread Leila Zia
On Fri, Aug 29, 2014 at 4:58 AM, Dan Andreescu wrote: > >>- I wonder if we might explore ways to improve such a survey. For >>example, we might include the gender question in the signup form for a >>small percentage of newly registered users. >> >> This experiment sounds more useful

[Analytics] Welcome Marcel Ruiz Forns to the Analytics Development team

2014-10-08 Thread Leila Zia
Welcome, Marcel! It's great to have you on the team. :-) On Wed, Oct 8, 2014 at 1:57 PM, Tomasz Finc > wrote: > Welcome Marcel. Great to have you here. Do come by and say hello while > your in SF. > > On Tue, Oct 7, 2014 at 4:44 PM, Toby Negrin > wrote: > > Hi Everyone, > > > > I'd like to welco

Re: [Analytics] Wikimedia Research showcase – October 15 2014, 11.30 PT

2014-10-15 Thread Leila Zia
This is the streaming link you can join to watch the showcase: http://youtu.be/-We4GZbH3Iw On Tue, Oct 14, 2014 at 10:26 PM, Dario Taraborelli < dtarabore...@wikimedia.org> wrote: > After a break in September, we’re resuming our monthly Research and Data > showcase >

Re: [Analytics] s1-analytics-slave impressively slow queries

2014-11-11 Thread Leila Zia
Sean, What Nuria said. It seems we've missed this one. Sorry for the trouble. Leila On Mon, Nov 10, 2014 at 8:01 AM, Nuria Ruiz wrote: > cc-ing leila as we were experimenting with these some weeks back in SF, I > think they can be killed w/o problems. I did not know they were still > runnin

Re: [Analytics] [LangEng] Analytics

2014-11-13 Thread Leila Zia
posted in the message is the only output I am seeing. I >>>>> do not see the URL-encoded section or the validation section. I think >>>>> there >>>>> may be something wrong with my testing setup. >>>>> >>>>> Niklas Laxst

[Analytics] EventLogging and Adblock on Linux/Firefox

2014-12-11 Thread Leila Zia
Hi everyone, From some initial tests it appears to me that EventLogging is not logging events from Linux/Firefox when Adblock is enabled. I'm on Ubuntu 14.04, Firefox 34.0, and Adblock Plus 2.6.6. When I disable Adblock, I see event.gif?{...} in Console, when I enable it, I don't. Just to make

Re: [Analytics] EventLogging and Adblock on Linux/Firefox

2014-12-11 Thread Leila Zia
quot;:"MobileWebClickTracking","webHost":" >>>en.m.wikipedia.org","wiki":"enwiki"};: >>> >>> >>> As far as I know, I use default Adblock settings. >>> >>> If you want to do some troubleshooti

Re: [Analytics] EventLogging and Adblock on Linux/Firefox

2014-12-11 Thread Leila Zia
good catch, didn't know there is a venue of them. My test was with Adblock Plus on Linux/Firefox. I installed Adblock Plus on Chrome just now and tested. Linux/Chrome logs events without a problem. On Thu, Dec 11, 2014 at 3:00 PM, Federico Leva (Nemo) wrote: > Is everyone talking of the same ex

Re: [Analytics] EventLogging and Adblock on Linux/Firefox

2014-12-11 Thread Leila Zia
your Adblock settings and what > filter subscription(s) you have set up? > > On Thu, Dec 11, 2014 at 3:00 PM, Leila Zia wrote: >> >> On my machine, Linux/Chrome works fine with Adblock 2.14.4 and Chrome >> 35.0.1914.114 >> >> On Thu, Dec 11, 2014 at 2:57 PM

Re: [Analytics] EventLogging and Adblock on Linux/Firefox

2014-12-11 Thread Leila Zia
ision":5929948,"schema":"MobileWebClickTracking","webHost":" >>>>>en.m.wikipedia.org","wiki":"enwiki"};: >>>>> >>>>> >>>>> As far as I know, I use default Adblock settings. >>&

Re: [Analytics] EventLogging and Adblock on Linux/Firefox

2014-12-11 Thread Leila Zia
> be ethical. I don't even think it's a bad thing for this rule to exist, to > be honest. > +1 > The % of users this affects is probably miniscule, since EasyPrivacy is > not a default subscription, AFAIK. > I don't have an idea about this one. Will keep an eye on

Re: [Analytics] Switching the R&D team to Phabricator

2014-12-15 Thread Leila Zia
Hi Oliver, I'd like to give Phabricator a try. I suggest the following steps if we decide to do it: 1. We block a 15-min team time in December during which R&D will play with Phabricator in https://phab-01.wmflabs.org/ If we all feel reasonably comfortable, then, 2. We switch to Ph

Re: [Analytics] Switching the R&D team to Phabricator

2014-12-15 Thread Leila Zia
s, ya know, >>> make waste. So let's talk about this more... >>> >>> On Mon, Dec 15, 2014 at 10:48 AM, Toby Negrin >>> wrote: >>> >>>> To be clear - I do not want to move to Fabricator without reviewing our >>>> prioritization p

Re: [Analytics] Switching the R&D team to Phabricator

2014-12-15 Thread Leila Zia
On Mon, Dec 15, 2014 at 10:48 AM, Toby Negrin wrote: > > > Shall we make this a Q3 goal since people seem really into it? > I'm not sure. If it involves figuring out prioritization, it can be a good idea. > On Dec 15, 2014, at 10:44 AM, Leila Zia wrote: > > Hi Olive

Re: [Analytics] EventLogging data QA

2014-12-15 Thread Leila Zia
On Mon, Dec 15, 2014 at 10:06 AM, Toby Negrin wrote: > > I share Christian's concerns - > > Dario/Leila - can you comment based on your recent experiences with > WikiGrok? > I agree with Christian. QA in beta labs is good but not enough. We still need to do QA when a feature goes to production a

Re: [Analytics] EventLogging data QA

2014-12-15 Thread Leila Zia
bs. If we >> are talking add-block that can be tested even earlier, vagrant will be a >> fine venue. All the issues related to the client (browser) not emitting >> events can be tested on the development environment with ease. >> >> >> >> On Mon, Dec 15, 201

Re: [Analytics] Getting Access to Wikipedia Database

2014-12-24 Thread Leila Zia
Hi Neta, On Wed, Dec 24, 2014 at 7:19 AM, Neta Livneh wrote: > > Actually, this is a great opportunity to say that I would love to get you > guys involved or at least hear insights from the analytics team regarding > the project's direction. > Feel free to keep me in the loop for the latter. B

[Analytics] WikiGrok and EventLogging

2015-01-06 Thread Leila Zia
Hi, The mobile team is planning to switch WikiGrok on for non-logged in users next week (2014-01-12). The widget will be on on 166,029 article pages in enwiki. There are two EventLogging schema that may collect data heavily and we want to make sure EL can handle the influx of data. The two sche

Re: [Analytics] WikiGrok and EventLogging

2015-01-07 Thread Leila Zia
ot;estimate" sampling ourselves. I imagine wikigrok is >>> been deployed to a number of users and it is with that usage the mobile >>> team could estimate the total throughput expected, with this throughput we >>> can recommend sampling ratios. >>> >>> >>>

Re: [Analytics] WikiGrok and EventLogging

2015-01-07 Thread Leila Zia
0.45% (1.25/sec) >> MobileWikiAppSearch 0.41% (1.13/sec) >> CentralAuth 0.40% (1.12/sec) >> >> On Wed, Jan 7, 2015 at 5:12 PM, Nuria Ruiz wrote: >> >>> &g

Re: [Analytics] Per-namespace daily edit numbers

2015-01-08 Thread Leila Zia
Gergo, this table has edits per name space aggregated by month. In your original email, you ask for edit count and time of edit. If that's the case, this table can't help (but how Aaron has generated this table can). mmonth: last day of the month (month is MM form) reverted: total number of re

Re: [Analytics] most clicked links in articles

2015-01-12 Thread Leila Zia
Hi Amir, We're working on a link improvement project [1] that will answer your first two questions. The first round of tests will be on ptwiki, then enwiki, and depending on the results we may add more languages. The algorithm used is robust to the choice of language, its accuracy, however, dep

Re: [Analytics] DNT, standards, and expectations

2015-01-15 Thread Leila Zia
Here's what we all agree on: We want the users of Wikimedia sites to have more control over whether their data is used for application improvement purposes. To be clear, we're not talking about data collected and deleted for operational purposes. Based on our conversations, we have three choices.

Re: [Analytics] DNT, standards, and expectations

2015-01-16 Thread Leila Zia
On Fri, Jan 16, 2015 at 4:56 PM, Ori Livneh wrote: > > On Fri, Jan 16, 2015 at 4:25 PM, Dario Taraborelli < > dtarabore...@wikimedia.org> wrote: > >> I second Aaron’s concerns, which I previously expressed during the >> consultation about the new privacy policy. My main objection to the >> propos

Re: [Analytics] s1-analytics-slave

2015-02-05 Thread Leila Zia
On Thu, Feb 5, 2015 at 6:45 AM, Aaron Halfaker wrote: > I've been slow to move some datasets off of s1-analytics-slave because it > remained available. If I were given ~ a week notice, it would be no > problem to move all datasets and work to analytics-store. > same here. __

[Analytics] [Technical] capacity planning for WikiGrok test 4

2015-02-17 Thread Leila Zia
Hi, The Mobile team will be running WikiGrok experiments in the first half of March 2015. Dario and I will be working closely with the team and will coordinate with Analytics-devs to make sure EventLogging can handle the throughput. The expected throughput is what EL experienced through the las

Re: [Analytics] February 2015 Research Showcase: Global South survey results; data imports in OpenStreetMap

2015-02-18 Thread Leila Zia
This is happening in 15 minutes. Here is the link for watching it: http://youtu.be/yaj9dfHjkOA We will be in IRC channel #wikimedia-research for taking your questions. :-) On Wed, Feb 11, 2015 at 5:21 PM, Dario Taraborelli < dtarabore...@wikimedia.org> wrote: > I am thrilled to announce our spe

Re: [Analytics] Welcome Joseph

2015-02-18 Thread Leila Zia
Welcome to the team, Joseph! b.t.w., I didn't know you have a background in NLP. That skill may become handy soon. ;-) On Wed, Feb 18, 2015 at 6:37 PM, Toby Negrin wrote: > Hi Everyone, > > I'd like to welcome Joseph Allemendou to the Analytics team! We are really > excited to get some of Josep

Re: [Analytics] analytics-store replag

2015-03-05 Thread Leila Zia
Hi Sean, Thanks for the email. The two create queries are mine. Should I kill one? Leila On Mar 5, 2015 7:09 AM, "Sean Pringle" wrote: > Just a heads-up: > > Analytics-store is seeing several hours of replag on s1, s4, and s6. > > s4 is me doing a commonswiki schema change, which should be d

Re: [Analytics] analytics-store replag

2015-03-05 Thread Leila Zia
Hi Sean, On Thu, Mar 5, 2015 at 9:59 PM, Sean Pringle wrote: > Hi Leila > > On Fri, Mar 6, 2015 at 1:38 AM, Leila Zia wrote: > > Hi Sean, > > > >Thanks for the email. The two create queries are mine. Should I kill > one? > > Lag has now reached 24h for

Re: [Analytics] [Cluster] Monitoring the impact Hive jobs have on the Analytics cluster

2015-03-08 Thread Leila Zia
This is really useful, Christian. Thanks for explaining and documenting it. Leila On Sat, Mar 7, 2015 at 6:14 AM, Christian Aistleitner < christ...@quelltextlich.at> wrote: > Hi, > > around running jobs on the Analytics cluster, I've sometime seen > people say in IRC: “Let's run this heavy job.

[Analytics] [Technical] which pageview definition

2015-03-15 Thread Leila Zia
Hi, I'm trying to figure out which of the two pageview definitions we currently have I can use for a question Bob and I are trying to address. It would be great if you share your thoughts. If you choose to do so, please do it by Tuesday, eod, PST. More details: *What are we doing?* We are bu

Re: [Analytics] Data on Wikidata description coverage

2015-03-19 Thread Leila Zia
Hi Dan, Here is one way: You can use http://tools.wmflabs.org/wikidata-terminator/ to go to the top 1000 missing descriptions in some languages. If you choose a specific language, you can download the full list for that languages. The lists are updated daily. Best, Leila On Thu, Mar 19, 2015

[Analytics] [Announcement] March 2015 Research Showcase

2015-03-20 Thread Leila Zia
Hi, This month's research showcase is scheduled for Wednesday, March 25, 11:30 (PST). We will have two presentations on user session identification by Aaron Halfaker, and mining missing hyperlinks in Wik

Re: [Analytics] Research Showcase Starting in 8 minutes!

2015-03-25 Thread Leila Zia
The youtube link has changed to: http://youtu.be/PHQqicVoVx4 On Wed, Mar 25, 2015 at 11:22 AM, Ellery Wulczyn wrote: > Today we will have two presentation: > > 1. User Session Identification by Aaron Halfaker > 2. Mining Missing Hyperlinks in Wikipedia by Bob West. > > You can follow the talk

Re: [Analytics] stats.grok.se not updating

2015-04-03 Thread Leila Zia
Hi Vipul, FYI, stats.grok.se is up again. Best, Leila On Wed, Apr 1, 2015 at 6:28 PM, Vipul Naik wrote: > Seems like stats.grok.se has at least a day's backlog (hasn't updated for > March 31, though it's already April 2 in UTC). Will it be updated soon? > > Vipul > > On Sun, Mar 22, 2015 a

Re: [Analytics] New fields in wmf.webrequest hive table

2015-04-10 Thread Leila Zia
Hi Joseph, Thanks for the update, and for doing this. These three items make the analysis of the data much easier on our end. We've had many requests in the past that required agent_type and access_method information and having them readily available is awesome! :-) Have a great weekend! Leil

Re: [Analytics] [Wiki-research-l] April 2015 research showcase: remix and reuse in collaborative communities; the oral citations debate

2015-04-30 Thread Leila Zia
A reminder that this event will start in 10 minutes. You can watch the event on YouTube here . As usual, we will be in #wikimedia-research for questions and chat. :-) On Thu, Apr 16, 2015 at 12:43 PM, Dario Taraborelli < dtarabore...@wikimedia.org> wrote: > I am thril

[Analytics] May 2015 research showcase

2015-05-11 Thread Leila Zia
Hi everyone, The next research showcase will be live-streamed this Wednesday, May 13 at 11.30 PT. The streaming link will be posted on the lists a few minutes before the showcase starts and as usual, you can join the conversation on IRC at #wikimedia-research. We look forward to seeing you! Leil

Re: [Analytics] Search dashboards are now running on live data

2015-05-22 Thread Leila Zia
On Fri, May 22, 2015 at 3:14 PM, Luis Villa wrote: > 68,000 searches/day seems *really* low, > right, but I'm not sure search sessions per day is the same as the number of searches per day. Oliver, what definition of a "search session" do you use? How do you compute it? Leila > Luis > > On Fr

Re: [Analytics] clicks on red links

2015-05-22 Thread Leila Zia
Hi Amir, As far as I know and as mentioned by others, the exact statistics you're looking for don't exist. More comments in-line. On Wed, May 20, 2015 at 10:37 PM, Amir E. Aharoni wrote: > Hi, > > Are there statistics about the number of people who click on red links in > Wikimedia projects?

Re: [Analytics] [Research-Internal] Revision history of deleted pages

2015-06-24 Thread Leila Zia
switching to the public list with Bob's permission. On Wed, Jun 24, 2015 at 1:58 PM, Robert West wrote: > Hi everyone, > > I'd like to find all enwiki articles that were ever marked with the > {{hoax}} template. Pages with that template mostly end up being deleted, so > they're not available in

Re: [Analytics] [Research-Internal] Revision history of deleted pages

2015-06-25 Thread Leila Zia
Aaron, any chance you know the answer to this question? I have a vague memory that we talked about deleted pages and their text some time back. This data should live somewhere, right? given that deleted pages can be restored. Thanks, Leila On Wed, Jun 24, 2015 at 2:03 PM, Leila Zia wrote

[Analytics] user table information

2015-06-27 Thread Leila Zia
Hi, For the article recommendation test, we queried user table to get editors' email addresses. We then excluded the emails that were not verified. We've received a comment here that suggests the use

Re: [Analytics] user table information

2015-06-29 Thread Leila Zia
Thanks a lot everyone. :-) On Mon, Jun 29, 2015 at 6:21 PM, Gergo Tisza wrote: > On Sat, Jun 27, 2015 at 2:30 PM, Leila Zia wrote: > >> For the article recommendation test, we queried user table to get >> editors' email addresses. We then excluded the emails that were

Re: [Analytics] Breakdown of unique visitors by country (and by project)

2015-09-08 Thread Leila Zia
Hi Cristian, On Tue, Sep 8, 2015 at 10:42 AM, Cristian Consonni wrote: > > we (Wikimedia Italia) are starting writing a proposal for a EU project > disclaimer: I'm not in Analytics. We don't have unique counts (per country/project, or otherwise) as you already guessed. If you can tell us more

Re: [Analytics] [Survey] Pageview API

2015-09-11 Thread Leila Zia
It's getting exciting. :-) I'd go with choice 2 since it gives more control to the user while offering what the user can get through choice 1 as well. Question: will we get page_ids or page_titles or both? It's good to have both. Leila On Fri, Sep 11, 2015 at 3:00 PM, Dan Andreescu wrote: > H

Re: [Analytics] Users changing language version through interwiki links

2015-09-12 Thread Leila Zia
Hi Strainu, On Sat, Sep 12, 2015 at 5:43 AM, Strainu wrote: > > I think for smaller wikis this would be an interesting way to know which > domains/articles to work on. > What I'm saying is not directly related to your data request but to your comment above: We've been working on a project to u

Re: [Analytics] Page visit request

2015-10-09 Thread Leila Zia
Hi Mark, On Fri, Oct 9, 2015 at 12:02 PM, Dan Andreescu wrote: > > To get the top 2 million articles you'll have to write a research proposal > and get one of the folks with access to volunteer to do this. This list is > probably your best chance of finding someone like that, just let everyone

Re: [Analytics] Echo databases on analytics-store?

2015-10-09 Thread Leila Zia
On Fri, Oct 9, 2015 at 1:26 PM, Neil P. Quinn wrote: > I'm trying to gather some stats on the use of Echo notifications across > wikis, and I'd like to join the `echo_events` table with the `user` table > for a given wiki. > I'm not sure what kind of information you need but there is a chance th

Re: [Analytics] Canonical location for metrics documentation

2015-10-14 Thread Leila Zia
On Wed, Oct 14, 2015 at 8:05 AM, Dan Andreescu wrote: > > > I'm not saying it's easy, but I think having documentation in more than > one place is an awful experience for newcomers. > I second this as a problem. I make a joke of it each time I want to explain to a newcomer what is documented whe

Re: [Analytics] Canonical location for metrics documentation

2015-10-14 Thread Leila Zia
Makes sense to me. On Wed, Oct 14, 2015 at 11:27 AM, Neil P. Quinn wrote: > Keep in mind that, when I say "metrics documentation", I'm not referring > to documentation about Hive > , the webrequest > logs

Re: [Analytics] [Spam] Re: User statistics for video marking ENWP 5m article milestone

2015-10-27 Thread Leila Zia
On Tue, Oct 27, 2015 at 9:56 AM, Aaron Halfaker wrote: > > If we want to critique how we communicate about something, we can't do it > in such general terms as "use 5+ edits". We need to know what meaning is > intended to be expressed. Only within the context of "meaning" can we talk > about "d

Re: [Analytics] A belated project completion shout-out

2015-10-29 Thread Leila Zia
Somehow I understand the firework language and this email better than all the other presentations we had so far. :D Congratulations, team! This is amazing! :-) L p.s. and I bake you a big cake the next time you're in town if you can wait until then. ;-) On Thu, Oct 29, 2015 at 2:40 PM, Dan Andre

Re: [Analytics] [link] Why Big Data Needs Thick Data

2016-01-27 Thread Leila Zia
search has not involved ethnographic research, however, it definitely has involved and will continue to involve a mix of qualitative and quantitative approaches. I hope this helps. And thanks again for starting this conversation. :-) Best, Leila Leila Zia Research Scientist Wikimedia Foundation On

Re: [Analytics] Zika

2016-02-14 Thread Leila Zia
​Hey Dan,​ On Sun, Feb 14, 2016 at 3:02 AM, Dan Andreescu wrote: > So, I felt personally compelled in the case of Zika, and the confusing > coverage it has seen, to offer to personally help. ​Which aspect of the coverage are you referring to as confusing?​ > I can run queries, test hypothes

Re: [Analytics] The WikiLove research project

2016-05-20 Thread Leila Zia
terest in this area, which is great. :) I'll follow up with Ashton. ​Best,​ Leila -- Leila Zia Research Scientist Wikimedia Foundation On Thu, May 19, 2016 at 2:05 PM, Ryan Kaldari wrote: > I think what they really want is research about whether or not WikiLove > has a positive imp

Re: [Analytics] [Wiki-research-l] question about Pageviews dumps

2016-06-28 Thread Leila Zia
+ Analytics On Tue, Jun 28, 2016 at 6:36 AM, Marc Miquel wrote: > Hello, > > I have a question for you regarding pageviews datadumps. > > I am considering to study reader engagement for different article topics > in different languages. Because of this, I would like to know if there is > any pl

Re: [Analytics] [Wiki-research-l] question about Pageviews dumps

2016-07-01 Thread Leila Zia
Hi Marc, On Tue, Jun 28, 2016 at 6:36 AM, Marc Miquel wrote: > Since this would be for a research project I might ask funding for it, I > would like to know if I could count on that, what is the nature of the > available data, and what would be the procedure to obtain this data and if > there w

Re: [Analytics] Heatmaps of readership and editor populations

2016-07-13 Thread Leila Zia
Hi Pine, Anything you can reuse from https://meta.wikimedia.org/wiki/Research:Mobile_trends ? Best, Leila Leila Zia Research Scientist Wikimedia Foundation On Wed, Jul 13, 2016 at 11:26 AM, Pine W wrote: > Hi Analytics, > > Are there heatmaps somewhere that show the geographic dis

Re: [Analytics] [Pageview API] Data Retention Question

2016-07-29 Thread Leila Zia
Dan, Thanks for reaching out. 18 months is enough for my use cases as long as the dumps capture the exact data structure. Best, Leila -- Leila Zia Senior Research Scientist Wikimedia Foundation On Fri, Jul 29, 2016 at 11:51 AM, Amir E. Aharoni < amir.ahar...@mail.huji.ac.il> wrote: >

Re: [Analytics] Analysing link

2016-08-26 Thread Leila Zia
On Fri, Aug 26, 2016 at 1:38 AM, Federico Leva (Nemo) wrote: > Jan Dittrich, 26/08/2016 10:03: > >> or even click paths >> > > Do you know about https://meta.wikimedia.org/wik > i/Research:Improving_link_coverage/Release_page_traces ? > ​and https://meta.wikimedia.org/wiki/Research:Wikipedia_Nav

Re: [Analytics] Getting search engine terms for specific wikibook?

2016-09-06 Thread Leila Zia
an be made possible, please read here <https://www.mediawiki.org/wiki/Wikimedia_Research/Formal_collaborations>. Leila Zia Senior Research Scientist Wikimedia Foundation On Mon, Sep 5, 2016 at 11:19 AM, Nuria Ruiz wrote: > >By the way, what about alternate, external methods such as subsc

Re: [Analytics] Split testing example implementations

2016-09-07 Thread Leila Zia
Hi Jan, I don't know of documented examples (the A/B testing design depends on the question you want to answer). If you want to chat about this more, I'd be happy to brainstorm with you about your options. Message me off-list and we can set up a time if that's helpful. Best, L

[Analytics] Fwd: [Research-Internal] Fwd: Dumps Rewrite getting underway (help needed!)

2016-09-13 Thread Leila Zia
​FYI -- Forwarded message -- From: Ariel Glenn WMF Date: Mon, Sep 12, 2016 at 9:07 AM Subject: [Research-Internal] Fwd: Dumps Rewrite getting underway (help needed!) To: research-inter...@lists.wikimedia.org -- Forwarded message -- From: Ariel Glenn WMF Date: Mo

[Analytics] Upcoming Research Showcase, November 16, 2016

2016-11-09 Thread Leila Zia
27;t make it, please feel free to watch the video later and get in touch with us with questions/comments. :) Best, Leila -- Leila Zia Senior Research Scientist Wikimedia Foundation ​[1] WMF Research and researchers from three academic institutions: EPFL, GESIS, and Stanford University, in colla

Re: [Analytics] ensuring reader anonymity

2016-11-11 Thread Leila Zia
​Hi Pine, On Fri, Nov 11, 2016 at 10:39 AM, Pine W wrote: > On Fri, Nov 11, 2016 at 9:25 AM, Leila Zia wrote: > >> Nuria, regarding the IP addresses specifically (not the proxy, for which, >> I'll need more time to go through the use-cases we've had and see if we ca

Re: [Analytics] ensuring reader anonymity

2016-11-11 Thread Leila Zia
hashing IP addresses in webrequest logs. Best, Leila On Fri, Nov 11, 2016 at 11:16 AM, Leila Zia wrote: > ​Hi Pine, > > On Fri, Nov 11, 2016 at 10:39 AM, Pine W wrote: > >> On Fri, Nov 11, 2016 at 9:25 AM, Leila Zia wrote: >> >>> Nuria, regarding the IP addresses

Re: [Analytics] Upcoming Research Showcase, November 16, 2016

2016-11-16 Thread Leila Zia
Hi all, A reminder that this is happening in 2 hours from now. Best, Leila On Wed, Nov 9, 2016 at 2:29 PM, Leila Zia wrote: > [Apologies for cross-posting] > > Hi everyone, > > Almost a year ago, we [1] embarked on a research project to understand who > Wikipedia

Re: [Analytics] Fwd: [Query Logs] Research:Understanding Wikidata Queries

2017-01-16 Thread Leila Zia
On Tue, Jan 3, 2017 at 9:30 AM, Stas Malyshev wrote: > Hi! > > > 1. Is there a unique key for the query log? The log I am refering to > > is the *wdqs_extract* table**from > > the hive database wmf.**We would like to be able to > > permanently link our own computed data with the l

Re: [Analytics] On Wikipedia edits archive per county.

2017-01-17 Thread Leila Zia
brainstorm with you about other innovative ways you can use Wikipedia/Wikimedia data for the purposes of these reports. If that's of interest, please ping me off-list. Best, Leila Leila Zia Senior Research Scientist Wikimedia Foundation On Tue, Jan 17, 2017 at 8:51 AM, Nuria Ruiz wrote

Re: [Analytics] stats.grok.se used in study about Snowden and internet traffic

2017-01-18 Thread Leila Zia
+ Juliet, as this is something Communications may want to follow up given that stats.groke.se is not maintained by a Wikimedia Foundation member. Thanks for sharing this. Leila Leila Zia Senior Research Scientist Wikimedia Foundation On Wed, Jan 18, 2017 at 9:00 AM, Andrew Otto wrote: >

Re: [Analytics] On Wikipedia edits archive per county.

2017-01-25 Thread Leila Zia
t; Thank you for your reply. Can we schedule a call for tomorrow or Monday >> morning? Let me know what times work for you best. >> >> Sincerely, >> >> Rafael Escalona Reynoso. >> >> On Tue, Jan 17, 2017 at 12:21 PM Leila Zia wrote: >> >>>

Re: [Analytics] web log data

2017-03-06 Thread Leila Zia
I'm sorry that we cannot be of more help for your research at this point. Best, Leila [1] https://www.mediawiki.org/wiki/Wikimedia_Research/Formal_collaborations#How_are_formal_research_collaborations_created.3F are met and Leila Zia Senior Research Scientist Wikimedia Foundation On Mon, Ma

Re: [Analytics] [Wiki-research-l] Wikipedia Detox: Scaling up our understanding of harassment on Wikipedia

2017-06-21 Thread Leila Zia
harassment research team meets every 2 weeks, if you're curious what's going on on this front and on our end and you want to listen in, please ping me. And, thank you for the offer to help. We may take you up on that. :) Best, Leila -- Leila Zia Senior Research Scientist Wikimedia Founda

Re: [Analytics] [Wiki-research-l] Wikipedia Detox: Scaling up our understanding of harassment on Wikipedia

2017-06-22 Thread Leila Zia
lt;https://arxiv.org/abs/1610.08914>, Section 3 should give you a relatively detailed description of how this question was approached. Best, Leila Pine > > On Wed, Jun 21, 2017 at 2:08 PM, Leila Zia wrote: > >> Hi Dan, >> >> Thanks for your note. :) >> >>

Re: [Analytics] new mediawiki_history snapshot available

2017-07-12 Thread Leila Zia
On Wed, Jul 12, 2017 at 12:16 PM, Nuria Ruiz wrote: > Further clarification that this snapshot of data is not yet public (meaning > available to the outside world, not just WMF/NAD holders) . Thanks for clarifying this and the work you and your team has put into this. > Our team is working towar

Re: [Analytics] new mediawiki_history snapshot available

2017-07-12 Thread Leila Zia
u! > > On Wed, Jul 12, 2017 at 12:22 PM, Leila Zia wrote: >> >> On Wed, Jul 12, 2017 at 12:16 PM, Nuria Ruiz wrote: >> > Further clarification that this snapshot of data is not yet public >> > (meaning >> > available to the outside world, not just W

Re: [Analytics] Analytics project request

2017-07-24 Thread Leila Zia
I'll review Daniel's email and will get back to him/you on this list in the next day or so. Leila -- Leila Zia Senior Research Scientist Wikimedia Foundation On Mon, Jul 24, 2017 at 7:59 AM, Nuria Ruiz wrote: > Daniel, > > Singining an NDA is not enough to get access to

Re: [Analytics] Analytics project request

2017-07-24 Thread Leila Zia
Plan/2017-2018/Final/Community_Health#Segment_3:_Research_on_harassment CD - Structured Data https://meta.wikimedia.org/wiki/Wikimedia_Foundation_Annual_Plan/2017-2018/Final/Structured_Data#Segment_4:_Programs [2] https://meta.wikimedia.org/wiki/Research:Wikipedia_clickstream -- Leila Zia Senior Research

Re: [Analytics] research process (was Re: Google Code-in: Get your tasks for young contributors prepared!)

2017-11-03 Thread Leila Zia
r best. The ticket for tracking this task is https://phabricator.wikimedia.org/T179693 . Best, Leila ​ -- Leila Zia Senior Research Scientist Wikimedia Foundation ​ > > > /Lars > > ___ > Analytics mailing list > Analytics@list

Re: [Analytics] Research Showcase Wednesday, November 15, 2017 at 11:30 AM (PST) 18:30 UTC

2017-11-15 Thread Leila Zia
On Wed, Nov 15, 2017 at 11:02 AM, Jan Ainali wrote: > Wasn't 18:30 UTC was 30 minutes ago? That seems to be a typo. It's at 19:30 UTC. Sorry about that. Best, Leila > > Med vänliga hälsningar > Jan Ainali > http://ainali.com > > 2017-11-15 19:53 GMT+01:00 Sarah R : >> >> Hi Everyone, >> >> Just

Re: [Analytics] research process (was Re: Google Code-in: Get your tasks for young contributors prepared!)

2017-11-17 Thread Leila Zia
s, and Tech Ops commitment and work. Thank you for your understanding, and I'm here to help if someone else picks up this task and they need Research input. Best, Leila -- Leila Zia Senior Research Scientist Wikimedia Foundation On Tue, Nov 7, 2017 at 12:22 PM, Nuria Ruiz wrote: > I

Re: [Analytics] Wikipedia aggregate clickstream data released

2018-01-16 Thread Leila Zia
contributed to it at https://blog.wikimedia.org/2018/01/16/wikipedia-rabbit-hole-clickstream/ Best, Leila -- Leila Zia Senior Research Scientist Wikimedia Foundation On Tue, Feb 17, 2015 at 11:00 AM, Dario Taraborelli < dtarabore...@wikimedia.org> wrote: > We’re glad to announce the rele

Re: [Analytics] How best to accurately record page interactions in Page Previews

2018-01-17 Thread Leila Zia
Hi Sam, On Wed, Jan 17, 2018 at 1:51 AM, Sam Smith wrote: > IMO #1 is preferable from the operations and performance perspectives as the > response is always served from the edge and includes very few headers, > whereas the request in #2 may be served by the application servers if the > user is

Re: [Analytics] Fwd: Help needed on web request analytics

2018-02-02 Thread Leila Zia
Update: I've read this thread and am in contact with the researcher on a separate thread. Best, Leila -- Leila Zia Senior Research Scientist Wikimedia Foundation On Mon, Jan 29, 2018 at 12:20 AM, Joseph Allemandou wrote: > Hi Simon, > I copy the analytics mailing list to this messa

Re: [Analytics] [Research-Internal] New SWAP (Jupyter Notebook) servers and updates!

2018-03-22 Thread Leila Zia
On Thu, Mar 22, 2018 at 12:34 PM, Andrew Otto wrote: > But there is good news too! Last week I rsynced everyone’s home directories > from notebook1001 over to notebook1003. Thanks for doing this. > OOooOo and there’s even more good news! I’ve made the notebooks able to > access system site pa

Re: [Analytics] Monitor the number of Wikipedia sites and the number of articles in each site

2018-03-29 Thread Leila Zia
Hi Victor et al., [going to a slight tangent.] On Thu, Mar 29, 2018 at 12:54 AM, Zainan Zhou (a.k.a Victor) > wrote: > >> >> >> As I mentioned to you, my team works on extracting the knowledge from >> Wikipedia. Currently it's undergoing a project that expands language >> coverage. >> > ​Please

  1   2   >