Re: [Analytics] Article about ML in production woes

2019-02-07 Thread Goran Milovanovic
Hi Andrew, I have recently started a six month AI/Machine Learning Engineering course which focuses exactly on the topics that you've shown interest in. So, >>> I'd love it if we had a working group (or whatever) that focused on how to standardize how we train and deploy ML for production use.

Re: [Analytics] [Research-Internal] Article about ML in production woes

2019-02-07 Thread Nuria Ruiz
Team, Since everyone is here, we will be working on a machine learning infrastructure program this year. I will set up meetings with everyone on this thread and some others in SRE and Audiences to get a "bag of requests" of things that are missing, first round of talks that I hope to finish next

Re: [Analytics] [Research-Internal] Article about ML in production woes

2019-02-07 Thread Miriam Redi
Hey Andrew! Thank you so much for sharing this and start this conversation. We had a meeting at All Hands with all people interested in "Image Classification" https://phabricator.wikimedia.org/T215413 , and one of the open questions was exactly how to find a "common repository" for ML models that

Re: [Analytics] Article about ML in production woes

2019-02-07 Thread Aaron Halfaker
Just gave the article a quick read. I think this article pushes on some key issues for sure. I definitely agree with the focus on python/jupyter as essential for a productive workflow that leverages the best from research scientists. We've been thinking about what ORES 2.0 would look like and

Re: [Analytics] Further Development of Wikipedia statistics

2019-02-07 Thread Nuria Ruiz
Hello, Several things come to mind: Top views provides much of this info digested in a way that would not be hard to calculate what you want, gets data from pageviewAPI and does some useful filtering: https://tools.wmflabs.org/topviews/?project=de.wikipedia.org=all-access=last-month= You

Re: [Analytics] [Wikimedia-l] Farewell, Erik!

2019-02-07 Thread Leinonen Teemu
Hi Erik, When I saw the Wikistats the very first time in mid 2000 (?) I was very impressed. After meeting with Erik, I respected the project and him even more. The impact of the Wikistats to researchers and students around the world, but also to the open data movement in general, has been

Re: [Analytics] [Wikimedia-l] Farewell, Erik!

2019-02-07 Thread Brad Patrick
Erik: From the early days until now, your quiet leadership and excellence have been a great credit to the organization and most importantly, your leadership by example has been an inspiration to untold numbers of people. But, actually, it’s not untold numbers because of your work! You tell it

[Analytics] Further Development of Wikipedia statistics

2019-02-07 Thread WikiPeter-HH
Hi, in light of the current switch from Wikistats 1 to Wikistats 2 I would like to express a strong desire to get some additional features for the statistcs. The rationale for this request is described below: 1. How many articles make up for 90 / 95 / 99 percent of all page views over a certain

Re: [Analytics] [Wikimedia-l] Farewell, Erik!

2019-02-07 Thread Philippe Beaudette
Like so many others, I was blown away by wikistats. I can’t begin to count the number of times I turned to it in my years at the WMF. And it goes without saying that Erik was an exemplary colleague, and a true gentleman. Enjoy your well earned retirement. Philippe On Wed, Feb 6, 2019 at 9:27

[Analytics] Article about ML in production woes

2019-02-07 Thread Andrew Otto
Just came across https://www.confluent.io/blog/machine-learning-with-python-jupyter-ksql-tensorflow In it, the author discusses some of what he calls the 'impedance mismatch' between data engineers and production engineers. The links to Ubers Michelangelo