[Wikitech-l] Testing recitation-bot

2017-03-19 Thread Anthony Di Franco
fully on in the recent past? Who would be best to approach with such a request? Thanks any and all for any advice you can offer. Anthony ___ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Re: [Wikitech-l] pywikibot troubleshooting in recitation-bot

2017-01-05 Thread Anthony Di Franco
135986 Any advice on managing this? On Sat, Dec 24, 2016 at 1:36 PM Legoktm wrote: Hi, +cc pywiki...@lists.wikimedia.org On 12/22/2016 03:55 PM, Anthony Di Franco wrote: > Hi all, > I'm doing some renovations on recitation-bot and running into trouble when > the time comes

[Wikitech-l] pywikibot troubleshooting in recitation-bot

2016-12-22 Thread Anthony Di Franco
bernetes rather than sun grid engine, so I suspect that the login state might not be making it into the container - can anyone advise on where the login state is maintained and whether this will be transferred into the kubernetes container? Thanks, Anthony __

[Wikitech-l] Technical advice on expert review?

2016-04-16 Thread Anthony Cole
ugh I'm getting faster, and transcribing offline comments from the Word document into the wiki table will be a small chore. The wiki table pastes easily into Word with highlighting and formatting intact, but not vice versa. (I've also asked at Village pump (technical).) Any th

Re: [Wikitech-l] Release candidate for 1.24.0

2014-11-23 Thread Anthony Cole
Ignore my last post - I appended it to the wrong thread. Anthony Cole <http://en.wikipedia.org/wiki/User_talk:Anthonyhcole> ___ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Re: [Wikitech-l] changing edit summaries

2014-11-23 Thread Anthony Cole
suke Matsubara's description, the extension he mentions above seems like it might fit our needs. Will it enable patrollers to add a comment to the edit summary? Does anyone know if it works on en.Wikipedia? 1.https://lists.wikimedia.org/pipermail/wikitech-l/2014-November/079418.html Anthony

Re: [Wikitech-l] Release candidate for 1.24.0

2014-11-22 Thread Anthony Cole
On Thu, Nov 13, 2014 at 9:57 PM, Yusuke Matsubara https://lists.wikimedia.org/mailman/listinfo/wikitech-l>> wrote: >* On Thu, Nov 13, 2014 at 9:15 PM, Amir E. Aharoni *>* https://lists.wikimedia.org/mailman/listinfo/wikitech-l>> wrote: *>>* I tried looking for it in Bugzilla; I expected to find a t

Re: [Wikitech-l] Revision metadata as a service?

2014-11-12 Thread Anthony Cole
comment? Would it interfere in any way with the normal practice of other editors who don't have that permission? Anthony Cole <http://en.wikipedia.org/wiki/User_talk:Anthonyhcole> On Wed, Nov 12, 2014 at 3:05 AM, Bartosz Dziewoński wrote: > W dniu środa, 12 listopada 201

Re: [Wikitech-l] Revision metadata as a service?

2014-11-11 Thread Anthony Cole
t Med Foundation are discussing this at the moment, and we see it as a very effective step toward safeguarding and improving our medical offering. If you could do this for us, it would be very much appreciated. Anthony Cole <http://en.wikipedia.org/wiki/User_talk:Anthonyhcole> On Mon

Re: [Wikitech-l] Top Level Design, LLC greedy registrar!

2014-04-05 Thread Anthony
The Chinese word for "wiki" is not "wiki". As far as sharing my own personal certainties, https://en.wikipedia.org/wiki/Generic_trademark would be a start, but for the most part certainties aren't something that can easily be shared. On Fri, Apr 4, 2014 at 5:12 PM, Federico Leva (Nemo) wrote: >

Re: [Wikitech-l] .wiki gTLD

2014-02-22 Thread Anthony
I wouldn't think any of those other than perhaps "media.wiki" would implicate a WMF trademark. As far as MediaWiki, WMF does claim a trademark on that. On Sat, Feb 22, 2014 at 5:17 AM, addshorewiki wrote: > en.wiki > data.wiki > meta.wiki > media.wiki > en.books.wiki > en.voyage.wiki > > Most of

Re: [Wikitech-l] [Xmldatadumps-l] Compressing full-history dumps faster

2014-01-21 Thread Anthony
If you're going to use xz then you wouldn't even have to recompress the blocks that haven't changed and are already well compressed. On Tue, Jan 21, 2014 at 5:26 PM, Randall Farmer wrote: > Ack, sorry for the (no subject); again in the right thread: > > > For external uses like XML dumps integr

Re: [Wikitech-l] Fwd: Participation in an Aaron Swartz Hackathon event

2013-10-11 Thread Anthony
It wasn't really a joke. On Oct 11, 2013 5:34 PM, "Petr Bena" wrote: > That's not a funny joke... > > On Fri, Oct 11, 2013 at 7:38 PM, Anthony wrote: > > Which websites are you planning on hacking into? > > > > > > On Fri, Oct 11, 2013 at 12

Re: [Wikitech-l] Fwd: Participation in an Aaron Swartz Hackathon event

2013-10-11 Thread Anthony
Which websites are you planning on hacking into? On Fri, Oct 11, 2013 at 12:17 PM, Quim Gil wrote: > There is a plan for a worldwide round of Aaron Hackathons, on the upcoming > Nov 8-10 weekend. > > http://aaronswartzhackathon.**org/ > > Coordination: > https

Re: [Wikitech-l] How's the SSL thing going?

2013-08-01 Thread Anthony
On Thu, Aug 1, 2013 at 12:52 AM, Jeremy Baron wrote: > On Thu, Aug 1, 2013 at 4:28 AM, Anthony wrote: > > Does rapid key rotation in any way make a MITM attack less detectable? > > Presumably the NSA would have no problem getting a fraudulent certificate > > signed by

Re: [Wikitech-l] How's the SSL thing going?

2013-07-31 Thread Anthony
On Wed, Jul 31, 2013 at 5:59 PM, George Herbert wrote: > The second is site key security (ensuring the NSA never gets your private > keys). Who theoretically has access to the private keys (and/or the signing key) right now? The third is perfect forward security with rapid key rotation. > Does

Re: [Wikitech-l] Git for idiots

2013-05-08 Thread Anthony
I guess the viewpoint and perspective from the more experienced users may be different. The veterans may start to take some knowledge for granted, as a given knowledge that they may thing people would already know. For example, the underlying concept of git commit is something that I now take for

Re: [Wikitech-l] Prototyping Wiki Inline Comments

2013-05-04 Thread Anthony
iel Mietchen < daniel.mietc...@googlemail.com> wrote: > Hi Anthony, > > interesting feature. How would the system handle cases in which the > content originally pointed at when making the initial inline comment > has been changed? > > Daniel > > > On Fri, May

[Wikitech-l] Prototyping Wiki Inline Comments

2013-05-03 Thread Anthony
Dear all, I have applied for the Prototyping Inline Comments for the Google Summer of Code. Essentially, the project is an extension that allows any wiki user to select text and then make an inline comment or a reply to an existing inline comment. Imagine: a user lands in a Wikipedia article, sel

Re: [Wikitech-l] Why are we still using captchas on WMF sites?

2013-01-21 Thread Anthony
On Mon, Jan 21, 2013 at 3:00 AM, David Gerard wrote: > I mean, you could redefine "something that doesn't block all spambots > but does hamper a significant proportion of humans" as "successful", > but it would be a redefinition. It's not a definition, it's a judgment. And whether or not it's a

Re: [Wikitech-l] Update on IPv6

2012-06-16 Thread Anthony
On Thu, Jun 14, 2012 at 12:54 AM, Daniel Friesen wrote: > On Tue, 12 Jun 2012 05:14:07 -0700, Anthony wrote: > >> On Sun, Jun 10, 2012 at 2:03 PM, Marcin Cieslak wrote: >>> >>> You *DON'T* want to >>> renumber your whole home network every time your I

Re: [Wikitech-l] Update on IPv6

2012-06-12 Thread Anthony
On Sun, Jun 10, 2012 at 2:28 PM, David Gerard wrote: > On 9 June 2012 21:51, Anthony wrote: > >> Here at BestISP, we assign you a unique number that you can never >> change!  We attach this unique number to all your Internet >> communications, so that every time you go

Re: [Wikitech-l] Update on IPv6

2012-06-12 Thread Anthony
On Sun, Jun 10, 2012 at 2:03 PM, Marcin Cieslak wrote: > You *DON'T* want to > renumber your whole home network every time your ISP changes your IPv6 > prefix. If only they had some service which converted easy to remember names into IPv6 addresses. > Just because some people got away with t

Re: [Wikitech-l] Update on IPv6

2012-06-09 Thread Anthony
On Sat, Jun 9, 2012 at 4:29 PM, Anthony wrote: > On Fri, Jun 8, 2012 at 9:59 AM, Strainu wrote: >> 2012/6/8 Anthony : >>> No one has to break the loop.  The loop will break itself.  Either >>> enough people will get sick of NAT to cause demand for IPv6, or they >&

Re: [Wikitech-l] Update on IPv6

2012-06-09 Thread Anthony
On Fri, Jun 8, 2012 at 9:59 AM, Strainu wrote: > 2012/6/8 Anthony : >> No one has to break the loop.  The loop will break itself.  Either >> enough people will get sick of NAT to cause demand for IPv6, or they >> won't. > > That one way of seeing things, but

Re: [Wikitech-l] Update on IPv6

2012-06-09 Thread Anthony
On Sat, Jun 9, 2012 at 7:51 AM, Daniel Friesen wrote: > On Fri, 08 Jun 2012 03:49:01 -0700, Risker wrote: >> Do this now, please.  Even I can see how easy it ought to be to replace >> the last >> three digits of an IPv4 address with XXX in publicly viewable lists >> and logsand reduce the pub

Re: [Wikitech-l] Update on IPv6

2012-06-08 Thread Anthony
On Fri, Jun 8, 2012 at 4:08 AM, Strainu wrote: > Risker, I think you're over-reacting here. Yes, there are risks > associated with IPv6. No, they haven't been addressed completely > before IPv6 day (apparently because of the very late moment the > decision to participate was taken). But it hasn't

Re: [Wikitech-l] [Wikimedia-l] Update on IPv6

2012-06-02 Thread Anthony
On Sat, Jun 2, 2012 at 9:59 AM, Leslie Carr wrote: > On Sat, Jun 2, 2012 at 6:13 AM, Anthony wrote: >> On Sat, Jun 2, 2012 at 8:49 AM, Thomas Dalton >> wrote: >>> On 2 June 2012 13:44, Anthony wrote: >>>> On Fri, Jun 1, 2012 at 7:27 PM, John Du Hart wrote

Re: [Wikitech-l] Adding MD5 / SHA1 column to revision table

2011-09-20 Thread Anthony
Thanks for the explanation. I guess I see what you're getting at now. Sorry I didn't see it sooner. On Tue, Sep 20, 2011 at 8:50 PM, Brion Vibber wrote: > On Tue, Sep 20, 2011 at 5:36 PM, Anthony wrote: > >> On Tue, Sep 20, 2011 at 3:37 PM, Happy Melon wrote: >&g

Re: [Wikitech-l] Adding MD5 / SHA1 column to revision table

2011-09-20 Thread Anthony
On Tue, Sep 20, 2011 at 3:37 PM, Happy Melon wrote: > It may or may not be an architecturally-better design to have it as a > separate table, although considering how rapidly MW's 'architecture' changes > I'd say keeping things as simple as possible is probably a virtue.  But that > is the basis o

Re: [Wikitech-l] Adding MD5 / SHA1 column to revision table

2011-09-20 Thread Anthony
On Tue, Sep 20, 2011 at 9:34 AM, Domas Mituzas wrote: >> >> Ah, okay.  I remember that's what happened in MyISAM but I figured >> they had that fixed in InnoDB. > > InnoDB has optimized path for index builds, not for schema changes. No support for built-in function-based indexes, right? (I searc

Re: [Wikitech-l] Adding MD5 / SHA1 column to revision table

2011-09-20 Thread Anthony
On Mon, Sep 19, 2011 at 10:39 PM, Daniel Friesen wrote: > On 11-09-19 06:39 PM, Anthony wrote: >> On Mon, Sep 19, 2011 at 3:57 PM, Brion Vibber wrote: >>> That's probably the simplest solution; adding a new empty table will be very >>> quick. It may make i

Re: [Wikitech-l] Adding MD5 / SHA1 column to revision table

2011-09-19 Thread Anthony
On Mon, Sep 19, 2011 at 3:57 PM, Brion Vibber wrote: > That's probably the simplest solution; adding a new empty table will be very > quick. It may make it slower to use the field though, depending on what all > uses/exposes it. Isn't adding a new column with all NULL values quick too? _

Re: [Wikitech-l] Adding MD5 / SHA1 column to revision table (discussing r94289)

2011-09-18 Thread Anthony
On Sun, Sep 18, 2011 at 7:20 PM, Anthony wrote: > On Sun, Sep 18, 2011 at 7:07 PM, bawolff wrote: >> Anthony wrote: >> The pages you link to seem to indicate he's nothing more than a >> willy-on-wheels type vandal, who at worst tricked an admin into doing >> a d

Re: [Wikitech-l] Adding MD5 / SHA1 column to revision table (discussing r94289)

2011-09-18 Thread Anthony
On Sun, Sep 18, 2011 at 7:07 PM, bawolff wrote: > Anthony wrote: >> It does not involve generating hash collisions, but it involves >> finding various bugs in mediawiki and using them to vandalise, often >> by injecting javascript.  The best description I could find w

Re: [Wikitech-l] Fwd: Adding MD5 / SHA1 column to revision table (discussing r94289)

2011-09-18 Thread Anthony
On Sun, Sep 18, 2011 at 6:00 PM, Roan Kattouw wrote: > On Sun, Sep 18, 2011 at 11:00 PM, Anthony wrote: >> Now I don't know how important the CPU differences in calculating the >> two versions would be.  If they're significant enough, then fine, use >> MD5, but m

Re: [Wikitech-l] Adding MD5 / SHA1 column to revision table (discussing r94289)

2011-09-18 Thread Anthony
On Sun, Sep 18, 2011 at 6:01 PM, Anthony wrote: > There's also a > description at http://en.wikipedia.org/wiki/User:Grawp , which does > not do justice to the "mad hacker skillz" of this individual and his > intent on finding bugs in mediawiki and exploiting them.

Re: [Wikitech-l] Adding MD5 / SHA1 column to revision table (discussing r94289)

2011-09-18 Thread Anthony
On Sun, Sep 18, 2011 at 5:50 PM, Chad wrote: > On Sun, Sep 18, 2011 at 5:47 PM, Anthony wrote: >> On Sun, Sep 18, 2011 at 5:30 PM, Chad wrote: >>> On Sun, Sep 18, 2011 at 7:24 AM, Russell N. Nelson - rnnelson >>> wrote: >>>> It is meaningless to talk ab

Re: [Wikitech-l] Adding MD5 / SHA1 column to revision table (discussing r94289)

2011-09-18 Thread Anthony
On Sun, Sep 18, 2011 at 5:30 PM, Chad wrote: > On Sun, Sep 18, 2011 at 7:24 AM, Russell N. Nelson - rnnelson > wrote: >> It is meaningless to talk about cryptography without a threat model, just as >> Robert says. Is anybody actually attacking us? Or are we worried about >> accidental collision

[Wikitech-l] Fwd: Adding MD5 / SHA1 column to revision table (discussing r94289)

2011-09-18 Thread Anthony
On Sun, Sep 18, 2011 at 1:55 AM, Robert Rohde wrote: > If collision attacks really matter we should use SHA-1. If collision attacks really matter you should use, at least, SHA-256, no? > However, do > any of the proposed use cases care about whether someone might > intentionally inject a collisi

Re: [Wikitech-l] Adding MD5 / SHA1 column to revision table (discussing r94289)

2011-09-18 Thread Anthony
On Sun, Sep 18, 2011 at 2:33 AM, Ariel T. Glenn wrote: > Στις 17-09-2011, ημέρα Σαβ, και ώρα 22:55 -0700, ο/η Robert Rohde > έγραψε: >> On Sat, Sep 17, 2011 at 4:56 PM, Anthony wrote: > > > >> > For offline analyses, there's no need to change the online d

Re: [Wikitech-l] Adding MD5 / SHA1 column to revision table (discussing r94289)

2011-09-18 Thread Anthony
On Sun, Sep 18, 2011 at 7:24 AM, Russell N. Nelson - rnnelson wrote: > It is meaningless to talk about cryptography without a threat model, just as > Robert says. Is > anybody actually attacking us? You mean, like Grawp? ___ Wikitech-l mailing list Wi

Re: [Wikitech-l] Adding MD5 / SHA1 column to revision table (discussing r94289)

2011-09-17 Thread Anthony
On Sat, Sep 17, 2011 at 6:46 PM, Robert Rohde wrote: > Is there a good reason to prefer SHA-1? > > Both have weaknesses allowing one to construct a collision (with > considerable effort) Considerable effort? I can create an MD5 collision in a few minutes on my home computer. Is there anything e

Re: [Wikitech-l] Licensing (Was: WYSIWYG and parser plans)

2011-05-06 Thread Anthony
On Fri, May 6, 2011 at 2:37 PM, Chad wrote: > On Fri, May 6, 2011 at 2:14 PM, Brion Vibber wrote: >> I'd like to respectfully ask that this thread be taken offlist, perhaps to a >> wiki page or a private thread among those who are interested. >> >> There's no active intent to change any licensing

Re: [Wikitech-l] Licensing (Was: WYSIWYG and parser plans)

2011-05-06 Thread Anthony
On Fri, May 6, 2011 at 2:24 PM, Trevor Parscal wrote: > "Dynamic linking" implies we have something to dynamically link in the first > place. A parser library consisting of compiled PHP in this particular case. > > Let's just cross this hypothetical bridge when we come to it, shall we? I guess, b

Re: [Wikitech-l] Licensing (Was: WYSIWYG and parser plans)

2011-05-06 Thread Anthony
On Fri, May 6, 2011 at 10:55 AM, Bryan Tong Minh wrote: > Can we stop discussing this issue? I believe that most MediaWiki > developers are in fact not interested in changing the status quo with > regards to licensing, so there is no point in discussing it. That there isn't going to be a license

Re: [Wikitech-l] Licensing (Was: WYSIWYG and parser plans)

2011-05-06 Thread Anthony
#x27;s no different than piping to/from gzip, and I don't think anyone argues that *that* creates a derivative work. On Fri, May 6, 2011 at 10:38 AM, Jay Ashworth wrote: >> From: "Anthony" >> On Tue, May 3, 2011 at 6:56 PM, Aryeh Gregor >> > You can always *use*

Re: [Wikitech-l] Licensing (Was: WYSIWYG and parser plans)

2011-05-06 Thread Anthony
On Tue, May 3, 2011 at 6:56 PM, Aryeh Gregor wrote: > You can still link it with proprietary code as long as you don't > distribute the result, so it would be fine for research projects or > similar that rely on proprietary components. What happens if one of your employees or volunteers distribut

Re: [Wikitech-l] Licensing (Was: WYSIWYG and parser plans)

2011-05-06 Thread Anthony
On Fri, May 6, 2011 at 9:41 AM, Jay Ashworth wrote: > - Original Message - >> From: "Anthony" > >> On Wed, May 4, 2011 at 6:57 PM, Aryeh Gregor >> wrote: >> > Linking has no special status in the GPL -- it's just a question of >>

Re: [Wikitech-l] Licensing (Was: WYSIWYG and parser plans)

2011-05-05 Thread Anthony
On Tue, May 3, 2011 at 11:55 PM, Jay Ashworth wrote: > The reasons why many programmers prefer GPL to BSD -- to keep the work > they've invested long hours in for free from being submerged in someone's > commercial project with no recompense to them -- which GPL forbids and > BSD does not -- is wi

Re: [Wikitech-l] Licensing (Was: WYSIWYG and parser plans)

2011-05-05 Thread Anthony
On Wed, May 4, 2011 at 6:57 PM, Aryeh Gregor wrote: > Linking has no special status in the GPL -- it's just a question of > what legally constitutes a derivative work.  If a C program that > dynamically links to a library is legally a derivative work of that > library, It isn't. A C program whic

Re: [Wikitech-l] strange page id numbering

2011-02-15 Thread Anthony Ventresque (Dr)
> On Tue, Feb 15, 2011 at 12:58 PM, Q wrote: > > On 2/15/2011 11:34 AM, Anthony Ventresque (Dr) wrote: > >> Wikipedia... is that a relevant answer to your remark? > > > > There's about 284 of those, you'll have to be a bit more specific. > > Anyone

Re: [Wikitech-l] strange page id numbering

2011-02-15 Thread Anthony Ventresque (Dr)
> On Tue, Feb 15, 2011 at 9:29 AM, Anthony Ventresque (Dr) > wrote: > > I was indeed suspecting something like that, but the difference in number > > of pages is large while we are talking about a relatively short delay > > (minutes?). > > Depending on

Re: [Wikitech-l] strange page id numbering

2011-02-15 Thread Anthony Ventresque (Dr)
> Anthony Ventresque (Dr) wrote: > > Hi, > > > > > > I've found something strange in some files. The maximum ids for a page are: > > latest > > > > pages-articles.xml: 29189922 > > page.sql: 28707562 > > categorylinks.

[Wikitech-l] strange page id numbering

2011-02-14 Thread Anthony Ventresque (Dr)
92297 page.sql: 30480288 categorylinks.sql: 30479519 Any idea why these numbers are different? Thanks for your help, Anthony CONFIDENTIALITY: This email is intended solely for the person(s) named and may be confidential and/or privileged. If you are not the intended reci

Re: [Wikitech-l] categorisation issues in dumps

2011-02-14 Thread Anthony Ventresque (Dr)
: [Wikitech-l] categorisation issues in dumps Anthony Ventresque (Dr) wrote: > Hi, > > I am trying to build an offline version of the wikipedia categorisation tree. > As usual with projects on wikipedia, I've downloaded dumps (actually the > interesting one here is pages-articles.xm

[Wikitech-l] categorisation issues in dumps

2011-02-07 Thread Anthony Ventresque (Dr)
works" and "Category:1960" which is present on the web page. And it is the same for a lot of categories I tried: many links are missing in the dump, but are present in the web. Any idea why is that so? Thanks for your help, Anthony CONFIDENTIALITY: This email is intended solely for th

Re: [Wikitech-l] WMF and IPv6

2011-02-03 Thread Anthony
On Thu, Feb 3, 2011 at 5:29 PM, Anthony wrote: > But, "supports IPv6" could be as simple as having an http proxy server > which sends (fake) IPv6 XFF headers. > > By fake, I mean that there's not even a need for the client to > actually use that IPv6 address, so lo

Re: [Wikitech-l] WMF and IPv6

2011-02-03 Thread Anthony
On Thu, Feb 3, 2011 at 5:20 PM, River Tarnell wrote: > In article , > Martijn Hoekstra   wrote: >>So what are exactly the implications for blocking and related issues >>when we will start to see ISP level NATing? > > Users will either need to move to an ISP that supports IPv6, or accept > that the

Re: [Wikitech-l] WMF and IPv6

2011-02-03 Thread Anthony
On Thu, Feb 3, 2011 at 5:10 PM, River Tarnell wrote: > In article , > Anthony   wrote: >>Is there a standard for using IPv6 inside X-Forwarded-For headers? > > There is no standard for X-Forwarded-For at all. Not even a de-facto one?

Re: [Wikitech-l] WMF and IPv6

2011-02-03 Thread Anthony
On Thu, Feb 3, 2011 at 4:45 PM, Brion Vibber wrote: > Front-end proxies need to speak IPv6 to the outside world so they can accept > connections from IPv6 clients, add the clients' IPv6 addresses to the HTTP > X-Forwarded-For header which gets passed to the Apaches, and then return the > response

Re: [Wikitech-l] From page history to sentence history

2011-01-21 Thread Anthony
ions to delete/oversight, that's still a viable solution. > On Thu, Jan 20, 2011 at 4:04 AM, Anthony wrote: >> It wouldn't be trivial, but it wouldn't be particularly hard either. >> Most of the work is already being done.  It's just being done >> inefficiently.

Re: [Wikitech-l] From page history to sentence history

2011-01-19 Thread Anthony
On Wed, Jan 19, 2011 at 7:49 PM, Happy-melon wrote: > "Anthony" wrote in message > news:AANLkTi=uk+uf3y_b+zld57wcfuef_7rf-bt8tnvtg...@mail.gmail.com... >> No, that's not the question.  The question is why are you >> uncompressing and undiffing (from DiffHistor

Re: [Wikitech-l] From page history to sentence history

2011-01-19 Thread Anthony
On Wed, Jan 19, 2011 at 3:33 AM, Aryeh Gregor wrote: > On Wed, Jan 19, 2011 at 3:59 AM, Anthony wrote: >> Why isn't this being used for the dumps? > > Well, the relevant code is totally unrelated, so the question is sort > of a non sequitur. No, the question is why the r

Re: [Wikitech-l] From page history to sentence history

2011-01-18 Thread Anthony
On Tue, Jan 18, 2011 at 7:21 PM, Aryeh Gregor wrote: > On Mon, Jan 17, 2011 at 9:12 PM, Roan Kattouw wrote: >> Wikimedia doesn't technically use delta compression. It concatenates a >> couple dozen adjacent revisions of the same page and compresses that >> (with gzip?), achieving very good compre

Re: [Wikitech-l] From page history to sentence history

2011-01-17 Thread Anthony
On Mon, Jan 17, 2011 at 12:41 PM, Anthony wrote: > And to recognize what's going on when a sentence changes *and* is > moved from one paragraph to another, requires an even greater level of > natural language understanding.  Again though, you can probably get it > right most of

Re: [Wikitech-l] From page history to sentence history

2011-01-17 Thread Anthony
On Mon, Jan 17, 2011 at 10:40 AM, Alex Brollo wrote: > 2011/1/17 Bryan Tong Minh > >> >> Difficult, but doable. Jan-Paul's sentence-level editing tool is able >> to make the distinction. It would perhaps be possible to use that as a >> framework for sentence-level diffs. >> > > Difficult, but dif

Re: [Wikitech-l] From page history to sentence history

2011-01-17 Thread Anthony
On Sun, Jan 16, 2011 at 7:34 PM, Lars Aronsson wrote: > Many articles are soo long, and have been edited so many > times, that the history view is almost useless. If I want > to find out when and how the sentence "Overall, the city > is relatively flat"in the article [[en:Paris]] has changed > ove

Re: [Wikitech-l] WYSIFTW status

2011-01-16 Thread Anthony
On Sun, Jan 16, 2011 at 8:24 PM, Krinkle wrote: > Op 17 jan 2011, om 02:12 heeft Anthony het volgende geschreven: > >> On Sun, Jan 16, 2011 at 7:16 PM, Magnus Manske >> wrote: >>> A quick update on WYSIFTW, my "augmented wikitext" editor. (Please >&

Re: [Wikitech-l] WYSIFTW status

2011-01-16 Thread Anthony
On Sun, Jan 16, 2011 at 7:16 PM, Magnus Manske wrote: > A quick update on WYSIFTW, my "augmented wikitext" editor. (Please see > http://meta.wikimedia.org/wiki/WYSIFTW for details.) Shouldn't it be WYSIFWT? ___ Wikitech-l mailing list Wikitech-l@lists.

Re: [Wikitech-l] Does anybody have the 20080726 dump version?

2011-01-04 Thread Anthony
On Sat, Jan 1, 2011 at 11:46 AM, Ariel T. Glenn wrote: > Στις 01-01-2011, ημέρα Σαβ, και ώρα 16:42 +, ο/η David Gerard > έγραψε: >> On 31 December 2010 17:09, Ariel T. Glenn wrote: >> >> > I'd like all the dumps from all the projects to be on line.  Being >> > realistic I think we would wind

Re: [Wikitech-l] Does anybody have the 20080726 dump version?

2010-12-31 Thread Anthony
File transfer is done. Thanks for helping with the transfer. Anthony On Fri, Dec 31, 2010 at 8:28 AM, Huib Laurens wrote: > If it fails i can give you access on others ways, its a dedicated > server that doesn't have a job right now... > > 2010/12/31, Anthony : >> On Fr

Re: [Wikitech-l] Does anybody have the 20080726 dump version?

2010-12-31 Thread Anthony
On Fri, Dec 31, 2010 at 10:54 AM, Platonides wrote: > Would be nice having an additional md5sum file for the uncompressed dumps. Yes. Here's what I found on my SATA and USB drives. I haven't had a chance to go through my IDE drives - that would take a while as I don't yet have a decent removabl

Re: [Wikitech-l] Does anybody have the 20080726 dump version?

2010-12-31 Thread Anthony
On Fri, Dec 31, 2010 at 4:08 AM, Ariel T. Glenn wrote: > Anthony: > > We would like to get copies of any of these dumps as well.  This > includes any of the other files: stubs, tables, the lot. > > If you have them for other languages or other time periods, that would > be

Re: [Wikitech-l] Does anybody have the 20080726 dump version?

2010-12-31 Thread Anthony
On Fri, Dec 31, 2010 at 1:47 AM, Huib Laurens wrote: > Okay, I emailed to Anthony how he can upload it. Transfer is in progress. ETA about 10 hours. md5sum is 30c9b48de3ede527289bcdb810126723 Hopefully there aren't any problems as I'm not quite sure how to resume up

Re: [Wikitech-l] Does anybody have the 20080726 dump version?

2010-12-30 Thread Anthony
On Thu, Dec 30, 2010 at 8:38 AM, Anthony wrote: > I just asked Dreamhost if they would give me permission to violate > their TOS for this one time one file. And the person who responded just told me that he's not authorized to give me permission to do that. So, any volunteers to host

Re: [Wikitech-l] Does anybody have the 20080726 dump version?

2010-12-30 Thread Anthony
ote: > Yes, I think they are the same! > > Is there any method to download it? > > Thanks very much!! > > > > On Wed, Dec 29, 2010 at 10:06 PM, Anthony wrote: > >> You talking about enwiki? >> >> I have enwiki-20080724-pages-articles.xml.bz2.  N

Re: [Wikitech-l] Does anybody have the 20080726 dump version?

2010-12-29 Thread Anthony
You talking about enwiki? I have enwiki-20080724-pages-articles.xml.bz2. Nothing for 20080726. On Wed, Dec 29, 2010 at 2:54 AM, Monica shu wrote: > @_...@... > > Thanks any way:) > > Anyone else hands  up? > > On Wed, Dec 29, 2010 at 3:18 PM, Chad wrote: > >> On Wed, Dec 29, 2010 at 12:16 AM,

Re: [Wikitech-l] Using MySQL as a NoSQL

2010-12-24 Thread Anthony
On Fri, Dec 24, 2010 at 4:08 AM, Domas Mituzas wrote: > Hi! > > A: >> It's easy to get fast results if you don't care about your reads being >> atomic (*), and I find it hard to believe they've managed to get >> atomic reads without going through MySQL. > > MySQL upper layers know nothing much abo

Re: [Wikitech-l] Using MySQL as a NoSQL

2010-12-23 Thread Anthony
On Thu, Dec 23, 2010 at 9:34 AM, Nikola Smolenski wrote: > I have recently encountered this text in which the author claims very > high MySQL speedups for simple queries (7.5 times faster than MySQL, > twice faster than memcached) by reading the data directly from InnoDB > where possible (MySQL is

Re: [Wikitech-l] Offline wiki tools

2010-12-15 Thread Anthony
On Thu, Dec 16, 2010 at 12:47 AM, Andrew Dunbar wrote: > At the moment I'm interested in .bz2 and .7z because those are the > formats WikiMedia currently publishes data in. I'm fairly certain the specific 7z format which Wikimedia uses doesn't allow for random access, because the dictionary is ne

Re: [Wikitech-l] dataset1, xml dumps

2010-12-15 Thread Anthony
On Wed, Dec 15, 2010 at 3:30 PM, Ariel T. Glenn wrote: > We are interested in other mirrors of the dumps; see > > http://meta.wikimedia.org/wiki/Mirroring_Wikimedia_project_XML_dumps On the talk page, it says "torrents are useful to save bandwidth, which is not our problem". If bandwidth is not

Re: [Wikitech-l] Offline wiki tools

2010-12-15 Thread Anthony
On Wed, Dec 15, 2010 at 12:01 PM, Andrew Dunbar wrote: > By the way I'm keen to find something similar for .7z I've written something similar for .xz, which uses LZMA2 same as .7z. It creates a virtual read-only filesystem using FUSE (the FUSE part is in perl, which uses pipes to dd and xzcat).

Re: [Wikitech-l] [Xmldatadumps-admin-l] 2010-03-11 01:10:08: enwiki Checksumming pages-meta-history.xml.bz2 :D

2010-04-08 Thread Anthony
On Thu, Apr 8, 2010 at 7:34 PM, Q wrote: > -BEGIN PGP SIGNED MESSAGE- > Hash: SHA256 > > On 4/8/2010 4:28 PM, Anthony wrote: > > I'd like to add that the md5 of the *uncompressed* file is > > cd4eee6d3d745ce716db2931c160ee35 . That's what I got from bo

Re: [Wikitech-l] [Xmldatadumps-admin-l] 2010-03-11 01:10:08: enwiki Checksumming pages-meta-history.xml.bz2 :D

2010-04-08 Thread Anthony
ess than a day. On Mon, Mar 29, 2010 at 8:16 PM, Tomasz Finc wrote: > You can find all the md5sums at > > http://download.wikipedia.org/enwiki/20100130/enwiki-20100130-md5sums.txt > > --tomasz > > Anthony wrote: > >> Got an md5sum? >> >> >> On Mon,

Re: [Wikitech-l] [Xmldatadumps-admin-l] 2010-03-11 01:10:08: enwiki Checksumming pages-meta-history.xml.bz2 :D

2010-03-29 Thread Anthony
Got an md5sum? On Mon, Mar 29, 2010 at 5:46 PM, Tomasz Finc wrote: > I love lzma compression. > > enwiki-20100130-pages-meta-history.xml.bz2 280.3 GB > > enwiki-20100130-pages-meta-history.xml.7z 31.9 GB > > Download at http://tinyurl.com/yeelbse > > Enjoy! > > --tomasz > > Tomasz Finc wrote: >

Re: [Wikitech-l] hiphop! :)

2010-02-28 Thread Anthony
On Sun, Feb 28, 2010 at 4:33 PM, Domas Mituzas wrote: > > > > Nevertheless - a process isn't the same process when it's going at 10x > > the speed. This'll be interesting. > > not 10x. I did concurrent benchmarks for API requests (e.g. opensearch) on > modern boxes, and saw: > > HipHop: Requests p

Re: [Wikitech-l] User-Agent:

2010-02-17 Thread Anthony
On Wed, Feb 17, 2010 at 8:51 AM, Anthony wrote: > On Wed, Feb 17, 2010 at 6:54 AM, Domas Mituzas wrote: > >> > >> > It showed that there was quite a bit of bathwater thrown out. And at >> least >> > one very large baby (Google translation), which was

Re: [Wikitech-l] User-Agent:

2010-02-17 Thread Anthony
On Wed, Feb 17, 2010 at 6:54 AM, Domas Mituzas wrote: > > > > It showed that there was quite a bit of bathwater thrown out. And at > least > > one very large baby (Google translation), which was temporarily > > resurrected. We still don't know how many other, smaller, babies were > > thrown out,

Re: [Wikitech-l] User-Agent:

2010-02-16 Thread Anthony
On Tue, Feb 16, 2010 at 11:32 PM, Tim Starling wrote: > I think it's common knowledge among people who have been reading these > lists for a long time, that Anthony has a serious deficit in his > sarcasm detection department, and often gives inappropriate responses > to sarcastic

Re: [Wikitech-l] User-Agent:

2010-02-16 Thread Anthony
On Tue, Feb 16, 2010 at 11:18 PM, John Vandenberg wrote: > With this solution, it is now possible to determine how much of the > traffic was from valid services. i.e. google translate and other > useful services will identify themselves And what separates google translate from other useful ser

Re: [Wikitech-l] User-Agent:

2010-02-16 Thread Anthony
On Tue, Feb 16, 2010 at 9:47 PM, John Vandenberg wrote: > On Wed, Feb 17, 2010 at 1:00 PM, Anthony wrote: > > On Wed, Feb 17, 2010 at 11:57 AM, Domas Mituzas > wrote: > >> Probably everything looks easier from your armchair. I'd love to have > that > &g

Re: [Wikitech-l] User-Agent:

2010-02-16 Thread Anthony
> Anyway, you probably are missing one important point. > We're trying to make Wikipedia's service better. > I'm sure you are. But that doesn't mean I agree with your methods. Probably everything looks easier from your armchair. I'd love to have that > view! :) > Then stop volunteering. ___

Re: [Wikitech-l] User-Agent:

2010-02-16 Thread Anthony
> > And "Mozilla/4.0 (compatible; MSIE 7.0; Windows NT > > 5.1)", is pretty much > > useless, unless you've already identified the spammer through some other > > process. > > It isn't useless. It clearly shows that the user is acting malicious by >

Re: [Wikitech-l] User-Agent:

2010-02-16 Thread Anthony
On Tue, Feb 16, 2010 at 2:31 PM, Domas Mituzas wrote: > > Presumably some percentage of that 20-50% will come back as the > > spammers realize they have to supply the string. Presumably we > > then start playing whack-a-mole. > > Yes, we will ban all IPs participating in this. > Guess it's just

Re: [Wikitech-l] User-Agent:

2010-02-16 Thread Anthony
On Tue, Feb 16, 2010 at 11:32 AM, Ariel T. Glenn wrote: > In fact some WMF paid employees (including me) were in the channel at > that time and agreed with the decision. It seemed then and still seems > to me a reasonable course of action given the circumstances. I > understand it's aggravating t

Re: [Wikitech-l] User-Agent:

2010-02-16 Thread Anthony
On Tue, Feb 16, 2010 at 11:04 AM, Domas Mituzas wrote: > > No idea. For ages you've been able to just go onto the Wikimedia servers > > and change whatever you feel like, and answer to nobody? You must be > > misunderstanding my question or something. > > Kind of. Isn't that a good enough motiva

Re: [Wikitech-l] User-Agent:

2010-02-16 Thread Anthony
On Tue, Feb 16, 2010 at 10:39 AM, Domas Mituzas wrote: > > Cool. Who's your boss, and who's your boss's boss? Sorry, I couldn't > find > > you in the org chart or I'd just have looked that up myself. > > Nobody? Really? Were you doing this work as a contractor, or as a volunteer? Someone's go

Re: [Wikitech-l] User-Agent:

2010-02-16 Thread Anthony
On Tue, Feb 16, 2010 at 10:31 AM, Domas Mituzas wrote: > Hi! > > > Whose decision was this? > > Mine. > > > Were Erik, Sue, or Danese involved? > > No. > Cool. Who's your boss, and who's your boss's boss? Sorry, I couldn't find you in the org chart or I'd just have looked that up myself. _

Re: [Wikitech-l] User-Agent:

2010-02-16 Thread Anthony
On Mon, Feb 15, 2010 at 8:54 PM, Domas Mituzas wrote: > Hi! > > from now on specific per-bot/per-software/per-client User-Agent header is > mandatory for contacting Wikimedia sites. > > Domas > Hi, Whose decision was this? Were Erik, Sue, or Danese involved?

  1   2   >