That would be nice to have a global cleanup at some point, but it won't be
able to handle every situation.
I don't think relying on VE to clean up is good:

   - First, it will take a long time before all articles are edited with VE
   (maybe never)
   - Second, I'm not a big fan of VE changing wikitext in parts not
   modified by the user: experience shows that it messes the diffs, and makes
   watching what VE is doing a lot more difficult. It has been requested
   several times that VE doesn't start modifying wikitext in places not
   modified by the user.


Things that are probably safe to fix automatically:

   - Whitespace characters between nowiki tags at the beginning of a line:
   remove everything including the whitespace characters.
   - Whitespace characters between nowiki tags not at the beginning of a
   line: remove the tags, keep the whitespace characters.
   - Some characters (letters, digits, ...) between nowiki tags: remove the
   tags, keep the characters
   - In a table, cell content with only a dash between nowiki: remove tha
   tags, add a whitespace characters before the dash

<nowiki /> are more difficult to fix automatically I think:

   - Between quotes: allows to mix a real quote with italics formatting
   - After the end of a wikilink:prevents the wikilink to extend to the
   text (often an error due to a bug in VE, but sometimes it may be normal)
   - ...

Nico


On Sun, Jun 21, 2015 at 8:43 PM, Amir E. Aharoni <
amir.ahar...@mail.huji.ac.il> wrote:

> Thanks Arlo. I added a few.
>
> But I'm not sure that it answers my original question: Will this be done
> every time a page happens to edited in VE and saved or will it be done
> globally on all pages in all wikis as some kind of a maintenance job?
>
>
> --
> Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי
> http://aharoni.wordpress.com
> ‪“We're living in pieces,
> I want to live in peace.” – T. Moore‬
>
> 2015-06-20 19:45 GMT+03:00 Arlo Breault <abrea...@wikimedia.org>:
>
> > On Friday, June 19, 2015 at 1:38 AM, Amir E. Aharoni wrote:
> > > There may be more - I'm still looking for these.
> >
> >
> > If you find any, please propose them on the Parsoid’s normalization talk
> > page [0].
> > I’ve added the ones you’ve mentioned so far.
> >
> > We’ve documented [1] what’s currently been implemented.
> >
> > A few months back, Subbu solicited feedback [2] on what style norms
> should
> > be enforced. We’ve since added a `scrubWikitext` parameter to Parsoid’s
> API
> > that clients (like VE) can benefit from.
> >
> > Cleaning up our past transgressions is great. Helping to prevent their
> > continued
> > existence is even better.
> >
> > I was reading the discussion on gradually enabling VE for new accounts
> [3]
> > and
> > Kww writes there,
> >
> > "Further, we still have issues with stray nowiki tags being scattered
> > across articles.
> > Until those are addressed, the notion that VE doesn't cause extra work
> for
> > experienced editors is simply a sign that the metrics used to analyze
> > effort were
> > wrong. Jdforrester, can you explain how a study that was intended to
> > measure
> > whether VE caused extra work failed to note that even with the current
> > limited use,
> > it corrupts articles at this kind of volume [4]? Why would we want to
> > encourage
> > such a thing?”
> >
> > Makes me sad.
> >
> >
> > [0] https://www.mediawiki.org/wiki/Talk:Parsoid/Normalizations
> > [1] https://www.mediawiki.org/wiki/Parsoid/Normalizations
> > [2]
> > https://lists.wikimedia.org/pipermail/wikitech-l/2015-April/081453.html
> > [3]
> >
> https://en.wikipedia.org/wiki/Wikipedia:Village_pump_%28proposals%29#Gradually_enabling_VisualEditor_for_new_accounts
> > [4]
> >
> https://en.wikipedia.org/w/index.php?title=Special:AbuseLog&offset=&limit=500&wpSearchFilter=550
> >
> >
> > _______________________________________________
> > Wikitech-l mailing list
> > Wikitech-l@lists.wikimedia.org
> > https://lists.wikimedia.org/mailman/listinfo/wikitech-l
> >
> _______________________________________________
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>
_______________________________________________
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Reply via email to