https://bugzilla.wikimedia.org/show_bug.cgi?id=52941

       Web browser: ---
            Bug ID: 52941
           Summary: Refactor sanitizer to work on the DOM too
           Product: Parsoid
           Version: unspecified
          Hardware: All
                OS: All
            Status: NEW
          Severity: normal
          Priority: Unprioritized
         Component: DOM
          Assignee: gwi...@wikimedia.org
          Reporter: gwi...@wikimedia.org
                CC: ssas...@wikimedia.org
    Classification: Unclassified
   Mobile Platform: ---

Currently we always serialize to wikitext and re-parse that to HTML, which runs
the sanitizer on the token stream to ensure that our final HTML does not cause
bad things to happen.

Soon both us and the Flow team want to store HTML from the VisualEditor
directly without first serializing to wikitext. This means that we need to
perform the sanitization on the HTML instead of the token stream. For
performance, sanitizing on the way in would be preferable. We should however
support re-sanitization when new issues were discovered. This could potentially
be coupled with the versioning discussed in bug 52937. A new sanitizer could
bump the version number, and the upgrade path would then run the new sanitizer
on old HTML (and probably update the storage with the newly sanitized version).

-- 
You are receiving this mail because:
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to