On 05/06/15 23:43, Georg Koppen wrote: > 1) How is the subset determined? Where can I find the code (if there is > any) which is doing that?
We take the upstream list (currently Disconnect): https://services.disconnect.me/disconnect-plaintext.json (without the "Content" and "Legacy" categories) then we convert it to the binary format that we need using this script: https://github.com/mozilla-services/shavar-list-creation after removing these entries (for compatibility reasons): https://github.com/mozilla-services/shavar-list-exceptions (only the ones which aren't commented-out). > 2) How does Disconnect exactly determine which domains should be on this > blocklist in the first place? Where is the code or the rules governing > this process? I don't know for sure how they created their list in the first place, but I believe it was done manually. Other than their FAQ, I don't think they have published anything else about their process. Francois _______________________________________________ dev-privacy mailing list [email protected] https://lists.mozilla.org/listinfo/dev-privacy
