On 05/06/15 23:43, Georg Koppen wrote:
> 1) How is the subset determined? Where can I find the code (if there is
> any) which is doing that?

We take the upstream list (currently Disconnect):

  https://services.disconnect.me/disconnect-plaintext.json

(without the "Content" and "Legacy" categories) then we convert it to
the binary format that we need using this script:

  https://github.com/mozilla-services/shavar-list-creation

after removing these entries (for compatibility reasons):

  https://github.com/mozilla-services/shavar-list-exceptions

(only the ones which aren't commented-out).

> 2) How does Disconnect exactly determine which domains should be on this
> blocklist in the first place? Where is the code or the rules governing
> this process?

I don't know for sure how they created their list in the first place,
but I believe it was done manually. Other than their FAQ, I don't think
they have published anything else about their process.

Francois
_______________________________________________
dev-privacy mailing list
[email protected]
https://lists.mozilla.org/listinfo/dev-privacy

Reply via email to