Re: [webkit-help] Feedback about Content Blocking Extensions from Adblock Plus

Benjamin Poulain Sun, 14 Jun 2015 20:43:23 -0700

Hi Sebastian,

Thanks for stating a thread for this. Let's see what we can do...

Did you already file radars for the issues? If you did, can you give theradar numbers? I'll link them with the meta radars tracking the featuresrequests we are getting for content blockers. If you did not fileradars, I'll do that.


On 6/14/15 7:07 AM, Sebastian Noack wrote:

I'm from Adblock Plus, and just read the articleon the WebKit website[1] about the new content blocking mechanism, introduced with Safari9. Thanks for providing some details. But I identified followingshortcomings that would effectively make the new mechanisminsufficient for us, or anybody supporting our filters [2], which areused by popular filter lists, including EasyList:

From the list, it seems to me that we should discuss concrete casesinstead of concrete solution.

The content blockers in WebKit are vastly different from what extensionsdo today. As such, a solution that works well for classic extensions maynot be the best way to solve the same problem in content blockers. Ifyou tell us about the actual problems (for example an example of awebsite were you can't filter a resource), it would be easier for us toidentify what we can do.

1. Most importantly, our exception rules are recursive. For example||example.com <http://example.com>$document prevents not onlydocuments loaded from example.com <http://example.com> being blocked.But also resources loaded as part of that document or in any of it'ssubframes or their subframes wouldn't be blocked either. However, thislogic doesn't seem to be possible with the ignore-previous-rulesaction. A recursive flag would come handy here.

That seems feasible. I have a couple of ideas on how to best achieve this.

Including the subframes is a bit worrying to me. A subframe of a trustedsource is typically not to be trusted. Do you have examples where thatis useful?

2. There doesn't seem to be a way to distinguish between document andsubdocument requests. While Adblock Plus blocks frames, it neverblocks the top level document, so that users can still access theresource that is blocked, when entering its URL in the address bar.

This sounds like a good idea for your use case.

Any suggestion on the format? What would be the best way to specify thisin your opinion?

3. A dedicated resource-type for XMLHttpRequests, objects (requestsloading a Flash element) and object subrequests (subsequent requestsissued by a Flash object) would certainly be useful as well. EasyListhas quite some filters specifically checking for those.

Targeting XHR specifically seems very easy to counter to me. Couldn'tone just use the Fetch API or Sockets to work around the rule?

Do you have an example where the distinction matters?

Regarding the object subrequest, that seems like a valuable thing to do.

4. Adblock Plus uses filters subscriptions (periodically downloadedfilter lists, like EasyList) as well as filters added by the user, todecide what to block. So we'd need a way to dynamically configureblock lists. I saw the pre-release announcements mentioning thenew setContentBlocker API for this purpose. I couldn't find anydetails on that, but I assume that you can simply pass in a block listas JavaScript object? But note that we'd need a way to invalidatepreviously set blocking rules when filters in Adblock Plus changed.However, a way to add new rules without flushing the previously setblock list would be extremely useful in some cases as well. Soideally, this API should let you modify the block list in place.

You can pass the rules as a JavaScript object, or as a serialized JSONstring.

When you set a new content blocker, it replaces the old one. Strictlyspeaking, the old one remains active until the new one is compiled andthen it is replaced.

There is a technical reason why you cannot modify/add/delete individualrules. In the engine, the rules are combined into giant state machines.The concept of rule does not exist past the compiler, after that all wehave is a very simple bytecode(http://trac.webkit.org/browser/trunk/Source/WebCore/contentextensions/DFABytecode.h)that executes several thousands triggers at once.

Note that compiling is not cheap. We are paying compile time whenloading rules in exchange for faster runtime and lower memory footprint.How often do you need to update the rules?


Cheers,
Benjamin

_______________________________________________
webkit-help mailing list
[email protected]
https://lists.webkit.org/mailman/listinfo/webkit-help

Re: [webkit-help] Feedback about Content Blocking Extensions from Adblock Plus

Reply via email to