Re: [fonc] misc: code security model

BGB Thu, 11 Aug 2011 22:28:42 -0700

On 8/11/2011 8:16 PM, David Barbour wrote:

On Thu, Aug 11, 2011 at 5:06 PM, BGB <cr88...@gmail.com<mailto:cr88...@gmail.com>> wrote:
    the big problem though:
    to try to implement this as a sole security model, and expecting
    it to be effective, would likely impact language design and
    programming strategy, and possibly lead to a fair amount of effort
    WRT "hole plugging" in an existing project.
A problem with language design is only a "big problem" if a lot ofprojects are using the language. Security is a big problem todaybecause a lot of projects use languages that were not designed witheffective security as a requirement.


or:
if the alteration would make the language unfamiliar to people;

if one has, say, a large pile of code (say, for example, 500 kloc or 1Mloc or more), and fundamental design changes could impact anynon-trivial amount of said code.

for example, for a single developer, a fundamental redesign in a 750kloc project is not a small task, and much easier is to find more quickand dirty ways to patch up problems as they arise, or find a few good(strategic and/or centralized) locations to add security checks, ratherthan a strategy which would essentially require lots of changes all overthe place.

    how to effectively prevent spoofing (say, one manages to "extract"
    the key from a trusted app, and then signs a piece of malware with
    it).
Reason about security /inductively/. Assume the piece holding the keyis secure up to its API. If you start with assumptions like: "well,let's assume the malware has backdoor access to your keys and such",you're assuming insecurity - you'll never reach security from there.

the problem though is that it may be possible for a person making thepiece of malware to get at the keys indirectly...


a simple example would be a login style system:
malware author has, say, GoodApp they want to get the login key from;

they make a dummy or hacked version of the VM (Bad VM), and run the goodapp with this;

GoodApp does its thing, and authorizes itself;
malware author takes this key, and puts it into "BadApp";

BadApp, when run on GoodVM, gives out GoodApp's key, and so can dowhatever GoodApp can do.

these types of problems are typically addressed (partially) with theVM/... logging into a server and authenticating keys over the internet,but there are drawbacks with this as well.

Phrases such as 'trusted app' or 'trusted code' smell like vaguely ofbrimstone - like a road built of good intentions. What is the apptrusted with? How do we answer this question with a suitablyfine-grained executable policy?

the terminology is mostly from what all I have read regarding the .NETand Windows security architecture...


but, generally the "trust" is presumably spread between several parties:
the vendors of the software (VM, apps, ...);
the user of the software.

    yes, there is still always the risk of a naive user confirming a
    piece of malware, but this is their own problem at this point.
I disagree. Computer security includes concerns such as limiting andrecovering from damage, and awareness. And just 'passing the blame' tothe user is a rather poor justification for computer security.


this is, however, how it is commonly handled with things like Windows.

if something looks suspect to the OS (bad file signing, the app tryingto access system files, ...) then Windows pops up a dialogue "Do youwant to allow this app to do this?"


at which point the user confirms this, then yes it is their problem.

the only "real" alternative is to assume that the user is "too stupidfor their own good", and essentially disallow them from using thesoftware outright. in practice, this is a much bigger problem, as thenone has taken away user rights (say, they can no longer installnon-signed apps on their system...).

systems which have taken the above policy have then often been manually"jailbroken" or "rooted" by the users, essentially gaining personalfreedom at the risk of (potentially) compromising their personalsecurity (or voided their warranty, or broke the law).

better I think to make the system do its best effort to keep itselfsecure, but then delegate to the user for the rest.

    if trying to use a feature simply makes code using it invalid
    ("sorry, I can't let you do that"), this works.
When I first got into language design, I thought as you did. Then Irealized:* With optional features, I have 2^N languages no matter how Iimplement them.* I pay implementation, maintenance, debugging, documentation, anddesign costs for those features, along with different subsets of them.* Library developers are motivated to write for the Least CommonDenominator (LCD) language anyway, for reusable code.* Library developers can (and will) create frameworks, interpreters,EDSLs to support more features above the LCD.* Therefore, it is wasteful to spend my time on anything but the LCDfeatures, and make life as cozy as possible for library developers andtheir EDSLs.
/The only cogent purpose of general purpose language design is toraise the LCD./
Optional features are a waste of time and effort, BGB - yours, and ofeveryone who as a result submits a bug report or wades through thedocumentation.

optional features are very common though in most systems, and in thiscase, most of the optional features are those mostly intended forlibrary development and "low-level" programming, notably pointers, ...

so, code which doesn't need to use pointers, shouldn't use pointers, andif people choose not to use them, that is no problem for me.


however, for some tasks, like C interop, they can be fairly useful...

anyways, C# and C++ do basically the same thing...

    with a language/VM existing for approx 8 years and with ~ 540
    opcodes, ... I guess things like this are inevitable.
I think this is a property of your language design philosophy, ratherthan inherent to language development.

well, this language isn't exactly the same as something like Lua orScheme, and is not intended to strive for elegance or minimalism or similar.

it is at this point sort of "between" lighter-weight languages (such asJavaScript) and heavier languages (such as C# or C++).

in an "ideal" world, it would be able to be usable in a similarabstraction domain roughly between C++ and JavaScript.

    but whitelisting is potentially much more effort than
    blacklisting, even if potentially somewhat better from a security
    perspective.
Effectiveness for effort, whitelisting is typically far better thanblacklisting. In most cases, it is less effort. Always, it is fareasier to reason about. I think you'll need to stretch to find rarecounter-examples.


it depends I think.

I mostly figure one can blacklist most of the obvious holes (directaccess to OS-level C APIs and unrestrained pointers, for example), andprobably leave the rest for later.

    LambdaMoo found a MUD, if this is what was in question...
LambdaMoo is a user-programmable MUD, with prototype based objects anda Unix-like security model.
    as for "simple" or "efficient", a Unix-style security model
    doesn't look all that bad.
Unix security model is complicated, inefficient, and ineffectivecompared to object capability model. But I agree that you could do worse.

most of the security checking amounts to if/else and bit-masking andother things.


this then is wrapped in "CanIDoX()" style function calls.

if(!CanIHasCheezeburger(...))
{
    ... BARF and throw something...
}

really, it is not too much different or worse from dynamic type-checking...

    luckily, there are only a relatively small number of places I
    really need to put in security checks (mostly in the object system
    and similar). most of the rest of the typesystem or VM doesn't
    really need them.
I recommend you pursue control of the toplevel capabilities (FFI, andexplicit forwarding of math, etc.) as you demonstrated earlier,perhaps add some support for 'freezing' objects to block implicitdelegation of assignment, and simply forget about Unix or permissionschecks.

the permissions are currently intended as the underlying model by whicha lot of the above can be achieved.

granted, yes, there are potential differences between, say, making theFFI not visible (by setting the reference to null), causing the FFIobject to be no-op, or doing both, but either way...

I initially considered just making a single state flag, basically asystem/user flag (similar to the Ring 0/1/2/3 concept in x86, or the"safe/unsafe" concept in .NET), but figured a naive Unix-like modelcould also provide limited protection of apps from each other, even whenboth are running in the same address space and may have access to manyof the same objects, ...

so, for the most part, it amounts mostly to a "root/non-root" and "A vsB" issue.as noted, I am currently making no attempt to implement either ACLs orthe full scope of the POSIX security model (which does use ACLs andsimilar).

also it will currently only apply to objects and functions, but mostother data types (lists, arrays, ...) will not have such checks(array-based checks would be considerably more frequent than toplevelchecks due to technical reasons, as essentially an array-based checkwould be invoked on every array access, rather than, say, the first timea given UID+GID tries to access a particular function or method). (or atleast, after I get around to: adding the relevant "access" field to themain VM thread contexts, locating the code for the hash table, andadding the access value into the hash function and entry match-check, ...).

technically, the access checks are done during the "recursive search"phase of lookups, whereas there is actually a "lookup hash" which isaccessed before that, and a cache-hit is assumed sufficient to justifythat one has access (if one didn't have access, there would be anexception, and the hash-slot would be set to "undefined" or similar).

(note that the hash is kept from getting "stale" by having certainoperations, such as assigning to delegates, ..., essentially flush thehash).

(note: there is a similar hash used mostly for implementing per-classslot and method lookups and interface-method dispatches, but this istechnically a different hash table in a different part of the codebase,and is flushed under different criteria).


this means, say, for:
for(i=0; i<100; i++)
    printf("test %d\n", i);

"printf()" may only needs a single access-rights check, rather than 100such checks.

the reason so much hashing/caches/... is used is because, actually, whenthe FFI gets involved some of this stuff (database queries, codegeneration, ...) can actually get kind of slow (previously I was lookingat 1-2s stalls during queries, although IIRC I think I went andoptimized something, mostly making a trivial reorganization of the DBstructure and query mechanism, unexpectedly resulting in a drastic speedup).


one can actually debate which is slower:
security access checks;

or performing queries against a largish database (~ 250k-entry IIRC),and dynamically-generating glue code (writing out assembler, assemblingit, and linking it into the program image), ...


really, it doesn't seem like all that big of a deal...

actually, bit of trivia: even with the power of AVL trees, 250k entriesin a single list can be a bit costly. splitting the DB into a number ofsmaller per-library lists, seemed to notably speed up the query times(for whatever reason, 30 lists of 8k entries or so can be queried morequickly than 1 list of 250k entries). it is a mystery...

granted, yes, DB queries are typically implemented using for-loops,strings, "sprintf()", and sequential probing, ... but, it works...



or such...

_______________________________________________
fonc mailing list
fonc@vpri.org
http://vpri.org/mailman/listinfo/fonc

Re: [fonc] misc: code security model

Reply via email to