Re: Classes on the stack trace

Nick Williams Wed, 31 Jul 2013 06:20:05 -0700

Okay, but I don't think @CallerSensitive is necessary to prevent MethodHandles 
from spoofing caller classes. I'll admit I don't know very much about 
MethodHandle, what it does, or how its invocation methods are different from 
Method.invoke(). In the code that I copied into this email from jvm.cpp last 
night, the third frame (frame #2) will always be returned unless it is one 
ignored by the security stack walk. @CallerSensitive doesn't come in to play 
here.


Nick

On Jul 31, 2013, at 2:36 AM, Peter Levart wrote:

> Hi Nick,
> 
> The @CallerSensitive annotation is an annotation that prevents some other 
> infrastructure, namely the MethodHandles, to "spoof" caller classes.
> 
> Try this:
> 
>         MethodHandles.Lookup lookup = MethodHandles.lookup().in(Object.class);
>         MethodHandle mh = lookup.findStatic(Class.class, "forName", 
> MethodType.methodType(Class.class, String.class));
> 
> ...you won't be able to pretend that you are the j.l.Object that is calling 
> method Class.forName(String)...
> 
> The annotation might have (or will have?) other effects like making sure some 
> infrastructure-inserted frames are hidden-away just for @CallerSensitive 
> methods which might be less optimal and not needed for normal methods.
> 
> 
> Regards, Peter
> 
> On 07/31/2013 01:11 AM, Nick Williams wrote:
>> Quick question for those of you that know anything about @CallerSensitive...
>> 
>> After looking at the code and experimenting some, I've discovered that 
>> getCallerClass() doesn't actually keep going until it finds the first method 
>> without @CallerSensitive. It only returns the caller of the caller. So, for 
>> example:
>> 
>> Stack 1
>> @CallerSensitive Reflection.getCallerClass()
>> @CallerSensitive MyClass1.method1();
>> MyClass2.method2();
>> 
>> In this case, getCallerClass() returns MyClass2.class. BUT:
>> 
>> Stack 2
>> @CallerSensitive Reflection.getCallerClass()
>> @CallerSensitive MyClass1.method1();
>> @CallerSensitive MyClass2.method2();
>> MyClass3.method3();
>> 
>> In this case, getCallerClass() STILL returns MyClass2.class. Based on the 
>> plain-language meaning of @CallerSensitive, I would expect getCallerClass() 
>> to return MyClass3.class in the second case. But, indeed, the JavaDoc for 
>> Reflection.getCallerClass() says: "Returns the class of the caller of the 
>> method calling this method." So, then, what's the point of @CallerSensitive? 
>> Looking at the code:
>> 
>>    vframeStream vfst(thread);
>>    // Cf. LibraryCallKit::inline_native_Reflection_getCallerClass
>>    for (int n = 0; !vfst.at_end(); vfst.security_next(), n++) {
>>      Method* m = vfst.method();
>>      assert(m != NULL, "sanity");
>>      switch (n) {
>>      case 0:
>>        // This must only be called from Reflection.getCallerClass
>>        if (m->intrinsic_id() != vmIntrinsics::_getCallerClass) {
>>          THROW_MSG_NULL(vmSymbols::java_lang_InternalError(), 
>> "JVM_GetCallerClass must only be called from Reflection.getCallerClass");
>>        }
>>        // fall-through
>>      case 1:
>>        // Frame 0 and 1 must be caller sensitive.
>>        if (!m->caller_sensitive()) {
>>          THROW_MSG_NULL(vmSymbols::java_lang_InternalError(), 
>> err_msg("CallerSensitive annotation expected at frame %d", n));
>>        }
>>        break;
>>      default:
>>        if (!m->is_ignored_by_security_stack_walk()) {
>>          // We have reached the desired frame; return the holder class.
>>          return (jclass) JNIHandles::make_local(env, 
>> m->method_holder()->java_mirror());
>>        }
>>        break;
>>      }
>>    }
>> 
>> It seems to me that @CallerSensitive is completely pointless. This is ALWAYS 
>> going to return the first non-reflection frame after frame 1, regardless of 
>> @CallerSensitive. If @CallerSensitive were really supposed to have an actual 
>> purpose, it would seem to me that the last part should be this:
>> 
>>        if (!m->is_ignored_by_security_stack_walk() && 
>> !m->caller_sensitive()) {
>>          // We have reached the desired frame; return the holder class.
>>          return (jclass) JNIHandles::make_local(env, 
>> m->method_holder()->java_mirror());
>>        }
>> 
>> Am I completely missing the point here? I just don't see a reason for 
>> @CallerSensitive. The code could do the exact same thing it currently is 
>> without @CallerSensitive (except for enforcing that frame 1 is 
>> @CallerSensitive, which really isn't necessary if you aren't using it in 
>> further frames).
>> 
>> Thoughts?
>> 
>> Nick
>> 
>> On Jul 30, 2013, at 10:33 AM, Jochen Theodorou wrote:
>> 
>>> Am 30.07.2013 16:16, schrieb Peter Levart:
>>>> On 07/30/2013 03:19 PM, Jochen Theodorou wrote:
>>>>> Am 30.07.2013 14:17, schrieb Peter Levart:
>>>>> [...]
>>>>>> So what would give Groovy or other language runtimes headaches when all
>>>>>> there was was a parameter-less getCallerClass() API? Aren't the
>>>>>> intermediate frames inserted by those runtimes controlled by the
>>>>>> runtimes? Couldn't the "surface" runtime-inserted methods capture the
>>>>>> caller and pass it down? I guess the problem is supporting calling the
>>>>>> caller-sensitive methods like Class.forName(String) and such which don't
>>>>>> have the overloaded variant taking caller Class or ClassLoader as an
>>>>>> argument...
>>>>> Speaking for Groovy...
>>>>> those intermediate frames are runtime controlled, yes, but passing down
>>>>> the caller class is exactly the problem. Imagine I would suggest that
>>>>> each and every method definition in Java automatically gets an
>>>>> additional parameter for the caller class, just to have access to it
>>>>> inside the method. You would not accept that for Java, would you? And so
>>>>> we cannot accept that for Groovy if we want to keep integration with
>>>>> Java...
>>>> Are you talking about internal Groovy implementation (the
>>>> runtime-inserted methods) or the publicly visible API?
>>> that's the problem, it is a mix, some internal, other not. We are going to 
>>> change that in Groovy 3
>>> 
>>>> One solution for
>>>> internal implementation of Groovy could be (speaking by heart since I
>>>> don't know the internals of Groovy) for the "surface" public API method
>>>> which doesn't have to have the special caller parameter, to capture the
>>>> caller with getCallerClass() parameterless API (possibly enclosed with a
>>>> quick check confirming that it might actually be needed) and bind it to
>>>> a ThreadLocal, then use this ThreadLocal down at the end...
>>> confirming that it might actually be needed is a problem. In the old 
>>> fallback path we don't know what we call until after we are deep in runtime 
>>> code, and there it is too late. In the other paths we could mark those 
>>> methods in a @CallerSensitive style and do it in that case only.
>>> 
>>>>> and the good integration with Java is one of the key points of
>>>>> Groovy. Even if we make something like that @CallerSensitive and add the
>>>>> parameter only in those cases, we break being able to override methods.
>>>> I guess I don't know every Groovy need to obtain the caller class. I
>>>> thought the problem was to support calling caller-sensitive methods in
>>>> Java API (like Class.forName(String)) from within Groovy code, where
>>>> there are runtime-inserted frames between the "call-site" and the target
>>>> method. Are there any other needs?
>>> ok, there is a misunderstanding...
>>> 
>>> if we call a Java implemented method from Groovy, which is using 
>>> getCallerClass() it may or may not work. In general this does not work and 
>>> our problem is not about that at all. With the change to let 
>>> getCallerClass() ignore some reflective frames it will work actually better 
>>> as long as we use our custom callsite caching implementation, it will not 
>>> work if indy is used or the fallback path.
>>> 
>>> To be able to call a method Class#forName(String), we need to "replace" it 
>>> with an implementation of our own, which we do with an approach similar to 
>>> extension methods (only that ours can hide existing implementation methods 
>>> for groovy). And in there we need to get to the caller class
>>> 
>>> Our problem though is @Grab which is an annotation to add elements to the 
>>> classpath while running a script.
>>> 
>>>>> Plus, before Groovy3 is not done we have to support several call paths.
>>>>> And the oldest one, which is still a fallback, does not support
>>>>> transporting the caller class through the runtime layers at all.
>>>>> Changing here is a breaking change.
>>>> Could you describe those call-paths? Examples of Groovy code and to what
>>>> it gets translated (equivalent Java code at call site) with a brief
>>>> description of what each intermediate layer (between the call-site and
>>>> the target method) does and at which point the caller class is extracted...
>>> the code generated at the call site depends on several factors actually... 
>>> The call site code itself is usually not very informative
>>> 
>>> I start with Groovy 1.0, since that is basically the fallback path. Here 
>>> this.foo() translates more or less to
>>> ScriptBytecodeAdapter.invokeMethod0(staticCallerClass, this,"foo")
>>> which basically does this.getMetaClass().invokeMethod(staticCallerClass, 
>>> this, "foo"). The problem is that the meta class might be user supplied and 
>>> the code executed in invokeMethod as well. The invocation is then finally 
>>> done by reflection. That means we have frames from ScriptBytecodeAdapter, 
>>> from the meta class, as well as maybe frames from a custom meta class and 
>>> reflection frames. At the level of ScriptBytecodeAdapter there is a means 
>>> of transporting the caller class, but that is the static one. Once there is 
>>> a subclass, this information is different from what is needed here and it 
>>> cannot simply be exchanged. Even if the bytecode adapter is changed, we 
>>> cannot change the public API for MetaClass#invokeMethod now. And then the 
>>> information would be lost.
>>> 
>>> In later versions of Groovy (since 1.6) we introduced a custom call site 
>>> caching technique, which uses runtime generated classes to create a helper 
>>> class per call site and is then used for invocation. At the callsite we 
>>> basically have something like callsiteArray[i].invoke(..). Here again the 
>>> staticCallerClass can be found. In this version we are able to "get" the 
>>> method we want to invoke, before invoking it (bypassing 
>>> MetaClass#invokeMethod). But to be able to get the method, certain 
>>> conditions have to be met (like no user supplied meta class). If they are 
>>> not met, then we do basically the same path as in 1.0, only that we don't 
>>> use ScriptBytecodeAdapter. Instead We use our CallSite class as entrance 
>>> point, which then makes the call to the meta class. In the "efficent" case 
>>> we have now frames from the callsite handling code between the callsite and 
>>> the target method only. This includes reflection in the first 
>>> instantiation, later the generated class is used so it reduces to two frame
>>> s of which one is the Callsite entrance point, the other a frame form the 
>>> generated method. In the fallback case we have frames from the callsite 
>>> handling code, plus meta class code, plus reflection of course. Again the 
>>> fallback case prevents us from transporting the caller information to the 
>>> target method. If we ignore the fallback case, then we could here maybe use 
>>> the Threadlocal information. It will require a new callsite interface for 
>>> the bytecode though, meaning this code will not work for precompiled grovvy 
>>> of older version, excluding from getting into Groovy 2.1.x, since it would 
>>> be a breaking change. The earliest version for that would be Groovy 2.2.0, 
>>> which is almost in RC now. Effectively it would mean we would have to do a 
>>> 2.3.0 very soon after most probably.
>>> 
>>> In Groovy 2 we added an indy implementation, which replaces the callsite 
>>> caching code.  At the callsite we have here basically invokedynamic "foo" 
>>> with IndyInterface#bootstrap. bootstrap will first introduce a target for 
>>> IndyInterface#selectMethod, since I need the runtime types instead of the 
>>> static ones. The static caller class information is here part of the 
>>> bootstrap method as Lookup object, added by invokedynamic itself. After 
>>> selectMethod is done we have an initial invocation using invokeExact and 
>>> later invocations by the handle stored in the callsite. Of course the same 
>>> conditions as for the callsite caching above have to be met, meaning the 
>>> fallback path might appear. That makes initially one IndyInterface frame, 
>>> then invokedynamic and lambda related frames, then optionally the traget 
>>> method, or in the fallback case the meta class frames plus reflection
>>> 
>>> 
>>> bye Jochen
>>> 
>>> -- 
>>> Jochen "blackdrag" Theodorou - Groovy Project Tech Lead
>>> blog: http://blackdragsview.blogspot.com/
>>> german groovy discussion newsgroup: de.comp.lang.misc
>>> For Groovy programming sources visit http://groovy-lang.org
>>> 
>

Re: Classes on the stack trace

Reply via email to