Re: [cfe-commits] [PATCH] Expressions have lvalues and rvalues

Ted Kremenek Wed, 15 Oct 2008 00:28:06 -0700


On Oct 12, 2008, at 2:12 AM, Zhongxing Xu wrote:

I don't see these are redundant. Values are raw bits interpretedwithin some context. We make a fundamental distinction between twotypes of values: value that represents some address and value thatnot. So I prefer we have two kinds of ConcreteInt: lval::ConcreteIntand nonlval::ConcreteInt. And they are seldom same.lval::ConcreteInt usually is very large.

You're right. I originally wrote this code this way for this veryreason, and in the course of this discussion I confused myself. I wasjust trying to think of whether or not an integer value was really anlvalue. Yes it can represent a location, but is it really an lvalue.The current way things are "typed" with lval and nonlval, however,makes the analyzer readily understand code like the following:


int foo() {
  return *&*((int*) 0xa0000);
}

In this example, the pointer cast causes the integer literal to betreated as an lval::ConcreteInt. This isn't really an lvalue though;it's really an rvalue that represents the location of something inmemory (at address 0xa0000). The '*' operator, however, expects anrvalue (per the wording in the C++ standard). So while nonlval andlval do reason about locations, they aren't really lvalues or rvaluesat all; just something that approximates them. Hence the motivationto change their names.

One thing about this is that it makes the transfer functionstructure basically fall out from what's in the C/C++ standards.For example, the '*' operator essentially has the following typesignature:
* : rval -> lval
Similarly, references to variables and the '&' operator could berepresented as follows:
variable reference:  (declrefexpr) -> lval
& : lval -> rval  (with the rval being an rval::MemoryRegion)
In contexts where an lval is used as an rval, we have an implicitconversion (as stated in the C++ standard). Such implicitconversations would be represented by a transfer function, whichcause a new state and ExplodedNode to be created to represent theeffect of this conversion. For example, an implicit conversion froman lval to rval could result in a value load (e.g., EvalLoad, whichwould have the type signature lval -> rval).
I don't understand your meaning very clearly. For * operator, wejust get its operand's rvalue, which is a location value. If we areat the LHS of an assignment, this location value is what we want. Ifwe are at the RHS of an assignment, we do another EvalLoad with thislocation value. For & operator, we get its operand's lvalue, andthis location value is the rvalue of the whole expression.

My pedantic point was that lval:: and nonlval:: classes are notlvalues and rvalues. We could change them to be as such; in this casethe transfer function of operator '*' would always return an lvalue,and then whatever used that lvalue would then perform the implicitconversion. We invert this in the static analyzer right now; thetransfer function logic for '*' does the implicit lvalue->rvalueconversion based on context (i.e., if the asLVal flag is not set).The current approach makes sense since we want to associate with agiven expression the value the expression evaluates to; this includesthe result of implicit conversions.

At the end of the day, however, lval:: and nonlval:: classes are notlvalues/rvalues respectively (in the C++ parlance). My question waswhether or not we should change the use of these classes so that theyEXACTLY map to lvalues/rvalues. After looking at the code, reviewingthe patch, thinking about the overall design of GRExprEngine, and allthe comments made here, I'm not in favor of this idea anymore. Ithink it is more useful to reason about locations versus non-locationsthan lvalues versus rvalues.

I am not letting the distinction between rvalues and lvalues happenin the Store. They do happen in GRExprEngine in my patch. Noticethat I only added a getLValue() to StoreManager. The intention is tolet the Store to return the lvalue of an expression.

To me that is the point of MemRegion. Shouldn't MemRegion be able torepresent the location in all cases? In my mind, an implementation ofStore should not depend on RValues.h. If a Store just reasons aboutregions, it doesn't need to think about lval objects, or is this notthe case?

I know I'm the one who came up with lval::FieldOffset,lval::ArrayOffset, etc. I'm questioning this decision. It just seemsto be hard-coding a particular Store's conception of memory into thelval classes. These concepts can easily be represented (far moreelegantly) as regions. Once you are just dealing with regions,StoreManager::getLValue() only needs to return a region type.

Because for different stores, we may have different representationof location values for the same expression. For example, inBasicStore, we may have a different location value for the lvalue ofexpression a[3] than in RegionStore

To me Stores can return different regions for a[3]. However, havingthe Store return an lval:: object has one distinct advantage that Isee: we don't have to define "Undefined" or "Unknown" for regions.

. So I let StoreManager to determine the concrete representation ofan expression's lvalue. Whether we want the lvalue or rvalue of anexpression is decided by the GRExprEngine according to the context,e.g., the position of the expression in the parent expression.


Right.

Yeah, we should limit this rvalue/lvalue distinction withinGRExprEngine. And the intention of my patch is this! Let mesummarize my patch in the following:
In GRExprEngine, we know when we want the rvalue of an expressionand when we want the lvalue of an expression. If we want the lvalueof an expression, we first evaluate all of its sub-exprs, then weask the Store what the concrete form of its lvalue. The Store returnus a location value.


This all makes much more sense to me now.

_______________________________________________
cfe-commits mailing list
[email protected]
http://lists.cs.uiuc.edu/mailman/listinfo/cfe-commits

Re: [cfe-commits] [PATCH] Expressions have lvalues and rvalues

Reply via email to