Re: [sympy] New assumptions and caching

Ronan Lamy Wed, 10 Apr 2013 04:14:58 -0700

On 10/04/2013 11:55, Tom Bachmann wrote:

This is essentially a continuation of my email "General strategy forremoving old assumptions".
One fairly evident problem with replacing the old assumptions systemby the new one in a naive way is caching. Recall that sympy has a"global cache". Any function decorated with @cacheit will benefit fromit. The cache works by assuming that the return value of the functiondepends exclusively on its arguments (i.e. the function issideeffect-free).
This makes sense when all assumptions are stored with their symbols.(The pervasive use of caching has lead to problems in the past, but Ithink it is fine in principle.) On the other hand, as soon asassumptions are separated from symbols, this will fail (and usually insubtle ways).
Let's be clear that this is an inherent feature of the new assumptionssystem: assumptions *cannot* in general be tied to symbols, considerfor example "x > y". We *want* to be able to support such assumptions,and this invalidates the naive caching strategy.
What modules rely most heavily on caching? I know that series doesvery much, and meijerint does to some extent. What about other modules?
There have been long discussions about caching before, in variouscontexts. Let me try to summarize what I think was the consensus:
The global cache, as it stands, is bad. Two name only two issues, itcan grow in long computations, and it has often lead to very subtlebugs (when some function was not truly side-effect free).
Any other general objections to the global cache?

There is also a performance impact. Fetching data from the cache has anon-trivial overhead. Also, on PyPy, it effectively disables the JIT,since lookups from a global dictionary cannot be optimised much and areprobably significantly slower than executing a method than has beencompiled down to a dozen of machine instructions. The sameconsiderations will apply to CPython if we ever attempt to staticallycompile the core (using e.g. Cython).

It is also felt that it should be unnecessary, as demonstrated by thesuccess of e.g. sympycore. On the other hand, we accept that somealgorithms (e.g. gruntz) can profit a lot from caching. Even if itwould be possible to redesign them in such a way as not tounnecessarily repeat the same computation over and over, it is feltthat using a cache achieves the same thing at a much smaller cost.
Did I miss any important points regarding caching?
The question now becomes what to do about the cache, in the context ofthe new assumptions system, both in the long and in the short term. Inthe short term, I see three ways of making new assumptions work withthe cache:
(1) disable the cache
(2) clear the cache on every (gobal) assumptions change
(3) hash the assumptions during caching
If these are all the possibilities, then this is really about decidingwhich is the least horrible. It would be tempting in (3) only to hashassumptions which contain symbols that are also in the arguments ofthe function, but this is wrong. Thus the difference between (2) and(3) is fairly small. (1) is going to make gruntz unusable. (2) or (3)could yield a usable gruntz, but this is not obvious. [There is abetter chance now than during the last attempts, because since then Idid some changes to gruntz which improve caching by creating fewerfresh dummies, and thus fewer extra assumptions.]
I think that (2) is a bad strategy, even mid-term. The reason is thatthey force us to stop "making new assumptions" during any computationthat relies on caching. However, I think patterns like
x = dummy()
with assuming(Q.possitive(x)):
  do some stuff
(or x = dummy(positive=True) in the old system) are just too nice toavoid. And even if we avoid them in say all of gruntz.py (which seemsmanagable), our cache will still be flushed any time we "accidentally"call some other function using this pattern.

(2) sounds to me like it would end up being the worst of both worlds,with caching overhead remaining unchanged, but the cache providinglittle benefit since it's cleared all the time. And even if we managedto avoid clearing during a gruntz() call, this would be brittle againstchanges in seemingly unrelated code.


Strategy (3) mitigates this to some extent, but not completely.

The big problem is that you need to identify which assumptions actuallymatter, which means keeping a record of every time where you make adecision using assumptions, otherwise it's just like (2).

However, what I think should be observed, is that most modules willneed changing for the new assumptions system anyway. So doing *minor*changes to the caching strategy is actually fine. This opens two moreoptions, I think:
(4) introduce a clear_cache keyword to assuming
(5) introduce local caches

By (4), I mean that we replace the above pattern by

x = dummy()
with assuming(Q.possitive(x), clear_cache=False):
  do some stuff.
clear_cache will be True by default, and do what you think it does.However, in the above situation, it is safe not to clear the cache,since the dummy is fresh. I *think* this should solve most problems.Indeed, this pattern always works when creating "fresh" symbols, andso it works in all situations where the current assumptions are set up(since we currently never allow to change them).

(4) looks like (2), only with a nicer syntax.

By (5), I mean to add an additional "id" to the @cacheit decorator,roughly like this (code written for clarity, not performance):
def cacheit(id, func):
  def wrapped(*args):
    if id in local_caches_dict:
      if hash(args) in local_caches_dict[id]
        return local_caches_dict[id][hash(args)]
      else:
        res = func(*args)
        local_caches_dict[id][hash(args)] = res
        return res
    else:
      with new_local_cache(id):
        return wrapped(*args)
  return wrapped
Here new_local_cache is a context manager which creates and destroysan entry in local_caches_dict. This kind of cache decorator istailored towards the use of caching in gruntz. Will it help in othercases?

How about letting gruntz() handle all the caching it needs explicitly?There doesn't seem to be much point in making local_caches_dict global.Having a gruntz_cache instead of local_caches_dict["gruntz"] seems cleaner.

It seems to me that, under the general strategy outlined in myprevious mail, (4) and (5) seem essentially equally beneficial. (5)feels slightly cleaner to me.
What do you guys think?

Removing the global cache is actually rather appealing, even though itprobably needs to be replaced with several smaller (module-level ?) caches.

The caching problem would also be a lot less sensitive if Dummy('x',positive=True) could continue to be a purely local operation thatdoesn't affect any global state.


--
You received this message because you are subscribed to the Google Groups 
"sympy" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to sympy+unsubscr...@googlegroups.com.
To post to this group, send email to sympy@googlegroups.com.
Visit this group at http://groups.google.com/group/sympy?hl=en-US.
For more options, visit https://groups.google.com/groups/opt_out.

Re: [sympy] New assumptions and caching

Reply via email to