Re: [sqlalchemy] Guidance regarding nested session usage

Mike Bayer Wed, 09 Aug 2017 13:27:06 -0700

On Wed, Aug 9, 2017 at 12:14 PM,  <a...@withplum.com> wrote:
> Hey,
>
> I'd like some help regarding nested session usage please.
>
> I'm working on an application that has an API layer but also has a lot of
> cron jobs (via Celery) and scripts. I'm trying to design the app in a way
> that my "business" logic is contained and re-usable by any of these
> interfaces.
>
> The SQLAlchemy session scope is request/task-wide (i.e requests and tasks
> remove the scoped session at the end) but I am doing explicit commits
> instead of committing on request end because I sometimes have to deal with
> complicated logic like creating/submitting transactions to payment
> processors etc.
>
> To start off, I use a context manager, much like the docs, which commits or
> rollbacks as necessary. I then have a layer of actions, which are considered
> "top-level" functions that can do a simple operation e.g update something or
> a collection of operations i.e create and submit a transaction. These
> actions use the context manager above to persist stuff and I've opted to
> keep all session "usage" in these actions alone and nowhere else in the
> code. Pretty soon, the need to use some of the simpler actions inside other,
> bigger actions arose which, after reading the docs, led me to turn
> autocommit=True and use session.begin(subtransactions=True). Note that I
> don't want to use savepoints, I just want to be able to use my actions
> inside other actions. The docs recommend that expire_on_commit is set to
> False with autocommit, which I've done but that led to a couple of
> situations where I was operating on out-of-date data hence I want to turn
> expire_on_commit to True again.
>
> My questions:
>
> (1) Does my application layout make sense from a SQLAlchemy perspective?


it's getting close to where I will have to make some major additions
to the docs as well as probably add some new patterns up.    The story
of "autocommit" mode as well as "subtransactions" are based on a
SQLAlchemy that was designed both when "always-transactions" was not
an assumption (it is now) as well as during Python 2.3, when not only
context managers hadn't been invented yet, we didn't even have
decorators.   The problem that "subtransactions" were meant to solve
are much better solved using normal Python today, that is, context
managers.    If you use the context manager pattern, use it with a
scoped_session and then have your context manager check if there's
already a session present (e.g. that your context managers are
nested), and then simply do nothing in the block if this is the case.




> (2) What is the problem with expire_on_commit=True and autocommit=True?

so here I need to go look at the docs and try to understand why this
isn't clear....(goes and looks) OK here's what it says:

> If used, it should always be combined with the usage of Session.begin() and 
> Session.commit(), to **ensure a transaction demarcation.**

> Executing queries **outside of a demarcated transaction is a legacy mode of 
> usage**, and can in some cases lead to concurrent connection checkouts.

> "***In the absence of a demarcated transaction***, the Session cannot make 
> appropriate decisions as to when autoflush should occur nor when 
> auto-expiration should occur, so these features should be disabled with 
> autoflush=False, expire_on_commit=False."

that is referring to the original, original, super old, never-use-it
version of autocommit, where you aren't calling begin(), which looked
like this (we are talking SQLAlchemy 0.2):

sess = create_session()

sess.add(object)
sess.flush()   # <--- commits a transaction

result = sess.query(SomeClass).all()

sess.add(some_other_object)
sess.flush()  # <-- -same

above, it would be very inconvenient if all the objects in memory were
cleared out every time you called flush.

Here's the problem with documentation.  If you go through the effort
to make them very specific and very accurate, you end up with too many
words, and the reader will often not understand the basic point being
made.  Which is above, **if you are not using begin() and commit(),
the way we just said you should, then you should turn off
autocommit=True**.   That section needs another rewrite but then again
the entire concept of "subtransactions" also needs to be discouraged
as these are all obsolete patterns.


> (3) I feel that, even with the context manager, the transaction boundaries
> are still blurry because the developer does not know what will actually get
> committed in the database. For example, if a previous part of the code
> changed something, then called an action that commits the session, the
> previous change will get committed as well.

So this whole part sounds wrong.   If you want your database function
to occur in the context of a larger transaction, then by definition,
there may be other pending data present.  Whether that data is pending
in the session of your Python application, or pending in the MVCC
buffer of your database, doesn't matter from a transaction-level point
of view.  It might matter for performance or debugging reasons, but in
that case, you'd want to just emit flush() at the top of the block, so
that those pending changes are on the server side of the transaction
rather than the client side, but all of it is still pending as far as
being permanent to disk and visible to the rest of the world.

if you have a function that wants to persist data out to the database
and it does not want to persist data that is already pending in the
ongoing tranasction, it should use a separate transaction.     This is
a common use case and it is what you do if you are for example putting
rows into a job queue type of table, or sending out messages that are
going to show up in some log or console output somewhere.


> I've searched around and found
> this: https://github.com/mitsuhiko/flask-sqlalchemy/pull/447 which basically
> issues a rollback on entering the context manager to ensure that only what
> is within the context manager will get committed. What do you think of it?

I'm much more a proponent of writing one's own patterns that suit
their application rather than making the prepackaged ones in something
like Flask fit.  I think if it implicitly rolls back, that's a
terrible idea because if you truly expect that nothing important is
present in the session, it should be asserting that and raising if
something is found (look in session.new, session.dirty,
session.deleted).




I
> can immediately see a problem where if I query for an object before passing
> it to an action, then use the context manager, all the work done on querying
> is lost since the object state is expired on rollback.
>
> I'd appreciate any advice/input.
>
> Best,
> Alex
>
> --
> SQLAlchemy -
> The Python SQL Toolkit and Object Relational Mapper
>
> http://www.sqlalchemy.org/
>
> To post example code, please provide an MCVE: Minimal, Complete, and
> Verifiable Example. See http://stackoverflow.com/help/mcve for a full
> description.
> ---
> You received this message because you are subscribed to the Google Groups
> "sqlalchemy" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to sqlalchemy+unsubscr...@googlegroups.com.
> To post to this group, send email to sqlalchemy@googlegroups.com.
> Visit this group at https://groups.google.com/group/sqlalchemy.
> For more options, visit https://groups.google.com/d/optout.

-- 
SQLAlchemy - 
The Python SQL Toolkit and Object Relational Mapper

http://www.sqlalchemy.org/

To post example code, please provide an MCVE: Minimal, Complete, and Verifiable 
Example.  See  http://stackoverflow.com/help/mcve for a full description.
--- 
You received this message because you are subscribed to the Google Groups 
"sqlalchemy" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to sqlalchemy+unsubscr...@googlegroups.com.
To post to this group, send email to sqlalchemy@googlegroups.com.
Visit this group at https://groups.google.com/group/sqlalchemy.
For more options, visit https://groups.google.com/d/optout.

Re: [sqlalchemy] Guidance regarding nested session usage

Reply via email to