subject:"Re\: \[sqlalchemy\] session.query\(\).get\(\) is unsupported during flush for getting an object that was just added\?"

Re: [sqlalchemy] session.query().get() is unsupported during flush for getting an object that was just added?

2012-01-26 Thread Michael Bayer

On Jan 26, 2012, at 11:28 AM, Kent Bower wrote:

I think I understand why, during a flush(), if I use session.query().get()
for an item that was just added during this flush, I don't get the persistent
object I might expect because the session still has it as pending even
though, logically, it is already persistent.

I don't suppose you have any desire to support that, huh? The use case would
be related to the future ticket http://www.sqlalchemy.org/trac/ticket/1939
(and http://www.sqlalchemy.org/trac/ticket/2350).

Attached is a script demonstrating the issue I've hit. I can work around it
with some difficulty, but I wanted your input and thoughts.

No, there's no plans to support this case at all; you're using the Session
inside of a mapper event, which is just not supported, and can never be due to
the nature of the unit of work. The most recent docstrings try to be very
explicit about this:

http://docs.sqlalchemy.org/en/latest/orm/events.html#sqlalchemy.orm.events.MapperEvents.before_update

I guess I have to add session.query() and get() in there as well.

The way the flush works is not as straightforward as persist object A; persist
object B; persist object C - that is, these are not atomic operations inside
the flush.It's more like, Perform step X for objects A, B, and C; perform
step Y for objects A, B and C. This is basically batching, and is necessary
since it is vastly more efficient than atomically completing each object one at
a time. Also, some decisions are needed by Y which can't always be made until
X has completed for objects involved in dependencies.

A side effect of batching is that if we provide a hook that emits after X and
before Y, you're being exposed to the objects in an unusual state. Hence, the
hooks that are in the middle like that are only intended to emit SQL on the
given Connection; not to do anything ORM level beyond assigning column-based
values on the immediate object.As always, before_flush() is where ORM-level
manipulations are intended to be placed.

--
You received this message because you are subscribed to the Google Groups
sqlalchemy group.
To post to this group, send email to sqlalchemy@googlegroups.com.
To unsubscribe from this group, send email to
sqlalchemy+unsubscr...@googlegroups.com.
For more options, visit this group at
http://groups.google.com/group/sqlalchemy?hl=en.

Re: [sqlalchemy] session.query().get() is unsupported during flush for getting an object that was just added?

2012-01-26 Thread Kent

Fair enough. I had enough understanding of what must be going on to
know flush isn't straightforward, but I'm still glad I asked. Sorry for
having not read the documents very well and thanks for your answer,
because from it, I surmise that before_flush() *is* safe for session
operations, which is very good to understand more clearly.

Thanks.

On 1/26/2012 12:06 PM, Michael Bayer wrote:

On Jan 26, 2012, at 11:28 AM, Kent Bower wrote:

I think I understand why, during a flush(), if I use session.query().get() for
an item that was just added during this flush, I don't get the persistent
object I might expect because the session still has it as pending even though,
logically, it is already persistent.

Attached is a script demonstrating the issue I've hit. I can work around it
with some difficulty, but I wanted your input and thoughts.

http://docs.sqlalchemy.org/en/latest/orm/events.html#sqlalchemy.orm.events.MapperEvents.before_update

I guess I have to add session.query() and get() in there as well.

The way the flush works is not as straightforward as persist object A; persist object B;
persist object C - that is, these are not atomic operations inside the flush.It's more
like, Perform step X for objects A, B, and C; perform step Y for objects A, B and C.
This is basically batching, and is necessary since it is vastly more efficient than atomically
completing each object one at a time. Also, some decisions are needed by Y which can't always be
made until X has completed for objects involved in dependencies.

Re: [sqlalchemy] session.query().get() is unsupported during flush for getting an object that was just added?

2012-01-26 Thread Michael Bayer

yup, before_flush is made for that, and I've for some time had some vague plans
to add some more helpers there so you could get events local to certain kinds
of objects in certain kinds of states, meaning it would look a lot like
before_update. But looping through .new, .dirty, and .deleted is how to do it
for now.

On Jan 26, 2012, at 12:12 PM, Kent wrote:

Fair enough. I had enough understanding of what must be going on to know
flush isn't straightforward, but I'm still glad I asked. Sorry for having
not read the documents very well and thanks for your answer, because from it,
I surmise that before_flush() *is* safe for session operations, which is very
good to understand more clearly.

Thanks.

On 1/26/2012 12:06 PM, Michael Bayer wrote:
On Jan 26, 2012, at 11:28 AM, Kent Bower wrote:

I think I understand why, during a flush(), if I use session.query().get()
for an item that was just added during this flush, I don't get the
persistent object I might expect because the session still has it as
pending even though, logically, it is already persistent.

I don't suppose you have any desire to support that, huh? The use case
would be related to the future ticket
http://www.sqlalchemy.org/trac/ticket/1939 (and
http://www.sqlalchemy.org/trac/ticket/2350).

Attached is a script demonstrating the issue I've hit. I can work around
it with some difficulty, but I wanted your input and thoughts.
No, there's no plans to support this case at all; you're using the Session
inside of a mapper event, which is just not supported, and can never be due
to the nature of the unit of work. The most recent docstrings try to be
very explicit about this:

http://docs.sqlalchemy.org/en/latest/orm/events.html#sqlalchemy.orm.events.MapperEvents.before_update

I guess I have to add session.query() and get() in there as well.

The way the flush works is not as straightforward as persist object A;
persist object B; persist object C - that is, these are not atomic
operations inside the flush.It's more like, Perform step X for objects
A, B, and C; perform step Y for objects A, B and C. This is basically
batching, and is necessary since it is vastly more efficient than atomically
completing each object one at a time. Also, some decisions are needed by Y
which can't always be made until X has completed for objects involved in
dependencies.

A side effect of batching is that if we provide a hook that emits after X
and before Y, you're being exposed to the objects in an unusual state.
Hence, the hooks that are in the middle like that are only intended to emit
SQL on the given Connection; not to do anything ORM level beyond assigning
column-based values on the immediate object.As always, before_flush() is
where ORM-level manipulations are intended to be placed.

Re: [sqlalchemy] session.query().get() is unsupported during flush for getting an object that was just added?

2012-01-26 Thread Kent

So, as a typical example of where it seems very natural to use
before_update, suppose you need to automatically update the not null
sequence of a related table. This but to get the sequence you need to
loop over the parent table's collection.

You want the sequence to be human friendly (natural primary key) and
also you want to be able to sort by sequence guaranteed in order without
the possibility of a database sequence wrap around. So you want the
sequence 1,2,3...

This seems extremely well fit for before_insert, like this:
==
parents_table = Table(parents, metadata,
Column(id, Integer, primary_key=True),
)

children_table = Table(children, metadata,
Column(parentid, Integer, ForeignKey('parents.id'),),
Column(sequence, Integer, primary_key=True),
)

class Parent(object):
pass

class Child(object):
pass

mapper(Parent, parents_table,
properties={'children': relationship(Child,
cascade='all,delete-orphan',
backref='parent')
})

mapper(Child, children_table)

@event.listens_for(Child, 'before_insert')
def set_sequence(mapper, connection, instance):
if instance.sequence is None:
instance.sequence = (max(c.sequence for c in
instance.parent.children) or 0) + 1

But this reaches across relationships, so that is actually not desired
here, is that correct?

For this, you would loop over session.new in before_update, is that how
you would approach this requirement?

On 1/26/2012 12:34 PM, Michael Bayer wrote:

On Jan 26, 2012, at 12:12 PM, Kent wrote:

Fair enough. I had enough understanding of what must be going on to know flush
isn't straightforward, but I'm still glad I asked. Sorry for having not read
the documents very well and thanks for your answer, because from it, I surmise
that before_flush() *is* safe for session operations, which is very good to
understand more clearly.

Thanks.

On 1/26/2012 12:06 PM, Michael Bayer wrote:

On Jan 26, 2012, at 11:28 AM, Kent Bower wrote:

I think I understand why, during a flush(), if I use session.query().get() for
an item that was just added during this flush, I don't get the persistent
object I might expect because the session still has it as pending even though,
logically, it is already persistent.

Attached is a script demonstrating the issue I've hit. I can work around it
with some difficulty, but I wanted your input and thoughts.

http://docs.sqlalchemy.org/en/latest/orm/events.html#sqlalchemy.orm.events.MapperEvents.before_update

I guess I have to add session.query() and get() in there as well.

The way the flush works is not as straightforward as persist object A; persist object B;
persist object C - that is, these are not atomic operations inside the flush.It's more
like, Perform step X for objects A, B, and C; perform step Y for objects A, B and C.
This is basically batching, and is necessary since it is vastly more efficient than atomically
completing each object one at a time. Also, some decisions are needed by Y which can't always be
made until X has completed for objects involved in dependencies.

Re: [sqlalchemy] session.query().get() is unsupported during flush for getting an object that was just added?

2012-01-26 Thread Michael Bayer


On Jan 26, 2012, at 1:21 PM, Kent wrote:

 So, as a typical example of where it seems very natural to use 
 before_update, suppose you need to automatically update the not null 
 sequence of a related table. This but to get the sequence you need to loop 
 over the parent table's collection.
 
 You want the sequence to be human friendly (natural primary key) and also 
 you want to be able to sort by sequence guaranteed in order without the 
 possibility of a database sequence wrap around.  So you want the sequence 
 1,2,3...
 
 This seems extremely well fit for before_insert, like this:
 ==
 parents_table = Table(parents, metadata,
Column(id, Integer, primary_key=True),
 )
 
 children_table = Table(children, metadata,
Column(parentid, Integer, ForeignKey('parents.id'),),
Column(sequence, Integer, primary_key=True),
 )
 
 class Parent(object):
pass
 
 class Child(object):
pass
 
 mapper(Parent, parents_table,
properties={'children': relationship(Child,
cascade='all,delete-orphan',
backref='parent')
})
 
 mapper(Child, children_table)
 
 @event.listens_for(Child, 'before_insert')
 def set_sequence(mapper, connection, instance):
if instance.sequence is None:
instance.sequence = (max(c.sequence for c in instance.parent.children) 
 or 0) + 1
 ==
 
 But this reaches across relationships, so that is actually not desired here, 
 is that correct?

that is correct.

 
 For this, you would loop over session.new in before_update, is that how you 
 would approach this requirement?

If the value is based on what's already been INSERTed for previous rows, I'd 
emit a SQL statement to get at the value.If it's based on some kind of 
natural consideration that isn't dependent on the outcome of an INSERT 
statement, then you can do the looping above within the before_flush() event 
and assign everything at once.Basically you need to batch the same way 
the UOW itself does.


-- 
You received this message because you are subscribed to the Google Groups 
sqlalchemy group.
To post to this group, send email to sqlalchemy@googlegroups.com.
To unsubscribe from this group, send email to 
sqlalchemy+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/sqlalchemy?hl=en.

Re: [sqlalchemy] session.query().get() is unsupported during flush for getting an object that was just added?

Re: [sqlalchemy] session.query().get() is unsupported during flush for getting an object that was just added?

Re: [sqlalchemy] session.query().get() is unsupported during flush for getting an object that was just added?

Re: [sqlalchemy] session.query().get() is unsupported during flush for getting an object that was just added?

Re: [sqlalchemy] session.query().get() is unsupported during flush for getting an object that was just added?

5 matches

Site Navigation

Mail list logo

Footer information