[Python-Dev] Re: Should set objects maintain insertion order too?

Larry Hastings Sun, 29 Dec 2019 07:58:36 -0800


On 12/28/19 6:24 PM, Tim Peters wrote:

[Larry]

Here is the original description of my problem, from the original email in
this thread.  I considered this an adequate explanation of my problem
at the time.

I do have a use case for this. In one project I maintain a "ready" list of
jobs; I need to iterate over it, but I also want fast lookup because I
soemtimes remove jobs when they're subsequently marked "not ready".

Which is a description not of "the problem", but of the operations you
want to perform.  It's the first time in my life I've heard of a
"ready list of jobs" where the programmer sometimes decides later "oh,
no - this one and that one aren't ready after all" ;-)

It's a lightweight abstract dependency graph. Its nodes are opaque,only required to be hashable. And it doesn't require that you give itall the nodes in strict dependency order.

When you add a node, you can also optionally specify dependencies, andthose dependencies aren't required to be nodes that the graph haspreviously seen. So you can add a node A that depends on B and C,without showing it B or C first. When that happens it createsplaceholder nodes for B and C, and naturally these nodes have nodependencies. You can then later inform the graph that node B hasdependencies on X Y and Z.

(My specific use case is a package manager. Packages are identifiedwith unique strings. So the nodes I give the give the graph are simplythose package names. It's pretty common for me to get a package withdependencies on packages I haven't seen yet. Designing the graph tocreate placeholders let me skip making two passes over the package list,pre-creating the package objects, etc etc. This made the code simplerand presumably faster.)

The graph API maintains an externally-visible "ready" iterable ofnodes. And since B can go from ready to not-ready, it can get added to"ready" and subsequently removed.

Also, when a node is satisfied, you simply remove it from the graph. IfA depends on B and C, and they all represent jobs, and B and C have nodependencies, they'll be in the "ready" iterable. You iterate over"ready", and execute those jobs, and assuming they are successful youthen remove them from the graph. When A's dependencies are allsatisfied, it'll get added to the "ready" iterable. And naturally whenB and C were removed from the graph, they were removed from the "ready"iterable too.

Thus it's this "ready" collection that I want to a) be iterable, b) bestable, and c) support fast removal. I add and remove nodes from thatset a lot, which I realized would perturb the iteration order from runto run, etc etc etc, and here we are.

I grant you maybe it's a weird approach. But after two false starts,where I was starting to boil the oceans with ABCs and "node issatisfied" APis and separate iterator objects, I had a re-think and hiton this super lightweight design. I'm actually quite pleased withit--it's short, it has a minimal API, it's readable, it was easy to getright, and it solves my problem.



Happy new year,


//arry/

_______________________________________________
Python-Dev mailing list -- python-dev@python.org
To unsubscribe send an email to python-dev-le...@python.org
https://mail.python.org/mailman3/lists/python-dev.python.org/
Message archived at 
https://mail.python.org/archives/list/python-dev@python.org/message/ZV2MXPJ4X7KE7ISJ3N664EVYMYPRMJHX/
Code of Conduct: http://python.org/psf/codeofconduct/

[Python-Dev] Re: Should set objects maintain insertion order too?

Reply via email to