launchpad-dev team mailing list archive

Thread
Date

Re: The with statement

To: Barry Warsaw <barry@xxxxxxxxxxxxx>
From: Jeroen Vermeulen <jtv@xxxxxxxxxxxxx>
Date: Tue, 02 Feb 2010 18:21:04 +0100
Cc: Launchpad Developers <launchpad-dev@xxxxxxxxxxxxxxxxxxx>
In-reply-to: <20100127123104.3292fa5d@freewill.wooz.org>
User-agent: Thunderbird 2.0.0.23 (X11/20090817)

Barry Warsaw wrote:

A common question is whether to use these to manage transactions.  Jeroen and
I have debated this one.  It's worked well for me in my Storm-based
non-Launchpad applications, but Jeroen doesn't like their semantics for this.
I'll let him take it from there.

I've got nothing against the keyword per se. The common, simple "freemy resource" use-case will be convenient. Using it for that has myunreserved blessing.

(Irrelevant gripes: it's a thin layer of sugar; it doesn't scale wellfor more than resource; cleanup requirements should be encapsulated inthe type not the using code; phew, that's off my chest).

My objections relate to how exception handling is designed into theprotocol. It basically assumes that the code inside the "with" block iswritten to suit the resource's wishes, not the other way around.

I would very much like us to stay away from any use of "with" beyond asubstitute for "finally." If that becomes impossible, we should figureout some very limited usage patterns that work well for us. The "with"blocks should always do something that we can understand and predict.The language does not enforce that, and I see that as a maintenance risk.


Rant follows; most people will want to stop reading here.

To me, the way the protocol deals with exceptions smells ofover-specific design. An API like that can trick people intounnecessary complexity when they should be ignoring the extra knobs anddials. It's like the Unmount / Eject / Safely remove drive options inNautilus: in theory they let you do it just right, but in practice itall depends on the specific device and you just get more ways to do itwrong.

The justification for this design was transactions: it lets you writetransaction classes don't need explicit commits or aborts, because theycan see for themselves if the code block exits normally or with anexception.

That is not a design I like. Maybe it's just because I'm Dutch, butcommit should be explicit. That's your finish line. You may haveseparate exception handling around it depending on special needs in yourcode. Aborts can safely be implicit, partly because they shouldn'traise exceptions anyway, and you may want to do them even when noexception comes up.

So the primary use-case is one that may seem sensible, but definitelynot an approach I'd take. I'd go for an explicit commit; the "with"exit handler would abort if commit was not reached, regardless ofexceptions. The design in the PEP makes the finish line invisible, butat the same time stakes everything on whether you cross that invisibleline normally or by exception. And possibly the type of the exception.And possibly other details of the exception object. Or maybe it justinspects your call tree and figures out what it thinks you wanted. Thisadds a whole new dimension to a simple API that sits in yourerror-handling paths (traditionally the most numerous and leastwell-tested paths in an application) and that needs to be carefullydesigned and documented, not yet supported by documentation tools AFAIK,all justified by a doubtful use-case.

And then on flip side of the coin, the "with" keyword also implicitlyswallows or propagates exceptions, at the resource's discretion. Sayyou have a transaction manager that commits if your "with" block exitsnormally, or aborts if it raises. Now, what does this code do?


with my_transaction_manager():
  with some_resource():
    foo()

Will the transaction abort when foo() raises an exception? It alsodepends on the handler for some_resource, and how it feels about theexception. To deal with that properly, a resource should probably havecustom exception types to signal different exit conditions to its own"with" handler. But even then:


def foo():
    with some_resource() as x:
        bar(x)

def bar(x):
    with some_resource() as y:
        x.splat()
        y.splat()

Say one of the splat()s raises a custom exception. How sure are youthat it gets to the right cleanup? It could be handled by y, or by xand y both; the language doesn't say. Which behavior do you want? Thatdepends on the code in foo and bar, but it's actually dictated bysome_resource. It's up to some_resource to implement and documentsomething sensible: handle-and-raise, handle-and-swallow, raise up tothe handler for either x or y depending on which raised the exceptionand either swallow there or raise a different exception type, etc.Whatever it chooses to do may or may not be what you need in yourparticular piece of code. Actually the 3rd case is probably the mostsensible, but the "with" API discourages that one compared to the othertwo. The author of some_resource may go with one of the easy optionsand come up with a better one later, breaking your code.

So if you use the exception-handling parts of the protocol, "with" isbasically an alias for multiple different control structures. It hidesthe choice of control flow for your code inside the type. We need to bevery conservative with this, or risk losing sight of our exception handling.



Jeroen

Follow ups

Re: The with statement
From: Barry Warsaw, 2010-02-10

References

The with statement
From: Barry Warsaw, 2010-01-27