launchpad-dev team mailing list archive

Thread
Date

Re: performance tuesday - the rabbit has landed

To: Robert Collins <robertc@xxxxxxxxxxxxxxxxx>
From: Jeroen Vermeulen <jtv@xxxxxxxxxxxxx>
Date: Sat, 14 May 2011 10:06:02 +0700
Cc: Launchpad Community Development Team <launchpad-dev@xxxxxxxxxxxxxxxxxxx>
In-reply-to: <BANLkTinagp64kJA5ZGhNN2RPk9R9SXm0Gg@mail.gmail.com>
User-agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.2.17) Gecko/20110424 Thunderbird/3.1.10

On 2011-05-11 10:13, Robert Collins wrote:

I suspect an easy migration target if folk want one would be to
migrate all the fire-and-forget jobs to trigger via rabbit (leaving
the payload in the db), by hooking a 'do it now' message into the
post-transaction actions in zope.

It's exciting news. We'll want to be careful in migrating jobs though:IIRC rabbit is nontransactional. That means we'll still need some wayfor consumers of jobs to recognize cases where the producer transactionaborted after firing off the job.

In some of those cases, executing a job unnecessarily won't hurt -- onesthat refresh statistics for example. In others, the job absolutely mustnot execute.

Without having looked into it properly, I think we'll need some kind ofwrapper to support this distinction. Traditional transactionalmessaging uses two-phase commit; other products use database queuessimilar to our Job. Both are probably overweight to the point where ourbaby would go out with the bathwater. We could fake it by queuing upjobs in memory and sending them after commit, but that leaves open awindow for message loss.

Another problem happens when things work too well: you create adatabase-backed object. You fire off a job related to that object. Youcommit. But the consumer of that job picks it up before your commit haspropagated and boom! The job dies in flames because it accesses objectsthat aren't decisively in the database yet.

I imagine both problems go away if every message carries a databasetransaction id, and the job runner keeps an eye on the databasetransaction log: the runner shouldn't consume a job until the producingtransaction has committed, and it should drop jobs whose producers haveaborted. Is something along those lines built in?



Jeroen

Follow ups

Re: performance tuesday - the rabbit has landed
From: Thomas Hervé, 2011-05-14
Re: performance tuesday - the rabbit has landed
From: Robert Collins, 2011-05-14
Re: performance tuesday - the rabbit has landed
From: Stuart Bishop, 2011-05-14

References

performance tuesday - the rabbit has landed
From: Robert Collins, 2011-05-11