launchpad-dev team mailing list archive

Thread
Date

Introduction to and Minutes of yellow squad's weekly postmortem call: 2012-03-30

To: Launchpad Development List <launchpad-dev@xxxxxxxxxxxxxxxxxxx>
From: Gary Poster <gary.poster@xxxxxxxxxxxxx>
Date: Fri, 30 Mar 2012 11:17:01 -0400
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:11.0) Gecko/20120313 Thunderbird/11.0

= Introduction for Launchpad devs =

Yellow squad has had a weekly postmortem call for many monthsnow--perhaps more than a year now.

The purpose is to identify successes, problems, and useful tricks; toshare them; and to see if we can problem solve and identify actionitems. The call started because we were not taking the time to do thesethings, which was wasteful and had us repeating mistakes and notlearning from them.

The call has been successful enough that we have kept at it all thistime. We've done some problem analysis, and shared a number of nicetricks among ourselves. Some of the nicer or more notable tricks haveled to emails to canonical tech, sharing what we've discovered with alarger audience.

In the course of writing up notes for the current performance reviewperiod, I realized that we still were not doing as good of a job on thisas I'd like. In particular, last period we had some issues that we didnot try to learn from.

I decided to try two small changes to make the call more rigorous--morelikely to accomplish its goals.

* I've expanded our checklist for the call with more pointedquestions. You can see them near the top ofhttps://dev.launchpad.net/yellow/ if you are curious.

 * I'm sending the minutes out to a wider audience.  Hello!

We tried the new, expanded checklist for the first time today. It workedwell so far. :-) Similarly, today is the first day for our more publicminutes. The rest of this email contains the minutes and action itemsfor the call. Please let us know if you'd prefer to have theseelsewhere, or if you like having them sent to the launchpad-dev list.


= Minutes of postmortem call 2012-03-30 =

gmb: had to rework a blog post, because it duplicated in part what benjiwrote recently(http://blog.launchpad.net/general/parallelising-the-unparallelisable).Lesson to be learned: if multiple blog posts are writtensimultaneously, coordinate! This is clearly a case of a larger rule: ifyou work on similar projects simultaneously, coordinate. We try toapply the larger rule to our coding projects, so keep it in mind foreverything else we do too.

frankban: the forked zope.testing egg that we have been using inLaunchpad for many months had many tests that failed. After work byyellow squad, the upcoming version (p5) only has three tests that fail,but > 0 is a problem because it makes further changes to the egg (likethe ones we did for bug 609986) difficult. This led to a largediscussion. Points raised include the following.* Forking code means that you take responsibility for it, within thecontext of your project. This includes passing tests, and tests foryour changes. Generally, forks hurt more than you think they will.* Patching (and patched eggs) is equivalent in that regard; and worsebecause subsequent developers do not have a branch to work with.* Doing a constant upgrade to your dependencies is the right thing todo in general, as advocated recently by someone on canonical-tech. It'slike brushing your teeth: if you don't do it, things rot, and it's a lotmore expensive to fix later. However, we acknowledged that it can be avery expensive regular process too.* We are far behind on zope eggs. Catching up will be expensive, andit is difficult to be motivated given the low activity/participation ofthe project.* Suggested process change: for future projects (e.g. SOA stand-aloneprojects) explicitly adopt a constant-upgrade policy, setting upexpectations and schedules initially.* Suggested process change: never patch eggs; fork branches if youmust, and make eggs from them.* Suggested process change: if you fork, make sure tests pass onbranch before you leave, and in general take ownership of the code.Acknowledge that you are making a new package.* Action item: Gary will send an email about the discussion to thelist. [I'm not sure if this counts or not ;-) but I'll send one underseparate cover with this info ]

benji: he found tty recorder "ttyrec"(http://0xcc.net/ttyrec/index.html.en) , which might be a tool toimprove his simple terminal sharing work in slack time(https://dev.launchpad.net/yellow/RemoteTerminalBroadcasting) . If youwant to play with it, he suggests installing it from apt, not fromsource, because source has some BSD-isms.

benji: using LXC for dev is nice. It puts up enough isolation betweenhost and and the container to be sufficient for our use, but allows nicesharing. A current downside for him is that LP's make doesn't workuntil you flail a bit. [It worked for me, but it's been awhile since Iset it up, and we have seen multiple regressions/fixes over the pastcouple of months.]

benji: we should test our buildbot/lxc setup daily to watch out forregressions. gary_poster said that, similarly, it was his intent to runparallel tests constantly on ec2 once we got near the end of theproject, because one of our goals given by Robert at project inceptionwas to prevent spurious failures. However, gary_poster's AWS bill thismonth will be close to $400 because of his work and tests on EC2. Benjiused http://calculator.s3.amazonaws.com/calc5.html to calculate thathaving juju buildbot master & slave with eight core machines for allwould by almost $1500/month; tricking juju into giving us two smallmachines and an 8-core would be about $600.* Action item: gary_poster will manually run a master and slave todayto see how we are doing (LXC bug 968371 was a problem yesterday; we alsowant to see how many test failures we have now, particularly afteradding --shuffle to the test command)* Action item: gary_poster will add a card to the kanban board formaking an automated juju setup and test run, at bac's suggestion. Maybewe'll run one of these daily. [DONE]

 * Action item: gary_poster will talk to flacoste about EC2 expenses.

Follow ups

Re: Introduction to and Minutes of yellow squad's weekly postmortem call: 2012-03-30
From: Aaron Bentley, 2012-03-30