openstack team mailing list archive

Thread
Date

Re: Performance diagnosis of metadata query

To: openstack@xxxxxxxxxxxxxxxxxxx
From: David Kranz <david.kranz@xxxxxxxxxx>
Date: Thu, 29 Mar 2012 13:22:34 -0400
In-reply-to: <CAFoXKmp9FUj4ngHoJWt1bZ+=0Q5QYS78ZPyseURhzF1q2Kr2vg@mail.gmail.com>
User-agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:9.0) Gecko/20111222 Thunderbird/9.0.1

On 3/29/2012 12:46 PM, Justin Santa Barbara wrote:

    Is there a good way to map back where in the code these calls are
    coming from?
There's not a great way currently. I'm trying to get a patch in forEssex which will let deployments easily turn on SQL debugging (thoughthis is proving contentious); it will have a configurable log level toallow for future improvements, and one of the things I'd like to do isadd later is something like a stack trace on 'problematic' SQL (largerow count, long query time). But that'll be in Folsom, or in G if wedon't get logging into Essex.
In the meantime, it's probably not too hard to follow the code andinfer where the calls are coming from. In the full log, there's a bitmore context, and I've probably snipped some of that out; in this casethe relevant code is get_metadata in the compute API service andget_instance_nw_info in the network service.
     Regardless, large table scans should be eliminated, especially if
    the table is mostly read, as the hit on an extra index on insert
    will be completely offset by the speedups on select.


Agreed - some of these problems are very clear-cut!
It does frustrate me that we've done so much programming work, butthen not do the simple stuff at the end to make things work well. Itfeels a bit like shipping we're shipping C code which we've compiledwith -O0 instead of -O3.

Well, in a project with the style of fixed-date release (short-durationtrain-model) that openstack has, I think we have to accept that therewill never be time to do anything except fight critical bugs "at theend". At least not until the project code is much more mature. Inprojects I have managed we always allocated time at the *beginning* of arelease cycle for fixing some backlogged bugs and performance work.There is less pressure and the code is not yet churning. It is alsoimportant to have performance benchmark tests to make sure new featuresdo not introduce performance regressions.


 -David

Follow ups

Re: Performance diagnosis of metadata query
From: Justin Santa Barbara, 2012-03-29

References

Performance diagnosis of metadata query
From: Justin Santa Barbara, 2012-03-25
Re: Performance diagnosis of metadata query
From: Sean Dague, 2012-03-29
Re: Performance diagnosis of metadata query
From: Justin Santa Barbara, 2012-03-29