mysql-proxy-discuss team mailing list archive

Thread
Date

Re: funnel - a multiplexer plugin for mysql-proxy

To: nick loeve <trickie@xxxxxxxxx>
From: Kay Röpke <Kay.Roepke@xxxxxxx>
Date: Thu, 05 Feb 2009 20:54:26 +0100
Cc: mysql-proxy-discuss@xxxxxxxxxxxxxxxxxxx
In-reply-to: <83c2b2e00902050923t32aacbc0p3b105f887b657911@mail.gmail.com>
Sender: Kay.Roepke@xxxxxxx

Hi Nick!

On Feb 5, 2009, at 6:23 PM, nick loeve wrote:

Hi,

I have created an experimental branch of the mysql-proxy code on
launchpad, in order to show what I have done to implement a connection
multiplexer with backlog for mysql-proxy, and hopefully get some
feedback on our implementation/design. We called the plugin 'funnel'.
[...]
Our existing solution works, but we are looking at how to get more
performance using a plugin to mysql-proxy. The plugin in the branch I
posted accomplishes the main tasks I described above, but there are
more features that we would like to implement, mainly support for more
statistics via the admin plugin. I had to make a few changes to the
core state-machine that handles the front-end/back-end connection
state in order to achieve the backlog.

Very interesting! I will take a closer look at what the actualdifferences to the proxy plugin are later (Launchpad sadly doesn'tmake it easy to diff two files...).

Some assumptions:

We have some hardcoded 'assumptions' in the code base, such as only
ever using one backend (as we always have the funnel sitting in front
of a mysqld on the same host) and we have a single user/database for
most of slave architectures, so multi-user and or complex permissions
may not work correctly. I would like to eventually remove these
limitations/assumptions.


Fair enough for a first version, I'd say.

We are currently testing our plugin in a live environment, and our
benchmarks are proving that the mysql-proxy design is giving us better
capacity and lower average query times at peak traffic times.


I'd be interested in some of the boundary conditions of your setup:
 - how many queries/sec do you have?
 - what is the average/stddev of the execution time of those queries?
 - how large are the resultsets (approx in bytes or something)?

- how many clients are generally active at the same time (IOW what'sthe concurrency)?

The reason I'm asking is because I've seen situations where therelative timing of query arrival, the average size of the resultsetsand the execution times were favorable to the current single threadimplementation in that you would not really notice a slowdown comparedto going to mysqld directly.In general, I think it would be a safe statement to say that for highconcurrency with very small execution times and tiny resultsets thecurrent single threaded Proxy has the most trouble (all because theadded latency is much larger than the time spent for the query itself).It would be nice to see if this theory holds true for what you areseeing, as well.

Im particularly interested in the blueprint on launchpad aboutthreaded

I/O. We did have an attempt at adding a thread pool to our plugin in
order to handle some backlog clearing and some I/O, but without large
changes to the main proxy engine we did not succeed in getting stable
enough to really test out in our high traffic environment.


In fact, Jan and I have met today and talked on this very topic.

Soon we will pick up our efforts in adding multithreading (mostlyrevitalizing old patches).The current plan is the following (and we need to add these to theblueprints after our team meeting next week):

Step 1:
- accept connections on one thread

- have multiple worker threads the accepted filedescriptor gets handedoff to (via a queue)- all subsequent events on this filedescriptor will be handled by thethread it was handed tothis essentially means that all network traffic will be handled bymultiple threadssince we still have a global lock around the Lua state, everythingthat needs to go into Lua will run as before in a single thread


Step 2:
- give each thread its own local Lua state (still sharing the script)
- remove the global mutex

- access to global structures (backends, usually) will need some kindof synchronizationwe would like to use a shared-nothing approach, basically makingcopies of global structures and versioning them (checks can be donewith atomic ints, for example)

  LuaLanes is another alternative.

Step 1 is relatively easy compared to Step 2.

There are few things to take into consideration, of course, even withstep 1.My initial prototype picked a worker thread on every event, whichproved to be extremely heavyweight under high load, mostly because thequeue used was under high contention (reading data from a sockettended to be not much slower than the overhead of putting the eventinto a queue and letting a worker thread pull it out again. it was onehot queue...)The danger with making connections stick to one thread for theirentire lifetime is that one thread might end up with getting all theactive connections, and leave the other threads idling, thus turningthe entire thing into a more or less heavyweight single threadimplementation. I'm not yet sure how to solve this efficiently, but Iguess we will try different approaches before we pick a winner.

Step 2 is not without complications either. Since a copied globalstate would only be "mostly up to date", it's fairly important to pickthe places where we update it. In most cases the global state isrelatively static, but in some applications it might not be, e.g. inload balancing situations where backend weights are a function ofbackend system load, number of queries executed or something alongthose lines.In those situations, it might actually be cheaper to use a mutex toaccess global state rather than copying a lot, but that can lead tohigh lock contention, too. Maybe a non-locking alternative would bebetter, using atomic operations where they are available (currentlyglib is lacking them on HP/UX for PA-RISC iirc, maybe some AIX, too).Atomic ops are not without traps either, of course.In either case, we need to have an implementation to make these kindsof decisions, otherwise the effects are pure speculation.(As an aside: We cannot get away with defining the lua_lock/lua_unlockmacros to acquire/release a mutex because those only make the Luainterpreter itself threadsafe, not what we built on top of it...sadly)


Thanks for sharing!

cheers,
-k
--
Kay Roepke
Software Engineer, MySQL Enterprise Tools

Sun Microsystems GmbH    Sonnenallee 1, DE-85551 Kirchheim-Heimstetten
Geschaeftsfuehrer: Thomas Schroeder, Wolfang Engels, Dr. Roland Boemer
Vorsitz d. Aufs.rat.: Martin Haering                    HRB MUC 161028

Follow ups

Re: funnel - a multiplexer plugin for mysql-proxy
From: nick loeve, 2009-02-06
Mysql Proxy UseCase scenario question
From: zoly farkas, 2009-02-05

References

funnel - a multiplexer plugin for mysql-proxy
From: nick loeve, 2009-02-05