mysql-proxy-discuss team mailing list archive

Thread
Date

Re: funnel - a multiplexer plugin for mysql-proxy

To: nick loeve <trickie@xxxxxxxxx>
From: Kay Röpke <Kay.Roepke@xxxxxxx>
Date: Fri, 06 Feb 2009 14:11:49 +0100
Cc: mysql-proxy-discuss@xxxxxxxxxxxxxxxxxxx
In-reply-to: <83c2b2e00902060035k3ecc8065yf03a588fb173cb83@mail.gmail.com>
Sender: Kay.Roepke@xxxxxxx

Hi!

On Feb 6, 2009, at 9:35 AM, nick loeve wrote:

On Thu, Feb 5, 2009 at 8:54 PM, Kay Röpke <Kay.Roepke@xxxxxxx> wrote:

I'd be interested in some of the boundary conditions of your setup:
- how many queries/sec do you have?
- what is the average/stddev of the execution time of those queries?
- how large are the resultsets (approx in bytes or something)?

- how many clients are generally active at the same time (IOWwhat's the

concurrency)?


[...]
Clients connected for the architecture above is around 500 per slave,
and can increase slightly at peak times. Those 500 client connections
are doing an average of 1K-1.5K queries per second per slave (at peak
times). Depending on slave hardware, sometimes up to 20% of queries
reach the backlog. We use persistent connections on that arch, so
average new connections per second is pretty low.

Persistent connections are definitely the way to go here, yeah, sinceevery event on the sockets will further limit the throughput capacityright now.Based on the numbers, I would say that having the network i/omultithreaded should show a tremendous performance boost, especiallywith some tuning of the number of worker threads.

We have around 10 slave architectures similar in ratio of
slaves/clients/queries/timings to the one mentioned above, and quite a
few more that have different replication setups, and are tuned for a
particular purpose.
The reason I'm asking is because I've seen situations where therelative
timing of query arrival, the average size of the resultsets and the
execution times were favorable to the current single threadimplementationin that you would not really notice a slowdown compared to going tomysqld
directly.
In general, I think it would be a safe statement to say that for high
concurrency with very small execution times and tiny resultsets thecurrentsingle threaded Proxy has the most trouble (all because the addedlatency is
much larger than the time spent for the query itself).
It would be nice to see if this theory holds true for what you areseeing,
as well.
Yes that is exactly what we are seeing in our main slave
architectures. We have some beefy hardware for our databslaves, but we
struggle to push the queries in and out quick enough to really make
the database work hard and take advantage of the number of cores and
memory available. Across all our arches our biggest bottleneck is
connection handling and network I/O.  We do not see this problem so
much with the architectures tuned for returning larger result sets.


Good to know that the theory holds :)

It all comes down to the single thread, here's hoping we can quicklyremedy that.

[...]

Step one sounds similar to what we tried to do within our plugin, but
we more and more had to re-implement parts of the core engine within
our plugin to accommodate multiple threads using libevent. I would be
interested in helping out where possible to achieve what you described
above.

I just briefly talked to Jan and it seems he still has a somewhatclean patchfor our multithreading efforts. IIUC he's pushing that to Launchpadtoday, so maybethat helps to see what's involved. Of course it's totally experimentaland might

prove disastrous if used in production etc. *disclaim* ;)

Implementing a backlog mostly in a plugin is probably rather painfulin the long run,

so I'd like to see this functionality to go into the core proper.

Last night I've had a brief diff-look on the funnel plugin, here aresome observations in no particular order:* in funnel_read_auth I would refrain from doing `do { ... }while(0);` just because there's a break statement in there (i guess itwas just the fastest way to implement...)* we are generally using G_STRLOC instead of __FILE__, __LINE__ now,this is not consistently in the proxy plugin either ATM, just noting(and less to type :))

* the diff is showing what i've suspect for a long time:

we need to refactor the proxy plugin into some more manageableparts to avoid copying around this much code. I've noticed this beforein a different plugin (for a completely different purpose, though) butsomehow time was always to short to actually do it.

* that you removed all Lua code gave me an interesting idea:

it looks like we can factor that out completely as well, to make itoptional, and of course pluggable, for plugin writers. I have somenifty ideas on this already.in general the amount of necessary code duplication right nowbothers me greatly* the limitations you mentioned in your initial mail regarding thenumber of backends and different users look relatively easy to lift.backends: since you always get the first backend replacing theconstant with a function call that picks a backend based on somealgorithm should be all that's necessaryusers: if i'm not mistaken (and there's no bug in it) theconnection pool code should provide this already.* moving to a multithreaded network io implementation, the code isobviously not 100% safe because of the backends. for now i'd take aglobal lock when actually handling them (as long as they aren'tmodified at runtime that should be "safe", barring the fact that UP/DOWN information has a race condition). this is something we have toproperly fix in the core for sure, but clients should be prepared tohandle failed queries by reexecuting them anyway.


Other than that, I'm happy to see it took so little code to add this!

Thanks for sharing!


Np, I look forward to more :)


yay :)

Please note we are leaving for a team meeting in the US tomorrow andwill be there the next week.I'll try to follow up on stuff, but responses will likely be delayed abit.


cheers,
-k
--
Kay Roepke
Software Engineer, MySQL Enterprise Tools

Sun Microsystems GmbH    Sonnenallee 1, DE-85551 Kirchheim-Heimstetten
Geschaeftsfuehrer: Thomas Schroeder, Wolfang Engels, Dr. Roland Boemer
Vorsitz d. Aufs.rat.: Martin Haering                    HRB MUC 161028

References

funnel - a multiplexer plugin for mysql-proxy
From: nick loeve, 2009-02-05
Re: funnel - a multiplexer plugin for mysql-proxy
From: Kay Röpke, 2009-02-05
Re: funnel - a multiplexer plugin for mysql-proxy
From: nick loeve, 2009-02-06