maria-developers team mailing list archive

Thread
Date

Re: Ideas for improving MariaDB/MySQL replication

To: maria-developers@xxxxxxxxxxxxxxxxxxx
From: seppo.jaakola@xxxxxxxxxxxxx
Date: Mon, 15 Mar 2010 22:34:49 +0200
In-reply-to: <873a01lxey.fsf@knielsen-hq.org>
User-agent: Internet Messaging Program (IMP) H3 (4.3)

Hi Kristian,

I agree with Alex's response, and I'll pick the hopefully all theremaining questions to answer here.


Quoting Kristian Nielsen <knielsen@xxxxxxxxxxxxxxx>:

So the basic for such an interface would be the ability to installhooks to be

called with row data for every handler::write_row(), handler::update_row(),
and handler::delete_row() invocation, just like the current row-based
binlogging does. And similar for SQL statement execution like statement-based
logging does now. That should be clear enough.

Then comes the need to hook into transaction start and commit and opening
tables.  At this point, more of the internals of the MySQL server start to
appear, and some careful thought will be needed to get an interface that
exposes enough that plugins can do what they need, without exposing too much
internal details of how MySQL query execution is implemented.


Yes, that's a good plan.

(But note that this is two different issues regarding "internal
implementations". One is how the *query execution* is implemented. The other
is how the *plugins* are implemented. If I understood you correctly, the
interface used for semisync in MySQL fails on the latter point).

One example of how a lot of details from query execution pop up iswith regard

to the mixed-mode binlogging. This is where queries are logged as statements
when this is safe, and as row events when this is not safe (nondeterministic
queries). The concept of "mixed mode binlogging" certainly seems like
something that should be an implementation detail of the plugin, not part of
the interface. On the other hand, determining whether a query is safe for
statement-based logging is highly complex, and exposing enough of the server
for the plugin to be able to determine this by itself may be too much. (Maybe
just expose an is_safe_for_statement() function to plugins could be enough).

Mixed mode replication (or binlog format, as called in MySQL code), issomething I would leave completely for the DBMS to decide about. Thereplication plugin should offer calls for SQL and ROW levelreplication, and DBMS just decides which one to call in each case.Note that replication plugin has it next to impossible to judge ifpassed SQL statement is valid to be replicated directly or if ROWevent should be used (this decision would require parsing inreplicator...).

In general, it is better if replicator does not need to look insidereplication events at all. (ok, there are requirements for SQL levelfiltering heterogeneous replication, query rewriting etc..., which maybe valid use cases as well)

Another example of hairy details is all the extra information thatcan go with

an SQL statement into the binary log. Things like current timestamp, random
seed, user-set @variables, etc. To support a statement-based replication
plugin, we probably have to expose all of this on the interface in a clean
fashion.

SQL level replication requires that session context will bemaintained, passed for the replicator and enforced in the applyingside. This will be a bit complicated to implement, but is somethingthat cannot be avoided. Support for session context management must inreplication API, but the context can be presented as opaque object andreplicator does not need to know about the details.



Seppo
--
http://www.codership.com  seppo.jaakola@xxxxxxxxxxxxx
tel: +358 40 510 5938 skype: seppo_jaakola

References

Ideas for improving MariaDB/MySQL replication
From: Kristian Nielsen, 2010-01-22
Re: Ideas for improving MariaDB/MySQL replication
From: Alex Yurchenko, 2010-01-22
Re: Ideas for improving MariaDB/MySQL replication
From: Kristian Nielsen, 2010-01-22
Re: Ideas for improving MariaDB/MySQL replication
From: Alex Yurchenko, 2010-01-23
Re: Ideas for improving MariaDB/MySQL replication
From: Kristian Nielsen, 2010-01-25
Re: Ideas for improving MariaDB/MySQL replication
From: Alex Yurchenko, 2010-01-25
Re: Ideas for improving MariaDB/MySQL replication
From: Kristian Nielsen, 2010-03-15