maria-discuss team mailing list archive

Thread
Date

Re: Is disabling doublewrite safe on ZFS?

To: Reinis Rozitis <r@xxxxxxx>, maria-discuss@xxxxxxxxxxxxxxxxxxx
From: Gionatan Danti <g.danti@xxxxxxxxxx>
Date: Mon, 20 Aug 2018 18:29:06 +0200
In-reply-to: <000001d43887$1fc98d30$5f5ca790$@roze.lv>
Organization: Assyoma s.r.l.
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1

On 20/08/2018 15:10, Reinis Rozitis wrote:

Hi all,
anyone with some suggestion/insight on the matter?


While I can't comment on the intricacies or internals of MySQL being (un)able to recover after a crash without the doublewrite buffer, if you skim through the changelog between versions (be that upstream Oracle or downstream in Maria/Percona), nearly every second (even minor) version has some sort of dataloss/corruption/segfault type of bug. Just for example a recent comes into mind https://jira.mariadb.org/browse/MDEV-15764


D'oh! [1]

 From my experience I've been switching off doublewrite on MySQL (even on XFS and now on ZFS (because of compression)) for years and even in the few accidental powerloss/total crash cases I haven't seen a corruption caused by an unexpected reboot (possible write lost midflight). Most times mysql hasn't been unable to start just because of internal issues (which you solve by having slaves and backups).

Uhm, powerloss and segfault/segkill (ie: process crash) are quitedifferent. The first means *any* activity is stopped (ie: filesystem hasno means to write anything), while the latter means *mysqld* stopswriting but the filesystem can write the partial data received.

My point being - zfs in principle is the same as the "atomic write hardware" (eg either the block writes succeed fully or not at all) so if you turn off doublewrite on those fancy Fusionio cards, I don't see a reason why you can't do the same on zfs.

What I means is that while a ZFS write is an all-or-nothing affair, itcan write partial data from the application (mysqld) point of view. Whatit needs is a partial data from the application itself (ie: a crashingmysqld) - garbage in, garbage out. Doublewrite *shoult* catch that (ie:at restarting, mysqld would read the double buffer, detect it as corruptand discard it while not touching at all any previous data on maindatabase files).

My understanding (which *can* be wrong) is that MariaDB "atomic writesupport" is a mean to inform the underlying device of the entire writeprocess and to keep new data on "spare" location until the applicationitself (mysqld) commits the *entire*, verified write, enabling thehardware device to atomically swap/remap the affected data locations. Inthis case, a failed mysqld process will never reach the "commit phase",leaving the old data untouched.

Even if there are some edge cases where it could become "unsafe" most of the time you still run with better performance and considering the SSD wear level the hardware could fail (reach end of life) ~two times sooner ;)


Good point, this surely is a factor to evaluate.


p.s. sorry for the mail not being about the particular technical aspects rather than general thoughts


They are greatly appreciated!
Thanks.


[1] https://en.wikipedia.org/wiki/D%27oh!

--
Danti Gionatan
Supporto Tecnico
Assyoma S.r.l. - www.assyoma.it
email: g.danti@xxxxxxxxxx - info@xxxxxxxxxx
GPG public key ID: FF5F32A8

Follow ups

Re: Is disabling doublewrite safe on ZFS?
From: Reinis Rozitis, 2018-08-20

References

Is disabling doublewrite safe on ZFS?
From: Gionatan Danti, 2018-08-14
Re: Is disabling doublewrite safe on ZFS?
From: Vladislav Vaintroub, 2018-08-14
Re: Is disabling doublewrite safe on ZFS?
From: Gionatan Danti, 2018-08-14
Re: Is disabling doublewrite safe on ZFS?
From: Vladislav Vaintroub, 2018-08-16
Re: Is disabling doublewrite safe on ZFS?
From: Gionatan Danti, 2018-08-17
Re: Is disabling doublewrite safe on ZFS?
From: Gionatan Danti, 2018-08-20
Re: Is disabling doublewrite safe on ZFS?
From: Reinis Rozitis, 2018-08-20