maria-discuss team mailing list archive
-
maria-discuss team
-
Mailing list archive
-
Message #05643
Re: Limited Unicode Support?
-
To:
maria-discuss@xxxxxxxxxxxxxxxxxxx
-
From:
Hartmut Holzgraefe <hartmut@xxxxxxxxxxx>
-
Date:
Thu, 10 Oct 2019 16:12:27 +0200
-
In-reply-to:
<CALC4AYQhFEEk=BnSGsKUcg4+=9o5KO7GWbKDunf4_k6jm7ninA@mail.gmail.com>
-
User-agent:
Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.9.0
On 10.10.19 15:52, Björn Keil wrote:
> SHOW VARIABLES LIKE 'character%':
> character_set_client utf8
> character_set_connection utf8
> character_set_database utf8
> character_set_filesystem binary
> character_set_results utf8
> character_set_server latin1
> character_set_system utf8
> character_sets_dir /usr/share/mysql/charsets/
>
> As you can see here, MariaDB does not take the character '🙋' (
> https://www.fileformat.info/info/unicode/char/1f64b/index.htm ) and
> instead replaces it with four question marks and I have no idea why.
'🙋' is in the extended unicode plane, and so needs 4 bytes when
using UTF-8 encoding. Classic "utf8" charset only supports unicode
code points that use up to 3 bytes when UTF-7 encoded.
For emoticons etc. you need to use the newer utf8mb4 character set
instead.
--
Hartmut Holzgraefe - Principal Support Engineer (EMEA)
MariaDB Corporation - http://www.mariadb.com/
References