← Back to team overview

maria-discuss team mailing list archive

Re: Limited Unicode Support?

 

On 10.10.19 15:52, Björn Keil wrote:

> SHOW VARIABLES LIKE 'character%':
> character_set_client utf8
> character_set_connection utf8
> character_set_database utf8
> character_set_filesystem binary
> character_set_results utf8
> character_set_server latin1
> character_set_system utf8
> character_sets_dir /usr/share/mysql/charsets/
> 
> As you can see here, MariaDB does not take the character '🙋' (
> https://www.fileformat.info/info/unicode/char/1f64b/index.htm ) and
> instead replaces it with four question marks and I have no idea why.

'🙋' is in the extended unicode plane, and so needs 4 bytes when
using UTF-8 encoding. Classic "utf8" charset only supports unicode
code points that use up to 3 bytes when UTF-7 encoded.

For emoticons etc. you need to use the newer utf8mb4 character set
instead.

-- 
Hartmut Holzgraefe - Principal Support Engineer (EMEA)
MariaDB Corporation - http://www.mariadb.com/


References