← Back to team overview

maria-discuss team mailing list archive

Re: mariadb 10.0.17 keeps crashing

 

using systemtap I got

[Tue Mar  3 18:48:46 2015] SIGKILL was sent to ps (pid:9763) by processes
uid:0
[Tue Mar  3 18:49:46 2015] SIGKILL was sent to ps (pid:9824) by processes
uid:0
*[Tue Mar  3 18:50:44 2015] SIGKILL was sent to mysqld (pid:7597) by mysqld
uid:498*
[Tue Mar  3 18:50:46 2015] SIGKILL was sent to ps (pid:9890) by processes
uid:0
[Tue Mar  3 18:51:46 2015] SIGKILL was sent to ps (pid:9997) by processes
uid:0
[Tue Mar  3 18:52:35 2015] SIGKILL was sent to cdm (pid:10045) by cdm uid:0


I saw this several times but I wasn't able to find (in this case) which
process has the PID 498. @jocelyn, you are right...to tweaks at all for
tokudb (my bad here), although I wasn't expecting such *naive* approach, I
mean, I could have not enable the tokudb plugin and all other tables using
the other engines would work just fine (maybe just slow though)

free -m returns about 47M free...which isn't much...

the issue is that if I need to provide tokudb more RAM I will need to take
if from the InnoDB buffer pool, which we use a lot...the main reason to use
tokudb was to "compress" some huge tables that are not transactional nor
readed all the time.




On Tue, Mar 3, 2015 at 4:06 PM, Justin Swanhart <greenlion@xxxxxxxxx> wrote:

> Hi,
>
> Presumably, if the server is using too much ram it will a) crash and leave
> a stack trace (not happening) or b) invoke the OOMPK which leaves a trace
> in syslog, which also, presumably, is not happening, since she can find no
> trace of OOMPK activity.
>
> So memory is a good guess (output of free -m would be useful) I doubt it
> is memory related do to lack of evidence for it.
>
> --Justin
>
> On Tue, Mar 3, 2015 at 9:58 AM, jocelyn fournier <
> jocelyn.fournier@xxxxxxxxx> wrote:
>
>>  Hi,
>>
>> No variables to tweak the tokudb memory allocation ?
>> By default tokudb_cache_size takes half of the total system memory.
>> With 6G allocated to InnoDB buffer pool, and 1G to the MyISAM key buffer,
>> are you sure you have enough memory for TokuDB ?
>>
>>   Jocelyn
>>
>>
>> Le 03/03/2015 17:37, Gabriel Sosa a écrit :
>>
>>  @joocelyn Here is the my.cfg file
>>
>>  @justin I will check that post. Thanks
>>
>>  [mysqld]
>>
>>  #
>> -------------------------------------------------------------------------------
>> # ++++ General
>> #
>> -------------------------------------------------------------------------------
>> datadir                         = /var/lib/mysql/data
>> pid-file                        = /var/run/mysqld/mysqld.pid
>> socket                          = /var/lib/mysql/mysql.sock
>> tmpdir                          = /var/lib/mysql/tmp
>> port                            = 3306
>>
>>  #
>> -------------------------------------------------------------------------------
>> # ++++ Logging
>> #
>> -------------------------------------------------------------------------------
>> log-error                       = /var/lib/mysql/log/mysqld-error.log
>> long_query_time                 = 10
>> slow-query-log-file             =
>> /var/lib/mysql/log/mysqld-queries_slow.log
>> log-slave-updates
>> log-bin                         = /var/lib/mysql/log/bin/slave-bin
>> log-warnings                    = 1
>>
>>  max_relay_log_size              = 200M
>> relay_log_space_limit           = 25000M
>>
>>  #
>> -------------------------------------------------------------------------------
>> # ++++ Network
>> #
>> -------------------------------------------------------------------------------
>> max_connections                 = 3000
>> max_connect_errors              = 1000
>> wait_timeout                    = 120
>> connect_timeout                 = 30
>> interactive_timeout             = 3600
>> slave_net_timeout               = 120
>> back_log                        = 50
>> max_allowed_packet              = 128M
>>
>>
>>  #
>> -------------------------------------------------------------------------------
>> # ++++ Misc
>> #
>> -------------------------------------------------------------------------------
>> # Text Searchs
>> ft_min_word_len                  = 2
>> plugin-load  = ha_tokudb
>>
>>  #
>> -------------------------------------------------------------------------------
>> # ++++ Threads
>> #
>> -------------------------------------------------------------------------------
>> thread_concurrency              = 8
>> thread_cache                    = 64
>>
>>  #
>> -------------------------------------------------------------------------------
>> # ++++ Memory
>> #
>> -------------------------------------------------------------------------------
>> # Tables
>> table_cache                     = 4096
>> tmp_table_size                  = 256M
>> # Memory per Thread
>> sort_buffer_size                = 8M
>> read_buffer_size                = 4M
>> read_rnd_buffer_size            = 16M
>> # Query Cache
>> query_cache_type                = 1
>> query_cache_limit               = 2M
>> query_cache_size                = 64M
>>
>>  #
>> -------------------------------------------------------------------------------
>> # ++++ MyISAM Parameters
>> #
>> -------------------------------------------------------------------------------
>> key_buffer_size                 = 1024M
>> myisam_sort_buffer_size         = 64M
>> myisam_recover                  = FORCE,BACKUP
>>
>>  #
>> -------------------------------------------------------------------------------
>> # ++++ InnoDB Parameters
>> #
>> -------------------------------------------------------------------------------
>> # General
>> innodb_data_home_dir            = /var/lib/mysql/innodb
>> innodb_log_group_home_dir       = /var/lib/mysql/innodblogs
>> innodb_file_per_table
>> innodb_data_file_path           = ibdata1:100M:autoextend
>> innodb_status_file              = ib_status
>> innodb_autoextend_increment     = 10M
>> innodb_support_xa               = 0
>> innodb_thread_concurrency       = 8
>> innodb_flush_method             = O_DIRECT
>> innodb_flush_log_at_trx_commit  = 2
>> # Memory
>> innodb_buffer_pool_size         = 6G
>> innodb_additional_mem_pool_size = 8M
>> innodb_open_files               = 512
>> # Logging
>> innodb_log_buffer_size          = 8M
>> innodb_log_file_size            = 256M
>> innodb_log_files_in_group       = 2
>>
>>
>>  #
>> -------------------------------------------------------------------------------
>> # ++++ Replication : SLAVE Profile
>> #
>> -------------------------------------------------------------------------------
>> #skip-slave-start
>> server-id                       = 124388
>> relay-log                       = /var/lib/mysql/log/replication/slave-bin
>> relay-log-info-file             = /var/lib/mysql/log/replication/
>> slave-log.info
>> master-info-file                = /var/lib/mysql/log/replication/
>> master-log.info
>> max_binlog_size                 = 20971520
>>
>>  read-only                       = 1
>>
>>  [mysqldump]
>> quick
>> max_allowed_packet              = 128M
>>
>>  [isamchk]
>> key_buffer                      = 256M
>> sort_buffer_size                = 256M
>> read_buffer                     = 2M
>> write_buffer                    = 2M
>>
>>  [myisamchk]
>> key_buffer                      = 256M
>> sort_buffer_size                = 256M
>> read_buffer                     = 2M
>> write_buffer                    = 2M
>>
>>  [mysqlhotcopy]
>> interactive-timeout
>>
>> On Tue, Mar 3, 2015 at 12:56 PM, Justin Swanhart <greenlion@xxxxxxxxx>
>> wrote:
>>
>>> Hi,
>>>
>>>  You might try to find the source of the termination with this:
>>>
>>> http://www.percona.com/blog/2014/07/18/systemtap-solves-phantom-mysqld-sigterm-sigkill-issue/
>>>
>>> On Tue, Mar 3, 2015 at 8:45 AM, jocelyn fournier <
>>> jocelyn.fournier@xxxxxxxxx> wrote:
>>>
>>>>  Hi,
>>>>
>>>> Could you send your my.cnf ?
>>>>
>>>> Thanks,
>>>>   Jocelyn
>>>>
>>>> Le 03/03/2015 16:24, Gabriel Sosa a écrit :
>>>>
>>>>   Hello,
>>>>
>>>>  I've been a proudly user of mariadb 5.5.x for a long time now  but
>>>> given the nice feature setI decided to give mariadb 10.x a try.
>>>>
>>>>  I took one of our current slaves running mariadb 5.5.x (on centos
>>>> 6.5) and followed the upgrade steps using yum and ran *mysql_upgrade*. The
>>>> upgrade ran without any trouble...then I moved a huge table (about 1B
>>>> records right now) from InnoDB to TokuDB.
>>>>
>>>>  Now, every couple of hours I find that the replication is far behind
>>>> the master and the reason is because this server keeps checking tables
>>>> marked as crashed....
>>>>
>>>>  I can't seems to find any indicator of OOM killer in the system logs
>>>> NOR anything related to that in the mysql log, the only clue I have is:
>>>>
>>>>
>>>>  ---------------
>>>> *150303 10:08:57 mysqld_safe Number of processes running now: 0*
>>>> *150303 10:08:57 mysqld_safe mysqld restarted*
>>>> 150303 10:08:58 [Warning] 'THREAD_CONCURRENCY' is deprecated and will
>>>> be removed in a future release.
>>>> 150303 10:08:59 [Warning] option 'innodb-status-file': boolean value
>>>> 'ib_status' wasn't recognized. Set to OFF.
>>>> 150303 10:08:59 [Warning] option 'innodb-autoextend-increment':
>>>> unsigned value 10485760 adjusted to 1000
>>>> 150303 10:08:59 [Note] InnoDB: Using mutexes to ref count buffer pool
>>>> pages
>>>> 150303 10:08:59 [Note] InnoDB: The InnoDB memory heap is disabled
>>>> 150303 10:08:59 [Note] InnoDB: Mutexes and rw_locks use GCC atomic
>>>> builtins
>>>> 150303 10:08:59 [Note] InnoDB: Memory barrier is not used
>>>> 150303 10:08:59 [Note] InnoDB: Compressed tables use zlib 1.2.3
>>>> 150303 10:08:59 [Note] InnoDB: Using Linux native AIO
>>>> 150303 10:08:59 [Note] InnoDB: Using CPU crc32 instructions
>>>> 150303 10:08:59 [Note] InnoDB: Initializing buffer pool, size = 6.0G
>>>> 150303 10:08:59 [Note] InnoDB: Completed initialization of buffer pool
>>>> 150303 10:08:59 [Note] InnoDB: Highest supported file format is
>>>> Barracuda.
>>>> 150303 10:08:59 [Note] InnoDB: Log scan progressed past the checkpoint
>>>> lsn 14178608217047
>>>> 150303 10:08:59 [Note] InnoDB: Database was not shutdown normally!
>>>> 150303 10:08:59 [Note] InnoDB: Starting crash recovery.
>>>> 150303 10:08:59 [Note] InnoDB: Reading tablespace information from the
>>>> .ibd files...
>>>> 150303 10:09:05 [Note] InnoDB: Restoring possible half-written data
>>>> pages
>>>> 150303 10:09:05 [Note] InnoDB: from the doublewrite buffer...
>>>> InnoDB: Doing recovery: scanned up to log sequence number 14178613459456
>>>> InnoDB: Doing recovery: scanned up to log sequence number 14178618702336
>>>> InnoDB: Doing recovery: scanned up to log sequence number 14178623945216
>>>> InnoDB: Doing recovery: scanned up to log sequence number 14178629188096
>>>> InnoDB: Doing recovery: scanned up to log sequence number 14178634430976
>>>> InnoDB: Doing recovery: scanned up to log sequence number 14178639673856
>>>> InnoDB: Doing recovery: scanned up to log sequence number 14178644916736
>>>> ......
>>>> ......
>>>> ......
>>>> ......
>>>>  150303 10:09:14 [Note] InnoDB: Starting an apply batch of log records
>>>> to the database...
>>>> InnoDB: Progress in percent: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
>>>> 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41
>>>> 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66
>>>> 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91
>>>> 92 93 94 95 96 97 98 99
>>>> InnoDB: Apply batch completed
>>>> InnoDB: In a MySQL replication slave the last master binlog file
>>>> InnoDB: position 0 16433109, file name slave-bin.852141
>>>> InnoDB: Last MySQL binlog file position 0 16784214, file name
>>>> /var/lib/mysql/log/bin/slave-bin.005211
>>>> 150303 10:09:34 [Note] InnoDB: 128 rollback segment(s) are active.
>>>> 150303 10:09:34 [Note] InnoDB: Waiting for purge to start
>>>> 150303 10:09:34 [Note] InnoDB:  Percona XtraDB (http://www.percona.com)
>>>> 5.6.22-72.0 started; log sequence number 14178834821658
>>>> Tue Mar  3 10:09:36 2015 TokuFT recovery starting in env
>>>> /var/lib/mysql/data/
>>>> Tue Mar  3 10:09:36 2015 TokuFT recovery scanning backward from 1330297
>>>> Tue Mar  3 10:09:36 2015 TokuFT recovery bw_end_checkpoint at 1330297
>>>> timestamp 1425395277901416 xid 1330281 (bw_newer)
>>>> Tue Mar  3 10:09:36 2015 TokuFT recovery bw_begin_checkpoint at 1330281
>>>> timestamp 1425395273487081 (bw_between)
>>>> Tue Mar  3 10:09:36 2015 TokuFT recovery turning around at begin
>>>> checkpoint 1330281 time 4414335
>>>> Tue Mar  3 10:09:36 2015 TokuFT recovery starts scanning forward to
>>>> 1330297 from 1330281 left 16 (fw_between)
>>>> Tue Mar  3 10:09:36 2015 TokuFT recovery closing 14 dictionaries
>>>> Tue Mar  3 10:09:36 2015 TokuFT recovery making a checkpoint
>>>> Tue Mar  3 10:09:36 2015 TokuFT recovery done
>>>> 150303 10:09:36 [Note] Recovering after a crash using
>>>> /var/lib/mysql/log/bin/slave-bin
>>>>
>>>>  ---------------
>>>>
>>>>  The odd thing is that the 5.5.x version was working just fine (since
>>>> about a year) in the same hardware...nothing changed in that front.
>>>>
>>>>  Any clue?
>>>>
>>>>  Thank you in advance
>>>>
>>>>
>>>>
>>>>  _______________________________________________
>>>> Mailing list: https://launchpad.net/~maria-discuss
>>>> Post to     : maria-discuss@xxxxxxxxxxxxxxxxxxx
>>>> Unsubscribe : https://launchpad.net/~maria-discuss
>>>> More help   : https://help.launchpad.net/ListHelp
>>>>
>>>>
>>>>
>>>> _______________________________________________
>>>> Mailing list: https://launchpad.net/~maria-discuss
>>>> Post to     : maria-discuss@xxxxxxxxxxxxxxxxxxx
>>>> Unsubscribe : https://launchpad.net/~maria-discuss
>>>> More help   : https://help.launchpad.net/ListHelp
>>>>
>>>>
>>>
>>
>>
>>  --
>> Gabriel Sosa
>> Sometimes the questions are complicated and the answers are simple. --
>> Dr. Seuss
>>
>>
>>
>


-- 
Gabriel Sosa
Sometimes the questions are complicated and the answers are simple. -- Dr.
Seuss

Follow ups

References