using systemtap I got
[Tue Mar 3 18:48:46 2015] SIGKILL was sent to ps (pid:9763) by
processes uid:0
[Tue Mar 3 18:49:46 2015] SIGKILL was sent to ps (pid:9824) by
processes uid:0
*[Tue Mar 3 18:50:44 2015] SIGKILL was sent to mysqld (pid:7597) by
mysqld uid:498*
[Tue Mar 3 18:50:46 2015] SIGKILL was sent to ps (pid:9890) by
processes uid:0
[Tue Mar 3 18:51:46 2015] SIGKILL was sent to ps (pid:9997) by
processes uid:0
[Tue Mar 3 18:52:35 2015] SIGKILL was sent to cdm (pid:10045) by cdm
uid:0
I saw this several times but I wasn't able to find (in this case)
which process has the PID 498. @jocelyn, you are right...to tweaks at
all for tokudb (my bad here), although I wasn't expecting such /naive/
approach, I mean, I could have not enable the tokudb plugin and all
other tables using the other engines would work just fine (maybe just
slow though)
free -m returns about 47M free...which isn't much...
the issue is that if I need to provide tokudb more RAM I will need to
take if from the InnoDB buffer pool, which we use a lot...the main
reason to use tokudb was to "compress" some huge tables that are not
transactional nor readed all the time.
On Tue, Mar 3, 2015 at 4:06 PM, Justin Swanhart <greenlion@xxxxxxxxx
<mailto:greenlion@xxxxxxxxx>> wrote:
Hi,
Presumably, if the server is using too much ram it will a) crash
and leave a stack trace (not happening) or b) invoke the OOMPK
which leaves a trace in syslog, which also, presumably, is not
happening, since she can find no trace of OOMPK activity.
So memory is a good guess (output of free -m would be useful) I
doubt it is memory related do to lack of evidence for it.
--Justin
On Tue, Mar 3, 2015 at 9:58 AM, jocelyn fournier
<jocelyn.fournier@xxxxxxxxx <mailto:jocelyn.fournier@xxxxxxxxx>>
wrote:
Hi,
No variables to tweak the tokudb memory allocation ?
By default tokudb_cache_size takes half of the total system
memory.
With 6G allocated to InnoDB buffer pool, and 1G to the MyISAM
key buffer, are you sure you have enough memory for TokuDB ?
Jocelyn
Le 03/03/2015 17:37, Gabriel Sosa a écrit :
@joocelyn Here is the my.cfg file
@justin I will check that post. Thanks
[mysqld]
#
-------------------------------------------------------------------------------
# ++++ General
#
-------------------------------------------------------------------------------
datadir = /var/lib/mysql/data
pid-file = /var/run/mysqld/mysqld.pid
socket = /var/lib/mysql/mysql.sock
tmpdir = /var/lib/mysql/tmp
port = 3306
#
-------------------------------------------------------------------------------
# ++++ Logging
#
-------------------------------------------------------------------------------
log-error =
/var/lib/mysql/log/mysqld-error.log
long_query_time = 10
slow-query-log-file =
/var/lib/mysql/log/mysqld-queries_slow.log
log-slave-updates
log-bin =
/var/lib/mysql/log/bin/slave-bin
log-warnings = 1
max_relay_log_size = 200M
relay_log_space_limit = 25000M
#
-------------------------------------------------------------------------------
# ++++ Network
#
-------------------------------------------------------------------------------
max_connections = 3000
max_connect_errors = 1000
wait_timeout = 120
connect_timeout = 30
interactive_timeout = 3600
slave_net_timeout = 120
back_log = 50
max_allowed_packet = 128M
#
-------------------------------------------------------------------------------
# ++++ Misc
#
-------------------------------------------------------------------------------
# Text Searchs
ft_min_word_len = 2
plugin-load = ha_tokudb
#
-------------------------------------------------------------------------------
# ++++ Threads
#
-------------------------------------------------------------------------------
thread_concurrency = 8
thread_cache = 64
#
-------------------------------------------------------------------------------
# ++++ Memory
#
-------------------------------------------------------------------------------
# Tables
table_cache = 4096
tmp_table_size = 256M
# Memory per Thread
sort_buffer_size = 8M
read_buffer_size = 4M
read_rnd_buffer_size = 16M
# Query Cache
query_cache_type = 1
query_cache_limit = 2M
query_cache_size = 64M
#
-------------------------------------------------------------------------------
# ++++ MyISAM Parameters
#
-------------------------------------------------------------------------------
key_buffer_size = 1024M
myisam_sort_buffer_size = 64M
myisam_recover = FORCE,BACKUP
#
-------------------------------------------------------------------------------
# ++++ InnoDB Parameters
#
-------------------------------------------------------------------------------
# General
innodb_data_home_dir = /var/lib/mysql/innodb
innodb_log_group_home_dir = /var/lib/mysql/innodblogs
innodb_file_per_table
innodb_data_file_path = ibdata1:100M:autoextend
innodb_status_file = ib_status
innodb_autoextend_increment = 10M
innodb_support_xa = 0
innodb_thread_concurrency = 8
innodb_flush_method = O_DIRECT
innodb_flush_log_at_trx_commit = 2
# Memory
innodb_buffer_pool_size = 6G
innodb_additional_mem_pool_size = 8M
innodb_open_files = 512
# Logging
innodb_log_buffer_size = 8M
innodb_log_file_size = 256M
innodb_log_files_in_group = 2
#
-------------------------------------------------------------------------------
# ++++ Replication : SLAVE Profile
#
-------------------------------------------------------------------------------
#skip-slave-start
server-id = 124388
relay-log =
/var/lib/mysql/log/replication/slave-bin
relay-log-info-file =
/var/lib/mysql/log/replication/slave-log.info
<http://slave-log.info>
master-info-file =
/var/lib/mysql/log/replication/master-log.info
<http://master-log.info>
max_binlog_size = 20971520
read-only = 1
[mysqldump]
quick
max_allowed_packet = 128M
[isamchk]
key_buffer = 256M
sort_buffer_size = 256M
read_buffer = 2M
write_buffer = 2M
[myisamchk]
key_buffer = 256M
sort_buffer_size = 256M
read_buffer = 2M
write_buffer = 2M
[mysqlhotcopy]
interactive-timeout
On Tue, Mar 3, 2015 at 12:56 PM, Justin Swanhart
<greenlion@xxxxxxxxx <mailto:greenlion@xxxxxxxxx>> wrote:
Hi,
You might try to find the source of the termination with
this:
http://www.percona.com/blog/2014/07/18/systemtap-solves-phantom-mysqld-sigterm-sigkill-issue/
On Tue, Mar 3, 2015 at 8:45 AM, jocelyn fournier
<jocelyn.fournier@xxxxxxxxx
<mailto:jocelyn.fournier@xxxxxxxxx>> wrote:
Hi,
Could you send your my.cnf ?
Thanks,
Jocelyn
Le 03/03/2015 16:24, Gabriel Sosa a écrit :
Hello,
I've been a proudly user of mariadb 5.5.x for a long
time now but given the nice feature setI decided to
give mariadb 10.x a try.
I took one of our current slaves running mariadb
5.5.x (on centos 6.5) and followed the upgrade steps
using yum and ran *mysql_upgrade*. The upgrade ran
without any trouble...then I moved a huge table
(about 1B records right now) from InnoDB to TokuDB.
Now, every couple of hours I find that the
replication is far behind the master and the reason
is because this server keeps checking tables marked
as crashed....
I can't seems to find any indicator of OOM killer in
the system logs NOR anything related to that in the
mysql log, the only clue I have is:
---------------
*150303 10:08:57 mysqld_safe Number of processes
running now: 0*
*150303 10:08:57 mysqld_safe mysqld restarted*
150303 10:08:58 [Warning] 'THREAD_CONCURRENCY' is
deprecated and will be removed in a future release.
150303 10:08:59 [Warning] option
'innodb-status-file': boolean value 'ib_status'
wasn't recognized. Set to OFF.
150303 10:08:59 [Warning] option
'innodb-autoextend-increment': unsigned value
10485760 adjusted to 1000
150303 10:08:59 [Note] InnoDB: Using mutexes to ref
count buffer pool pages
150303 10:08:59 [Note] InnoDB: The InnoDB memory
heap is disabled
150303 10:08:59 [Note] InnoDB: Mutexes and rw_locks
use GCC atomic builtins
150303 10:08:59 [Note] InnoDB: Memory barrier is not
used
150303 10:08:59 [Note] InnoDB: Compressed tables use
zlib 1.2.3
150303 10:08:59 [Note] InnoDB: Using Linux native AIO
150303 10:08:59 [Note] InnoDB: Using CPU crc32
instructions
150303 10:08:59 [Note] InnoDB: Initializing buffer
pool, size = 6.0G
150303 10:08:59 [Note] InnoDB: Completed
initialization of buffer pool
150303 10:08:59 [Note] InnoDB: Highest supported
file format is Barracuda.
150303 10:08:59 [Note] InnoDB: Log scan progressed
past the checkpoint lsn 14178608217047
150303 10:08:59 [Note] InnoDB: Database was not
shutdown normally!
150303 10:08:59 [Note] InnoDB: Starting crash recovery.
150303 10:08:59 [Note] InnoDB: Reading tablespace
information from the .ibd files...
150303 10:09:05 [Note] InnoDB: Restoring possible
half-written data pages
150303 10:09:05 [Note] InnoDB: from the doublewrite
buffer...
InnoDB: Doing recovery: scanned up to log sequence
number 14178613459456
InnoDB: Doing recovery: scanned up to log sequence
number 14178618702336
InnoDB: Doing recovery: scanned up to log sequence
number 14178623945216
InnoDB: Doing recovery: scanned up to log sequence
number 14178629188096
InnoDB: Doing recovery: scanned up to log sequence
number 14178634430976
InnoDB: Doing recovery: scanned up to log sequence
number 14178639673856
InnoDB: Doing recovery: scanned up to log sequence
number 14178644916736
......
......
......
......
150303 10:09:14 [Note] InnoDB: Starting an apply
batch of log records to the database...
InnoDB: Progress in percent: 0 1 2 3 4 5 6 7 8 9 10
11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27
28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44
45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61
62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78
79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95
96 97 98 99
InnoDB: Apply batch completed
InnoDB: In a MySQL replication slave the last master
binlog file
InnoDB: position 0 16433109, file name slave-bin.852141
InnoDB: Last MySQL binlog file position 0 16784214,
file name /var/lib/mysql/log/bin/slave-bin.005211
150303 10:09:34 [Note] InnoDB: 128 rollback
segment(s) are active.
150303 10:09:34 [Note] InnoDB: Waiting for purge to
start
150303 10:09:34 [Note] InnoDB: Percona XtraDB
(http://www.percona.com) 5.6.22-72.0 started; log
sequence number 14178834821658
Tue Mar 3 10:09:36 2015 TokuFT recovery starting in
env /var/lib/mysql/data/
Tue Mar 3 10:09:36 2015 TokuFT recovery scanning
backward from 1330297
Tue Mar 3 10:09:36 2015 TokuFT recovery
bw_end_checkpoint at 1330297 timestamp
1425395277901416 xid 1330281 (bw_newer)
Tue Mar 3 10:09:36 2015 TokuFT recovery
bw_begin_checkpoint at 1330281 timestamp
1425395273487081 (bw_between)
Tue Mar 3 10:09:36 2015 TokuFT recovery turning
around at begin checkpoint 1330281 time 4414335
Tue Mar 3 10:09:36 2015 TokuFT recovery starts
scanning forward to 1330297 from 1330281 left 16
(fw_between)
Tue Mar 3 10:09:36 2015 TokuFT recovery closing 14
dictionaries
Tue Mar 3 10:09:36 2015 TokuFT recovery making a
checkpoint
Tue Mar 3 10:09:36 2015 TokuFT recovery done
150303 10:09:36 [Note] Recovering after a crash
using /var/lib/mysql/log/bin/slave-bin
---------------
The odd thing is that the 5.5.x version was working
just fine (since about a year) in the same
hardware...nothing changed in that front.
Any clue?
Thank you in advance
_______________________________________________
Mailing list:https://launchpad.net/~maria-discuss <https://launchpad.net/%7Emaria-discuss>
Post to :maria-discuss@xxxxxxxxxxxxxxxxxxx <mailto:maria-discuss@xxxxxxxxxxxxxxxxxxx>
Unsubscribe :https://launchpad.net/~maria-discuss <https://launchpad.net/%7Emaria-discuss>
More help :https://help.launchpad.net/ListHelp
_______________________________________________
Mailing list: https://launchpad.net/~maria-discuss
<https://launchpad.net/%7Emaria-discuss>
Post to : maria-discuss@xxxxxxxxxxxxxxxxxxx
<mailto:maria-discuss@xxxxxxxxxxxxxxxxxxx>
Unsubscribe : https://launchpad.net/~maria-discuss
<https://launchpad.net/%7Emaria-discuss>
More help : https://help.launchpad.net/ListHelp
--
Gabriel Sosa
Sometimes the questions are complicated and the answers are
simple. -- Dr. Seuss
--
Gabriel Sosa
Sometimes the questions are complicated and the answers are simple. --
Dr. Seuss