maria-discuss team mailing list archive

Thread
Date

Re: connection handling is buggy (HY000/2002): Resource temporarily unavailable)

To: maria-discuss@xxxxxxxxxxxxxxxxxxx
From: Reindl Harald <h.reindl@xxxxxxxxxxxxx>
Date: Mon, 17 Jul 2017 15:07:34 +0200
In-reply-to: <846cbcb1-538f-1202-9408-7c3dbc3b7e2e@gmail.com>
Organization: the lounge interactive design
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.2.1



Am 17.07.2017 um 15:02 schrieb Vladislav Vaintroub:

On 17.07.2017 14:19, Reindl Harald wrote:
i can reprduce that also on the 12-core production sevrer, in thatcase only 250 connections are failing and also here only withKeep-Alive requests to the webserver, otherwise not enough load
i guess "futex(0x7f65e917eae0, FUTEX_WAIT_PRIVATE, 2, NULL) = -1EAGAIN (Resource temporarily unavailable)" is the culprit but whyneeds that to be exposed to the client as connection error insteadjust let him wait and try internally again?
You are getting HY000/2002, the range starting with 2000 (ending iirc3000) is the error originating on client. If I was to guess it is aclient-side connection timeout.Threadpool throttles thread creation, which means following : if allthreads are busy, threadpool won't create a thread to handle connectionrequest immediately. The queuing time can be long enough to exceedclient or server side connection timeout.One way to handle that is to increase connection timeout, hopefully PHPallows that


please look at the thread start

i can reproduce this with pool / thread-per-connection and"thread_pool_priority" set as well as keep it to defaults

Another through : with your setting "thread_pool_priority = high", allrequests have the same priority, either connecting or handling thequeries. If you used default setting, connection request would be higherpriority, which means generally it is handled faster compared to normalrequests.

see above, also another piece of my intial posting - why in the worlddoes "Threadpool_threads" not reach the highest value under load butafter the benchmark is finished

what makes me really worry here is while the load is running (mysqldeach time restartet before start apache-benchmark)


+-------------------------+-------+
| Variable_name           | Value |
+-------------------------+-------+
| Threadpool_idle_threads | 181   |
| Threadpool_threads      | 189   |
+-------------------------+-------+

after the benchmark has finished and the machine is idle:
+-------------------------+-------+
| Variable_name           | Value |
+-------------------------+-------+
| Threadpool_idle_threads | 207   |
| Threadpool_threads      | 208   |
+-------------------------+-------+

fcntl(88, F_SETFL, O_RDWR|O_NONBLOCK)   = 0
accept4(88, {sa_family=AF_UNIX}, [128->2], SOCK_CLOEXEC) = 95
fcntl(88, F_SETFL, O_RDWR)              = 0
fcntl(95, F_SETFD, FD_CLOEXEC)          = 0
futex(0x7f65e917eae0, FUTEX_WAIT_PRIVATE, 2, NULL) = -1 EAGAIN(Resource temporarily unavailable)
futex(0x7f65e917eae0, FUTEX_WAKE_PRIVATE, 1) = 0
futex(0x7f656bf20ca4, FUTEX_WAKE_OP_PRIVATE, 1, 1, 0x7f656bf20ca0,FUTEX_OP_SET<<28|0<<12|FUTEX_OP_CMP_GT<<24|0x1) = 1
futex(0x55ce6e382888, FUTEX_WAKE_PRIVATE, 1) = 1
poll([{fd=87, events=POLLIN}, {fd=88, events=POLLIN}], 2, -1) = 1([{fd=88, revents=POLLIN}])
fcntl(88, F_GETFL)                      = 0x2 (flags O_RDWR)
fcntl(88, F_SETFL, O_RDWR|O_NONBLOCK)   = 0
accept4(88, {sa_family=AF_UNIX}, [128->2], SOCK_CLOEXEC) = 104
fcntl(88, F_SETFL, O_RDWR)              = 0
fcntl(104, F_SETFD, FD_CLOEXEC)         = 0
futex(0x7f65be075ca4, FUTEX_WAKE_OP_PRIVATE, 1, 1, 0x7f65be075ca0,FUTEX_OP_SET<<28|0<<12|FUTEX_OP_CMP_GT<<24|0x1) = 1
futex(0x55ce6e382948, FUTEX_WAKE_PRIVATE, 1) = 1
poll([{fd=87, events=POLLIN}, {fd=88, events=POLLIN}], 2, -1) = 1([{fd=88, revents=POLLIN}])
fcntl(88, F_GETFL)                      = 0x2 (flags O_RDWR)
fcntl(88, F_SETFL, O_RDWR|O_NONBLOCK)   = 0
accept4(88, {sa_family=AF_UNIX}, [128->2], SOCK_CLOEXEC) = 101
fcntl(88, F_SETFL, O_RDWR)              = 0
fcntl(101, F_SETFD, FD_CLOEXEC)         = 0

Am 16.07.2017 um 07:33 schrieb Reindl Harald:
simple reproducer script below, obviously it needs the '-k'(keep-alive) flag, otherwise not enough contention on the databaseserver
ab -c 200 -n 500000 -k http://corecms/connect-bench.php

[root@srv-rhsoft:~]$ cat php_error.log | wc -l
312326

<?php declare(strict_types=1);
require __DIR__ . '/php/serverconf.inc.php';
$conn = mysqli_init();
mysqli_options($conn, MYSQLI_OPT_INT_AND_FLOAT_NATIVE, true);
if(mysqli_real_connect($conn, 'localhost', $sql_user, $sql_pwd,$sql_db, 3600, '', 0) === true)
{
  echo 'OK';
}
else
{
  echo 'FAILED';
}
?>

[harry@srv-rhsoft:~]$ php -v
PHP 7.1.8-dev (cli) (built: Jul 13 2017 17:26:17) ( NTS )

[harry@srv-rhsoft:~]$ rpm -q mariadb
mariadb-10.2.7-5.fc25.20170714.rh.x86_64

Am 16.07.2017 um 06:55 schrieb Reindl Harald:
i started to play around with "thread_handling = pool-of-threads"with 10.2.7 and removed at that time the @mysqli_real_connect()error supression of my database-layer which also has a usleep() andretry-loop in case connection failed on so completly burried the issue
PHP Warning: mysqli_real_connect() [<ahref='http://at.php.net/manual/de/function.mysqli-real-connect.php'>function.mysqli-real-connect.php</a>]:(HY000/2002): Resource temporarily unavailable
you should not see such messages when run a "ab -c 200 -n 500000 -khttp://corecms/show_content.php?sid=2"; with "max_connections = 300"
_____________________________________

thread_handling  = one-thread-per-connection
[root@srv-rhsoft:~]$ cat php_error.log | wc -l
52596

thread_handling = pool-of-threads
thread_pool_idle_timeout = 900
[root@srv-rhsoft:~]$ cat php_error.log | wc -l
39282

thread_handling = pool-of-threads
thread_pool_oversubscribe = 10
thread_pool_idle_timeout = 900
thread_pool_priority = high
[root@srv-rhsoft:~]$ cat php_error.log | wc -l
24849
since my database-layer makes a usleep(100000) before each retry andthe retry-lop still has error-supression that means the cms waits10% of all requests at least 0.1 seconds for the mariadb serverwhich means the 4300 Requests/Second could be much higher if everyconnection suceeds at the first try (at least the thread pool seemsto work slightly better then without)
_____________________________________
what makes me really worry here is while the load is running (mysqldeach time restartet before start apache-benchmark)
+-------------------------+-------+
| Variable_name           | Value |
+-------------------------+-------+
| Threadpool_idle_threads | 181   |
| Threadpool_threads      | 189   |
+-------------------------+-------+

after the benchmark has finished and the machine is idle:
+-------------------------+-------+
| Variable_name           | Value |
+-------------------------+-------+
| Threadpool_idle_threads | 207   |
| Threadpool_threads      | 208   |
+-------------------------+-------+
frankly i would expect at least that this numbers are going up whilethe load is running to at least 200 or not going up that high at allbut don't refuse conncetions which is IMHO the point of a pool
The thread count increases in presence of contention. The benchmarks areusually written in such a way that all threads shutdown at the same timeat the end. This runs into internal locks (LOCK_thread_count, if Iremember correctly).What happens in case of the serious contention is that threadpoolnotices contention, starts getting worried about all threads gettingblocked and possibly deadlocked, and not having enough spare threads tohandle new requests( should they come). Thus threadpool will increasethe number if threads.
If you wait long enough after your benchmark (long enough is longerthat thread_pool_idle_timeout seconds), the excess threads will disappear.
_____________________________________
the core-cms itself makes exactly two queries per request over 3MyISAM tables, one cache-table with a pirmary key and the second oneis a simple join on two tables with only 2 records, so not reallysomething one should call real load
select SQL_NO_CACHE * from cms1_cache where`hash`='fullmenu_1_2_0de0_0'select SQL_NO_CACHE * from `cms1_sub` join `cms1_haupt` on sid=2 andhid=shid
mysqld as well as httpd had "LimitNOFILE=infinity" in thesystemd-unit, the connection type is unix socket, so that all shouldnot be a problem on a i7-3770 CPU @ 3.40GHz with 32 GB RAM
it takes some time until the errors start to appear in the log,likely after httpd (mod_prefork) had forked enough worker tointroduce real concurrency to the database

Follow ups

Re: connection handling is buggy (HY000/2002): Resource temporarily unavailable)
From: Vladislav Vaintroub, 2017-07-17

References

connection handling is buggy (HY000/2002): Resource temporarily unavailable)
From: Reindl Harald, 2017-07-16
Re: connection handling is buggy (HY000/2002): Resource temporarily unavailable)
From: Reindl Harald, 2017-07-16
Re: connection handling is buggy (HY000/2002): Resource temporarily unavailable)
From: Reindl Harald, 2017-07-17
Re: connection handling is buggy (HY000/2002): Resource temporarily unavailable)
From: Vladislav Vaintroub, 2017-07-17