MySQL Thread Pool: Problem Definition

A new thread pool plugin is now a part of the MySQL Enterprise Edition.

In this blog we will cover the problem that the thread pool is solving

and some high-level description of how it solves this problem.

In the traditional MySQL server model there is a one-to-one mapping between

thread and connection. Even the MySQL server has lots of code where thread

or some abbreviation of thread is actually representing a connection.

Obviously this mapping has served MySQL very well over the years, but there

are some cases where this model don't work so well.

One such case is where there are much more connections executing queries

simultaneously compared to the number of CPUs available in the server. The

MySQL Server also have scalability bottlenecks where performance suffers

when too many connections execute in parallel.

So effectively there are two reasons that can make performance suffer in

the original MySQL Server model.

The first is that many connections executing in parallel means that the

amount of data that the CPUs work on increases. This will decrease the

CPU cache hit rates. Lowering the CPU cache hit rate can have a significant

negative impact on server performance. Actually in some cases the amount

of memory allocated by the connections executing in parallel could at times

even supersede the memory available in the server. In this case we enter a

state called swapping which is very detrimental to performance.

The second problem is that the number of parallel queries and transactions

can have a negative impact on the throughput through the "critical sections"

of the MySQL Server (critical section is where mutexes are applied to

ensure only one CPU changes a certain data structure at a time, when such

a critical section becomes a scalability problem we call it a hot spot).

Statements that writes are more affected since they use more critical

sections.

Neither of those problems can be solved in the operating system scheduler.

However there are some operating systems that have attempted solving this

problem for generic applications on a higher level in the operating system.

Both of those problems have the impact that performance suffers more and

more as the number of statements executed in parallel increases.

In addition there are hot spots where the mutex is held for a longer time

when many concurrent statements and/or transactions are executed in

parallel. One such example is the transaction list in InnoDB where each

transaction is listed in a linked list. Thus when the number of concurrent

transactions increases the time to scan the list increases and the time

holding the lock increases and thus the hot spot becomes even hotter

as the concurrency increases.

Current solutions to these issues exist in InnoDB through use of the

configuration parameter --innodb-thread-concurrency. When this parameter

is set to a nonzero value, this indicates how many threads are

able to run through InnoDB code concurrently. This solution have its

use cases where it works well. It does however have the drawback that

the solution itself contains a hot spot that limits the MySQL server

scalability. It does also not contain any solution to limiting the

number of concurrent transactions.

In a previous alpha version of the MySQL Server (MySQL 6.0) a thread

pool was developed. This thread pool solved the problem with limiting

the number of concurrent threads executing. It did nothing to solve

the problem with limiting the number of concurrent transactions.

It was also a scalability bottleneck in itself. Finally it didn't

solve all issues regarding long queries and blocked queries.

This made it possible for the MySQL Server to become completely

blocked.

When developing the thread pool extension now available in the MySQL

Enterprise Edition we decided to start from a clean plate with the

following requirements:

1) Limit the number of concurrently executing statements to ensure

that each statement execution has sufficient CPU and memory resources

to fulfill its task.

2) Split threads and connection into thread groups that are

independently managed. This is to ensure that the thread pool

plugin itself doesn't become a scalability bottleneck. The

aim is that each thread group has one or zero active threads

at any point in time.

3) Limit the number of concurrently executing transactions

through prioritizing queued connections dependent on if

they have started a transaction or not.

4) Avoid deadlocks when a statement execution becomes long or

when the statement is blocked for some reason for an extended

time.

If you are interested in knowing more details of how the new

thread pool solves these requirements there will be a

webinar on Thursday 20 Oct 2011 at 9.00 PDT. Check here

for details on how to access it.

If you want to try out the thread pool go here.

参考：

http://mikaelronstrom.blogspot.ae/2011/10/mysql-thread-pool-problem-definition.html

上一篇：数据库中的SELECT语句逻辑执行顺序分析_MySQL
下一篇：mysql 大数据库的备份和还原_MySQL