配置MySQL使用HugePages

前言:
对于有Oracle运维经验的童鞋来说,如果服务器内存很大,一般都会设置HugePages,是因为如下原因:
对于 Linux 操作系统,通过 Linux kswapd 进程和页表内存结构(针对系统中存在的每个进程包含一条记录)实现内存管理。 linux的内存管理采取的是分页存取机制,为了保证物理内存能得到充分的利用, 内核会按照LRU算法在适当的时候将物理内存中不经常使用的内存页自动交换到虚拟内存中, 而将经常使用的信息保留到物理内存。通常情况下,Linux默认情况下每页是4K,这就意味着如果物理内存很大,则映射表的条目将会非常多, 会影响CPU的检索效率。而且也浪费内存。因为内存大小是固定的,为了减少映射表的条目,可采取的办法只有增加页的尺寸。 因此Hugepage便因此而来。也就是打破传统的小页面的内存管理方式,使用大页面2m,4m,16m,但是Linux系统的大页默认就是2M 如此一来映射条目则明显减少。如果系统有大量的物理内存(大于64G),建议使用Hugepage。 注意事项
1、HugePage使用的是共享内存,在操作系统启动期间被动态分配并被保留,因为他们不会被置换。 2、由于不会被置换的特点,在使用hugepage的内存不能被其他的进程使用。所以要合理设置该值,避免造成内存浪费。 3、如果增加HugePage或添加物理内存或者是当前服务器增加了新的instance以及SGA设置发生变化,应该重新设置所需的HugePage。
辣么,MySQL也是支持滴,那么下面开始讲讲怎么设置大页内存
1.首先来看看共享段内存, ###centos6的默认共享段内存大小是64G,如果你服务器内存没有超过128G,可以不用修改 # Controls the maximum shared segment size,
in bytes kernel.shmmax = 68719476736 # Controls the maximum number of shared memory segments, in pages kernel.shmall = 4294967296 ###先透露一下,使用大页内存的和没有使用大页内存的PageTables [root@crmdbL-172 ~]# free -m total used free shared buffers cached Mem: 32058 29144 2913 0 20 11526 -/+ buffers/cache: 17597 14460 Swap: 8191 3 8188 [root@crmdbL-172 ~]# [root@crmdbL-172 ~]# cat /proc/meminfo | grep PageTables PageTables: 44808 kB [root@crmdbL-172 ~]# [root@node-207 ~]# free -m total used free shared buffers cached Mem: 32095 28501 3593 0 21 9233 -/+ buffers/cache: 19246 12848 Swap: 8095 0 8095 [root@node-207 ~]# [root@node-207 ~]# cat /proc/meminfo | grep PageTables PageTables: 5372 kB [root@node-207 ~]# 差距呢44808-5372=39436

2.设置MySQL使用大页内存
########下面开始设置使用大页内存 innodb_buffer_pool_size = 16384M innodb_additional_mem_pool_size = 16M 16384M+16M/2=8200 根据以往对ORACLE设置大页的经验,大页内存要大于这个内存,所以我设置了8211个大页 vim /etc/sysctl.conf #### HugePages 大小 vm.nr_hugepages=8211 ###使用大页内存的用户ID vm.hugetlb_shm_group=3306id mysql得到的结果) 设置当前系统生效,只要刷新一下就行了 sysctl -p vim /etc/security/limits.conf * soft nofile 65535 * soft nproc 65535 * hard nofile 65535 * hard nproc 65535 #* soft core 0 #* hard rss 10000 #@student hard nproc 20 #@faculty soft nproc 20 #@faculty hard nproc 50 #ftp hard nproc 0 #@student - maxlogins 4 ###设置mysql 使用 HugePages @mysql soft memlock unlimited @mysql hard memlock unlimited oracle使用大页也是这样设置 重启MySQL,查看错误日志, 150728 16:37:43 mysqld_safe mysqld from pid file /data/3306/tmp/mysql.pid ended 150728 16:37:44 mysqld_safe Starting mysqld daemon with databases from /data/3306/data 2015-07-28 16:37:45 0 [Note] /opt/app/mysql/bin/mysqld (mysqld 5.6.24-log) starting as process 13420 ... 2015-07-28 16:37:45 13420 [Note] Plugin 'FEDERATED' is disabled. 2015-07-28 16:37:45 7f56f311d740 InnoDB: Warning: Using innodb_additional_mem_pool_size is DEPRECATED. This option may be removed in future releases, together with the option innodb_use_sys_malloc and with the InnoDB's internal memory allocator. 2015-07-28 16:37:45 13420 [Note] InnoDB: Using atomics to ref count buffer pool pages 2015-07-28 16:37:45 13420 [Note] InnoDB: The InnoDB memory heap is disabled 2015-07-28 16:37:45 13420 [Note] InnoDB: Mutexes and rw_locks use GCC atomic builtins 2015-07-28 16:37:45 13420 [Note] InnoDB: Memory barrier is not used 2015-07-28 16:37:45 13420 [Note] InnoDB: Compressed tables use zlib 1.2.3 2015-07-28 16:37:45 13420 [Note] InnoDB: Using Linux native AIO 2015-07-28 16:37:45 13420 [Note] InnoDB: Using CPU crc32 instructions 2015-07-28 16:37:45 13420 [Note] InnoDB: Initializing buffer pool, size = 16.0G InnoDB: HugeTLB: Warning: Failed to allocate 2197815296 bytes. errno 12 InnoDB HugeTLB: Warning: Using conventional memory pool 居然两个警告 InnoDB: HugeTLB: Warning: Failed to allocate 2197815296 bytes. errno 12 InnoDB HugeTLB: Warning: Using conventional memory pool ####using conventional memory pool 因为hugepage分配内存的时候,预分配、 而且这些分配的内存不能被其他进程占用,而且也不会交换到swap里面去。 因为这边配置的,不够大,innodb引擎要求的比你分配的大,这点内存不够使用,所以转成使用常规内存了 oracle这方面也出过案例,配置的内存小于SGA大小,白白的浪费那么多内存,造成是用到swap 既然报警说不够,查看官方文档,才知道大页内存大小要大于(innodb_buffer_pool_size+innodb_additional_mem_pool_size+innodb_log_buffer_size+tmp_table_size),那么刚才配置的显然不够 那么我来慷慨点9300个大页也就是说有(9300*2M=18600M,有18.1G的内存),看看能正常启用大页的日志是怎么样的,再次启动mysql看看,这次就不报错了 150728 16:55:33 mysqld_safe mysqld from pid file /data/3306/tmp/mysql.pid ended 150728 16:56:04 mysqld_safe Starting mysqld daemon with databases from /data/3306/data 2015-07-28 16:56:05 0 [Note] /opt/app/mysql/bin/mysqld (mysqld 5.6.24-log) starting as process 17256 ... 2015-07-28 16:56:05 17256 [Note] Plugin 'FEDERATED' is disabled. 2015-07-28 16:56:05 7fa0048e5740 InnoDB: Warning: Using innodb_additional_mem_pool_size is DEPRECATED. This option may be removed in future releases, together with the option innodb_use_sys_malloc and with the InnoDB's internal memory allocator. 2015-07-28 16:56:05 17256 [Note] InnoDB: Using atomics to ref count buffer pool pages 2015-07-28 16:56:05 17256 [Note] InnoDB: The InnoDB memory heap is disabled 2015-07-28 16:56:05 17256 [Note] InnoDB: Mutexes and rw_locks use GCC atomic builtins 2015-07-28 16:56:05 17256 [Note] InnoDB: Memory barrier is not used 2015-07-28 16:56:05 17256 [Note] InnoDB: Compressed tables use zlib 1.2.3 2015-07-28 16:56:05 17256 [Note] InnoDB: Using Linux native AIO 2015-07-28 16:56:05 17256 [Note] InnoDB: Using CPU crc32 instructions 2015-07-28 16:56:05 17256 [Note] InnoDB: Initializing buffer pool, size = 16.0G 2015-07-28 16:56:06 17256 [Note] InnoDB: Completed initialization of buffer pool 2015-07-28 16:56:06 17256 [Note] InnoDB: Highest supported file format is Barracuda. 2015-07-28 16:56:06 17256 [Note] InnoDB: 128 rollback segment(s) are active. 2015-07-28 16:56:06 17256 [Note] InnoDB: Waiting for purge to start 2015-07-28 16:56:07 17256 [Note] InnoDB: 5.6.24 started; log sequence number 26564145028 2015-07-28 16:56:07 17256 [Note] Server hostname (bind-address): '*'; port: 3306 2015-07-28 16:56:07 17256 [Note] IPv6 is available. 2015-07-28 16:56:07 17256 [Note] - '::' resolves to '::'; 2015-07-28 16:56:07 17256 [Note] Server socket created on IP: '::'. 2015-07-28 16:56:07 17256 [Warning] Recovery from master pos 155925988 and file mysql-bin.000025. 2015-07-28 16:56:07 17256 [Warning] Storing MySQL user name or password information in the master info repository is not secure and is therefore not recommended. Please consider using the USER and PASSWORD connection options for START SLAVE; see the 'START SLAVE Syntax' in the MySQL Manual for more information. 2015-07-28 16:56:07 17256 [Note] Slave SQL thread initialized, starting replication in log 'mysql-bin.000025' at position 155925988, relay log '/data/3306/logs/relay-bin.000058' position: 4 2015-07-28 16:56:07 17256 [Note] Slave I/O thread: connected to master 'slave@172.16.117.247:3306',replication started in log 'mysql-bin.000025' at position 155925988 2015-07-28 16:56:07 17256 [Note] Event Scheduler: Loaded 0 events 2015-07-28 16:56:07 17256 [Note] /opt/app/mysql/bin/mysqld: ready for connections. Version: '5.6.24-log' socket: '/data/3306/tmp/mysql.sock' port: 3306 MySQL Community Server (GPL) [root@node-207 ~]# cat /proc/meminfo | grep ^HugePages HugePages_Total: 9300 HugePages_Free: 9067 HugePages_Rsvd: 8178 HugePages_Surp: 0 Hugepagesize: 2048 kB [root@node-207 ~]# 因为大页内存是独占的,你给多了,也是浪费,那么根据计算公式设置合理的大页大小。 然后根据公式在计算了一下 innodb_buffer_pool_size = 16384M innodb_additional_mem_pool_size = 16M innodb_log_buffer_size = 32M tmp_table_size=512M max_heap_table_size=512M (16384+16+32+512)=16944/2=8472,因为大页内存要比这个大,所以设置了8476,多了四个,因为这是独占的,设置多的也是不能使用的,一般多设置(2个大页以上,5个大页以下) 注意这边指的临时表是max_heap_table_size这个参数值大小,是说允许创建内存引擎的临时表大小, 下面我们来看看启动日志是不是正常 150728 17:14:23 mysqld_safe Starting mysqld daemon with databases from /data/3306/data 2015-07-28 17:14:23 0 [Note] /opt/app/mysql/bin/mysqld (mysqld 5.6.24-log) starting as process 18569 ... 2015-07-28 17:14:23 18569 [Note] Plugin 'FEDERATED' is disabled. 2015-07-28 17:14:23 7fee7b559740 InnoDB: Warning: Using innodb_additional_mem_pool_size is DEPRECATED. This option may be removed in future releases, together with the option innodb_use_sys_malloc and with the InnoDB's internal memory allocator. 2015-07-28 17:14:23 18569 [Note] InnoDB: Using atomics to ref count buffer pool pages 2015-07-28 17:14:23 18569 [Note] InnoDB: The InnoDB memory heap is disabled 2015-07-28 17:14:23 18569 [Note] InnoDB: Mutexes and rw_locks use GCC atomic builtins 2015-07-28 17:14:23 18569 [Note] InnoDB: Memory barrier is not used 2015-07-28 17:14:23 18569 [Note] InnoDB: Compressed tables use zlib 1.2.3 2015-07-28 17:14:23 18569 [Note] InnoDB: Using Linux native AIO 2015-07-28 17:14:23 18569 [Note] InnoDB: Using CPU crc32 instructions 2015-07-28 17:14:23 18569 [Note] InnoDB: Initializing buffer pool, size = 16.0G 2015-07-28 17:14:24 18569 [Note] InnoDB: Completed initialization of buffer pool 2015-07-28 17:14:24 18569 [Note] InnoDB: Highest supported file format is Barracuda. 2015-07-28 17:14:25 18569 [Note] InnoDB: 128 rollback segment(s) are active. 2015-07-28 17:14:25 18569 [Note] InnoDB: Waiting for purge to start 2015-07-28 17:14:25 18569 [Note] InnoDB: 5.6.24 started; log sequence number 26585446708 2015-07-28 17:14:25 18569 [Note] Server hostname (bind-address): '*'; port: 3306 2015-07-28 17:14:25 18569 [Note] IPv6 is available. 2015-07-28 17:14:25 18569 [Note] - '::' resolves to '::'; 2015-07-28 17:14:25 18569 [Note] Server socket created on IP: '::'. 2015-07-28 17:14:25 18569 [Warning] Recovery from master pos 166617263 and file mysql-bin.000025. 2015-07-28 17:14:25 18569 [Warning] Storing MySQL user name or password information in the master info repository is not secure and is therefore not recommended. Please consider using the USER and PASSWORD connection options for START SLAVE; see the 'START SLAVE Syntax' in the MySQL Manual for more information. 2015-07-28 17:14:25 18569 [Note] Slave SQL thread initialized, starting replication in log 'mysql-bin.000025' at position 166617263, relay log '/data/3306/logs/relay-bin.000060' position: 4 2015-07-28 17:14:25 18569 [Note] Slave I/O thread: connected to master 'slave@172.16.117.247:3306',replication started in log 'mysql-bin.000025' at position 166617263 2015-07-28 17:14:25 18569 [Note] Event Scheduler: Loaded 0 events 2015-07-28 17:14:25 18569 [Note] /opt/app/mysql/bin/mysqld: ready for connections. Version: '5.6.24-log' socket: '/data/3306/tmp/mysql.sock' port: 3306 MySQL Community Server (GPL) OK,非常好 那么我们来看看大页内存使用了多少, [root@node-207 ~]# cat /proc/meminfo | grep ^HugePages HugePages_Total: 8476 HugePages_Free: 8202 HugePages_Rsvd: 8137 HugePages_Surp: 0 Hugepagesize: 2048 kB [root@node-207 ~]# 才使用了一点点 HugePages_Total: 8476 HugePages_Free: 8202 HugePages_Rsvd: 8137
Hugepagesize:       2048 kB
那么我们来个大表count(主键) 再来看看 [root@node-207 ~]# cat /proc/meminfo | grep ^HugePages HugePages_Total: 8476 HugePages_Free: 8123 HugePages_Rsvd: 8058 HugePages_Surp: 0 Hugepagesize:       2048 kB [root@node-207 ~]# cat /proc/meminfo | grep ^HugePages HugePages_Total: 8476 HugePages_Free: 7233 HugePages_Rsvd: 7201 HugePages_Surp: 0
Hugepagesize:       2048 kB ####看到木有,有在使用大页了 HugePages_Free: 8123 #### HugePages_Free: 7233 在看看innodb情况,算起来是用了那么多内存,到此大页内存是配置好了 ---BUFFER POOL 7 Buffer pool size 131072 Free buffers 113960 Database pages 17102 Old database pages 8571 Modified db pages 1164 Pending reads 0 Pending writes: LRU 0, flush list 0, single page 0 Pages made young 55, not young 0 0.13 youngs/s, 0.00 non-youngs/s Pages read 17080, created 22, written 1056 0.80 reads/s, 0.00 creates/s, 3.33 writes/s Buffer pool hit rate 974 / 1000, young-making rate 4 / 1000 not 0 / 1000 Pages read ahead 0.00/s, evicted without access 0.00/s, Random read ahead 0.00/s LRU len: 17102, unzip_LRU len: 0 I/O sum[0]:cur[4], unzip sum[0]:cur[0] -------------- ROW OPERATIONS -------------- 0 queries inside InnoDB, 0 queries in queue 0 read views open inside InnoDB Main thread process no. 18569, id 140659839203072, state: sleeping Number of rows inserted 12222, updated 10955, deleted 184, read 1819484 10.53 inserts/s, 7.67 updates/s, 0.40 deletes/s, 8.07 reads/s ---------------------------- END OF INNODB MONITOR OUTPUT ============================ 1 row in set (0.00 sec) 参考资料: https://dev.mysql.com/doc/refman/5.0/en/large-page-support.html I hope this comment will save severals hours and white nights on production launching... After folowing every How-to and all's documentation over Google, to enable huge pages... i must give you this post. For enabling huge pages with Linux Debian 6.0.5 on Linux 2.6.32-5-amd64 #x86_64 GNU/Linux (64Bits) and MySQL 5.1, you got to add this your /etc/sysctl.conf : # Total of allowed memory vm.nr_hugepages = YYYYYY # total amount of memory that can be allocated to shared memory, huge pages or not, on the box kernel.shmall = XXXXXXXXXX # maximum single shared memory segment, which for me was basically innodb_buffer_pool+1% kernel.shmmax = XXXXXXXXXX # Groupe autorisé vm.hugetlb_shm_group = `id -g mysql` XXXXX is given by this script shell in bash : ##### SCRIPT START ######### #!/bin/bash # keep 2go memory for system # (i got 68Go on this one ans 128Go RAM on other one) keep_for_system=2097152 mem=$(free|grep Mem|awk '{print$2}') mem=$(echo "$mem-$marge"|bc) totmem=$(echo "$mem*1024"|bc) huge=$(grep Hugepagesize /proc/meminfo|awk '{print $2}') max=$(echo "$totmem*75/100"|bc) all=$(echo "$max/$huge"|bc) echo "kernel.shmmax = $max" echo "kernel.shmall = $all" ######### SCRIPT END ######### check memory usage before reboot by command : cat /proc/meminfo | grep -i huge Reboot your system. and check memory usage again. It works ! ;-) Posted by John Anderson on May 13 2015 11:09am [Delete] [Edit] A bit of a note on the math here, some articles and blogs say that you should add your innodb_buffer_pool size to your innodb_additional_mem_pool_size, and divide that by your hugetlb page size. Then add a few on to that. Unfortunately, that doesn't seem to be the whole story. For those who want to allocate as little RAM as possible to HugeTLB while still satisfying the requirements outlined in my.cnf, this formula might be a little better. This is after some experimentation led me to put some effort behind finding out why I always had to allocate many more pages than the math suggested. The real formula should be: (innodb_buffer_pool_size in kb + innodb_additional_mem_pool_size in kb + tmp_table_size in kb + innodb_log_buffer_size in kb) / hugetlb size in kb Then to that, add an additional 11 - 15 pages until MySQL starts. I give my best guess as to why these pages are unaccounted for below. First, a note on why tmp_table_size is included: I'm not sure if it *should* be tmp_table_size * max_tmp_tables, but MySQL starts and runs with only tmp_table_size included. I think this only applies if default_tmp_storage_engine is InnoDB. If a tmp table needs to be created for a sort or order, and that table is going to be InnoDB in RAM, then hugetlb will need to be used. Secondly, I noticed in the source code that the InnoDB buffer log uses the 'os_mem_alloc_large' function. So I think that should be included in the calculation as well. In my experimentation, I had 22 pages unaccounted for until I found that, then my unaccounted for pages went down to 11. As for the pages which don't seem to be accounted for, I think that is the overhead cost of the nature of pages. For instance, if you have an innodb_buffer_pool size of 256 MB, and you have 8 buffer instances then you have: (268435456 bytes / 8 instances ) = 33554.4 kilobtes to allocate per page. At 2048 KB per page, that comes to 16.4 pages per buffer. That .4 of a page means an entire page must be allocated, or 17 pages per buffer instead of 16.4. That would account for 8 pages right there. So if one is really picky, declaring buffer sizes that meet the page size exactly would theoretically leave no overhead to absorb. I don't know why but MySQL and google convert have differing opinions on how to convert megabytes to bytes, and vice versa. So if you want to cut it as close as possible, fill out your my.cnf. Start mysql without large-pages, and take note of the values of these 4 variables. Then convert those values into kilobytes for the page count calculation.

 

posted @ 2015-07-30 16:55  文采飞扬  阅读(3873)  评论(0)    收藏  举报