MySQL优化index merge引起的死锁怎么解决

背景生产环境出现死锁流水，通过查看死锁日志，看到造成死锁的是两条一样的update语句（只有where条件中的值不同），
如下：
update test_table set `status` = 1 where `trans_id` = 'xxx1' and `status` = 0;update test_table set `status` = 1 where `trans_id` = 'xxx2' and `status` = 0;
一开始比较费解，通过大量查询跟学习后，分析出了死锁形成的具体原理，特分享给大家，希望能帮助到遇到同样问题的朋友。
因为mysql知识点较多，这里对很多名词不进行过多介绍，有兴趣的朋友，可以后续进行专项深入学习。
死锁日志*** (1) transaction:transaction 791913819, active 0 sec starting index read, thread declared inside innodb 4999mysql tables in use 3, locked 3lock wait 4 lock struct(s), heap size 1184, 3 row lock(s)mysql thread id 462005230, os thread handle 0x7f55d5da3700, query id 2621313306 x.x.x.x test_user searching rows for updateupdate test_table set `status` = 1 where `trans_id` = 'xxx1' and `status` = 0;*** (1) waiting for this lock to be granted:record locks space id 110 page no 39167 n bits 1056 index `idx_status` of table `test`.`test_table` trx id 791913819 lock_mode x waitingrecord lock, heap no 495 physical record: n_fields 2; compact format; info bits 0*** (2) transaction:transaction 791913818, active 0 sec starting index read, thread declared inside innodb 4999mysql tables in use 3, locked 35 lock struct(s), heap size 1184, 4 row lock(s)mysql thread id 462005231, os thread handle 0x7f55cee63700, query id 2621313305 x.x.x.x test_user searching rows for updateupdate test_table set `status` = 1 where `trans_id` = 'xxx2' and `status` = 0;*** (2) holds the lock(s):record locks space id 110 page no 39167 n bits 1056 index `idx_status` of table `test`.`test_table` trx id 791913818 lock_mode xrecord lock, heap no 495 physical record: n_fields 2; compact format; info bits 0*** (2) waiting for this lock to be granted:record locks space id 110 page no 41569 n bits 88 index `primary` of table `test`.`test_table` trx id 791913818 lock_mode x locks rec but not gap waitingrecord lock, heap no 14 physical record: n_fields 30; compact format; info bits 0*** we roll back transaction (1)
简要分析下上边的死锁日志：
1、第一块内容（第1行到第9行）中，第6行为事务(1)执行的sql语句，第7和第8行意思为事务(1)在等待 idx_status 索引上的x锁；
2、第二块内容（第11行到第19行）中，第16行为事务(2)执行的sql语句，第17和第18行意思为事务(2)持有 idx_status 索引上的x锁；
意思为：事务(2)正在等待在 primary 索引上获取 x 锁。（but not gap指不是间隙锁）
4、最后一句的意思即为，mysql将事务(1)进行了回滚操作。
表结构create table `test_table` (`id` int(11) not null auto_increment,`trans_id` varchar(21) not null,`status` int(11) not null,primary key (`id`),unique key `uniq_trans_id` (`trans_id`) using btree,key `idx_status` (`status`) using btree) engine=innodb auto_increment=1 default charset=utf8
通过表结构可以看出，trans_id 列上有一个唯一索引uniq_trans_id ，status 列上有一个普通索引idx_status ，id列为主键索引 primary 。
innodb引擎中有两种索引：
聚簇索引：将数据存储与索引放到了一块，索引结构的叶子节点保存了行数据。
辅助索引：辅助索引叶子节点存储的是主键值，也就是聚簇索引的键值。
主键索引 primary 就是聚簇索引，叶子节点中会保存行数据。uniq_trans_id 索引和idx_status 索引为辅助索引，叶子节点中保存的是主键值，也就是id列值。
当我们通过辅助索引查找行数据时，先通过辅助索引找到主键id，再通过主键索引进行二次查找（也叫回表），最终找到行数据。
执行计划
通过看执行计划，可以发现，update语句用到了索引合并，也就是这条语句既用到了 uniq_trans_id 索引，又用到了 idx_status 索引，using intersect(uniq_trans_id,idx_status)的意思是通过两个索引获取交集。
为什么会用 index_merge（索引合并）mysql5.0之前，一个表一次只能使用一个索引，无法同时使用多个索引分别进行条件扫描。但是从5.1开始，引入了 index merge 优化技术，对同一个表可以使用多个索引分别进行条件扫描。
如执行计划中的语句：
update test_table set `status` = 1 where `trans_id` = '38' and `status` = 0 ;
mysql会根据 trans_id = ‘38’这个条件，利用 uniq_trans_id 索引找到叶子节点中保存的id值；同时会根据 status = 0这个条件，利用 idx_status 索引找到叶子节点中保存的id值；然后将找到的两组id值取交集，最终通过交集后的id回表，也就是通过 primary 索引找到叶子节点中保存的行数据。
这里可能很多人会有疑问了，uniq_trans_id 已经是一个唯一索引了，通过这个索引最终只能找到最多一条数据，那mysql优化器为啥还要用两个索引取交集，再回表进行查询呢，这样不是多了一次 idx_status 索引查找的过程么。我们来分析一下这两种情况执行过程。
第一种只用uniq_trans_id索引：
根据 trans_id = ‘38’查询条件，利用uniq_trans_id 索引找到叶子节点中保存的id值；
通过找到的id值，利用primary索引找到叶子节点中保存的行数据；
再通过 status = 0 条件对找到的行数据进行过滤。
第二种用到索引合并 using intersect(uniq_trans_id,idx_status)：
根据 trans_id = ‘38’ 查询条件，利用 uniq_trans_id 索引找到叶子节点中保存的id值；
根据 status = 0 查询条件，利用 idx_status 索引找到叶子节点中保存的id值；
将1/2中找到的id值取交集，然后利用primary索引找到叶子节点中保存的行数据
上边两种情况，主要区别在于，第一种是先通过一个索引把数据找到后，再用其它查询条件进行过滤；第二种是先通过两个索引查出的id值取交集，如果取交集后还存在id值，则再去回表将数据取出来。
当优化器认为第二种情况执行成本比第一种要小时，就会出现索引合并。（生产环境流水表中 status = 0 的数据非常少，这也是优化器考虑用第二种情况的原因之一）。
为什么用了 index_merge 就死锁了
上面简要画了一下两个update事务加锁的过程，从图中可以看到，在idx_status 索引和 primary （聚簇索引）上都存在重合交叉的部分，这样就为死锁造成了条件。
如，当遇到以下时序时，就会出现死锁：
事务1等待事务2释放锁，事务2等待事务1释放锁，这样就造成了死锁。
mysql检测到死锁后，会自动回滚代价更低的那个事务，如上边的时序图中，事务1持有的锁比事务2少，则mysql就将事务1进行了回滚。
解决方案一、从代码层面where 查询条件中，只传 trans_id ，将数据查询出来后，在代码层面判断 status 状态是否为0；
使用 force index(uniq_trans_id) 强制查询语句使用 uniq_trans_id 索引；
where 查询条件后边直接用 id 字段，通过主键去更新。
二、从mysql层面删除 idx_status 索引或者建一个包含这俩列的联合索引；
将mysql优化器的index merge优化关闭。
以上就是mysql优化index merge引起的死锁怎么解决的详细内容。

MySQL优化index merge引起的死锁怎么解决

推荐信息