Marc's Blog

Monday, April 28, 2025

(Yet another) blog on monitoring Multi-threaded replication in MySQL

One of the most commonly misunderstood configurations in MySQL is related to parallel replication threads in MySQL, and more importantly how do I monitor my replica to ensure its configured effectively.

Background on parallel replication

Traditionally, MySQL replication was single threaded where changes applied on a writer database are fetched by a replica and applied serially on a replica using a single applier thread, in the same order they are executed on a writer. Over time some optimizations were introduced to allow replicas apply non-dependent transactions/changes in parallel on a replica.

To decide on whether a change is dependent MySQL first introduced database level dependency tracking where changes applied to different databases could be applied in parallel on a replica.

Then MySQL introduced the "logical clock". When an event is written to the binary log on a mysql database, whether a replica is connected or not, each transaction/event will also be associated with a a logical timestamp. The logical timestamp is calculated according to the type of dependency tracking configured on the source, and consists of two values, last_committed and sequence_number .

sequence_number - each transaction in a binlog file, starting with the first, is assigned a sequence number starting from 1 and monotonically increasing from there. In other words, the Nth transaction in a file will have sequence_number == N
last_committed - this field of the timestamp is the sequence number of the most recent transaction which the current transaction depends on. In other words, this is the last transaction which the current transaction cannot be applied in parallel with.

Before MySQL 5.7 there was one mechanism of determining dependencies based on logical timestamp, COMMIT_ORDER. The core concept here is that transactions can safely run in parallel if they held their locks simultaneously at commit time on the source. This concept is implemented by having each transaction track its dependencies through a commit_parent variable, which updates as locks are acquired. After each non-COMMIT statement, this variable is updated to match the latest transaction in the binlog, and at commit time, this value becomes the last_committed number. The mechanism works because once a transaction has executed its last pre-COMMIT statement, it holds all its needed locks. Any transactions that commit during this window can safely run in parallel on a replica since these transactions would have held their locks concurrently. This approach ensures data consistency while allowing parallel execution, providing a reliable mechanism for determining safe concurrent transactions, though the actual parallelism gains may be modest depending on workload. There are some caveats here, such as when a DDL is being executed, no other replica worker threads can be active which will basically block all other threads from scheduling work. All transactions are dependent on the DDL executing,

In this example, we have parsed the binary log using the mysqlbinlog tool.

Here we can see:

Statement	last_committed	sequence_number	Note
create database test_db	0	1	This is the first statement in a new binary log file, so last committed is zero, and the first statement is assigned the sequence number of 1
create table test_db.t1(id int)	1	2	This is the second transaction/event in this binary log file so it is assigned a sequence number of 2, last committed is set to 1 as this is the sequence number of the previous transaction/event to which this is dependant on. Logically thinking, it would not make sense to try execute create table(sequence number 2) in a database which does not exist.(sequence number 1)

This new implementation (COMMIT_ORDER) was heavily dependent on the concurrency/throughput of the source system. The sequence numbers generated are heavily dependent on commit rate; where low-thread sources would commonly run into issues where dependency information was not granular enough. As a result, MySQL introduced a new mechanism to determine dependencies called WRITESET dependency tracking, which could be enabled via binlog_transaction_dependency_tracking. (WL#9556) Rather than relying on timing-dependent COMMIT_ORDER dependency tracking, WRITESET tracks actual data dependencies between transactions. When WRITESET is used for dependency tracking, MySQL creates transaction "writesets," which are collections of hashes. Each hash represents a modified database row on the MySQL source database.

The algorithm tracks:

sequence_number (unique transaction ID)
last_committed (most recent dependent transaction)
Overlapping writesets (indicating dependencies)

Using this information, MySQL is able to determine dependencies based on writesets, rather than just locks held at commit time. Two transactions are deemed dependent if their writesets overlap. In the binary log from the source, last_committed now points to the most recent transaction with shared data. Using writesets, independent transactions can execute in parallel even if:

They were sequential on the source
Their group commit/commit windows didn't overlap
They were in different sessions

Source

The introduction of WRITESET provided huge improvements to parallelism of replica apply threads, even when write concurrency was low. As concurrency is increased on the source, and group commit windows become larger, the benefits may be less noticeable. This is demonstrated here from Oracle's blog on "Improving the Parallel Applier with Writeset-based Dependency Tracking" you can see WRITESET and COMMIT_ORDER perform similarly under high thread counts/parallelism on the source, WRITESET shows better performance in environments with lower concurrency.

Since I didn't want to write "yet another" blog on dependency tracking I may have skipped over some details above but if you want to read more into this I would highly recommend reading Daniel Nichter's blog series here:

Hat tip to JFG too, the OG of replication deep dives and monitoring! He has many presentations and posts on the topic over the years which I would recommend checking out, they are also linked in Daniel's blog series.

https://jfg-mysql.blogspot.com/

New Performance Schema Replication Tables

In part 3 of Daniel's blog series he raised some key challenges with monitoring replication in MySQL, specifically how do I determine if I have over or under provisioned replication threads?(MTR thread utilization) and how do I determine if underutilization is to source throughput/dependencies(Waiting for Commit Order)? As he stated there is currently no view available in MySQL to see this information easily which got me excited when I seen the following in the 9.1 release notes:

Replication: This release adds the MySQL Replication Applier Metrics component, which provides users with statistical information about replication formerly logged in the error log. The component adds two tables containing this information to the MySQL Performance Schema: replication_applier_metrics provides replication applier metrics for a given replication channel, and replication_applier_progress_by_worker: This table provides similar metrics for a specific worker.
This enhances observability of replication by gathering statistics from the entire replication pipeline, and unifying their presentation. As part of this work, some metrics which were not especially helpful have been replaced with more useful ones.
For more information about this component, see Replication Applier Metrics Component. (WL #15620)
References: See also: Bug #32587480.

Looking at the docs, this was the solution to our problem! But the devil is in the details, these new performance schema tables were implemented as a component which is only available in MySQL enterprise edition. :(

This got me thinking, could we find this information elsewhere, and add some views to. create a "poor-mans" replication monitoring component?

Multi-threaded replica statistics

After some digging, it turns out that MySQL does actually expose information on MTS threads(5.3.2 Monitoring Replication Applier Worker Threads) and has done for a number of years now. In fact, Percona blogged about this in 2017.

$ grep "Multi-threaded replica statistics" node1/data/msandbox.err | head -n 4
2025-01-27T18:07:46.516639Z 6 [Note] [MY-010559] [Repl] Multi-threaded replica statistics for channel '': seconds elapsed = 250; events assigned = 13313; worker queues filled over overrun level = 0; waited due a Worker queue full = 0; waited due the total size = 0; waited at clock conflicts = 7565719600 waited (count) when Workers occupied = 44 waited when Workers occupied = 1489733600
2025-01-27T18:09:46.150048Z 6 [Note] [MY-010559] [Repl] Multi-threaded replica statistics for channel '': seconds elapsed = 120; events assigned = 466945; worker queues filled over overrun level = 0; waited due a Worker queue full = 0; waited due the total size = 0; waited at clock conflicts = 55769034800 waited (count) when Workers occupied = 15175 waited when Workers occupied = 69850249500
2025-01-27T18:12:04.031062Z 16 [Note] [MY-010559] [Repl] Multi-threaded replica statistics for channel '': seconds elapsed = 120; events assigned = 976897; worker queues filled over overrun level = 0; waited due a Worker queue full = 0; waited due the total size = 0; waited at clock conflicts = 103952309000 waited (count) when Workers occupied = 0 waited when Workers occupied = 0
2025-01-27T18:15:09.087390Z 38 [Note] [MY-010559] [Repl] Multi-threaded replica statistics for channel '': seconds elapsed = 120; events assigned = 1163265; worker queues filled over overrun level = 0; waited due a Worker queue full = 0; waited due the total size = 0; waited at clock conflicts = 111850198700 waited (count) when Workers occupied = 0 waited when Workers occupied = 0

You can see all definitions in the MySQL docs, but from first glance we can see this prints information on worker thread utilization(waited when Workers occupied) and clock conflicts! (Waited at clock conflicts). So whats the catch?

This only gets printed when log_error_verbosity=3. Since MySQL defaults to 2, a lot of users probably have not seen this before.
This is a dynamic parameter which can be changed online.
When introduced, there were no views exposing this information at the database level, requiring grepping and wrangling of log files. This coupled with the log_error_verbosity default, probably led to this log message being neglected by MySQL operators over the years.
Luckily, MySQL 8.0 changes this with the introduction of the performance_schema.error_log table. More on this below.
Messages are not printed in real time, as per the docs "The statistics are printed depending on the volume of events that the coordinator thread has assigned to applier worker threads, with a maximum frequency of once every 120 seconds". So we only have outputs at most, every two minutes.
This shouldn't be an issue on most replication setups though, we should be generating enough replication events to have enough processed events every 2 minute window.

New replication Views

Now that we have a way of getting the information we can do a proof of concept to see if we can diagnose some causes for replication lag in MySQL, the poor man’s component.

As described above, from MySQL 8 we can use the performance_schema.error log table to query the error log directly from the database. To demonstrate this I will load some sysbench sample data and run some tests. After a couple of minutes with log_error_verbosity=3 on the replica, you should see some outputs.

slave1 [*********:23244] {msandbox} (performance_schema) > SELECT * FROM 
  performance_schema.error_log 
WHERE 
  SUBSYSTEM = 'Repl' 
  AND ERROR_CODE = 'MY-010559'
      AND DATA LIKE '%Multi-threaded % statistics for channel%'\G

*************************** 1. row ***************************
    LOGGED: 2025-04-25 21:28:05.406046
 THREAD_ID: 6
      PRIO: Note
ERROR_CODE: MY-010559
 SUBSYSTEM: Repl
      DATA: Multi-threaded replica statistics for channel '': seconds elapsed = 132; events assigned = 5121; worker queues filled over overrun level = 0; waited due a Worker queue full = 0; waited due the total size = 0; waited at clock conflicts = 883466700 waited (count) when Workers occupied = 8 waited when Workers occupied = 86657300
.
.
*************************** 30. row ***************************
    LOGGED: 2025-04-25 22:37:53.021052
 THREAD_ID: 16
      PRIO: Note
ERROR_CODE: MY-010559
 SUBSYSTEM: Repl
      DATA: Multi-threaded replica statistics for channel '': seconds elapsed = 120; events assigned = 58414081; worker queues filled over overrun level = 0; waited due a Worker queue full = 0; waited due the total size = 0; waited at clock conflicts = 316849996600 waited (count) when Workers occupied = 55876 waited when Workers occupied = 31639630000
30 rows in set (0.00 sec)

Now that we have a way of getting the log messages we can do some parsing of the data columns to present it in a neater way. To do this I've created a view called binlog_replication_coordinator_stats which you can see below in the appendix.

Column	Description
timestamp	When these statistics were logged
channel	Replication channel name ('default' if unnamed channel)
seconds_elapsed	Time period since last stats output in error log
events_assigned	Number of events the coordinator assigned to worker threads
avg_events_assigned	Average events assigned per second (events_assigned/seconds_elapsed)
queues_filled	Number of times worker queues exceeded the overrun level in this period. High numbers indicate workers can't keep up with coordinator
waited_worker_queue_full	Times coordinator waited because a worker's queue was full
waited_pending_total_size	Times coordinator waited due to total pending jobs size limit
clock_conflict_waits	Times coordinator waited due to commit order conflicts. High numbers can indicate: 1. Transactions touching same databases (when slave_parallel_type=DATABASE) 2. Write-set dependency conflicts (when slave_parallel_type=LOGICAL_CLOCK or WRITE_SET) 3. Commit order preservation overhead with slave_preserve_commit_order=ON: - Workers must commit in same order as master binlog - Earlier transaction in worker queue blocks later ones - Critical for preventing data inconsistency Can be influenced by: - binlog-transaction-dependency-tracking setting on source (COMMIT_ORDER, WRITESET, or WRITESET_SESSION) - slave_parallel_type setting on replica (DATABASE, LOGICAL_CLOCK, or WRITE_SET) - transaction writeset contents - transaction size and complexity
workers_occupied_count	Number of times coordinator found all workers busy. Indicates potential need for more worker threads
workers_occupied_time	Total microseconds waited for busy workers. Long waits suggest workers are the bottleneck

Example troubleshooting flow:

slave1 [*********:23244] {msandbox} (repl_mon) > pager grep "Seconds_Behind_Master"
PAGER set to 'grep "Seconds_Behind_Master"'
slave1 [*********:23244] {msandbox} (repl_mon) > show slave status\G
        Seconds_Behind_Master: 51
1 row in set, 1 warning (0.00 sec)

slave1 [*********:23244] {msandbox} (repl_mon) > select * from binlog_replication_coordinator_stats;
+----------------------------+---------+-----------------+-----------------+---------------------+---------------+--------------------------+---------------------------+----------------------+------------------------+-----------------------+
| timestamp                  | channel | seconds_elapsed | events_assigned | avg_events_assigned | queues_filled | waited_worker_queue_full | waited_pending_total_size | clock_conflict_waits | workers_occupied_count | workers_occupied_time |
+----------------------------+---------+-----------------+-----------------+---------------------+---------------+--------------------------+---------------------------+----------------------+------------------------+-----------------------+
| 2025-04-28 21:25:44.328627 | default |             345 |            1025 |                   3 |             0 |                        0 |                         0 |             24208500 |                      4 |               8697300 |
| 2025-04-28 21:27:44.012512 | default |             120 |         5208064 |               43401 |             0 |                        0 |                         0 |         100624635100 |                   8256 |            4027435500 |
| 2025-04-28 21:35:57.464912 | default |             493 |         4176896 |                8472 |             0 |                        0 |                         0 |          69082991700 |                   5879 |            2292737600 |
| 2025-04-28 21:37:57.020855 | default |             120 |         5331968 |               44433 |             0 |                        0 |                         0 |         100571333200 |                   7642 |            3408264300 |
+----------------------------+---------+-----------------+-----------------+---------------------+---------------+--------------------------+---------------------------+----------------------+------------------------+-----------------------+
4 rows in set (0.00 sec)

From the above, we can see some waits on Workers occupied and also clock conflicts. In show slave status we can see Seconds_Behind_Master is 51 seconds. This is an indicator that I may have underprovisioned my @@replica_parallel_workers, but also that I may need to look into dependency tracking.

Since changing the replica_parallel_workers is low hanging fruit lets start there.

To test this out I will:

Increase replica_parallel_workers to a higher value and restart the replication thread.
Obseve replication lag to see if it decreases.
Tune replica_parallel_workers further, or continue troubleshooting.

Step 1: Change replica_parallel_workers and restart replication:

slave1 [*********:23244] {msandbox} (repl_mon) > select @@replica_parallel_workers;
+----------------------------+
| @@replica_parallel_workers |
+----------------------------+
|                          4 |
+----------------------------+
1 row in set (0.00 sec)

slave1 [*********:23244] {msandbox} (repl_mon) > set global replica_parallel_workers=20;
Query OK, 0 rows affected (0.00 sec)

slave1 [*********:23244] {msandbox} (repl_mon) > select @@replica_parallel_workers;
+----------------------------+
| @@replica_parallel_workers |
+----------------------------+
|                         20 |
+----------------------------+
1 row in set (0.00 sec)

slave1 [*********:23244] {msandbox} (repl_mon) > stop slave;start slave;
Query OK, 0 rows affected, 1 warning (0.00 sec)

Query OK, 0 rows affected, 1 warning (0.09 sec)

Step 2: Wait for 4-5 minutes to observe changes in workers_occupied_count and seconds_behind_master After a couple of minutes we can see that workers_occupied_count dropped to zero which is great! But our replication lag has not improved.

slave1 [*********:23244] {msandbox} (repl_mon) > select * from binlog_replication_coordinator_stats;
+----------------------------+---------+-----------------+-----------------+---------------------+---------------+--------------------------+---------------------------+----------------------+------------------------+-----------------------+
| timestamp                  | channel | seconds_elapsed | events_assigned | avg_events_assigned | queues_filled | waited_worker_queue_full | waited_pending_total_size | clock_conflict_waits | workers_occupied_count | workers_occupied_time |
+----------------------------+---------+-----------------+-----------------+---------------------+---------------+--------------------------+---------------------------+----------------------+------------------------+-----------------------+
| 2025-04-28 21:25:44.328627 | default |             345 |            1025 |                   3 |             0 |                        0 |                         0 |             24208500 |                      4 |               8697300 |
| 2025-04-28 21:27:44.012512 | default |             120 |         5208064 |               43401 |             0 |                        0 |                         0 |         100624635100 |                   8256 |            4027435500 |
| 2025-04-28 21:35:57.464912 | default |             493 |         4176896 |                8472 |             0 |                        0 |                         0 |          69082991700 |                   5879 |            2292737600 |
| 2025-04-28 21:37:57.020855 | default |             120 |         5331968 |               44433 |             0 |                        0 |                         0 |         100571333200 |                   7642 |            3408264300 |
| 2025-04-28 21:42:53.061556 | default |             120 |         4802561 |               40021 |             0 |                        0 |                         0 |          90936245100 |                      0 |                     0 |
| 2025-04-28 21:44:53.014055 | default |             120 |         5512192 |               45935 |             0 |                        0 |                         0 |         103869504400 |                      0 |                     0 |
+----------------------------+---------+-----------------+-----------------+---------------------+---------------+--------------------------+---------------------------+----------------------+------------------------+-----------------------+
6 rows in set (0.00 sec)

slave1 [*********:23244] {msandbox} (repl_mon) > pager grep "Seconds_Behind_Master"
PAGER set to 'grep "Seconds_Behind_Master"'
slave1 [*********:23244] {msandbox} (repl_mon) > show slave status\G
        Seconds_Behind_Master: 75
1 row in set, 1 warning (0.00 sec)

Step 3: Now we should look into dependency tracking changes. Here I will take Daniel's advice, and enable WRITESET on the source instance. This can be done online, so lets try:

master [*********:23243] {msandbox} ((none)) > set global binlog_transaction_dependency_tracking='writeset';
Query OK, 0 rows affected, 1 warning (0.00 sec)

master [*********:23243] {msandbox} ((none)) > select @@binlog_transaction_dependency_tracking,@@version;
+------------------------------------------+-----------+
| @@binlog_transaction_dependency_tracking | @@version |
+------------------------------------------+-----------+
| WRITESET                                 | 8.0.42    |
+------------------------------------------+-----------+
1 row in set, 1 warning (0.00 sec)

Now we need to wait a few minutes to allow changes to propagate.

Remember that dependency tracking is applied on the source, so only binlog events from the time binlog_transaction_dependency_tracking is changed will use writeset to persist dependencies to the binary log, so we need to wait for replica to catch up to the change point to take advantage.

slave1 [*********:23244] {msandbox} (repl_mon) > show slave status\G
        Seconds_Behind_Master: 143
1 row in set, 1 warning (0.00 sec)

slave1 [*********:23244] {msandbox} (repl_mon) > \n
PAGER set to stdout
slave1 [*********:23244] {msandbox} (repl_mon) > select * from binlog_replication_coordinator_stats;
+----------------------------+---------+-----------------+-----------------+---------------------+---------------+--------------------------+---------------------------+----------------------+------------------------+-----------------------+
| timestamp                  | channel | seconds_elapsed | events_assigned | avg_events_assigned | queues_filled | waited_worker_queue_full | waited_pending_total_size | clock_conflict_waits | workers_occupied_count | workers_occupied_time |
+----------------------------+---------+-----------------+-----------------+---------------------+---------------+--------------------------+---------------------------+----------------------+------------------------+-----------------------+
| 2025-04-28 21:25:44.328627 | default |             345 |            1025 |                   3 |             0 |                        0 |                         0 |             24208500 |                      4 |               8697300 |
| 2025-04-28 21:27:44.012512 | default |             120 |         5208064 |               43401 |             0 |                        0 |                         0 |         100624635100 |                   8256 |            4027435500 |
| 2025-04-28 21:35:57.464912 | default |             493 |         4176896 |                8472 |             0 |                        0 |                         0 |          69082991700 |                   5879 |            2292737600 |
| 2025-04-28 21:37:57.020855 | default |             120 |         5331968 |               44433 |             0 |                        0 |                         0 |         100571333200 |                   7642 |            3408264300 |
| 2025-04-28 21:42:53.061556 | default |             120 |         4802561 |               40021 |             0 |                        0 |                         0 |          90936245100 |                      0 |                     0 |
| 2025-04-28 21:44:53.014055 | default |             120 |         5512192 |               45935 |             0 |                        0 |                         0 |         103869504400 |                      0 |                     0 |
| 2025-04-28 21:46:53.018936 | default |             120 |         5500928 |               45841 |             0 |                        0 |                         0 |         104048594800 |                      0 |                     0 |
| 2025-04-28 21:48:53.020615 | default |             120 |         5524480 |               46037 |             0 |                        0 |                         0 |         *********300 |                      0 |                     0 |
+----------------------------+---------+-----------------+-----------------+---------------------+---------------+--------------------------+---------------------------+----------------------+------------------------+-----------------------+
8 rows in set (0.00 sec)

Step 4: Verify changes to binlog_transaction_dependency_tracking=writeset After a few minutes, we can now see replication lag has disappeared!

slave1 [*********:23244] {msandbox} (repl_mon) > pager grep "Seconds_Behind_Master"
PAGER set to 'grep "Seconds_Behind_Master"'
slave1 [*********:23244] {msandbox} (repl_mon) > show slave status\G
        Seconds_Behind_Master: 0
1 row in set, 1 warning (0.00 sec)

This is great, and shows the impact that clock conflicts were having on our replication throughput.

However, if we look at binlog_replication_coordinator_stats we can now see the bottleneck has moved back to the number of worker threads!

Can we achieve more parallelism? Lets increase replica_parallel_workers again and see:

slave1 [*********:23244] {msandbox} (repl_mon) > select * from binlog_replication_coordinator_stats;
+----------------------------+---------+-----------------+-----------------+---------------------+---------------+--------------------------+---------------------------+----------------------+------------------------+-----------------------+
| timestamp                  | channel | seconds_elapsed | events_assigned | avg_events_assigned | queues_filled | waited_worker_queue_full | waited_pending_total_size | clock_conflict_waits | workers_occupied_count | workers_occupied_time |
+----------------------------+---------+-----------------+-----------------+---------------------+---------------+--------------------------+---------------------------+----------------------+------------------------+-----------------------+
| 2025-04-28 21:25:44.328627 | default |             345 |            1025 |                   3 |             0 |                        0 |                         0 |             24208500 |                      4 |               8697300 |
| 2025-04-28 21:27:44.012512 | default |             120 |         5208064 |               43401 |             0 |                        0 |                         0 |         100624635100 |                   8256 |            4027435500 |
| 2025-04-28 21:35:57.464912 | default |             493 |         4176896 |                8472 |             0 |                        0 |                         0 |          69082991700 |                   5879 |            2292737600 |
| 2025-04-28 21:37:57.020855 | default |             120 |         5331968 |               44433 |             0 |                        0 |                         0 |         100571333200 |                   7642 |            3408264300 |
| 2025-04-28 21:42:53.061556 | default |             120 |         4802561 |               40021 |             0 |                        0 |                         0 |          90936245100 |                      0 |                     0 |
| 2025-04-28 21:44:53.014055 | default |             120 |         5512192 |               45935 |             0 |                        0 |                         0 |         103869504400 |                      0 |                     0 |
| 2025-04-28 21:46:53.018936 | default |             120 |         5500928 |               45841 |             0 |                        0 |                         0 |         104048594800 |                      0 |                     0 |
| 2025-04-28 21:48:53.020615 | default |             120 |         5524480 |               46037 |             0 |                        0 |                         0 |         104187049300 |                      0 |                     0 |
| 2025-04-28 21:50:53.017684 | default |             120 |        13111296 |              109261 |             0 |                        0 |                         0 |          38211462000 |                 148868 |           34451537500 |
| 2025-04-28 21:52:53.016574 | default |             120 |         9456640 |               78805 |             0 |                        0 |                         0 |            265316200 |                  52396 |           14188590700 |
| 2025-04-28 21:54:53.014725 | default |             120 |         7398400 |               61653 |             0 |                        0 |                         0 |             43328300 |                   4493 |            3769165700 |
+----------------------------+---------+-----------------+-----------------+---------------------+---------------+--------------------------+---------------------------+----------------------+------------------------+-----------------------+
11 rows in set (0.00 sec)

slave1 [*********:23244] {msandbox} (repl_mon) > select @@replica_parallel_workers;
+----------------------------+
| @@replica_parallel_workers |
+----------------------------+
|                         20 |
+----------------------------+
1 row in set (0.00 sec)

slave1 [*********:23244] {msandbox} (repl_mon) > set global replica_parallel_workers=40;
Query OK, 0 rows affected (0.00 sec)

slave1 [*********:23244] {msandbox} (repl_mon) > stop slave;start slave;
Query OK, 0 rows affected, 1 warning (0.00 sec)

Query OK, 0 rows affected, 1 warning (0.10 sec)

Step 5: Wait for 4-5 minutes to observe changes in workers_occupied_count

Now we can see results have improved, for fun we have even increased to 100 to see how things go.

After doing so we can see the bottleneck swung back to clock_conflict_waits and workers_occupied_count dropped to zero.

slave1 [*********:23244] {msandbox} (repl_mon) > select * from binlog_replication_coordinator_stats;
[... previous output truncated for brevity ...]

# For fun, lets increase even more

slave1 [*********:23244] {msandbox} (repl_mon) > select @@replica_parallel_workers;
+----------------------------+
| @@replica_parallel_workers |
+----------------------------+
|                        100 |
+----------------------------+
1 row in set (0.00 sec)

Final thoughts:

Replication is a first-class citizen in MySQL, it would be nice to have native CE views for this information instead of relying on error log hacks.

Also see Bug #93034 : replication logging - please move to performance_schema

The performance_schema.error_log table is limited in size, so I would suggest periodically publishing metrics to your preferred monitoring solution so logs are not deleted or rotated accidentally.
log_error_verbosity is a dynamic variable, so if replication logging is too noisy for your liking, you can only enable when needed but historical info can be useful in such investigations; its a tradeoff :).
Starting in MySQL 8.4, writeset is enabled by default(only config allowed), for 8.0 you should really look into enabling it.
Parallel replication threads defaults to 4 from MySQL version 8.0.27. Prior to this it was disabled by default.
Tuning of worker threads(and replication in general!) should be done with caution, there are always trade offs and you should optimize based on your requirements.
In the above example, while provisioning 100 replica_parallel_workers removed the bottleneck on worker threads, my problem was solved at ~20 threads.
Be very careful that you are not shifting deck chairs around the titanic. Having sensible values, can help avoid fires elsewhere and act as a throttling mechanism, you don't want to saturate other parts of your system unnecessarily at peak times.
Its not always the replication configuration, and you should be careful, and verify while changing. For example:

There can be other bottlenecks such as missing primary keys, DDLs and long running transactions blocking replication applier threads which can lead to lag too. See Daniel's blog for advice on monitoring these. I will try do a follow up here too if I can get time.
Increasing replica_parallel_workers can introduce bottlenecks elsewhere, in my case I started to see the following messages in the error log indicating my redo log size was not adequate. So be careful to test any such changes at peak and regular load.

slave1 [*********:23244] {msandbox} (repl_mon) > select * from performance_schema.error_log where error_code='MY-013865' order by 1 desc limit 4;
+----------------------------+-----------+---------+------------+-----------+---------------------------------------------------------------------------------------------------+
| LOGGED                     | THREAD_ID | PRIO    | ERROR_CODE | SUBSYSTEM | DATA                                                                                              |
+----------------------------+-----------+---------+------------+-----------+---------------------------------------------------------------------------------------------------+
| 2025-04-28 22:12:12.303168 |         0 | Warning | MY-013865  | InnoDB    | Redo log writer is waiting for a new redo log file. Consider increasing innodb_redo_log_capacity. |
| 2025-04-28 22:11:59.514332 |         0 | Warning | MY-013865  | InnoDB    | Redo log writer is waiting for a new redo log file. Consider increasing innodb_redo_log_capacity. |
| 2025-04-28 22:11:47.038615 |         0 | Warning | MY-013865  | InnoDB    | Redo log writer is waiting for a new redo log file. Consider increasing innodb_redo_log_capacity. |
| 2025-04-28 22:11:39.048435 |         0 | Warning | MY-013865  | InnoDB    | Redo log writer is waiting for a new redo log file. Consider increasing innodb_redo_log_capacity. |
+----------------------------+-----------+---------+------------+-----------+---------------------------------------------------------------------------------------------------+
4 rows in set (0.01 sec)

Appendix

View Definition

Below is the definition for the view used in this blog post. Use/test with caution, created for demonstration purposes :)

CREATE OR REPLACE VIEW repl_mon.binlog_replication_coordinator_stats AS
WITH repl_stats AS (
  SELECT 
    LOGGED as timestamp,
    SUBSTRING_INDEX(SUBSTRING_INDEX(DATA, 'channel \'', -1), '\':', 1) as channel,
    CAST(SUBSTRING_INDEX(SUBSTRING_INDEX(DATA, 'seconds elapsed = ', -1), ';', 1) AS UNSIGNED) as seconds_elapsed,
    CAST(SUBSTRING_INDEX(SUBSTRING_INDEX(DATA, 'events assigned = ', -1), ';', 1) AS UNSIGNED) as events_assigned,
    CAST(SUBSTRING_INDEX(SUBSTRING_INDEX(DATA, 'worker queues filled over overrun level = ', -1), ';', 1) AS UNSIGNED) as queues_filled,
    CAST(SUBSTRING_INDEX(SUBSTRING_INDEX(DATA, 'waited due a Worker queue full = ', -1), ';', 1) AS UNSIGNED) as waited_worker_queue_full,
    CAST(SUBSTRING_INDEX(SUBSTRING_INDEX(DATA, 'waited due the total size = ', -1), ';', 1) AS UNSIGNED) as waited_pending_total_size,
    CAST(SUBSTRING_INDEX(SUBSTRING_INDEX(DATA, 'waited at clock conflicts = ', -1), ' waited', 1) AS UNSIGNED) as clock_conflict_waits,
    CAST(SUBSTRING_INDEX(SUBSTRING_INDEX(DATA, 'waited (count) when Workers occupied = ', -1), ' waited', 1) AS UNSIGNED) as workers_occupied_count,
    CAST(SUBSTRING_INDEX(DATA, 'when Workers occupied = ', -1) AS UNSIGNED) as workers_occupied_time
  FROM performance_schema.error_log
  WHERE 
    SUBSYSTEM = 'Repl' 
    AND ERROR_CODE = 'MY-010559'
    AND DATA LIKE '%Multi-threaded % statistics for channel%'
)
SELECT 
  timestamp,
  CASE WHEN channel = '' THEN 'default' ELSE channel END as channel,
  seconds_elapsed,
  events_assigned,
  ROUND(events_assigned/seconds_elapsed) as avg_events_assigned,
  queues_filled,
  waited_worker_queue_full,
  waited_pending_total_size,
  clock_conflict_waits,
  workers_occupied_count,
  workers_occupied_time
FROM repl_stats;

Quick notes on common Issues and Interpretations.

feel free to add to comments as you see them!

High waited_worker_queue_full: Consider increasing replica_pending_jobs_size_max
High waited_pending_total_size: Check overall replication memory usage and limits
High clock_conflict_waits: Could be improved by:
- Using WRITESET dependency tracking on master
- Using LOGICAL_CLOCK or WRITE_SET parallel type on dependency tracking instead of DATABASE
- Better database partitioning (if using DATABASE parallel type)
- Optimizing transaction batching
Note: If slave_preserve_commit_order=ON:
- Conflicts are necessary for consistency
- Disabling it may improve performance but:
  - Can break data consistency
  - May cause temporary read inconsistencies
  - Not suitable if applications depend on commit order
- See https://hackmysql.com/replica-preserve-commit-order/ for more details
High workers_occupied counts/times: Consider increasing replica_parallel_workers

Related Configuration Variables

replica_parallel_workers: Number of worker threads (per channel)
replica_parallel_type: DATABASE, LOGICAL_CLOCK, or WRITE_SET
replica_preserve_commit_order: Affects commit order preservation
replica_pending_jobs_size_max: Per-worker queue size limit
binlog_transaction_dependency_tracking: How master tracks dependencies (COMMIT_ORDER, WRITESET, or WRITESET_SESSION)

Write-set Based Dependency Tracking

Write-set based dependency tracking can significantly reduce conflicts by:

Tracking actual row-level write conflicts instead of schema-level
Allowing parallel execution of non-conflicting transactions
Supporting better transaction scheduling
Reducing false dependencies in LOGICAL_CLOCK mode

Note: Each replication channel has its own coordinator thread and worker threads, allowing different parallel replication configurations per channel.

Version Information

The examples in this blog post were tested with:

MySQL Version: 8.0.42

Last updated: April 2024

Monday, October 14, 2024

InnoDB data_locks: a happy ending/new beginning in MySQL 8.0.40 ! (?)

Earlier today lefred tweeted that the new MySQL releases were rolling out today! As per the usual release process, the yum repositories typically have the release rpms available first, followed by github (release tags here) and then the release notes.

Hey hey! It's that time again! #MySQL 🐬 9️⃣ .1️⃣ being released! Stay tuned for release notes tomorrow at 9.00AM PDT pic.twitter.com/rIvZrDtP7z
— lefred (@lefred) October 14, 2024

Since the release notes/commits are not out yet(at time of writing!) I’ll hold back on commenting on other changes, but one change I’ve really been looking forward to is improvements in the performance_schema.data_locks and performance_schema.data_lock_waits tables which I was notified of in my Bug #112035 report last month. You can see more on these tables in the MySQL docs, but at a high level these views can be used to view and troubleshoot locks being held by InnoDB. You have lock contention or want to see a lock tree? Check these tables!

But not so fast!!!

For years now, when dealing with high concurrency MySQL deployments, touching these tables should be done with extreme caution, especially in an automated fashion! There have been multiple bugs reported [1] where querying these views on systems under heavy load can lead to InnoDB freezing up and stalling, which is the last thing you need when you are trying to debug a server which is already struggling due to lock contention! I’ve lost count of the number monitoring scripts and services I have seen bitten by this over the years.

Just to demonstrate, here is what we can see in mysql 8.0.39, if you attempt to:

Run a sysbench workload (top pane)
Have an open transaction holding many row locks (middle pane). Here I' doing so on an unrelated set of tables(20m rows) to the foreground transactions for demonstration purposes.
In a third session, select from the performance_schema.data_locks table (bottom pane)

MySQL 8.0.39 : limit clause

As you can see, when the select ENGINE,ENGINE_LOCK_ID,ENGINE_TRANSACTION_ID from data_locks limit 1; is executed, the sysbench workload stalls to zero until the query is completed(14.4 seconds here). But why is this? To simplify, its because when querying this table, InnoDB must gather the trx_sys mutex while scanning through all the active transactions/row locks. If there are a lot of active transactions or locks being held by InnoDB, the mutex can be held for quite some time while the view is materializing, blocking foreground transactions from “doing work“. (see linked bug reports for more detailed info)
So when I seen the following in my bug report Bug #112035 last month I was pretty excited.

Posted by developer:

Added the following note to the MySQL Server 8.0.40 release notes:

Redesigned the performance schema data_locks and data_lock_waits tables so
that querying them does not require an exclusive global mutex on the
transaction or lock system. It now iterates over buckets of hash tables
that hold the locks to only latch the actively processed shard, when
previously it iterated over the transactions. This also improves the
iteration logic complexity in terms of speed and memory to decrease the
impact of these queries on the rest of the system.

Note that the query result might show an incomplete list of transaction locks
if it committed, started, or otherwise changed the set of owned locks
in-between visiting two buckets. This differs from previous behavior which
always showed a consistent snapshot of locks held by individual
transactions, although two different transactions could have been
presented at different moments. In other words, the new approach gives a
consistent view of a single wait queue to show conflicting locks with a
waiting lock because they are always in the same bucket, while the old
approach could miss some of them because they belonged to other
transactions. The old approach would always show all the other locks held
by a reported transaction but could miss locks of other transactions even
if they were conflicting.

Also, thanks to the MySQL team for the detailed explanations in the proposed release note entry!

So back to where we started, when I seen lefred’s tweet I couldn't wait to give it a spin. After running the same 8.0.39 test from above on 8.0.40, I could no longer see the stall, and my basic query completed in less than a second.

MySQL 8.0.40 : limit clause

Running without the limit clause will increase execution time as we need to materialize the view, but will not cause a stall for the entire duration in my test case, as noticed in 8.0.39:

MySQL 8.0.40 : No limit clause

Conclusion

- While this is only a first look and not an extensive test, so be sure to test and report any issues you may find. Saying that, its a very promising sign and hopefully will put this to bed; allowing users monitor, troubleshoot and track locks/waits more granularly, with less risk, going forward.

- Note the changes in behavior for these tables starting from 8.0.40 from bug report Bug #112035 and review 8.0/8.4/9.x release notes for final information once released.

- Time will tell, but if the above holds true, its a good enough reason to go to 8.0.40 on its own!

Also, kudos to the MySQL team for including this fix in the 8.0 releases, and not just LTS and innovation releases. In the past this has been the case as mentioned by JFG here, but nice to see they have taken feedback onboard. Exited to see what else is included in 8.0.40/8.4.3/9.2.0 when their release notes are put out.(not available at time of writing) Long live da Dolphin!! 🐬 :-)

Appendix:

Where you will find MySQL release notes once published:

MySQL 8.0 Release Notes

MySQL 8.4 Release Notes

MySQL 9.1 Release Notes

Some MySQL docs on locking:

Some other resources with lots more deep dive info on InnoDB internals/Locking

InnoDB Data Locking series on the Oracle MySQL blog
https://kernelmaker.github.io/
https://baotiao.github.io
Advanced MySQL Blog

[1] Example bug reports for the data_locks tables, not exhaustive.

Bug #112035 Materializing performance_schema.data_locks can lead to excessive mem usage/OOM (reported by yours truly)
Bug #100537 Performance degradation caused by monitoring sys.innodb_lock_waits in MySQL 8.0
Bug #111082 Queries to the Performance Schema Lock up all transactions until KILL
Bug #113761 Access performance_schema.data_locks causes SQL execution stuck
Bug #115702 (Private)
Bug #109539 Performance of scanning data_lock_waits worse than expected with read-only trx (Had some improvements in 8.0.38 for RO trx, commit message worth the read for details on implementation)
Bug #104367 Query all the locks in performance.schema.data_locks cause mysqld oom

Sunday, July 7, 2024

Bug hunting in MySQL with git bisect

Troubleshooting a database crash or regression can be a tedious task. Whether you have recently deployed new application changes, upgraded your database or "nothing has changed" ™ it can be tricky to corner why the database started behaving differently to how it has before.

In the case of a database crash or regression post upgrade, my usual first step in troubleshooting will be to take a look at the database workload and backtrace in the `mysql-error.log`, and see where it's crashing. From the stack you should be able to get a good idea of where the engine is crashing.
Upon checking that, I will review release notes to see if any changes have been made in this area recently or if there are any open bug reports. Sometimes you will get a good idea of what change may have introduced the issue, but others can be difficult to corner and we have to resort to using other tools. The release note review process can be a lot easier if you only went from one minor to the next, but this becomes a lot tricker when moving multiple minors, 8.0.23 to 8.0.38 will have hundreds of changes.
Whether or not you get a good idea of what changed, the next step is to get a reliable repro. This is key for understanding the issue and opening a good quality bug report. In a lot of cases this can be very difficult, such as when "production" like workload and data is required; especially when debugging is required on production servers.

Provided you can get a reliable way of reproducing the crash or regression you can then start narrowing down which release or change may have led to this change in behavior. Here I will show how you can do this using git bisect

As per the docs:

This command uses a binary search algorithm to find which commit in your project’s history introduced a bug. You use it by first telling it a "bad" commit that is known to contain the bug, and a "good" commit that is known to be before the bug was introduced. Then git bisect picks a commit between those two endpoints and asks you whether the selected commit is "good" or "bad". It continues narrowing down the range until it finds the exact commit that introduced the change.

Git bisect process

With git bisect, you can specify two points in time(branches, tags or commits), execute a reproducible test case and evaluate if the regression/change of behavior is still present. Once the bisect process is complete you should be presented with a commit where the issue started occurring.

The most well-known process to:

Start the bisect process outlining the known good and bad commits/branches.
Build MySQL
Run your test

If test fails: Mark as bad
If test Succeeds: Mark as good

Repeat

You continue this process until you are presented with the “first” offending commit. This can be quite a tedious process of manually stepping through commits, so here we will demonstrate using git bisect run to automatically handle this for you. With git bisect run you create a script which builds MySQL and executes your test case. Based on the exit status, bisect will automatically mark it as good or bad, and eventually spit out the first bad commit. Just start it, go for a coffee and come back to your bad commit!

Note: See the git bisect docs for more information on how exit status codes are handled. In general, exit code 0 will be “good”, non-zero will be “bad“.

Once you have the commit, you can start focusing your debugging on a single area of the code, and if you have upgraded multiple minor releases find which release the regression was introduced.
Even if you do not feel comfortable, or it’s not possible to revert or fix the issue, knowing what commit introduced a change in behavior can help in a few ways :

It improves the quality of bug reports. All bug reports should include reproducible test cases, this makes things a lot easier for the bug verification team to identify cause and introduce a fix. Providing the commit which introduced the regression is even better; make life easier for the database developers!
Knowing what area of the code triggers the behavior can also help DBAs and Application developers better understand what caused the regression, and if possible, introduce an application side change which can help in avoiding the problem until a permanent fix is in place. For example, if you see the regression/crash is caused by a commit relating to INSTANT DDL in InnoDB, you can make application-level changes to avoid this to restore availability until a permanent fix is available.

In this post I will demonstrate using git bisect run to identify a commit which led to a regression in 8.0.37, that has since been fixed in 8.0.38. You can see more information on the bug here PS-9222 , originally reported in private bug MySQL bug # 114837. I have chosen this as we have a reproducible test case already using the MySQL test framework (MTR).

Note:
The MySQL test framework (MTR) is being used here but it is not required. You can do whatever test you want in your shell script; you just need to ensure the correct exit status is returned so git bisect run knows whether it should mark is as good or bad. For example, here is a simple script which catches the exit code and returns something git bisect run is comfortable with:

#!/bin/bash

# Build MySQL
build_mysql.sh 

# Start MySQL binary with a custom data directory
~/mysql-server/bld/runtime_output_directory/mysqld \
--defaults-group-suffix=.1 --defaults-file=~/my.cnf \
--log-output=file --explain-format=TRADITIONAL_STRICT \
--loose-debug-sync-timeout=600 --initialize=0 --core-file \
--init-file=~/init.sql

exit_code=$?
if [[ ${exit_code} -gt 127 ]]; then
    exit 1
fi

Example

The first step is to get the mysql-server source code

git clone https://github.com/mysql/mysql-server.git

Once you have the source code, checkout your branch. In MySQL, each release has its own tag. So, we will checkout mysql-8.0.37

cd mysql-server
git checkout mysql-8.0.37

Now we will get the test cases from PS-9222.

mysql-test/suite/innodb/include/instant_ddl_recovery.inc
mysql-test/suite/innodb/r/instant_alter_stored_column_order.result
mysql-test/suite/innodb/r/instant_ddl_recovery_debug.result
mysql-test/suite/innodb/t/instant_alter_stored_column_order.test

Build MySQL and verify that the above test cases fail on mysql-8.0.37. I won’t go through the MySQL build process here, but here are the results of the above.

mkdir mysql-server/bld
cd mysql-server/bld
cmake .. -DWITH_DEBUG=1 -DWITH_SSL=system -DWITH_ZLIB=bundled -DMYSQL_MAINTAINER_MODE=0 -DENABLED_LOCAL_INFILE=1  -DDOWNLOAD_BOOST=1 -DWITH_BOOST=$PWD/../boost \
  -DCMAKE_CXX_FLAGS="$CXXF" -DCMAKE_C_FLAGS="$CF" \
  -DWITH_ROUTER=OFF -DWITH_MYSQLX=OFF -DWITH_UNIT_TESTS=OFF
make -j 8

Note: Here I’m building a debug build. It’s not a requirement for git bisect but a lot of MTR tests will require it. If you are not using MTR, you can build non-debug and use it in the same way

Once built, try running the test using mtr in your bld directory, this is linked to mysql-test-run-pl. (see here for usage information):

 /home/ec2-user/mysql-server/bld/mysql-test/mtr innodb.instant_alter_stored_column_order;

Upon running the test, you will see that the test did not pass, the engine crashed(reproduced the issue!) and a non-zero exit status was returned.

[ec2-user@i-0c6e9baba12c0d932 mysql-server]$ /home/ec2-user/mysql-server/bld/mysql-test/mtr innodb.instant_alter_stored_column_order;
Logging: /home/ec2-user/mysql-server/mysql-test/mysql-test-run.pl  innodb.instant_alter_stored_column_order
MySQL Version 8.0.37
Checking supported features
 - Binaries are debug compiled
Using 'all' suites
Collecting tests
Checking leftover processes
Removing old var directory
Creating var directory '/home/ec2-user/mysql-server/bld/mysql-test/var'
Installing system database
Using parallel: 1

==============================================================================
                  TEST NAME                       RESULT  TIME (ms) COMMENT
------------------------------------------------------------------------------
[ 50%] innodb.instant_alter_stored_column_order  [ fail ]
        Test ended at 2024-07-07 22:50:42

CURRENT_TEST: innodb.instant_alter_stored_column_order
mysqltest: At line 17: Query 'UPDATE t1 SET c4 = 'value' WHERE c1 = 'k1'' failed.
ERROR 2013 (HY000): Lost connection to MySQL server during query

The result from queries just before the failure was:
# PS-9222 - Testing if ALGORITHM=instant crashes server
CREATE TABLE t1 (c1 TINYTEXT COLLATE ascii_bin NOT NULL, c2 DATETIME(3) NOT NULL, c3 TEXT, PRIMARY KEY (c1(30)));
INSERT INTO t1 (c1, c2, c3) VALUE ('k1','2021-12-21','something');
INSERT INTO t1 (c1, c2, c3) VALUE ('k3','2021-12-21','something else');
ALTER TABLE t1 ADD COLUMN c4 VARCHAR(18) NOT NULL, algorithm=instant;
UPDATE t1 SET c4 = 'value' WHERE c1 = 'k1';
safe_process[187000]: Child process: 187001, exit: 1


Server [mysqld.1 - pid: 186954, winpid: 186954, exit: 256] failed during test run
Server log from this test:
----------SERVER LOG START-----------
2024-07-07T22:50:22.121344Z 0 [Note] [MY-013667] [Server] Error-log destination "stderr" is not a file. Can not restore error log messages from previous run.
2024-07-07T22:50:22.116996Z 0 [Warning] [MY-013711] [Server] Manifest file '/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld.my' is not read-only. For better security, please make sure that the file is read-only.
2024-07-07T22:50:22.118817Z 0 [Warning] [MY-010099] [Server] Insecure configuration for --secure-file-priv: Data directory is accessible through --secure-file-priv. Consider choosing a different directory.
2024-07-07T22:50:22.118828Z 0 [Warning] [MY-010101] [Server] Insecure configuration for --secure-file-priv: Location is accessible to all OS users. Consider choosing a different directory.
2024-07-07T22:50:22.118909Z 0 [Note] [MY-013932] [Server] BuildID[sha1]=22bfc288cb831a3fc0bdeee5e573d5e8649563d4
2024-07-07T22:50:22.118923Z 0 [Note] [MY-010949] [Server] Basedir set to /home/ec2-user/mysql-server/bld/.
2024-07-07T22:50:22.118933Z 0 [System] [MY-010116] [Server] /home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld (mysqld 8.0.37-debug) starting as process 186955
2024-07-07T22:50:22.125897Z 0 [Warning] [MY-010075] [Server] No existing UUID has been found, so we assume that this is the first time that this server has been started. Generating a new UUID: 4aaa3aec-3cb3-11ef-a9d1-0e76622b41ab.
2024-07-07T22:50:22.130885Z 0 [Note] [MY-012366] [InnoDB] Using Linux native AIO
2024-07-07T22:50:22.131239Z 0 [Note] [MY-010747] [Server] Plugin 'FEDERATED' is disabled.
2024-07-07T22:50:22.131324Z 0 [Note] [MY-010747] [Server] Plugin 'ndbcluster' is disabled.
2024-07-07T22:50:22.131357Z 0 [Note] [MY-010747] [Server] Plugin 'ndbinfo' is disabled.
2024-07-07T22:50:22.131368Z 0 [Note] [MY-010747] [Server] Plugin 'ndb_transid_mysql_connection_map' is disabled.
2024-07-07T22:50:22.133583Z 1 [System] [MY-013576] [InnoDB] InnoDB initialization has started.
2024-07-07T22:50:22.133620Z 1 [Note] [MY-013546] [InnoDB] Atomic write enabled
2024-07-07T22:50:22.133675Z 1 [Note] [MY-012932] [InnoDB] PUNCH HOLE support available
2024-07-07T22:50:22.133690Z 1 [Note] [MY-012937] [InnoDB] !!!!!!!! UNIV_DEBUG switched on !!!!!!!!!
2024-07-07T22:50:22.133695Z 1 [Note] [MY-012944] [InnoDB] Uses event mutexes
2024-07-07T22:50:22.133701Z 1 [Note] [MY-012945] [InnoDB] GCC builtin __atomic_thread_fence() is used for memory barrier
2024-07-07T22:50:22.133708Z 1 [Note] [MY-012948] [InnoDB] Compressed tables use zlib 1.2.13
2024-07-07T22:50:22.137355Z 1 [Note] [MY-012951] [InnoDB] Using hardware accelerated crc32 and polynomial multiplication.
2024-07-07T22:50:22.137865Z 1 [Note] [MY-012203] [InnoDB] Directories to scan './'
2024-07-07T22:50:22.137908Z 1 [Note] [MY-012204] [InnoDB] Scanning './'
2024-07-07T22:50:22.138749Z 1 [Note] [MY-012208] [InnoDB] Completed space ID check of 8 files.
2024-07-07T22:50:22.139293Z 1 [Note] [MY-012955] [InnoDB] Initializing buffer pool, total size = 24.000000M, instances = 1, chunk size =24.000000M
2024-07-07T22:50:22.143342Z 1 [Note] [MY-012957] [InnoDB] Completed initialization of buffer pool
2024-07-07T22:50:22.148312Z 0 [Note] [MY-011952] [InnoDB] If the mysqld execution user is authorized, page cleaner thread priority can be changed. See the man page of setpriority().
2024-07-07T22:50:22.148587Z 1 [Note] [MY-013532] [InnoDB] Using './#ib_16384_0.dblwr' for doublewrite
2024-07-07T22:50:22.148880Z 1 [Note] [MY-013532] [InnoDB] Using './#ib_16384_1.dblwr' for doublewrite
2024-07-07T22:50:22.159089Z 1 [Note] [MY-013566] [InnoDB] Double write buffer files: 2
2024-07-07T22:50:22.159109Z 1 [Note] [MY-013565] [InnoDB] Double write buffer pages per instance: 4
2024-07-07T22:50:22.159140Z 1 [Note] [MY-013532] [InnoDB] Using './#ib_16384_0.dblwr' for doublewrite
2024-07-07T22:50:22.159155Z 1 [Note] [MY-013532] [InnoDB] Using './#ib_16384_1.dblwr' for doublewrite
2024-07-07T22:50:22.229760Z 1 [Note] [MY-013883] [InnoDB] The latest found checkpoint is at lsn = 20116776 in redo log file ./#innodb_redo/#ib_redo61.
2024-07-07T22:50:22.229839Z 1 [Note] [MY-013086] [InnoDB] Starting to parse redo log at lsn = 20116527, whereas checkpoint_lsn = 20116776 and start_lsn = 20116480
2024-07-07T22:50:22.234657Z 1 [Note] [MY-013083] [InnoDB] Log background threads are being started...
2024-07-07T22:50:22.235450Z 1 [Note] [MY-012532] [InnoDB] Applying a batch of 0 redo log records ...
2024-07-07T22:50:22.235464Z 1 [Note] [MY-012535] [InnoDB] Apply batch completed!
2024-07-07T22:50:22.235902Z 1 [Note] [MY-013252] [InnoDB] Using undo tablespace './undo_001'.
2024-07-07T22:50:22.236084Z 1 [Note] [MY-013252] [InnoDB] Using undo tablespace './undo_002'.
2024-07-07T22:50:22.236510Z 1 [Note] [MY-012910] [InnoDB] Opened 2 existing undo tablespaces.
2024-07-07T22:50:22.236573Z 1 [Note] [MY-011980] [InnoDB] GTID recovery trx_no: 1086
2024-07-07T22:50:22.294196Z 1 [Note] [MY-013776] [InnoDB] Parallel initialization of rseg complete
2024-07-07T22:50:22.294247Z 1 [Note] [MY-013777] [InnoDB] Time taken to initialize rseg using 4 thread: 57685 ms.
2024-07-07T22:50:22.294501Z 1 [Note] [MY-012923] [InnoDB] Creating shared tablespace for temporary tables
2024-07-07T22:50:22.294574Z 1 [Note] [MY-012265] [InnoDB] Setting file './ibtmp1' size to 12 MB. Physically writing the file full; Please wait ...
2024-07-07T22:50:22.326292Z 1 [Note] [MY-012266] [InnoDB] File './ibtmp1' size is now 12 MB.
2024-07-07T22:50:22.326574Z 1 [Note] [MY-013627] [InnoDB] Scanning temp tablespace dir:'./#innodb_temp/'
2024-07-07T22:50:22.368475Z 1 [Note] [MY-013018] [InnoDB] Created 128 and tracked 128 new rollback segment(s) in the temporary tablespace. 128 are now active.
2024-07-07T22:50:22.369219Z 1 [Note] [MY-012976] [InnoDB] 8.0.37 started; log sequence number 20116786
2024-07-07T22:50:22.369525Z 1 [System] [MY-013577] [InnoDB] InnoDB initialization has ended.
2024-07-07T22:50:22.384916Z 1 [Note] [MY-011089] [Server] Data dictionary restarting version '80023'.
2024-07-07T22:50:22.845682Z 1 [Note] [MY-012357] [InnoDB] Reading DD tablespace files
2024-07-07T22:50:22.849801Z 1 [Note] [MY-012356] [InnoDB] Scanned 10 tablespaces. Validated 10.
2024-07-07T22:50:22.924166Z 1 [Note] [MY-010006] [Server] Using data dictionary with version '80023'.
2024-07-07T22:50:23.700027Z 0 [Note] [MY-010902] [Server] Thread priority attribute setting in Resource Group SQL shall be ignored due to unsupported platform or insufficient privilege.
2024-07-07T22:50:23.715619Z 0 [Note] [MY-013911] [Server] Crash recovery finished in binlog engine. No attempts to commit, rollback or prepare any transactions.
2024-07-07T22:50:23.715648Z 0 [Note] [MY-013911] [Server] Crash recovery finished in InnoDB engine. No attempts to commit, rollback or prepare any transactions.
2024-07-07T22:50:23.721492Z 0 [Note] [MY-012487] [InnoDB] DDL log recovery : begin
2024-07-07T22:50:23.721664Z 0 [Note] [MY-012488] [InnoDB] DDL log recovery : end
2024-07-07T22:50:23.722107Z 0 [Note] [MY-011946] [InnoDB] Loading buffer pool(s) from /home/ec2-user/mysql-server/bld/mysql-test/var/mysqld.1/data/ib_buffer_pool
2024-07-07T22:50:23.728378Z 0 [Note] [MY-011946] [InnoDB] Buffer pool(s) load completed at 240708  1:50:23
2024-07-07T22:50:23.762999Z 0 [Note] [MY-010303] [Server] Skipping generation of SSL certificates as options related to SSL are specified.
2024-07-07T22:50:23.765041Z 0 [Warning] [MY-010068] [Server] CA certificate /home/ec2-user/mysql-server/mysql-test/std_data/cacert.pem is self signed.
2024-07-07T22:50:23.765066Z 0 [System] [MY-013602] [Server] Channel mysql_main configured to support TLS. Encrypted connections are now supported for this channel.
2024-07-07T22:50:23.765077Z 0 [Note] [MY-010310] [Server] Skipping generation of RSA key pair as --sha256_password_auto_generate_rsa_keys is set to OFF.
2024-07-07T22:50:23.765082Z 0 [Note] [MY-010310] [Server] Skipping generation of RSA key pair as --caching_sha2_password_auto_generate_rsa_keys is set to OFF.
2024-07-07T22:50:23.766780Z 0 [Note] [MY-010252] [Server] Server hostname (bind-address): '*'; port: 13000
2024-07-07T22:50:23.766827Z 0 [Note] [MY-010253] [Server] IPv6 is available.
2024-07-07T22:50:23.766839Z 0 [Note] [MY-010264] [Server]   - '::' resolves to '::';
2024-07-07T22:50:23.766859Z 0 [Note] [MY-010251] [Server] Server socket created on IP: '::'.
2024-07-07T22:50:23.780117Z 0 [Warning] [MY-000067] [Server] unknown variable 'loose-mysqlx-port=13020'.
2024-07-07T22:50:23.780140Z 0 [Warning] [MY-000067] [Server] unknown variable 'loose-mysqlx-socket=/home/ec2-user/mysql-server/bld/mysql-test/var/tmp/mysqlx.1.sock'.
2024-07-07T22:50:23.780146Z 0 [Warning] [MY-000067] [Server] unknown variable 'loose-mysqlx-ssl=DISABLED'.
2024-07-07T22:50:23.780150Z 0 [Warning] [MY-000067] [Server] unknown variable 'loose-mysqlx-ssl-ca='.
2024-07-07T22:50:23.780153Z 0 [Warning] [MY-000067] [Server] unknown variable 'loose-mysqlx-ssl-cert='.
2024-07-07T22:50:23.780157Z 0 [Warning] [MY-000067] [Server] unknown variable 'loose-mysqlx-ssl-key='.
2024-07-07T22:50:23.856915Z 0 [Note] [MY-011025] [Repl] Failed to start replica threads for channel ''.
2024-07-07T22:50:23.865353Z 5 [Note] [MY-010051] [Server] Event Scheduler: scheduler thread started with id 5
2024-07-07T22:50:23.865943Z 0 [System] [MY-010931] [Server] /home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld: ready for connections. Version: '8.0.37-debug' socket: '/home/ec2-user/mysql-server/bld/mysql-test/var/tmp/mysqld.1.sock'  port: 13000  Source distribution.
2024-07-07T22:50:24.332276Z 9 [ERROR] [MY-013183] [InnoDB] Assertion failure: mtr0log.cc:647:total > (ulint)(log_ptr - log_start) thread 139761664595520
InnoDB: We intentionally generate a memory trap.
InnoDB: Submit a detailed bug report to http://bugs.mysql.com.
InnoDB: If you get repeated assertion failures or crashes, even
InnoDB: immediately after the mysqld startup, there may be
InnoDB: corruption in the InnoDB tablespace. Please refer to
InnoDB: http://dev.mysql.com/doc/refman/8.0/en/forcing-innodb-recovery.html
InnoDB: about forcing recovery.
2024-07-07T22:50:24Z UTC - mysqld got signal 6 ;
Most likely, you have hit a bug, but this error can also be caused by malfunctioning hardware.
BuildID[sha1]=22bfc288cb831a3fc0bdeee5e573d5e8649563d4
Thread pointer: 0x7f1c2c011bd0
Attempting backtrace. You can use the following information to find out
where mysqld died. If you see no messages after this, something went
terribly wrong...
stack_bottom = 7f1ccc5f09f0 thread_stack 0x100000
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(my_print_stacktrace(unsigned char const*, unsigned long)+0x43) [0x49e97e7]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(print_fatal_signal(int)+0x3a2) [0x35c5d19]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(my_server_abort()+0x6b) [0x35c5fcb]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(my_abort()+0xd) [0x49e01d5]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(ut_dbg_assertion_failed(char const*, char const*, unsigned long)+0x1d1) [0x4de03e9]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld() [0x4c0165a]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld() [0x4c0176e]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld() [0x4c03767]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(mlog_open_and_write_index(mtr_t*, unsigned char const*, dict_index_t const*, mlog_id_t, unsigned long, unsigned char*&)+0x5c5) [0x4c01d48]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld() [0x4c38a52]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(page_cur_delete_rec(page_cur_t*, dict_index_t const*, unsigned long const*, mtr_t*)+0x363) [0x4c39019]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(btr_cur_pessimistic_update(unsigned long, btr_cur_t*, unsigned long**, mem_block_info_t**, mem_block_info_t*, big_rec_t**, upd_t*, unsigned long, que_thr_t*, unsigned long, unsigned long, mtr_t*, btr_pcur_t*)+0xe7e) [0x4e64749]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld() [0x4d24445]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld() [0x4d24d1b]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld() [0x4d251af]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(row_upd_step(que_thr_t*)+0x248) [0x4d2566a]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld() [0x4ca8b43]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(row_update_for_mysql(unsigned char const*, row_prebuilt_t*)+0xbb) [0x4ca8f2f]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(ha_innobase::update_row(unsigned char const*, unsigned char*)+0x423) [0x4a58f2d]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(handler::ha_update_row(unsigned char const*, unsigned char*)+0x1ea) [0x3770398]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(Sql_cmd_update::update_single_table(THD*)+0x214b) [0x34d3f63]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(Sql_cmd_update::execute_inner(THD*)+0x10c) [0x34d6d74]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(Sql_cmd_dml::execute(THD*)+0x6ed) [0x341d0b1]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(mysql_execute_command(THD*, bool)+0x2156) [0x33971cb]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(dispatch_sql_command(THD*, Parser_state*)+0x798) [0x339c811]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(dispatch_command(THD*, COM_DATA const*, enum_server_command)+0x16ec) [0x3392541]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(do_command(THD*)+0x5bd) [0x339048e]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld() [0x35b0edf]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld() [0x5685133]
/lib64/libc.so.6(+0x9f7f2) [0x7f1cd589f7f2]
/lib64/libc.so.6(+0x3f450) [0x7f1cd583f450]

Trying to get some variables.
Some pointers may be invalid and cause the dump to abort.
Query (7f1c2c5cc1c0): UPDATE t1 SET c4 = 'value' WHERE c1 = 'k1'
Connection ID (thread ID): 9
Status: NOT_KILLED

The manual page at http://dev.mysql.com/doc/mysql/en/crashing.html contains
information that should help you find out what is causing the crash.
Writing a core file
safe_process[186954]: Child process: 186955, killed by signal: 6
----------SERVER LOG END-------------


 - the logfile can be found in '/home/ec2-user/mysql-server/bld/mysql-test/var/log/innodb.instant_alter_stored_column_order/instant_alter_stored_column_order.log'

[100%] shutdown_report                           [ pass ]
------------------------------------------------------------------------------
The servers were restarted 0 times
The servers were reinitialized 0 times
Spent 0.000 of 33 seconds executing testcases

Completed: Failed 1/2 tests, 50.00% were successful.

Failing test(s): innodb.instant_alter_stored_column_order

The log files in var/log may give you some hint of what went wrong.

If you want to report this error, please read first the documentation
at http://dev.mysql.com/doc/mysql/en/mysql-test-suite.html

mysql-test-run: *** ERROR: there were failing test cases
[ec2-user@i-0c6e9baba12c0d932 mysql-server]$ echo $?
1
[ec2-user@i-0c6e9baba12c0d932 mysql-server]$

Now that we have verified the above, we can stitch it all together into a single shell script, and start git bisect:

$ cat ~/bisect_commands.sh
#!/bin/bash
#

# Build MySQL
cd /home/ec2-user/mysql-server/bld
cmake .. -DWITH_DEBUG=1 -DWITH_SSL=system -DWITH_ZLIB=bundled -DMYSQL_MAINTAINER_MODE=0 -DENABLED_LOCAL_INFILE=1  -DDOWNLOAD_BOOST=1 -DWITH_BOOST=$PWD/../boost \
  -DCMAKE_CXX_FLAGS="$CXXF" -DCMAKE_C_FLAGS="$CF" \
  -DWITH_ROUTER=OFF -DWITH_MYSQLX=OFF -DWITH_UNIT_TESTS=OFF
make -j 8

# Run test case, after each build.
# If test passes(exit code 0), bisect will mark as good, otherwise bad. 
/home/ec2-user/mysql-server/bld/mysql-test/mtr innodb.instant_alter_stored_column_order;

Now start bisect specifying the good and bad releases. Here I’m using mysql-8.0.36 where I know we dont crash, and mysql-8.0.37 where I know we do crash!

[ec2-user@i-0c6e9baba12c0d932 bld]$ git bisect start mysql-8.0.37 mysql-8.0.36
Bisecting: 156 revisions left to test after this (roughly 7 steps)
[310704bc6b3a6dad52726bf8820f5a1e3194c844] Bug#35993709 Several MTR tests fail when MySQL Server built with VS2022

Then I execute git bisect run ~/bisect_commands.sh and go grab coffee!

[ec2-user@i-0c6e9baba12c0d932 bld]$ chmod +x ~/bisect_commands.sh
[ec2-user@i-0c6e9baba12c0d932 bld]$ git bisect run ~/bisect_commands.sh >> ~/bisect_log.txt 2>&1

It can take a while to complete, but you can tail the ~/bisect_log.txt to check progress. While running you will see outputs of the MTR test run, mysql logs and the bisect commands. For the bisect commands you will see something like the following after each build and test run letting you know how the bisect is progressing

[ec2-user@i-0c6e9baba12c0d932 ~]$ grep "Bisect" bisect_log.txt
Bisecting: 77 revisions left to test after this (roughly 6 steps)
Bisecting: 38 revisions left to test after this (roughly 5 steps)
Bisecting: 19 revisions left to test after this (roughly 4 steps)
Bisecting: 9 revisions left to test after this (roughly 3 steps)
Bisecting: 4 revisions left to test after this (roughly 2 steps)
Bisecting: 2 revisions left to test after this (roughly 1 step)
Bisecting: 0 revisions left to test after this (roughly 0 steps)

You can see the full output here, but looking at the log you will see the outputs from the build, and also that in some cases the test passed, while others it failed. This is expected and is part of the bisect process.

Failed:

Logging: /home/ec2-user/mysql-server/mysql-test/mysql-test-run.pl  innodb.instant_alter_stored_column_order
MySQL Version 8.0.37
Checking supported features
 - Binaries are debug compiled
Using 'all' suites
Collecting tests
Checking leftover processes
Removing old var directory
Creating var directory '/home/ec2-user/mysql-server/bld/mysql-test/var'
Installing system database
Using parallel: 1

==============================================================================
                  TEST NAME                       RESULT  TIME (ms) COMMENT
------------------------------------------------------------------------------
[ 50%] innodb.instant_alter_stored_column_order  [ fail ]
        Test ended at 2024-07-07 23:21:12
        Completed: Failed 1/2 tests, 50.00% were successful.
.
.
.
2024-07-07T23:20:55.043456Z 9 [ERROR] [MY-013183] [InnoDB] Assertion failure: mtr0log.cc:646:total > (ulint)(log_ptr - log_start) thread 139912252782144
InnoDB: We intentionally generate a memory trap.
InnoDB: Submit a detailed bug report to http://bugs.mysql.com.
InnoDB: If you get repeated assertion failures or crashes, even
InnoDB: immediately after the mysqld startup, there may be
InnoDB: corruption in the InnoDB tablespace. Please refer to
InnoDB: http://dev.mysql.com/doc/refman/8.0/en/forcing-innodb-recovery.html
InnoDB: about forcing recovery.
2024-07-07T23:20:55Z UTC - mysqld got signal 6 ;
Most likely, you have hit a bug, but this error can also be caused by malfunctioning hardware.
BuildID[sha1]=b8458559fd278542e79ef051af2982224cca197d
Thread pointer: 0x7f3f3c5b8d20
Attempting backtrace. You can use the following information to find out
where mysqld died. If you see no messages after this, something went
terribly wrong...
stack_bottom = 7f3fdc206a30 thread_stack 0x100000
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(my_print_stacktrace(unsigned char const*, unsigned long)+0x43) [0x49e2b43]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(print_fatal_signal(int)+0x3a2) [0x35c0a1d]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(my_server_abort()+0x6b) [0x35c0ccf]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(my_abort()+0xd) [0x49d9531]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(ut_dbg_assertion_failed(char const*, char const*, unsigned long)+0x1d1) [0x4dd936f]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld() [0x4bfa71c]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld() [0x4bfa830]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld() [0x4bfc829]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(mlog_open_and_write_index(mtr_t*, unsigned char const*, dict_index_t const*, mlog_id_t, unsigned long, unsigned char*&)+0x5c5) [0x4bfae0a]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld() [0x4c31aae]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(page_cur_delete_rec(page_cur_t*, dict_index_t const*, unsigned long const*, mtr_t*)+0x363) [0x4c32075]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(btr_cur_pessimistic_update(unsigned long, btr_cur_t*, unsigned long**, mem_block_info_t**, mem_block_info_t*, big_rec_t**, upd_t*, unsigned long, que_thr_t*, unsigned long, unsigned long, mtr_t*, btr_pcur_t*)+0xe7e) [0x4e5d707]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld() [0x4d1d4a1]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld() [0x4d1dd77]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld() [0x4d1e20b]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(row_upd_step(que_thr_t*)+0x248) [0x4d1e6c6]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld() [0x4ca1b9f]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(row_update_for_mysql(unsigned char const*, row_prebuilt_t*)+0xbb) [0x4ca1f8b]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(ha_innobase::update_row(unsigned char const*, unsigned char*)+0x423) [0x4a5228b]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(handler::ha_update_row(unsigned char const*, unsigned char*)+0x1ea) [0x376afdc]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(Sql_cmd_update::update_single_table(THD*)+0x214b) [0x34cec67]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(Sql_cmd_update::execute_inner(THD*)+0x10c) [0x34d1a78]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(Sql_cmd_dml::execute(THD*)+0x6ed) [0x3417af1]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(mysql_execute_command(THD*, bool)+0x2156) [0x3391b99]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(dispatch_sql_command(THD*, Parser_state*)+0x798) [0x33971df]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(dispatch_command(THD*, COM_DATA const*, enum_server_command)+0x16ec) [0x338cf0f]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld(do_command(THD*)+0x5bd) [0x338ae5c]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld() [0x35abbe3]
/home/ec2-user/mysql-server/bld/runtime_output_directory/mysqld() [0x567d049]
/lib64/libc.so.6(+0x9f7f2) [0x7f3fe3a9f7f2]
/lib64/libc.so.6(+0x3f450) [0x7f3fe3a3f450]

Trying to get some variables.
Some pointers may be invalid and cause the dump to abort.
Query (7f3f3c368790): UPDATE t1 SET c4 = 'value' WHERE c1 = 'k1'
Connection ID (thread ID): 9
Status: NOT_KILLED

The manual page at http://dev.mysql.com/doc/mysql/en/crashing.html contains
information that should help you find out what is causing the crash.
Writing a core file
safe_process[220791]: Child process: 220792, killed by signal: 6
----------SERVER LOG END-------------
.
.
.
Failing test(s): innodb.instant_alter_stored_column_order

The log files in var/log may give you some hint of what went wrong.

If you want to report this error, please read first the documentation
at http://dev.mysql.com/doc/mysql/en/mysql-test-suite.html

mysql-test-run: *** ERROR: there were failing test cases
Bisecting: 4 revisions left to test after this (roughly 2 steps)

Passed:

Logging: /home/ec2-user/mysql-server/mysql-test/mysql-test-run.pl  innodb.instant_alter_stored_column_order
MySQL Version 8.0.37
Checking supported features
 - Binaries are debug compiled
Using 'all' suites
Collecting tests
Checking leftover processes
Removing old var directory
Creating var directory '/home/ec2-user/mysql-server/bld/mysql-test/var'
Installing system database
Using parallel: 1

==============================================================================
                  TEST NAME                       RESULT  TIME (ms) COMMENT
------------------------------------------------------------------------------
[ 50%] innodb.instant_alter_stored_column_order  [ pass ]   2530
[100%] shutdown_report                           [ pass ]
------------------------------------------------------------------------------
The servers were restarted 0 times
The servers were reinitialized 0 times
Spent 2.530 of 20 seconds executing testcases

Completed: All 2 tests were successful.

Bisecting: 2 revisions left to test after this (roughly 1 step)
[908d0de4420946a45bd89fd9bd79702fd5c28773] Bug #33970854  Connect timeout too long towards MGMD nodes

At the very end of the bisect log, you will see a message indicating what the “first bad commit“ is:

e6e13a8f64ca1005f0d2d4da8463fdbef3767917 is the first bad commit
commit e6e13a8f64ca1005f0d2d4da8463fdbef3767917
Author: Yasufumi Kinoshita <yasufumi.kinoshita@oracle.com>
Date:   Tue Aug 22 12:03:52 2023 +0900

    Bug#35183686: ALTER_STORED_COLUMN_ORDER with algorithm=INSTANT makes wrong INSERT REDO log

    REDO log is not logged (non instant add/del) column order change with instant DDL.
    It might cause wrong REDO log replay when recovery, very danger.

    The fix makes the column order change logged also correctly.

    Change-Id: I7e617735bb9d327136b18b5165039e1155f1fe50

 .../suite/innodb/include/instant_ddl_recovery.inc  | 23 +++++++++++
 .../innodb/r/instant_ddl_recovery_debug.result     | 45 ++++++++++++++++++++
 storage/innobase/mtr/mtr0log.cc                    | 48 ++++++++++++++++++----
 3 files changed, 107 insertions(+), 9 deletions(-)
bisect found first bad commit

In this case it identified the first bad commit as “Bug#35183686: ALTER_STORED_COLUMN_ORDER with algorithm=INSTANT makes wrong INSERT REDO log”, which as we guessed, is related to instant DDL changes in InnoDB, that was introduced in MySQL 8.0.37. This makes sense based on our repro! After reviewing release notes on 8.0.38, you will be happy to see this is already fixed so you can just upgrade![1] And remember, you have a test case; so why not test it out on 8.0.38 prior to upgrading!

[1]

InnoDB: MySQL unexpectedly halted on an UPDATE after an ALTER TABLE operation. (Bug #36571091)
References: This issue is a regression of: Bug #35183686.

ref

What now?

Now that you know the offending commit, you have a few options.

If you are not on the latest MySQL version, try checking release notes and upgrading to see if the issue is fixed. In this case, the issue is fixed in 8.0.38.
If the issue is not fixed, try digging deeper into the cause of the issue. Knowing what introduced the bug is a great head start into understanding why, and a kickstart to the debugging process. Read the commit message as to why this change was made and familiarize yourself with this area of code. Once you have done this consider making a contribution to have it included in a future release. If you don’t feel comfortable or cannot fix the issue, don’t fret! You should still open a bug report including your repro, test case and let the bug verification team know what you think may be causing the issue, or at least what commit you think introduced the problem. I’m sure they would be happy with the leg up!
Even if you do open a bug report, MySQL releases are quarterly and prior to release, there are no announcements if/when your contribution will be accepted or the bug fixed. So it may be a good idea, if not already fixed in another release to evaluate if any application or database parameter changes could alleviate the pain. For the example above, since its related to INSTANT DDL, one option could be to consider using other DDL types such as the INPLACE DDL algorithm, or an online schema change tool such as PTOSC, GH-ost or Spirit.

Thanks for reading this far, hope this will help someone else out!

Marc's Blog

Monday, April 28, 2025

(Yet another) blog on monitoring Multi-threaded replication in MySQL

Background on parallel replication

New Performance Schema Replication Tables

Multi-threaded replica statistics

New replication Views

Example troubleshooting flow:

Final thoughts:

Appendix

View Definition

Quick notes on common Issues and Interpretations.

Related Configuration Variables

Write-set Based Dependency Tracking

Related Reading

Version Information

Monday, October 14, 2024

InnoDB data_locks: a happy ending/new beginning in MySQL 8.0.40 ! (?)

Conclusion

Appendix:

Where you will find MySQL release notes once published:

Some MySQL docs on locking:

Some other resources with lots more deep dive info on InnoDB internals/Locking

[1] Example bug reports for the data_locks tables, not exhaustive.

Sunday, July 7, 2024

Bug hunting in MySQL with git bisect

(Yet another) blog on monitoring Multi-threaded replication in MySQL

Report Abuse

Labels