MySQL relay log corrupted, how do I fix it? Tried but failedReplication doesn't resume once it fails to...
Taking headphones when quitting job
Possible issue with my W4 and tax return
How to politely refuse in-office gym instructor for steroids and protein
Custom shape shows unwanted extra line
Are the positive and negative planes inner or outer planes in the Great Wheel cosmology model?
Square Root Distance from Integers
Memory usage: #define vs. static const for uint8_t
A fantasy book with seven white haired women on the cover
Integration of two exponential multiplied by each other
Do authors have to be politically correct in article-writing?
Why do all the books in Game of Thrones library have their covers facing the back of the shelf?
A Missing Symbol for This Logo
Why is it that Bernie Sanders is always called a "socialist"?
What's this assembly doing?
"Starve to death" Vs. "Starve to the point of death"
Does it take energy to move something in a circle?
Plausible reason for gold-digging ant
Which scales have a major chord built on second note? Which scales have a minor chord built on the seventh note?
Why is the "Domain users" group missing from this PowerShell AD query?
Book where a space ship journeys to the center of the galaxy to find all the stars had gone supernova
Why maximum length of IP, TCP, UDP packet is not suit?
Does the US government have any planning in place to ensure there's no shortages of food, fuel, steel and other commodities?
Crack the bank account's password!
What senses are available to a corpse subjected to a Speak with Dead spell?
MySQL relay log corrupted, how do I fix it? Tried but failed
Replication doesn't resume once it fails to connectLooking for an efficient way to fix “Could not parse relay log event entry…” errorWhat max_allowed_packet is big enough, and why do I need to change it?Broken mysql replication on slave server (error 1236)Max allowed packet changed still replication not startingMySQL slave replicates changes that are in neither binlog_do_db nor replicate_do_dbmysql master/master replication problemReplication doesn't resume once it fails to connect1236 On Slave After Master Master Failureerror connecting to master 'replica@192.168.57.10:3306' mac-ubuntuCreate a mysql replication from multiple master and single slave server?
A MySQL v5.1.61 relay got corrupted when the machine suddenly shut down. I tried to fix it but it didn't work.
— How do I fix it? Did I do something wrong?
As far as I've read, corrupted MySQL relay logs are easily fixed:
change master to master_log_file='<Relay_Master_Log_File>',
master_log_pos=<Exec_Master_Log_Pos>;
where Relay_Master_Log_File
and Exec_Master_Log_Pos
are listed by:mysql> show slave status;
However when I did change master status ...
, I got a primary key violation error. How is that possible? Is the above procedure no correct, or is e.g. some +1 missing?
(For now I've simply re-imported a --master-data mysqldump from the master to the slave, and this solved the problem. However, in the future, doing that
might not be appropriate.)
Here follows details about my particular problem:
mysql> show slave status G
*************************** 1. row ***************************
Slave_IO_State: Waiting for master to send event
Master_Host: the-master-host
Master_User: replication
Master_Port: 3306
Connect_Retry: 60
Master_Log_File: mysql-bin.000021
Read_Master_Log_Pos: 33639968
Relay_Log_File: mysql-relay-bin.000271
Relay_Log_Pos: 2031587
Relay_Master_Log_File: mysql-bin.000020
Slave_IO_Running: Yes
Slave_SQL_Running: No
Replicate_Do_DB: the_database
Replicate_Ignore_DB:
Replicate_Do_Table:
Replicate_Ignore_Table:
Replicate_Wild_Do_Table:
Replicate_Wild_Ignore_Table:
Last_Errno: 1594
Last_Error: Relay log read failure: Could not parse relay log event entry. The possible reasons are: the master's binary log is corrupted (you can check this by running 'mysqlbinlog' on the binary log), the slave's relay log is corrupted (you can check this by running 'mysqlbinlog' on the relay log), a network problem, or a bug in the master's or slave's MySQL code. If you want to check the master's binary log or slave's relay log, you will be able to know their names by issuing 'SHOW SLAVE STATUS' on this slave.
Skip_Counter: 0
Exec_Master_Log_Pos: 66395191
Relay_Log_Space: 36559177
Until_Condition: None
Until_Log_File:
Until_Log_Pos: 0
Master_SSL_Allowed: No
Master_SSL_CA_File:
Master_SSL_CA_Path:
Master_SSL_Cert:
Master_SSL_Cipher:
Master_SSL_Key:
Seconds_Behind_Master: NULL
Master_SSL_Verify_Server_Cert: No
Last_IO_Errno: 0
Last_IO_Error:
Last_SQL_Errno: 1594
Last_SQL_Error: Relay log read failure: Could not parse relay log event entry. The possible reasons are: the master's binary log is corrupted (you can check this by running 'mysqlbinlog' on the binary log), the slave's relay log is corrupted (you can check this by running 'mysqlbinlog' on the relay log), a network problem, or a bug in the master's or slave's MySQL code. If you want to check the master's binary log or slave's relay log, you will be able to know their names by issuing 'SHOW SLAVE STATUS' on this slave.
And this is what I did:
mysql> stop slave;
mysql> reset slave;
mysql> change master to master_host='the-master-host', master_user='replication', master_password='the-password', master_log_file='mysql-bin.000020', master_log_pos=66395191;
mysql> start slave;
And this is what happened, a PK error:
131122 15:17:29 [Note] Slave I/O thread: connected to master 'replication@the-master-host:3306',replication started in log 'mysql-bin.000020' at position 66395191
131122 15:17:29 [ERROR] Slave SQL: Error 'Duplicate entry '71373' for key 'PRIMARY'' on query. Default database: 'the_database'. Query: 'insert into ... values ...', Error_code: 1062
131122 15:17:29 [Warning] Slave: Data truncated for column 'date' at row 1 Error_code: 1265
131122 15:17:29 [Warning] Slave: Duplicate entry '71373' for key 'PRIMARY' Error_code: 1062
I think I followed the recommended procedure (see links just below), still there was a PK error :-( ?
http://bugs.mysql.com/bug.php?id=26489, search for "Workarounds".
http://mhbarr.wordpress.com/2013/07/26/mysql-slave-corrupted-relay-log/
https://stackoverflow.com/a/14438408
mysql replication corruption
add a comment |
A MySQL v5.1.61 relay got corrupted when the machine suddenly shut down. I tried to fix it but it didn't work.
— How do I fix it? Did I do something wrong?
As far as I've read, corrupted MySQL relay logs are easily fixed:
change master to master_log_file='<Relay_Master_Log_File>',
master_log_pos=<Exec_Master_Log_Pos>;
where Relay_Master_Log_File
and Exec_Master_Log_Pos
are listed by:mysql> show slave status;
However when I did change master status ...
, I got a primary key violation error. How is that possible? Is the above procedure no correct, or is e.g. some +1 missing?
(For now I've simply re-imported a --master-data mysqldump from the master to the slave, and this solved the problem. However, in the future, doing that
might not be appropriate.)
Here follows details about my particular problem:
mysql> show slave status G
*************************** 1. row ***************************
Slave_IO_State: Waiting for master to send event
Master_Host: the-master-host
Master_User: replication
Master_Port: 3306
Connect_Retry: 60
Master_Log_File: mysql-bin.000021
Read_Master_Log_Pos: 33639968
Relay_Log_File: mysql-relay-bin.000271
Relay_Log_Pos: 2031587
Relay_Master_Log_File: mysql-bin.000020
Slave_IO_Running: Yes
Slave_SQL_Running: No
Replicate_Do_DB: the_database
Replicate_Ignore_DB:
Replicate_Do_Table:
Replicate_Ignore_Table:
Replicate_Wild_Do_Table:
Replicate_Wild_Ignore_Table:
Last_Errno: 1594
Last_Error: Relay log read failure: Could not parse relay log event entry. The possible reasons are: the master's binary log is corrupted (you can check this by running 'mysqlbinlog' on the binary log), the slave's relay log is corrupted (you can check this by running 'mysqlbinlog' on the relay log), a network problem, or a bug in the master's or slave's MySQL code. If you want to check the master's binary log or slave's relay log, you will be able to know their names by issuing 'SHOW SLAVE STATUS' on this slave.
Skip_Counter: 0
Exec_Master_Log_Pos: 66395191
Relay_Log_Space: 36559177
Until_Condition: None
Until_Log_File:
Until_Log_Pos: 0
Master_SSL_Allowed: No
Master_SSL_CA_File:
Master_SSL_CA_Path:
Master_SSL_Cert:
Master_SSL_Cipher:
Master_SSL_Key:
Seconds_Behind_Master: NULL
Master_SSL_Verify_Server_Cert: No
Last_IO_Errno: 0
Last_IO_Error:
Last_SQL_Errno: 1594
Last_SQL_Error: Relay log read failure: Could not parse relay log event entry. The possible reasons are: the master's binary log is corrupted (you can check this by running 'mysqlbinlog' on the binary log), the slave's relay log is corrupted (you can check this by running 'mysqlbinlog' on the relay log), a network problem, or a bug in the master's or slave's MySQL code. If you want to check the master's binary log or slave's relay log, you will be able to know their names by issuing 'SHOW SLAVE STATUS' on this slave.
And this is what I did:
mysql> stop slave;
mysql> reset slave;
mysql> change master to master_host='the-master-host', master_user='replication', master_password='the-password', master_log_file='mysql-bin.000020', master_log_pos=66395191;
mysql> start slave;
And this is what happened, a PK error:
131122 15:17:29 [Note] Slave I/O thread: connected to master 'replication@the-master-host:3306',replication started in log 'mysql-bin.000020' at position 66395191
131122 15:17:29 [ERROR] Slave SQL: Error 'Duplicate entry '71373' for key 'PRIMARY'' on query. Default database: 'the_database'. Query: 'insert into ... values ...', Error_code: 1062
131122 15:17:29 [Warning] Slave: Data truncated for column 'date' at row 1 Error_code: 1265
131122 15:17:29 [Warning] Slave: Duplicate entry '71373' for key 'PRIMARY' Error_code: 1062
I think I followed the recommended procedure (see links just below), still there was a PK error :-( ?
http://bugs.mysql.com/bug.php?id=26489, search for "Workarounds".
http://mhbarr.wordpress.com/2013/07/26/mysql-slave-corrupted-relay-log/
https://stackoverflow.com/a/14438408
mysql replication corruption
1
Yes, it looks like it should have worked, and actually it looks like it probably actually did work, as perhaps the original relay log, before the corrupt section, had already done the insert at that master log position, but couldn't advance the displayed master position to the next pointer, since that pointer is stored in the relay log (which was corrupt.) So you might have gotten away with skipping that event and moving to the next event, then verifying that master and slave actually had identical data... I haven't had an opportunity to review the question in sufficient detail, yet.
– Michael - sqlbot
Nov 25 '13 at 15:09
Thanks @Michael-sqlbot, then I think that if this problem happens again, I'll doSET GLOBAL sql_slave_skip_counter = 1; START SLAVE;
and skip one event on the slave, and hope that helps — does that make sense? If it doesn't help (if there's still a PK error), I'll import a dump with--master-data
again.
– KajMagnus
Nov 25 '13 at 15:44
add a comment |
A MySQL v5.1.61 relay got corrupted when the machine suddenly shut down. I tried to fix it but it didn't work.
— How do I fix it? Did I do something wrong?
As far as I've read, corrupted MySQL relay logs are easily fixed:
change master to master_log_file='<Relay_Master_Log_File>',
master_log_pos=<Exec_Master_Log_Pos>;
where Relay_Master_Log_File
and Exec_Master_Log_Pos
are listed by:mysql> show slave status;
However when I did change master status ...
, I got a primary key violation error. How is that possible? Is the above procedure no correct, or is e.g. some +1 missing?
(For now I've simply re-imported a --master-data mysqldump from the master to the slave, and this solved the problem. However, in the future, doing that
might not be appropriate.)
Here follows details about my particular problem:
mysql> show slave status G
*************************** 1. row ***************************
Slave_IO_State: Waiting for master to send event
Master_Host: the-master-host
Master_User: replication
Master_Port: 3306
Connect_Retry: 60
Master_Log_File: mysql-bin.000021
Read_Master_Log_Pos: 33639968
Relay_Log_File: mysql-relay-bin.000271
Relay_Log_Pos: 2031587
Relay_Master_Log_File: mysql-bin.000020
Slave_IO_Running: Yes
Slave_SQL_Running: No
Replicate_Do_DB: the_database
Replicate_Ignore_DB:
Replicate_Do_Table:
Replicate_Ignore_Table:
Replicate_Wild_Do_Table:
Replicate_Wild_Ignore_Table:
Last_Errno: 1594
Last_Error: Relay log read failure: Could not parse relay log event entry. The possible reasons are: the master's binary log is corrupted (you can check this by running 'mysqlbinlog' on the binary log), the slave's relay log is corrupted (you can check this by running 'mysqlbinlog' on the relay log), a network problem, or a bug in the master's or slave's MySQL code. If you want to check the master's binary log or slave's relay log, you will be able to know their names by issuing 'SHOW SLAVE STATUS' on this slave.
Skip_Counter: 0
Exec_Master_Log_Pos: 66395191
Relay_Log_Space: 36559177
Until_Condition: None
Until_Log_File:
Until_Log_Pos: 0
Master_SSL_Allowed: No
Master_SSL_CA_File:
Master_SSL_CA_Path:
Master_SSL_Cert:
Master_SSL_Cipher:
Master_SSL_Key:
Seconds_Behind_Master: NULL
Master_SSL_Verify_Server_Cert: No
Last_IO_Errno: 0
Last_IO_Error:
Last_SQL_Errno: 1594
Last_SQL_Error: Relay log read failure: Could not parse relay log event entry. The possible reasons are: the master's binary log is corrupted (you can check this by running 'mysqlbinlog' on the binary log), the slave's relay log is corrupted (you can check this by running 'mysqlbinlog' on the relay log), a network problem, or a bug in the master's or slave's MySQL code. If you want to check the master's binary log or slave's relay log, you will be able to know their names by issuing 'SHOW SLAVE STATUS' on this slave.
And this is what I did:
mysql> stop slave;
mysql> reset slave;
mysql> change master to master_host='the-master-host', master_user='replication', master_password='the-password', master_log_file='mysql-bin.000020', master_log_pos=66395191;
mysql> start slave;
And this is what happened, a PK error:
131122 15:17:29 [Note] Slave I/O thread: connected to master 'replication@the-master-host:3306',replication started in log 'mysql-bin.000020' at position 66395191
131122 15:17:29 [ERROR] Slave SQL: Error 'Duplicate entry '71373' for key 'PRIMARY'' on query. Default database: 'the_database'. Query: 'insert into ... values ...', Error_code: 1062
131122 15:17:29 [Warning] Slave: Data truncated for column 'date' at row 1 Error_code: 1265
131122 15:17:29 [Warning] Slave: Duplicate entry '71373' for key 'PRIMARY' Error_code: 1062
I think I followed the recommended procedure (see links just below), still there was a PK error :-( ?
http://bugs.mysql.com/bug.php?id=26489, search for "Workarounds".
http://mhbarr.wordpress.com/2013/07/26/mysql-slave-corrupted-relay-log/
https://stackoverflow.com/a/14438408
mysql replication corruption
A MySQL v5.1.61 relay got corrupted when the machine suddenly shut down. I tried to fix it but it didn't work.
— How do I fix it? Did I do something wrong?
As far as I've read, corrupted MySQL relay logs are easily fixed:
change master to master_log_file='<Relay_Master_Log_File>',
master_log_pos=<Exec_Master_Log_Pos>;
where Relay_Master_Log_File
and Exec_Master_Log_Pos
are listed by:mysql> show slave status;
However when I did change master status ...
, I got a primary key violation error. How is that possible? Is the above procedure no correct, or is e.g. some +1 missing?
(For now I've simply re-imported a --master-data mysqldump from the master to the slave, and this solved the problem. However, in the future, doing that
might not be appropriate.)
Here follows details about my particular problem:
mysql> show slave status G
*************************** 1. row ***************************
Slave_IO_State: Waiting for master to send event
Master_Host: the-master-host
Master_User: replication
Master_Port: 3306
Connect_Retry: 60
Master_Log_File: mysql-bin.000021
Read_Master_Log_Pos: 33639968
Relay_Log_File: mysql-relay-bin.000271
Relay_Log_Pos: 2031587
Relay_Master_Log_File: mysql-bin.000020
Slave_IO_Running: Yes
Slave_SQL_Running: No
Replicate_Do_DB: the_database
Replicate_Ignore_DB:
Replicate_Do_Table:
Replicate_Ignore_Table:
Replicate_Wild_Do_Table:
Replicate_Wild_Ignore_Table:
Last_Errno: 1594
Last_Error: Relay log read failure: Could not parse relay log event entry. The possible reasons are: the master's binary log is corrupted (you can check this by running 'mysqlbinlog' on the binary log), the slave's relay log is corrupted (you can check this by running 'mysqlbinlog' on the relay log), a network problem, or a bug in the master's or slave's MySQL code. If you want to check the master's binary log or slave's relay log, you will be able to know their names by issuing 'SHOW SLAVE STATUS' on this slave.
Skip_Counter: 0
Exec_Master_Log_Pos: 66395191
Relay_Log_Space: 36559177
Until_Condition: None
Until_Log_File:
Until_Log_Pos: 0
Master_SSL_Allowed: No
Master_SSL_CA_File:
Master_SSL_CA_Path:
Master_SSL_Cert:
Master_SSL_Cipher:
Master_SSL_Key:
Seconds_Behind_Master: NULL
Master_SSL_Verify_Server_Cert: No
Last_IO_Errno: 0
Last_IO_Error:
Last_SQL_Errno: 1594
Last_SQL_Error: Relay log read failure: Could not parse relay log event entry. The possible reasons are: the master's binary log is corrupted (you can check this by running 'mysqlbinlog' on the binary log), the slave's relay log is corrupted (you can check this by running 'mysqlbinlog' on the relay log), a network problem, or a bug in the master's or slave's MySQL code. If you want to check the master's binary log or slave's relay log, you will be able to know their names by issuing 'SHOW SLAVE STATUS' on this slave.
And this is what I did:
mysql> stop slave;
mysql> reset slave;
mysql> change master to master_host='the-master-host', master_user='replication', master_password='the-password', master_log_file='mysql-bin.000020', master_log_pos=66395191;
mysql> start slave;
And this is what happened, a PK error:
131122 15:17:29 [Note] Slave I/O thread: connected to master 'replication@the-master-host:3306',replication started in log 'mysql-bin.000020' at position 66395191
131122 15:17:29 [ERROR] Slave SQL: Error 'Duplicate entry '71373' for key 'PRIMARY'' on query. Default database: 'the_database'. Query: 'insert into ... values ...', Error_code: 1062
131122 15:17:29 [Warning] Slave: Data truncated for column 'date' at row 1 Error_code: 1265
131122 15:17:29 [Warning] Slave: Duplicate entry '71373' for key 'PRIMARY' Error_code: 1062
I think I followed the recommended procedure (see links just below), still there was a PK error :-( ?
http://bugs.mysql.com/bug.php?id=26489, search for "Workarounds".
http://mhbarr.wordpress.com/2013/07/26/mysql-slave-corrupted-relay-log/
https://stackoverflow.com/a/14438408
mysql replication corruption
mysql replication corruption
edited May 23 '17 at 12:40
Community♦
1
1
asked Nov 25 '13 at 9:33
KajMagnusKajMagnus
4242719
4242719
1
Yes, it looks like it should have worked, and actually it looks like it probably actually did work, as perhaps the original relay log, before the corrupt section, had already done the insert at that master log position, but couldn't advance the displayed master position to the next pointer, since that pointer is stored in the relay log (which was corrupt.) So you might have gotten away with skipping that event and moving to the next event, then verifying that master and slave actually had identical data... I haven't had an opportunity to review the question in sufficient detail, yet.
– Michael - sqlbot
Nov 25 '13 at 15:09
Thanks @Michael-sqlbot, then I think that if this problem happens again, I'll doSET GLOBAL sql_slave_skip_counter = 1; START SLAVE;
and skip one event on the slave, and hope that helps — does that make sense? If it doesn't help (if there's still a PK error), I'll import a dump with--master-data
again.
– KajMagnus
Nov 25 '13 at 15:44
add a comment |
1
Yes, it looks like it should have worked, and actually it looks like it probably actually did work, as perhaps the original relay log, before the corrupt section, had already done the insert at that master log position, but couldn't advance the displayed master position to the next pointer, since that pointer is stored in the relay log (which was corrupt.) So you might have gotten away with skipping that event and moving to the next event, then verifying that master and slave actually had identical data... I haven't had an opportunity to review the question in sufficient detail, yet.
– Michael - sqlbot
Nov 25 '13 at 15:09
Thanks @Michael-sqlbot, then I think that if this problem happens again, I'll doSET GLOBAL sql_slave_skip_counter = 1; START SLAVE;
and skip one event on the slave, and hope that helps — does that make sense? If it doesn't help (if there's still a PK error), I'll import a dump with--master-data
again.
– KajMagnus
Nov 25 '13 at 15:44
1
1
Yes, it looks like it should have worked, and actually it looks like it probably actually did work, as perhaps the original relay log, before the corrupt section, had already done the insert at that master log position, but couldn't advance the displayed master position to the next pointer, since that pointer is stored in the relay log (which was corrupt.) So you might have gotten away with skipping that event and moving to the next event, then verifying that master and slave actually had identical data... I haven't had an opportunity to review the question in sufficient detail, yet.
– Michael - sqlbot
Nov 25 '13 at 15:09
Yes, it looks like it should have worked, and actually it looks like it probably actually did work, as perhaps the original relay log, before the corrupt section, had already done the insert at that master log position, but couldn't advance the displayed master position to the next pointer, since that pointer is stored in the relay log (which was corrupt.) So you might have gotten away with skipping that event and moving to the next event, then verifying that master and slave actually had identical data... I haven't had an opportunity to review the question in sufficient detail, yet.
– Michael - sqlbot
Nov 25 '13 at 15:09
Thanks @Michael-sqlbot, then I think that if this problem happens again, I'll do
SET GLOBAL sql_slave_skip_counter = 1; START SLAVE;
and skip one event on the slave, and hope that helps — does that make sense? If it doesn't help (if there's still a PK error), I'll import a dump with --master-data
again.– KajMagnus
Nov 25 '13 at 15:44
Thanks @Michael-sqlbot, then I think that if this problem happens again, I'll do
SET GLOBAL sql_slave_skip_counter = 1; START SLAVE;
and skip one event on the slave, and hope that helps — does that make sense? If it doesn't help (if there's still a PK error), I'll import a dump with --master-data
again.– KajMagnus
Nov 25 '13 at 15:44
add a comment |
4 Answers
4
active
oldest
votes
Error: Last_SQL_Errno: 1594
Last_SQL_Error: Relay log read failure: Could not parse relay log event entry.
This error means that either the master log file is corrupted or the relay log file is corrupted.
- Before doing anything backup all your databases, logs, image servers, repeat, several times, and only continue at your own risk.
First run "show slave status G" on the slave and note:
Master_Log_File: mysql-bin.000026
Read_Master_Log_Pos: 2377104
Relay_Log_File: mysqld-relay-bin.000056
Relay_Log_Pos: 1097303
Relay_Master_Log_File: mysql-bin.000026
Exec_Master_Log_Pos: 1097157
First we want to make sure that the master log file is intact, so jump onto the master server and find the Relay_Master_Log_File (check /var/log/mysql) and run the following command:
mysqlbinlog mysql-bin.000026
The log will be displayed but hopefully you won't see any error messages. If you see error messages then the master logs are corrupt and you'll likely have to re-image.
Next run the same command on the slave relay log (often in /var/lib/mysql)
mysqlbinlog mysqld-relay-bin.000056
You will likely see some errors showing the corruption that has stopped replication, like this:
ERROR: Error in Log_event::read_log_event(): 'read error', data_len: 336, event_type: 2
ERROR: Could not read entry at offset 1097414: Error in log format or read error.
DELIMITER ;
# End of log file
ROLLBACK /* added by mysqlbinlog */;
/*!50003 SET COMPLETION_TYPE=@OLD_COMPLETION_TYPE*/;
/*!50530 SET @@SESSION.PSEUDO_SLAVE_MODE=0*/;
root@db:/var/lib/mysql#
If you see any errors then the log is fine on the master and only the slave's relay log is corrupt. This is good news, we can reset the slave and tell it the masters details and where to continue from. If you don't see any errors then stop reading now, you have a different problem.
If the slave relay log has errors, run the following commands to reset the slave and corrupted logs reconnect to the master, get the ok logs and start slaving again. Note that MASTER_LOG_POS is the Exec_Master_Log_Pos
, and MASTER_LOG_FILE is the Relay_Master_Log_File
(NOT the first one, which matches relay logs that have been fetched and need to be thrown away) both from the first command.
mysql> stop slave;
Query OK, 0 rows affected (0.14 sec)
mysql> reset slave all;
Query OK, 0 rows affected (0.43 sec)
mysql> CHANGE MASTER TO MASTER_HOST='master.host.com', MASTER_USER='masteruser', MASTER_PASSWORD='masterpass', MASTER_LOG_FILE='mysql-bin.000026', MASTER_LOG_POS=1097157;
Query OK, 0 rows affected (0.93 sec)
mysql> start slave;
Query OK, 0 rows affected (0.00 sec)
2
Hi, thanks for your answer. If you read the question carefully, you'll notice it says "Relay log corrupted" — that's because we had already usedmysqlbinlog
in the manner you suggest, and found out that the relay log (not the master log) had been corrupted. Concenring the fix you suggest — if you read the question carefully, you'll notice that the fix you suggest is exactly what we had already attempted. But that did not work, and that is what the question is about. — But your answer might be useful for other people with a similar problem.
– KajMagnus
Aug 28 '14 at 18:28
2
It should probably be noted, thatMASTER_LOG_FILE
inCHANGE MASTER
should be taken fromRelay_Master_Log_File
and not fromMaster_Log_File
. Usually they will be the same but it might not be the case always (see percona.com/blog/2008/07/07/… ).
– brablc
Feb 3 '16 at 10:07
@brablc is right.Relay_Master_Log_File
must be used, notMaster_Log_File
. See also: percona.com/blog/2008/07/07/…
– Mircea Vutcovici
Sep 6 '16 at 15:31
in most cases, there's no need forreset slave all
because the master settings don't need to be changed (e.g. master_host, master_user, master_password), only MASTER_LOG_FILE and MASTER_LOG_POS, then areset_slave
should suffice
– ympostor
Jul 14 '17 at 3:15
add a comment |
I know it's been over a year, but here is what may have happened to this particular problem.
mysql> stop slave;
mysql> reset slave;
mysql> change master to master_host='the-master-host', master_user='replication', master_password='the-password', master_log_file='mysql-bin.000020', master_log_pos=66395191;
mysql> start slave;
That looks like that should have fixed it because it removed the corrupt relay log.
Then, you got a PK error 1062. Why ?
There is an outstanding bug (http://bugs.mysql.com/bug.php?id=60847) that still active in MySQL 5.5
Although the bug relates to using mysql --single-transaction --flush-logs, a related quirk exists.
I have seen that quirk on some EC2 servers running as Slaves for a client just last week in MySQL 5.5.15
On the Master, there was a weird multiple row extended INSERT where each tuple being inserted was a SELECT. What happened was that the LAST_INSERT_ID in the relay log, which forms the next auto increment to assign, was already in use on the Slave because of multiple-row inserts beforehand.
The serialized INSERT in the relay log looked like
INSERT INTO tablname (column,column) VALUES (value,value,...)
The column list did not include the numeric primary key. When the 1062 error came back, I would use the same query it failed on, run the query manually. It did not hit the 1062 error. Then, I ran the usual skip slave commands:
STOP SLAVE;
SET GLOBAL SQL_SLAVE_SKIP_COUNTER=1;
START SLAVE;
SET @sleepnumber = SLEEP(3);
SHOW SLAVE STATUSG
Then, replication caught up.
My advice would be to properly serialize your INSERTs on the Master because this bug-like situation is actually quite avoidable.
add a comment |
[Fixing MySQL replication after slaves' relay log was corrupted]
MySQL replication on slave (version 5.XX) has stopped. Slave_IO_Running was marked as Yes, but Slave_SQL_Running as No. Simple stop/start slave didn’t help so further problem analysis was needed. It seemed that current slave’s relay log was corrupted because testing with “mysqlbinlog” has printed out an error. Therefore, the solution was to discard current relay binlogs and to point slave to the last master binlog position.
To fix the error, current binlog files on slave should be discarded and set new position. Before setting new binlog position it’s important to remember Relay_Master_Log_File and Exec_Master_Log_Pos values:
Relay_Master_Log_File: mysql-bin.002045
Exec_Master_Log_Pos: 103641119
OK, with this values, new binlog position can be set:
# stop slave
mysql> stop slave;
# make slave forget its replication position in the master's binary log
mysql> reset slave;
# change slave to start reading from stopped position
mysql> change master to master_log_file='mysql-bin.002045', master_log_pos=103641119;
# start slave
mysql> start slave;
Just to note that
reset slave
will deletemaster.info
,relay-log.info
and all the relay log files, so it's not needed to clean leftovers in/var/lib/mysql
directory.
add a comment |
You have done it quite right (as other already said).
The only issue is with master.info file (contains information about position in master's mysql-bin.log) because this file is not synced to disc after each query processed.
So your info about positions in master's log is outdated and you are processing already processed queries which needs to be skipped with SET GLOBAL SQL_SLAVE_SKIP_COUNTER=1;
.
Unfortunately, if you use some queries like UPDATE table SET counter=counter+1 WHERE id = 12345
and using binlog_format=STATEMENT
your databases may go out of sync, I think.
You can tell MySQL server to sync master.info after every event by setting up variable sync_master_info but it will probably have huge performance consequences.
add a comment |
Your Answer
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "182"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdba.stackexchange.com%2fquestions%2f53893%2fmysql-relay-log-corrupted-how-do-i-fix-it-tried-but-failed%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
4 Answers
4
active
oldest
votes
4 Answers
4
active
oldest
votes
active
oldest
votes
active
oldest
votes
Error: Last_SQL_Errno: 1594
Last_SQL_Error: Relay log read failure: Could not parse relay log event entry.
This error means that either the master log file is corrupted or the relay log file is corrupted.
- Before doing anything backup all your databases, logs, image servers, repeat, several times, and only continue at your own risk.
First run "show slave status G" on the slave and note:
Master_Log_File: mysql-bin.000026
Read_Master_Log_Pos: 2377104
Relay_Log_File: mysqld-relay-bin.000056
Relay_Log_Pos: 1097303
Relay_Master_Log_File: mysql-bin.000026
Exec_Master_Log_Pos: 1097157
First we want to make sure that the master log file is intact, so jump onto the master server and find the Relay_Master_Log_File (check /var/log/mysql) and run the following command:
mysqlbinlog mysql-bin.000026
The log will be displayed but hopefully you won't see any error messages. If you see error messages then the master logs are corrupt and you'll likely have to re-image.
Next run the same command on the slave relay log (often in /var/lib/mysql)
mysqlbinlog mysqld-relay-bin.000056
You will likely see some errors showing the corruption that has stopped replication, like this:
ERROR: Error in Log_event::read_log_event(): 'read error', data_len: 336, event_type: 2
ERROR: Could not read entry at offset 1097414: Error in log format or read error.
DELIMITER ;
# End of log file
ROLLBACK /* added by mysqlbinlog */;
/*!50003 SET COMPLETION_TYPE=@OLD_COMPLETION_TYPE*/;
/*!50530 SET @@SESSION.PSEUDO_SLAVE_MODE=0*/;
root@db:/var/lib/mysql#
If you see any errors then the log is fine on the master and only the slave's relay log is corrupt. This is good news, we can reset the slave and tell it the masters details and where to continue from. If you don't see any errors then stop reading now, you have a different problem.
If the slave relay log has errors, run the following commands to reset the slave and corrupted logs reconnect to the master, get the ok logs and start slaving again. Note that MASTER_LOG_POS is the Exec_Master_Log_Pos
, and MASTER_LOG_FILE is the Relay_Master_Log_File
(NOT the first one, which matches relay logs that have been fetched and need to be thrown away) both from the first command.
mysql> stop slave;
Query OK, 0 rows affected (0.14 sec)
mysql> reset slave all;
Query OK, 0 rows affected (0.43 sec)
mysql> CHANGE MASTER TO MASTER_HOST='master.host.com', MASTER_USER='masteruser', MASTER_PASSWORD='masterpass', MASTER_LOG_FILE='mysql-bin.000026', MASTER_LOG_POS=1097157;
Query OK, 0 rows affected (0.93 sec)
mysql> start slave;
Query OK, 0 rows affected (0.00 sec)
2
Hi, thanks for your answer. If you read the question carefully, you'll notice it says "Relay log corrupted" — that's because we had already usedmysqlbinlog
in the manner you suggest, and found out that the relay log (not the master log) had been corrupted. Concenring the fix you suggest — if you read the question carefully, you'll notice that the fix you suggest is exactly what we had already attempted. But that did not work, and that is what the question is about. — But your answer might be useful for other people with a similar problem.
– KajMagnus
Aug 28 '14 at 18:28
2
It should probably be noted, thatMASTER_LOG_FILE
inCHANGE MASTER
should be taken fromRelay_Master_Log_File
and not fromMaster_Log_File
. Usually they will be the same but it might not be the case always (see percona.com/blog/2008/07/07/… ).
– brablc
Feb 3 '16 at 10:07
@brablc is right.Relay_Master_Log_File
must be used, notMaster_Log_File
. See also: percona.com/blog/2008/07/07/…
– Mircea Vutcovici
Sep 6 '16 at 15:31
in most cases, there's no need forreset slave all
because the master settings don't need to be changed (e.g. master_host, master_user, master_password), only MASTER_LOG_FILE and MASTER_LOG_POS, then areset_slave
should suffice
– ympostor
Jul 14 '17 at 3:15
add a comment |
Error: Last_SQL_Errno: 1594
Last_SQL_Error: Relay log read failure: Could not parse relay log event entry.
This error means that either the master log file is corrupted or the relay log file is corrupted.
- Before doing anything backup all your databases, logs, image servers, repeat, several times, and only continue at your own risk.
First run "show slave status G" on the slave and note:
Master_Log_File: mysql-bin.000026
Read_Master_Log_Pos: 2377104
Relay_Log_File: mysqld-relay-bin.000056
Relay_Log_Pos: 1097303
Relay_Master_Log_File: mysql-bin.000026
Exec_Master_Log_Pos: 1097157
First we want to make sure that the master log file is intact, so jump onto the master server and find the Relay_Master_Log_File (check /var/log/mysql) and run the following command:
mysqlbinlog mysql-bin.000026
The log will be displayed but hopefully you won't see any error messages. If you see error messages then the master logs are corrupt and you'll likely have to re-image.
Next run the same command on the slave relay log (often in /var/lib/mysql)
mysqlbinlog mysqld-relay-bin.000056
You will likely see some errors showing the corruption that has stopped replication, like this:
ERROR: Error in Log_event::read_log_event(): 'read error', data_len: 336, event_type: 2
ERROR: Could not read entry at offset 1097414: Error in log format or read error.
DELIMITER ;
# End of log file
ROLLBACK /* added by mysqlbinlog */;
/*!50003 SET COMPLETION_TYPE=@OLD_COMPLETION_TYPE*/;
/*!50530 SET @@SESSION.PSEUDO_SLAVE_MODE=0*/;
root@db:/var/lib/mysql#
If you see any errors then the log is fine on the master and only the slave's relay log is corrupt. This is good news, we can reset the slave and tell it the masters details and where to continue from. If you don't see any errors then stop reading now, you have a different problem.
If the slave relay log has errors, run the following commands to reset the slave and corrupted logs reconnect to the master, get the ok logs and start slaving again. Note that MASTER_LOG_POS is the Exec_Master_Log_Pos
, and MASTER_LOG_FILE is the Relay_Master_Log_File
(NOT the first one, which matches relay logs that have been fetched and need to be thrown away) both from the first command.
mysql> stop slave;
Query OK, 0 rows affected (0.14 sec)
mysql> reset slave all;
Query OK, 0 rows affected (0.43 sec)
mysql> CHANGE MASTER TO MASTER_HOST='master.host.com', MASTER_USER='masteruser', MASTER_PASSWORD='masterpass', MASTER_LOG_FILE='mysql-bin.000026', MASTER_LOG_POS=1097157;
Query OK, 0 rows affected (0.93 sec)
mysql> start slave;
Query OK, 0 rows affected (0.00 sec)
2
Hi, thanks for your answer. If you read the question carefully, you'll notice it says "Relay log corrupted" — that's because we had already usedmysqlbinlog
in the manner you suggest, and found out that the relay log (not the master log) had been corrupted. Concenring the fix you suggest — if you read the question carefully, you'll notice that the fix you suggest is exactly what we had already attempted. But that did not work, and that is what the question is about. — But your answer might be useful for other people with a similar problem.
– KajMagnus
Aug 28 '14 at 18:28
2
It should probably be noted, thatMASTER_LOG_FILE
inCHANGE MASTER
should be taken fromRelay_Master_Log_File
and not fromMaster_Log_File
. Usually they will be the same but it might not be the case always (see percona.com/blog/2008/07/07/… ).
– brablc
Feb 3 '16 at 10:07
@brablc is right.Relay_Master_Log_File
must be used, notMaster_Log_File
. See also: percona.com/blog/2008/07/07/…
– Mircea Vutcovici
Sep 6 '16 at 15:31
in most cases, there's no need forreset slave all
because the master settings don't need to be changed (e.g. master_host, master_user, master_password), only MASTER_LOG_FILE and MASTER_LOG_POS, then areset_slave
should suffice
– ympostor
Jul 14 '17 at 3:15
add a comment |
Error: Last_SQL_Errno: 1594
Last_SQL_Error: Relay log read failure: Could not parse relay log event entry.
This error means that either the master log file is corrupted or the relay log file is corrupted.
- Before doing anything backup all your databases, logs, image servers, repeat, several times, and only continue at your own risk.
First run "show slave status G" on the slave and note:
Master_Log_File: mysql-bin.000026
Read_Master_Log_Pos: 2377104
Relay_Log_File: mysqld-relay-bin.000056
Relay_Log_Pos: 1097303
Relay_Master_Log_File: mysql-bin.000026
Exec_Master_Log_Pos: 1097157
First we want to make sure that the master log file is intact, so jump onto the master server and find the Relay_Master_Log_File (check /var/log/mysql) and run the following command:
mysqlbinlog mysql-bin.000026
The log will be displayed but hopefully you won't see any error messages. If you see error messages then the master logs are corrupt and you'll likely have to re-image.
Next run the same command on the slave relay log (often in /var/lib/mysql)
mysqlbinlog mysqld-relay-bin.000056
You will likely see some errors showing the corruption that has stopped replication, like this:
ERROR: Error in Log_event::read_log_event(): 'read error', data_len: 336, event_type: 2
ERROR: Could not read entry at offset 1097414: Error in log format or read error.
DELIMITER ;
# End of log file
ROLLBACK /* added by mysqlbinlog */;
/*!50003 SET COMPLETION_TYPE=@OLD_COMPLETION_TYPE*/;
/*!50530 SET @@SESSION.PSEUDO_SLAVE_MODE=0*/;
root@db:/var/lib/mysql#
If you see any errors then the log is fine on the master and only the slave's relay log is corrupt. This is good news, we can reset the slave and tell it the masters details and where to continue from. If you don't see any errors then stop reading now, you have a different problem.
If the slave relay log has errors, run the following commands to reset the slave and corrupted logs reconnect to the master, get the ok logs and start slaving again. Note that MASTER_LOG_POS is the Exec_Master_Log_Pos
, and MASTER_LOG_FILE is the Relay_Master_Log_File
(NOT the first one, which matches relay logs that have been fetched and need to be thrown away) both from the first command.
mysql> stop slave;
Query OK, 0 rows affected (0.14 sec)
mysql> reset slave all;
Query OK, 0 rows affected (0.43 sec)
mysql> CHANGE MASTER TO MASTER_HOST='master.host.com', MASTER_USER='masteruser', MASTER_PASSWORD='masterpass', MASTER_LOG_FILE='mysql-bin.000026', MASTER_LOG_POS=1097157;
Query OK, 0 rows affected (0.93 sec)
mysql> start slave;
Query OK, 0 rows affected (0.00 sec)
Error: Last_SQL_Errno: 1594
Last_SQL_Error: Relay log read failure: Could not parse relay log event entry.
This error means that either the master log file is corrupted or the relay log file is corrupted.
- Before doing anything backup all your databases, logs, image servers, repeat, several times, and only continue at your own risk.
First run "show slave status G" on the slave and note:
Master_Log_File: mysql-bin.000026
Read_Master_Log_Pos: 2377104
Relay_Log_File: mysqld-relay-bin.000056
Relay_Log_Pos: 1097303
Relay_Master_Log_File: mysql-bin.000026
Exec_Master_Log_Pos: 1097157
First we want to make sure that the master log file is intact, so jump onto the master server and find the Relay_Master_Log_File (check /var/log/mysql) and run the following command:
mysqlbinlog mysql-bin.000026
The log will be displayed but hopefully you won't see any error messages. If you see error messages then the master logs are corrupt and you'll likely have to re-image.
Next run the same command on the slave relay log (often in /var/lib/mysql)
mysqlbinlog mysqld-relay-bin.000056
You will likely see some errors showing the corruption that has stopped replication, like this:
ERROR: Error in Log_event::read_log_event(): 'read error', data_len: 336, event_type: 2
ERROR: Could not read entry at offset 1097414: Error in log format or read error.
DELIMITER ;
# End of log file
ROLLBACK /* added by mysqlbinlog */;
/*!50003 SET COMPLETION_TYPE=@OLD_COMPLETION_TYPE*/;
/*!50530 SET @@SESSION.PSEUDO_SLAVE_MODE=0*/;
root@db:/var/lib/mysql#
If you see any errors then the log is fine on the master and only the slave's relay log is corrupt. This is good news, we can reset the slave and tell it the masters details and where to continue from. If you don't see any errors then stop reading now, you have a different problem.
If the slave relay log has errors, run the following commands to reset the slave and corrupted logs reconnect to the master, get the ok logs and start slaving again. Note that MASTER_LOG_POS is the Exec_Master_Log_Pos
, and MASTER_LOG_FILE is the Relay_Master_Log_File
(NOT the first one, which matches relay logs that have been fetched and need to be thrown away) both from the first command.
mysql> stop slave;
Query OK, 0 rows affected (0.14 sec)
mysql> reset slave all;
Query OK, 0 rows affected (0.43 sec)
mysql> CHANGE MASTER TO MASTER_HOST='master.host.com', MASTER_USER='masteruser', MASTER_PASSWORD='masterpass', MASTER_LOG_FILE='mysql-bin.000026', MASTER_LOG_POS=1097157;
Query OK, 0 rows affected (0.93 sec)
mysql> start slave;
Query OK, 0 rows affected (0.00 sec)
edited Jul 14 '17 at 6:43
ympostor
1053
1053
answered Aug 28 '14 at 13:10
A.BadgerA.Badger
38136
38136
2
Hi, thanks for your answer. If you read the question carefully, you'll notice it says "Relay log corrupted" — that's because we had already usedmysqlbinlog
in the manner you suggest, and found out that the relay log (not the master log) had been corrupted. Concenring the fix you suggest — if you read the question carefully, you'll notice that the fix you suggest is exactly what we had already attempted. But that did not work, and that is what the question is about. — But your answer might be useful for other people with a similar problem.
– KajMagnus
Aug 28 '14 at 18:28
2
It should probably be noted, thatMASTER_LOG_FILE
inCHANGE MASTER
should be taken fromRelay_Master_Log_File
and not fromMaster_Log_File
. Usually they will be the same but it might not be the case always (see percona.com/blog/2008/07/07/… ).
– brablc
Feb 3 '16 at 10:07
@brablc is right.Relay_Master_Log_File
must be used, notMaster_Log_File
. See also: percona.com/blog/2008/07/07/…
– Mircea Vutcovici
Sep 6 '16 at 15:31
in most cases, there's no need forreset slave all
because the master settings don't need to be changed (e.g. master_host, master_user, master_password), only MASTER_LOG_FILE and MASTER_LOG_POS, then areset_slave
should suffice
– ympostor
Jul 14 '17 at 3:15
add a comment |
2
Hi, thanks for your answer. If you read the question carefully, you'll notice it says "Relay log corrupted" — that's because we had already usedmysqlbinlog
in the manner you suggest, and found out that the relay log (not the master log) had been corrupted. Concenring the fix you suggest — if you read the question carefully, you'll notice that the fix you suggest is exactly what we had already attempted. But that did not work, and that is what the question is about. — But your answer might be useful for other people with a similar problem.
– KajMagnus
Aug 28 '14 at 18:28
2
It should probably be noted, thatMASTER_LOG_FILE
inCHANGE MASTER
should be taken fromRelay_Master_Log_File
and not fromMaster_Log_File
. Usually they will be the same but it might not be the case always (see percona.com/blog/2008/07/07/… ).
– brablc
Feb 3 '16 at 10:07
@brablc is right.Relay_Master_Log_File
must be used, notMaster_Log_File
. See also: percona.com/blog/2008/07/07/…
– Mircea Vutcovici
Sep 6 '16 at 15:31
in most cases, there's no need forreset slave all
because the master settings don't need to be changed (e.g. master_host, master_user, master_password), only MASTER_LOG_FILE and MASTER_LOG_POS, then areset_slave
should suffice
– ympostor
Jul 14 '17 at 3:15
2
2
Hi, thanks for your answer. If you read the question carefully, you'll notice it says "Relay log corrupted" — that's because we had already used
mysqlbinlog
in the manner you suggest, and found out that the relay log (not the master log) had been corrupted. Concenring the fix you suggest — if you read the question carefully, you'll notice that the fix you suggest is exactly what we had already attempted. But that did not work, and that is what the question is about. — But your answer might be useful for other people with a similar problem.– KajMagnus
Aug 28 '14 at 18:28
Hi, thanks for your answer. If you read the question carefully, you'll notice it says "Relay log corrupted" — that's because we had already used
mysqlbinlog
in the manner you suggest, and found out that the relay log (not the master log) had been corrupted. Concenring the fix you suggest — if you read the question carefully, you'll notice that the fix you suggest is exactly what we had already attempted. But that did not work, and that is what the question is about. — But your answer might be useful for other people with a similar problem.– KajMagnus
Aug 28 '14 at 18:28
2
2
It should probably be noted, that
MASTER_LOG_FILE
in CHANGE MASTER
should be taken from Relay_Master_Log_File
and not from Master_Log_File
. Usually they will be the same but it might not be the case always (see percona.com/blog/2008/07/07/… ).– brablc
Feb 3 '16 at 10:07
It should probably be noted, that
MASTER_LOG_FILE
in CHANGE MASTER
should be taken from Relay_Master_Log_File
and not from Master_Log_File
. Usually they will be the same but it might not be the case always (see percona.com/blog/2008/07/07/… ).– brablc
Feb 3 '16 at 10:07
@brablc is right.
Relay_Master_Log_File
must be used, not Master_Log_File
. See also: percona.com/blog/2008/07/07/…– Mircea Vutcovici
Sep 6 '16 at 15:31
@brablc is right.
Relay_Master_Log_File
must be used, not Master_Log_File
. See also: percona.com/blog/2008/07/07/…– Mircea Vutcovici
Sep 6 '16 at 15:31
in most cases, there's no need for
reset slave all
because the master settings don't need to be changed (e.g. master_host, master_user, master_password), only MASTER_LOG_FILE and MASTER_LOG_POS, then a reset_slave
should suffice– ympostor
Jul 14 '17 at 3:15
in most cases, there's no need for
reset slave all
because the master settings don't need to be changed (e.g. master_host, master_user, master_password), only MASTER_LOG_FILE and MASTER_LOG_POS, then a reset_slave
should suffice– ympostor
Jul 14 '17 at 3:15
add a comment |
I know it's been over a year, but here is what may have happened to this particular problem.
mysql> stop slave;
mysql> reset slave;
mysql> change master to master_host='the-master-host', master_user='replication', master_password='the-password', master_log_file='mysql-bin.000020', master_log_pos=66395191;
mysql> start slave;
That looks like that should have fixed it because it removed the corrupt relay log.
Then, you got a PK error 1062. Why ?
There is an outstanding bug (http://bugs.mysql.com/bug.php?id=60847) that still active in MySQL 5.5
Although the bug relates to using mysql --single-transaction --flush-logs, a related quirk exists.
I have seen that quirk on some EC2 servers running as Slaves for a client just last week in MySQL 5.5.15
On the Master, there was a weird multiple row extended INSERT where each tuple being inserted was a SELECT. What happened was that the LAST_INSERT_ID in the relay log, which forms the next auto increment to assign, was already in use on the Slave because of multiple-row inserts beforehand.
The serialized INSERT in the relay log looked like
INSERT INTO tablname (column,column) VALUES (value,value,...)
The column list did not include the numeric primary key. When the 1062 error came back, I would use the same query it failed on, run the query manually. It did not hit the 1062 error. Then, I ran the usual skip slave commands:
STOP SLAVE;
SET GLOBAL SQL_SLAVE_SKIP_COUNTER=1;
START SLAVE;
SET @sleepnumber = SLEEP(3);
SHOW SLAVE STATUSG
Then, replication caught up.
My advice would be to properly serialize your INSERTs on the Master because this bug-like situation is actually quite avoidable.
add a comment |
I know it's been over a year, but here is what may have happened to this particular problem.
mysql> stop slave;
mysql> reset slave;
mysql> change master to master_host='the-master-host', master_user='replication', master_password='the-password', master_log_file='mysql-bin.000020', master_log_pos=66395191;
mysql> start slave;
That looks like that should have fixed it because it removed the corrupt relay log.
Then, you got a PK error 1062. Why ?
There is an outstanding bug (http://bugs.mysql.com/bug.php?id=60847) that still active in MySQL 5.5
Although the bug relates to using mysql --single-transaction --flush-logs, a related quirk exists.
I have seen that quirk on some EC2 servers running as Slaves for a client just last week in MySQL 5.5.15
On the Master, there was a weird multiple row extended INSERT where each tuple being inserted was a SELECT. What happened was that the LAST_INSERT_ID in the relay log, which forms the next auto increment to assign, was already in use on the Slave because of multiple-row inserts beforehand.
The serialized INSERT in the relay log looked like
INSERT INTO tablname (column,column) VALUES (value,value,...)
The column list did not include the numeric primary key. When the 1062 error came back, I would use the same query it failed on, run the query manually. It did not hit the 1062 error. Then, I ran the usual skip slave commands:
STOP SLAVE;
SET GLOBAL SQL_SLAVE_SKIP_COUNTER=1;
START SLAVE;
SET @sleepnumber = SLEEP(3);
SHOW SLAVE STATUSG
Then, replication caught up.
My advice would be to properly serialize your INSERTs on the Master because this bug-like situation is actually quite avoidable.
add a comment |
I know it's been over a year, but here is what may have happened to this particular problem.
mysql> stop slave;
mysql> reset slave;
mysql> change master to master_host='the-master-host', master_user='replication', master_password='the-password', master_log_file='mysql-bin.000020', master_log_pos=66395191;
mysql> start slave;
That looks like that should have fixed it because it removed the corrupt relay log.
Then, you got a PK error 1062. Why ?
There is an outstanding bug (http://bugs.mysql.com/bug.php?id=60847) that still active in MySQL 5.5
Although the bug relates to using mysql --single-transaction --flush-logs, a related quirk exists.
I have seen that quirk on some EC2 servers running as Slaves for a client just last week in MySQL 5.5.15
On the Master, there was a weird multiple row extended INSERT where each tuple being inserted was a SELECT. What happened was that the LAST_INSERT_ID in the relay log, which forms the next auto increment to assign, was already in use on the Slave because of multiple-row inserts beforehand.
The serialized INSERT in the relay log looked like
INSERT INTO tablname (column,column) VALUES (value,value,...)
The column list did not include the numeric primary key. When the 1062 error came back, I would use the same query it failed on, run the query manually. It did not hit the 1062 error. Then, I ran the usual skip slave commands:
STOP SLAVE;
SET GLOBAL SQL_SLAVE_SKIP_COUNTER=1;
START SLAVE;
SET @sleepnumber = SLEEP(3);
SHOW SLAVE STATUSG
Then, replication caught up.
My advice would be to properly serialize your INSERTs on the Master because this bug-like situation is actually quite avoidable.
I know it's been over a year, but here is what may have happened to this particular problem.
mysql> stop slave;
mysql> reset slave;
mysql> change master to master_host='the-master-host', master_user='replication', master_password='the-password', master_log_file='mysql-bin.000020', master_log_pos=66395191;
mysql> start slave;
That looks like that should have fixed it because it removed the corrupt relay log.
Then, you got a PK error 1062. Why ?
There is an outstanding bug (http://bugs.mysql.com/bug.php?id=60847) that still active in MySQL 5.5
Although the bug relates to using mysql --single-transaction --flush-logs, a related quirk exists.
I have seen that quirk on some EC2 servers running as Slaves for a client just last week in MySQL 5.5.15
On the Master, there was a weird multiple row extended INSERT where each tuple being inserted was a SELECT. What happened was that the LAST_INSERT_ID in the relay log, which forms the next auto increment to assign, was already in use on the Slave because of multiple-row inserts beforehand.
The serialized INSERT in the relay log looked like
INSERT INTO tablname (column,column) VALUES (value,value,...)
The column list did not include the numeric primary key. When the 1062 error came back, I would use the same query it failed on, run the query manually. It did not hit the 1062 error. Then, I ran the usual skip slave commands:
STOP SLAVE;
SET GLOBAL SQL_SLAVE_SKIP_COUNTER=1;
START SLAVE;
SET @sleepnumber = SLEEP(3);
SHOW SLAVE STATUSG
Then, replication caught up.
My advice would be to properly serialize your INSERTs on the Master because this bug-like situation is actually quite avoidable.
answered Jan 2 '15 at 2:04
RolandoMySQLDBARolandoMySQLDBA
142k24223379
142k24223379
add a comment |
add a comment |
[Fixing MySQL replication after slaves' relay log was corrupted]
MySQL replication on slave (version 5.XX) has stopped. Slave_IO_Running was marked as Yes, but Slave_SQL_Running as No. Simple stop/start slave didn’t help so further problem analysis was needed. It seemed that current slave’s relay log was corrupted because testing with “mysqlbinlog” has printed out an error. Therefore, the solution was to discard current relay binlogs and to point slave to the last master binlog position.
To fix the error, current binlog files on slave should be discarded and set new position. Before setting new binlog position it’s important to remember Relay_Master_Log_File and Exec_Master_Log_Pos values:
Relay_Master_Log_File: mysql-bin.002045
Exec_Master_Log_Pos: 103641119
OK, with this values, new binlog position can be set:
# stop slave
mysql> stop slave;
# make slave forget its replication position in the master's binary log
mysql> reset slave;
# change slave to start reading from stopped position
mysql> change master to master_log_file='mysql-bin.002045', master_log_pos=103641119;
# start slave
mysql> start slave;
Just to note that
reset slave
will deletemaster.info
,relay-log.info
and all the relay log files, so it's not needed to clean leftovers in/var/lib/mysql
directory.
add a comment |
[Fixing MySQL replication after slaves' relay log was corrupted]
MySQL replication on slave (version 5.XX) has stopped. Slave_IO_Running was marked as Yes, but Slave_SQL_Running as No. Simple stop/start slave didn’t help so further problem analysis was needed. It seemed that current slave’s relay log was corrupted because testing with “mysqlbinlog” has printed out an error. Therefore, the solution was to discard current relay binlogs and to point slave to the last master binlog position.
To fix the error, current binlog files on slave should be discarded and set new position. Before setting new binlog position it’s important to remember Relay_Master_Log_File and Exec_Master_Log_Pos values:
Relay_Master_Log_File: mysql-bin.002045
Exec_Master_Log_Pos: 103641119
OK, with this values, new binlog position can be set:
# stop slave
mysql> stop slave;
# make slave forget its replication position in the master's binary log
mysql> reset slave;
# change slave to start reading from stopped position
mysql> change master to master_log_file='mysql-bin.002045', master_log_pos=103641119;
# start slave
mysql> start slave;
Just to note that
reset slave
will deletemaster.info
,relay-log.info
and all the relay log files, so it's not needed to clean leftovers in/var/lib/mysql
directory.
add a comment |
[Fixing MySQL replication after slaves' relay log was corrupted]
MySQL replication on slave (version 5.XX) has stopped. Slave_IO_Running was marked as Yes, but Slave_SQL_Running as No. Simple stop/start slave didn’t help so further problem analysis was needed. It seemed that current slave’s relay log was corrupted because testing with “mysqlbinlog” has printed out an error. Therefore, the solution was to discard current relay binlogs and to point slave to the last master binlog position.
To fix the error, current binlog files on slave should be discarded and set new position. Before setting new binlog position it’s important to remember Relay_Master_Log_File and Exec_Master_Log_Pos values:
Relay_Master_Log_File: mysql-bin.002045
Exec_Master_Log_Pos: 103641119
OK, with this values, new binlog position can be set:
# stop slave
mysql> stop slave;
# make slave forget its replication position in the master's binary log
mysql> reset slave;
# change slave to start reading from stopped position
mysql> change master to master_log_file='mysql-bin.002045', master_log_pos=103641119;
# start slave
mysql> start slave;
Just to note that
reset slave
will deletemaster.info
,relay-log.info
and all the relay log files, so it's not needed to clean leftovers in/var/lib/mysql
directory.
[Fixing MySQL replication after slaves' relay log was corrupted]
MySQL replication on slave (version 5.XX) has stopped. Slave_IO_Running was marked as Yes, but Slave_SQL_Running as No. Simple stop/start slave didn’t help so further problem analysis was needed. It seemed that current slave’s relay log was corrupted because testing with “mysqlbinlog” has printed out an error. Therefore, the solution was to discard current relay binlogs and to point slave to the last master binlog position.
To fix the error, current binlog files on slave should be discarded and set new position. Before setting new binlog position it’s important to remember Relay_Master_Log_File and Exec_Master_Log_Pos values:
Relay_Master_Log_File: mysql-bin.002045
Exec_Master_Log_Pos: 103641119
OK, with this values, new binlog position can be set:
# stop slave
mysql> stop slave;
# make slave forget its replication position in the master's binary log
mysql> reset slave;
# change slave to start reading from stopped position
mysql> change master to master_log_file='mysql-bin.002045', master_log_pos=103641119;
# start slave
mysql> start slave;
Just to note that
reset slave
will deletemaster.info
,relay-log.info
and all the relay log files, so it's not needed to clean leftovers in/var/lib/mysql
directory.
edited 8 mins ago
answered Nov 23 '17 at 7:46
Mohamed AyasMohamed Ayas
312
312
add a comment |
add a comment |
You have done it quite right (as other already said).
The only issue is with master.info file (contains information about position in master's mysql-bin.log) because this file is not synced to disc after each query processed.
So your info about positions in master's log is outdated and you are processing already processed queries which needs to be skipped with SET GLOBAL SQL_SLAVE_SKIP_COUNTER=1;
.
Unfortunately, if you use some queries like UPDATE table SET counter=counter+1 WHERE id = 12345
and using binlog_format=STATEMENT
your databases may go out of sync, I think.
You can tell MySQL server to sync master.info after every event by setting up variable sync_master_info but it will probably have huge performance consequences.
add a comment |
You have done it quite right (as other already said).
The only issue is with master.info file (contains information about position in master's mysql-bin.log) because this file is not synced to disc after each query processed.
So your info about positions in master's log is outdated and you are processing already processed queries which needs to be skipped with SET GLOBAL SQL_SLAVE_SKIP_COUNTER=1;
.
Unfortunately, if you use some queries like UPDATE table SET counter=counter+1 WHERE id = 12345
and using binlog_format=STATEMENT
your databases may go out of sync, I think.
You can tell MySQL server to sync master.info after every event by setting up variable sync_master_info but it will probably have huge performance consequences.
add a comment |
You have done it quite right (as other already said).
The only issue is with master.info file (contains information about position in master's mysql-bin.log) because this file is not synced to disc after each query processed.
So your info about positions in master's log is outdated and you are processing already processed queries which needs to be skipped with SET GLOBAL SQL_SLAVE_SKIP_COUNTER=1;
.
Unfortunately, if you use some queries like UPDATE table SET counter=counter+1 WHERE id = 12345
and using binlog_format=STATEMENT
your databases may go out of sync, I think.
You can tell MySQL server to sync master.info after every event by setting up variable sync_master_info but it will probably have huge performance consequences.
You have done it quite right (as other already said).
The only issue is with master.info file (contains information about position in master's mysql-bin.log) because this file is not synced to disc after each query processed.
So your info about positions in master's log is outdated and you are processing already processed queries which needs to be skipped with SET GLOBAL SQL_SLAVE_SKIP_COUNTER=1;
.
Unfortunately, if you use some queries like UPDATE table SET counter=counter+1 WHERE id = 12345
and using binlog_format=STATEMENT
your databases may go out of sync, I think.
You can tell MySQL server to sync master.info after every event by setting up variable sync_master_info but it will probably have huge performance consequences.
answered May 28 '18 at 8:55
DragonnDragonn
112
112
add a comment |
add a comment |
Thanks for contributing an answer to Database Administrators Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdba.stackexchange.com%2fquestions%2f53893%2fmysql-relay-log-corrupted-how-do-i-fix-it-tried-but-failed%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
1
Yes, it looks like it should have worked, and actually it looks like it probably actually did work, as perhaps the original relay log, before the corrupt section, had already done the insert at that master log position, but couldn't advance the displayed master position to the next pointer, since that pointer is stored in the relay log (which was corrupt.) So you might have gotten away with skipping that event and moving to the next event, then verifying that master and slave actually had identical data... I haven't had an opportunity to review the question in sufficient detail, yet.
– Michael - sqlbot
Nov 25 '13 at 15:09
Thanks @Michael-sqlbot, then I think that if this problem happens again, I'll do
SET GLOBAL sql_slave_skip_counter = 1; START SLAVE;
and skip one event on the slave, and hope that helps — does that make sense? If it doesn't help (if there's still a PK error), I'll import a dump with--master-data
again.– KajMagnus
Nov 25 '13 at 15:44