开发者

Deleting duplicate records in join table

I have a HABTM association between user and role.

User can be an admin (role_id = 1) or a user (role_id = 2) for roles.

In the join table, roles_users, I have some redundant records. For ex:

Deleting duplicate records in join table

I want to remove the duplicate records such as 1:1, 2:4.

Two questions:

  1. Where's the best place to execute t开发者_开发知识库he sql script that removes the dups -- migration? script?

  2. What is the sql query to remove the dups?


CREATE TABLE roles_users2 LIKE roles_users; -- this ensures indexes are preserved
INSERT INTO roles_users2 SELECT DISTINCT * FROM roles_users; 
DROP TABLE roles_users;
RENAME TABLE roles_users2 TO roles_users;

and for the future, to prevent duplicate rows

ALTER TABLE roles_users ADD UNIQUE INDEX (role_id, user_id);

Or, you can do all of it in one step with ALTER TABLE IGNORE:

ALTER IGNORE TABLE roles_users ADD UNIQUE INDEX (role_id, user_id);

IGNORE is a MySQL extension to standard SQL. It controls how ALTER TABLE works if there are duplicates on unique keys in the new table or if warnings occur when strict mode is enabled. If IGNORE is not specified, the copy is aborted and rolled back if duplicate-key errors occur. If IGNORE is specified, only the first row is used of rows with duplicates on a unique key. The other conflicting rows are deleted. Incorrect values are truncated to the closest matching acceptable value.


The simplest is to copy the data into a new table, minus the duplicates:

CREATE TABLE roles_users2 AS
SELECT DISTINCT * FROM roles_users

You can then choose one of the following:

  • Drop the old table, rename the new table to the old name and add indexes.
  • Truncate the old table and insert the rows from roles_users2 back into roles_users.
0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜