Duplicate Entries MySQL with large data?

2023-03-11 08:20 问答作者：

I'm using the follow query to try and identify data from a table with around 10 million rows

  SELECT *
FROM
  db.tablename x
JOIN db.tablename z
  ON x.columnA = z.columnA
WHERE
  x.columnB > z.columnB

The problem is that the table doesn't have a Primary Key because of the duplicated data in the value that is the primaryKey. The above query is hugely slow and there isn't any way I can figure out how to make it more effici开发者_运维百科ent.

Adding LIMIT 100 still doesn't seem to help ?

Any ideas ?

To find the duplicate values you could:

select   columnA
from     table
group by columnA
having   count(*) > 1

Depending on what you then want to do you could put this in a temporary table.

But it seems strange that you have a concept of identity (even if not 100% correct all the time) and don't have an index on the data - would you not want to do lookups using this field fairly often? Perhaps you could create a non-unique index on columnA, at least while you run your query

Duplicate Entries MySQL with large data?

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？