what is the quickest way to run a query to find where 2 fields are the same

2023-01-08 19:49 问答作者：

i have a table with id, first, last and i want to run a query that says

give me every record where the combination of first and last exists more than once

(i am trying to find duplicate records)

EDIT

Concatenation will give out false answers as pointed out in the comments ('Roberto Neil' vs 'Robert ONeil'.

Here is an answer that eliminates the concatenation issue. I found out the non duplicates and eliminated them from the final answer.

WITH MyTable AS
(
    SELECT 1 as ID, 'John' as FirstName, 'Doe' as LastName
    UNION
    SELECT 2 as ID, 'John' as FirstName, 'Doe' as LastName
    UNION
    SELECT 3 as ID, 'Tim' as FirstName, 'Doe' as LastName
    UNION
    SELECT 4 as ID, 'Jane' as FirstName, 'Doe' as LastName
    UNION
    SELECT 5 as ID, 'Jane' as FirstName, 'Doe' as LastName
)
SELECT Id, FirstName, LastName
FROM MyTable SelectTable
WHERE Id Not In
(
    SELECT Min (Id)
    From MyTable SearchTable
    GROUP BY FirstName, LastName
    HAVING COUNT (*) = 1
)

OLD SOLUTION

Use GROUP BY and HAVING.. check out this working sample

WITH MyTable AS
(
SELECT 1 as ID, 'John' as FirstName, 'Doe' as LastName
UNION
SELECT 2 as ID, 'John' as FirstName, 'Doe' as LastName
UNION
SELECT 3 as ID, 'Time' as FirstName, 'Doe' as LastName
UNION
SELECT 4 as ID, 'Jane' as FirstName, 'Doe' as LastName
)
SELECT ID, FirstName, LastName
FROM MyTable
WHERE FirstName + LastName IN
(
    SELECT FirstName + LastName
    FROM MyTable
    GROUP BY FirstName + LastName
    HAVING COUNT (*) > 1
)

This will result in the following

ID          FirstName LastName
----------- --------- --------
1           John      Doe
2           John      Doe

You can also use windowing functions. This will perform slightly better than Raj More's solution:

with MyTable as
(
    select 1 as ID, 'John' as FirstName, 'Doe' as LastName
    union
    select 2 as ID, 'John' as FirstName, 'Doe' as LastName
    union
    select 3 as ID, 'Time' as FirstName, 'Doe' as LastName
    union
    select 4 as ID, 'Jane' as FirstName, 'Doe' as LastName
)
select * 
from (
    select *, cnt = count(*) over ( partition by FirstName, LastName )
    from MyTable
) x
where x.cnt > 1

Here are two possible solutions. Which is faster will likely depend on your indexes and data, so try both and see which works better for you. In most cases though, the first query will be faster I believe.

SELECT
    T1.id
FROM
    My_Table T1
INNER JOIN
(
    SELECT
        first_name,
        last_name
    FROM
        My_Table T2
    GROUP BY
        first_name,
        last_name
    HAVING
        COUNT(*) > 1
) SQ ON
    SQ.first_name = T1.first_name AND
    SQ.last_name = T1.last_name

SELECT
    T1.id
FROM
    My_Table T1
WHERE
    EXISTS
    (
        SELECT *
        FROM
            My_Table T2
        WHERE
            T2.first_name = T1.first_name AND
            T2.last_name = T1.last_name AND
            T2.id <> T1.id
    )

SELECT count(*) FROM table HAVING count(*) > 1 GROUP BY concat(first, last)

Untested:

SELECT name, count(*) from (
   SELECT id, first+last as [name]
   from table) t
HAVING count(*) >1

继续阅读：sql sql-server

what is the quickest way to run a query to find where 2 fields are the same

EDIT

OLD SOLUTION

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？

EDIT

OLD SOLUTION

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集 河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？