Sql server 2008 r2 - redundant rows

2023-03-23 06:14 问答作者：

The first query

   --645 rows
    SELECT  *
            FROM    (
                      SELECT DISTINCT
                                cu.*,
                                ROW_NUMBER() OVER ( ORDER BY cu.Id ) AS RowNum
                      FROM      Customers cu
                                LEFT JOIN dbo.CustomerCategories cc
                                ON cc.CustomerId = cu.Id
                                LEFT JOIN dbo.CustomerServices cs
                                 ON cs.CustomerId = cu.Id
                      WHERE     ( @FullName IS NULL
                                  OR cu.FullName LIKE @FullName
                                )
                                AND ( @CategoriesIdXml IS   NULL
                                      OR cc.CategoryId IN ( SELECT  *
                                                            FROM    @CategoriesList )
                                    )
                                AND ( @ServicesIdXml IS NULL
                                      OR cs.ServiceId IN ( SELECT   *
                                                           FROM     @ServicesList )
                                    ) 

                                    ) AS _
            WHERE   RowNum BETWEEN ( @PageIndex - 1 ) * @PageSize + 1
                           AND     @PageIndex * @PageSiz开发者_JAVA技巧e

The second query

 --41 rows
    SELECT  *
        FROM    (
                  SELECT DISTINCT
                            cu.*,
                            ROW_NUMBER() OVER ( ORDER BY cu.Id ) AS RowNum
                  FROM      Customers cu
                  --          LEFT JOIN dbo.CustomerCategories cc
                  --          ON cc.CustomerId = cu.Id
                  --          LEFT JOIN dbo.CustomerServices cs
                  --           ON cs.CustomerId = cu.Id
                  --WHERE     ( @FullName IS NULL
                  --            OR cu.FullName LIKE @FullName
                  --          )
                  --          AND ( @CategoriesIdXml IS   NULL
                  --                OR cc.CategoryId IN ( SELECT  *
                  --                                      FROM    @CategoriesList )
                  --              )
                  --          AND ( @ServicesIdXml IS NULL
                  --                OR cs.ServiceId IN ( SELECT   *
                  --                                     FROM     @ServicesList )
                  --              ) 

                                ) AS _
        WHERE   RowNum BETWEEN ( @PageIndex - 1 ) * @PageSize + 1
                       AND     @PageIndex * @PageSize

The second query returns right result set (41 rows), but the first returns 645 rows which is wrong. But I use DISTINCT in both queries and I wonder why first returns too much rows.

How do I avoid it?

The DISTINCT is being applied after the creation of the ROW_NUMBER()

As ROW_NUMBER() is different for every row, every row is unique by definition. This means that you appear to have a few options.

Apply the Distinct in one query, then wrap another around it for ROW_NUMBER()

SELECT
  *
FROM
(
  SELECT
    *,
    ROW_NUMBER() OVER (ORDER BY id) AS row_num
  FROM
  (
    SELECT DISTINCT
      cu.*
    FROM
      <your query>
  )
    AS raw_data
)
  AS ordered_data
WHERE
  RowNum BETWEEN ( @PageIndex - 1 ) * @PageSize + 1
             AND   @PageIndex       * @PageSize

Use GROUP BY instead of DISTINCT

SELECT
  *
FROM
(
  SELECT DISTINCT
    cu.*,
    ROW_NUMBER() OVER (ORDER BY id) AS row_num
  FROM
    <your query>
  GROUP BY
    cu.id,
    cu.field1,
    cu.field2,
    etc, etc
)
  AS ordered_data
WHERE
  RowNum BETWEEN ( @PageIndex - 1 ) * @PageSize + 1
             AND   @PageIndex       * @PageSize

ROW_NUMBER is not right, use DENSE_RANK instead.

You can see difference here : Difference between ROW_NUMBER, RANK and DENSE_RANK

ROW_NUMBER will give you different number for the same Customer, and this is not what you want, you need the same value so that your distinct could work.

继续阅读：sql-server sql-server-2008-r2

Sql server 2008 r2 - redundant rows

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？