How do I remove rows where some key values have occurred 'before'?

2023-01-14 12:13 问答作者：

I have views which look something like this:

mysql> select * from p2;
+---+---------+---------+
| k | measure | time_id |
+---+---------+---------+
| D |     200 |       2 |
| E |     201 |       2 |
| F |     203 |       2 |
| A |      20 |       1 |
| B |      22 |       1 |
| C |      23 |       1 |
| D |     100 |       1 |
| E |     101 |       1 |
| F |     103 |       1 |
| G |       4 |       1 |
| H |       7 |       1 |
| I |      10 |       1 |
+---+---------+---------+

(k, time_id) is a unique key, and the above is greatly simplified (there w开发者_运维知识库ill be many more values of time_id and k). The sort order is time_id DESC (followed by k ASC, but that's not so important).

I want to find a SELECT statement that will filter it to this:

+---+---------+---------+
| k | measure | time_id |
+---+---------+---------+
| D |     200 |       2 |
| E |     201 |       2 |
| F |     203 |       2 |
| A |      20 |       1 |
| B |      22 |       1 |
| C |      23 |       1 |
| G |       4 |       1 |
| H |       7 |       1 |
| I |      10 |       1 |
+---+---------+---------+

I want to make sure that values for column k are unique, by filtering out rows where the k value has already been used before.

In this example, in the original view rows 0, 1, 2 contained k values D, E and F, but so did rows 6, 7, 8, so rows 6-8 are removed to make the second view.

Is there a SELECT statement that can do this? It feels like it should be straightforward, but I can't figure out how to do it.

  select * from p2 e
      join (select k, Max(time_id) time_id 
            from p2
            group by k) t
      ON (e.k = t.k and e.time_id = t.time_id)

You may want to use a derived table:

SELECT p2.* 
FROM   p2
JOIN   (
           SELECT    MAX(time_id) max_time, k 
           FROM      p2 
           GROUP BY  k
       ) d_p2 ON (d_p2.k = p2.k AND d_p2.max_time = p2.time_id);

Or you could also use the "null-self-join" method:

SELECT    p2.*
FROM      p2
LEFT JOIN p2 AS d_p2 ON d_p2.k = p2.k AND d_p2.time_id > p2.time_id
WHERE     d_p2.k IS NULL;

These should work fine as long as you are sure that time_id is unique for each k. Otherwise you could still get duplicate rows.

Test case:

CREATE TABLE p2 (k char(1), measure int, time_id int);

INSERT INTO p2 VALUES ('D', 200, 2);
INSERT INTO p2 VALUES ('E', 201, 2);
INSERT INTO p2 VALUES ('F', 203, 2);
INSERT INTO p2 VALUES ('A',  20, 1);
INSERT INTO p2 VALUES ('B',  22, 1);
INSERT INTO p2 VALUES ('C',  23, 1);
INSERT INTO p2 VALUES ('D', 100, 1);
INSERT INTO p2 VALUES ('E', 101, 1);
INSERT INTO p2 VALUES ('F', 103, 1);
INSERT INTO p2 VALUES ('G',   4, 1);
INSERT INTO p2 VALUES ('H',   7, 1);
INSERT INTO p2 VALUES ('I',  10, 1);

Result:

+------+---------+---------+
| k    | measure | time_id |
+------+---------+---------+
| D    |     200 |       2 |
| E    |     201 |       2 |
| F    |     203 |       2 |
| A    |      20 |       1 |
| B    |      22 |       1 |
| C    |      23 |       1 |
| G    |       4 |       1 |
| H    |       7 |       1 |
| I    |      10 |       1 |
+------+---------+---------+
9 rows in set (0.00 sec)

继续阅读：sql

How do I remove rows where some key values have occurred 'before'?

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？