group by字段作为分桶列是否可以加速查询

Viewed 24

假如有如下建表语句

CREATE TABLE xxx
(
    a int,
    b varchar,
    c varchar,
    d int
)
DISTRIBUTED BY HASH(a,b,c)

那么如下语句是否会执行更快些,对a、b和c顺序是否有要求?

select x from xxx group by a,b,c

select x from xxx group by a,b

select x from xxx group by a,c
1 Answers

GROUP BY 字段作为分桶列有一定的加速效果,具体的性能可以根据实际业务场景实测下。