doris 查询每次hive数据得到的结果不一致

Viewed 57

doris版本:3.0.2
hive版本:3.1.2

hive_catalog 创建语句
CREATE CATALOG hive PROPERTIES (
'type'='hms',
'hive.metastore.uris' = 'thrift://172.0.0.1:9083',
'hadoop.username' = 'hive',
'dfs.nameservices'='your-nameservice',
'dfs.ha.namenodes.your-nameservice'='nn1,nn2',
'dfs.namenode.rpc-address.your-nameservice.nn1'='172.21.0.2:8088',
'dfs.namenode.rpc-address.your-nameservice.nn2'='172.21.0.3:8088',
'dfs.client.failover.proxy.provider.your-nameservice'='org.apache.hadoop.hdfs.server.namenode.ha.ConfiguredFailoverProxyProvider'
);

hive表结构
create table A(
vid string
)partitioned by(datekey string,event string)
stored as parquet;

查询语句,得到的结果有时会不一致
select datekey,count(1) from A where datekey>='20250301' and datekey<='20250331' and vid ='xxxxxx' group by datekey order by datekey;

通过explain查看执行计划,发现approximate每次结果不一致。
image.png

image.png
注:该表数据没有更新

1 Answers

麻烦补齐下脱敏后的hive catalog语句以及对比描述