2.1.5,k8s版本偶发性错误: failed to get file size xxx

Viewed 137

版本信息:
k8s版本 2.1.5 在一台机器上安装了1个fe节点,2个be节点

错误信息:

SQL 错误 [1105] [HY000]: errCode = 2, detailMessage = (doriscluster-be-1.doriscluster-be-internal.default.svc.cluster.local)[CANCELLED]failed to initialize storage reader. tablet=281090, res=[NOT_FOUND]failed to get file size /opt/apache-doris/be/storage/data/503/281090/1023168322/0200000000000cf55e42960219c4e8766e8993773e2b4e91_0.dat, backend=doriscluster-be-1.doriscluster-be-internal.default.svc.cluster.local
4 Answers

刚重新执行查询命令

SELECT x.* FROM doris_db.c_0ac9c1161bc6467692e691bb0132785b x LIMIT 0, 200

还是抛出异常,但是异常中的tablet变了

SQL 错误 [1105] [HY000]: errCode = 2, detailMessage = (doriscluster-be-0.doriscluster-be-internal.default.svc.cluster.local)[CANCELLED]
failed to get file size /opt/apache-doris/be/storage/data/20/282917/362329311/02000000000019ed5d4912c24f3efc12b3d19a2983f56f80_0.dat

而且不固定,有的时候是282914,有的时候是282917:


show tablet:

DetailCMD

目录下文件情况:

从页面看该表情况:


please convert the issue to a kernel developer to fix it.

你的表都是几副本的,这块应该是查询的副本出现问题了,看看这个文件还在吗
/opt/apache-doris/be/storage/data/503/281090/1023168322/0200000000000cf55e42960219c4e8766e8993773e2b4e91_0.dat

然后可以看看show tablet 281090 ,然后执行最后的detailcmd 看看这个tablet的状态

刚重新执行查询命令

SELECT x.* FROM doris_db.c_0ac9c1161bc6467692e691bb0132785b x LIMIT 0, 200

还是抛出异常,但是异常中的tablet变了

SQL 错误 [1105] [HY000]: errCode = 2, detailMessage = (doriscluster-be-0.doriscluster-be-internal.default.svc.cluster.local)[CANCELLED]
failed to get file size /opt/apache-doris/be/storage/data/20/282917/362329311/02000000000019ed5d4912c24f3efc12b3d19a2983f56f80_0.dat

而且不固定,有的时候是282914,有的时候是282917:


show tablet:

DetailCMD

目录下文件情况:

从页面看该表情况: