【已记录】doris从2.0.3升级到2.1.0后,一致报警告信息

Viewed 236

doris从2.0.3升级到2.1.0后,be后台一致报如下错误,感觉是report_runtime_query_statistics收集信息连接不上fe似的,请问是哪里配置不对吗?

W20240319 13:31:41.643570 849447 doris_main.cpp:123] thrift internal message: TSocket::open() getaddrinfo() <Host: Port: 0>Name or service not known
W20240319 13:31:41.643606 849447 status.h:380] meet error status: [THRIFT_RPC_ERROR]Couldn't open transport for :0 (Could not resolve host for client socket.)

    0#  doris::ThriftClientImpl::open() at /var/local/ldb_toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:187
    1#  doris::ClientCacheHelper::_create_client(doris::TNetworkAddress const&, std::function<doris::ThriftClientImpl* (doris::TNetworkAddress const&, void**)>&, void**, int) at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:449
    2#  doris::ClientCacheHelper::get_client(doris::TNetworkAddress const&, std::function<doris::ThriftClientImpl* (doris::TNetworkAddress const&, void**)>&, void**, int) at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:449
    3#  doris::ClientConnection<doris::FrontendServiceClient>::ClientConnection(doris::ClientCache<doris::FrontendServiceClient>*, doris::TNetworkAddress const&, int, doris::Status*, int) at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:345
    4#  doris::RuntimeQueryStatiticsMgr::report_runtime_query_statistics() at /home/zcp/repo_center/doris_release/doris/be/src/runtime/runtime_query_statistics_mgr.cpp:83
    5#  doris::Daemon::report_runtime_query_statistics_thread() at /var/local/ldb_toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/chrono:510
    6#  doris::Thread::supervise_thread(void*) at /var/local/ldb_toolchain/bin/../usr/include/pthread.h:562
    7#  ?
    8#  ?
5 Answers

是否有配host,使用ip是否正常;
为更具体地了解问题(处理后会更新回帖),可以➕我一下W:yz-jayhua

thrift连接有问题,其他正常查询正常吗?

这个问题还在吗?

在,一致存在,查询正常。不知道哪个配置有问题,我be是配置的hostname

我试了下2.1.1的最新的rc版本,没有复现这个问题
2.1.0有多个rc版本,可以提供下具体的rc版本号,我这边复现下
另外可以通过执行以下语句确认是否有汇报成功的情况

mysql [information_schema]>select * from information_schema.backend_active_tasks\G;
*************************** 1. row ***************************
                    BE_ID: 10013
                  FE_HOST: 
                 QUERY_ID: 5facfdaee4734365-b7b7a13511d0cdf8
             TASK_TIME_MS: 4
         TASK_CPU_TIME_MS: 0
                SCAN_ROWS: 0
               SCAN_BYTES: 0
     BE_PEAK_MEMORY_BYTES: 4096
CURRENT_USED_MEMORY_BYTES: 4096
       SHUFFLE_SEND_BYTES: 0
        SHUFFLE_SEND_ROWS: 0
1 row in set (0.02 sec)

如果这个列表有数据,说明有汇报成功的情况,说明可能有部分查询确实没有带fe的信息因此汇报失败。
如果这个表一直为空,说明就没有汇报成功过。