Doris2.1.5版本 FE无法启动,频繁报:wait catalog to be ready. feType:UNKNOWN isReady:false

Viewed 520
  1. 手动、单机、部署2.1.5版本,fe和be都未设置priority_networks参数,第一次运行正常,可建表等。
  2. 后来在这台机器上安装了docker(未使用docker环境,仅启动了docker的服务,原本想做其他测试)。
  3. 重启机器后fe无法启动,据说是由于fe主节点ip变化,导致无法启动。
  4. 修改并设置了priority_networks参数,同时在fe中设置了:metadata_failure_recovery=true。
    priority_networks = 192.168.174.0/24
    metadata_failure_recovery=true
  5. 再次启动fe:sh bin/start_fe.sh。
  6. fe依然无法正确启动。
  7. 经过1-5以后,fe的日志如下:
    2024-09-05 20:16:27,389 INFO (main|1) [DorisFE.start():158] Doris FE starting...
    2024-09-05 20:16:27,404 INFO (main|1) [FrontendOptions.analyzePriorityCidrs():194] configured prior_cidrs value: 192.168.174.0/24
    2024-09-05 20:16:27,410 INFO (main|1) [FrontendOptions.initAddrUseIp():101] local address: /192.168.174.88.
    2024-09-05 20:16:27,796 INFO (main|1) [ConsistencyChecker.initWorkTime():105] consistency checker will work from 23:00 to 23:00
    2024-09-05 20:16:27,991 ERROR (main|1) [Util.report():128] SLF4J: Class path contains multiple SLF4J bindings.
    2024-09-05 20:16:27,992 ERROR (main|1) [Util.report():128] SLF4J: Found binding in [jar:file:/home/doris/software/apache-doris/fe/lib/slf4j-reload4j-1.7.36.jar!/org/slf4j/impl/StaticLoggerBinder.class]
    2024-09-05 20:16:27,992 ERROR (main|1) [Util.report():128] SLF4J: Found binding in [jar:file:/home/doris/software/apache-doris/fe/lib/log4j-slf4j-impl-2.18.0.jar!/org/slf4j/impl/StaticLoggerBinder.class]
    2024-09-05 20:16:27,993 ERROR (main|1) [Util.report():128] SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
    2024-09-05 20:16:28,008 ERROR (main|1) [Util.report():128] SLF4J: Actual binding is of type [org.slf4j.impl.Reload4jLoggerFactory]
    2024-09-05 20:16:28,943 INFO (main|1) [PrivTable.addEntry():89] add priv entry: Node_priv,Admin_priv
    2024-09-05 20:16:28,944 INFO (main|1) [PrivTable.addEntry():89] add priv entry: Admin_priv
    2024-09-05 20:16:28,967 INFO (main|1) [PrivTable.addEntry():89] add priv entry: database privilege.ctl: internal, db: information_schema, priv: Select_priv
    2024-09-05 20:16:28,967 INFO (main|1) [PrivTable.addEntry():89] add priv entry: database privilege.ctl: internal, db: mysql, priv: Select_priv
    2024-09-05 20:16:28,968 INFO (main|1) [PrivTable.addEntry():89] add priv entry: origWorkloadGroup:normal, priv:Usage_priv
    2024-09-05 20:16:28,968 INFO (main|1) [Auth.createUserInternal():498] finished to create user: 'root'@'%', is replay: true
    2024-09-05 20:16:28,969 INFO (main|1) [PrivTable.addEntry():89] add priv entry: database privilege.ctl: internal, db: information_schema, priv: Select_priv
    2024-09-05 20:16:28,969 INFO (main|1) [PrivTable.addEntry():89] add priv entry: database privilege.ctl: internal, db: mysql, priv: Select_priv
    2024-09-05 20:16:28,969 INFO (main|1) [PrivTable.addEntry():89] add priv entry: origWorkloadGroup:normal, priv:Usage_priv
    2024-09-05 20:16:28,969 INFO (main|1) [Auth.createUserInternal():498] finished to create user: 'admin'@'%', is replay: true
    2024-09-05 20:16:28,972 INFO (main|1) [AuthenticatorManager.():42] authenticate type: DEFAULT
    2024-09-05 20:16:29,161 INFO (main|1) [MTMVService.registerHook():66] registerHook: MTMVJobManager
    2024-09-05 20:16:29,162 INFO (main|1) [MTMVService.registerHook():66] registerHook: MTMVRelationManager
    2024-09-05 20:16:29,179 INFO (main|1) [Env.getSelfHostPort():1372] get self node: HostInfo{host='192.168.174.88', port=9010}
    2024-09-05 20:16:29,180 INFO (main|1) [Env.getHelperNodes():1426] get helper nodes: [HostInfo{host='192.168.174.88', port=9010}]
    2024-09-05 20:16:29,192 INFO (main|1) [Env.getClusterIdAndRole():1297] finished to get cluster id: 305254781, isElectable: true, role: FOLLOWER and node name: fe_ae692d73_7c0d_447b_aa84_1f1333a7daa1
    2024-09-05 20:16:29,206 INFO (main|1) [MetaReader.read():72] start load image from /home/doris/software/apache-doris/fe/doris-meta/image/image.37. is ckpt: false
    2024-09-05 20:16:29,550 INFO (main|1) [MetaHeader.read():74] Image header length: 21, format: COR1.
    2024-09-05 20:16:29,552 INFO (main|1) [MetaFooter.read():84] Image footer length: 778, indices: [header:29, masterInfo:50, frontends:122, backends:265, datasource:269, db:349, alterJob:512, recycleBin:548, globalVariable:564, cluster:10442, broker:10442, resources:10446, exportJob:10476, syncJob:10480, backupHandler:10484, paloAuth:10492, transactionState:15068, colocateTableIndex:15084, routineLoadJobs:15092, loadJobV2:15096, smallFiles:15100, plugins:15104, deleteHandler:15108, sqlBlockRule:15108, policy:15145, globalFunction:15431, workloadGroups:15451, binlogs:15798, resourceGroups:15798, AnalysisMgrV2:16145, AsyncJobManager:16157, workloadSchedPolicy:16161, insertOverwrite:16187, plsql:16271]
    2024-09-05 20:16:29,559 INFO (main|1) [Env.loadHeaderCOR1():1986] finished replay header from image
    2024-09-05 20:16:29,627 INFO (main|1) [Env.loadMasterInfo():1995] finished replay masterInfo from image
    2024-09-05 20:16:29,640 INFO (main|1) [Env.replayAddFrontend():3905] replay add frontend: name: fe_ae692d73_7c0d_447b_aa84_1f1333a7daa1, role: FOLLOWER, 192.168.122.1:9010, is alive: false
    2024-09-05 20:16:29,641 INFO (main|1) [Env.loadFrontends():2012] finished replay frontends from image
    2024-09-05 20:16:29,941 INFO (main|1) [Env.loadCatalog():2251] finished replay catalog from image
    2024-09-05 20:16:29,952 INFO (main|1) [InternalCatalog.loadDb():3470] finished replay databases from image
    2024-09-05 20:16:29,953 INFO (main|1) [Env.loadAlterJob():2055] finished replay alterJob from image
    2024-09-05 20:16:29,953 INFO (main|1) [Env.loadRecycleBin():2144] finished replay recycleBin from image
    2024-09-05 20:16:29,971 INFO (main|1) [Env.loadGlobalVariable():2151] finished replay globalVariable from image
    2024-09-05 20:16:29,972 INFO (main|1) [MetaReader.read():96] Skip cluster module since empty meta length.
    2024-09-05 20:16:29,972 INFO (main|1) [Env.loadBrokers():5371] finished replay brokerMgr from image
    2024-09-05 20:16:29,972 INFO (main|1) [Env.loadResources():2194] finished replay resources from image
    2024-09-05 20:16:29,972 INFO (main|1) [Env.loadExportJob():2038] finished replay exportJob from image
    2024-09-05 20:16:29,973 INFO (main|1) [Env.loadSyncJobs():2046] finished replay syncJobMgr from image
    2024-09-05 20:16:29,973 INFO (main|1) [Env.loadBackupHandler():2110] finished replay backupHandler from image
    2024-09-05 20:16:29,977 INFO (main|1) [PrivTable.addEntry():89] add priv entry: database privilege.ctl: internal, db: mysql, priv: Select_priv
    2024-09-05 20:16:29,978 INFO (main|1) [PrivTable.addEntry():89] add priv entry: database privilege.ctl: internal, db: information_schema, priv: Select_priv
    2024-09-05 20:16:29,978 INFO (main|1) [PrivTable.addEntry():89] add priv entry: origWorkloadGroup:normal, priv:Usage_priv
    2024-09-05 20:16:29,978 INFO (main|1) [PrivTable.addEntry():89] add priv entry: Admin_priv
    2024-09-05 20:16:29,979 INFO (main|1) [PrivTable.addEntry():89] add priv entry: database privilege.ctl: internal, db: mysql, priv: Select_priv
    2024-09-05 20:16:29,980 INFO (main|1) [PrivTable.addEntry():89] add priv entry: database privilege.ctl: internal, db: information_schema, priv: Select_priv
    2024-09-05 20:16:29,980 INFO (main|1) [PrivTable.addEntry():89] add priv entry: origWorkloadGroup:normal, priv:Usage_priv
    2024-09-05 20:16:29,981 INFO (main|1) [PrivTable.addEntry():89] add priv entry: Node_priv,Admin_priv
    2024-09-05 20:16:29,986 INFO (main|1) [Env.loadAuth():2123] finished replay auth from image
    2024-09-05 20:16:29,986 INFO (main|1) [Env.loadTransactionState():2131] finished replay transactionState from image
    2024-09-05 20:16:29,986 INFO (main|1) [Env.loadColocateTableIndex():2164] finished replay colocateTableIndex from image
    2024-09-05 20:16:29,986 INFO (main|1) [Env.loadRoutineLoadJobs():2170] finished replay routineLoadJobs from image
    2024-09-05 20:16:29,986 INFO (main|1) [Env.loadLoadJobsV2():2176] finished replay loadJobsV2 from image
    2024-09-05 20:16:29,987 INFO (main|1) [Env.loadSmallFiles():2218] finished replay smallFiles from image
    2024-09-05 20:16:29,987 INFO (main|1) [Env.loadPlugins():5725] finished replay plugins from image
    2024-09-05 20:16:29,987 INFO (main|1) [MetaReader.read():96] Skip deleteHandler module since empty meta length.
    2024-09-05 20:16:29,989 INFO (main|1) [Env.loadSqlBlockRule():2226] finished replay sqlBlockRule from image
    2024-09-05 20:16:29,995 INFO (main|1) [Env.loadPolicy():2237] finished replay policy from image
    2024-09-05 20:16:29,996 INFO (main|1) [Env.loadGlobalFunction():2260] finished replay global function from image
    2024-09-05 20:16:30,006 INFO (main|1) [Env.loadWorkloadGroups():2200] finished replay workload groups from image
    2024-09-05 20:16:30,006 INFO (main|1) [MetaReader.read():96] Skip binlogs module since empty meta length.
    2024-09-05 20:16:30,007 INFO (main|1) [Env.loadWorkloadGroups():2200] finished replay workload groups from image
    2024-09-05 20:16:30,008 INFO (main|1) [Env.loadAnalysisManager():2266] finished replay AnalysisMgr from image
    2024-09-05 20:16:30,008 INFO (main|1) [Env.loadAsyncJobManager():2182] finished replay asyncJobMgr from image
    2024-09-05 20:16:30,015 INFO (main|1) [Env.loadWorkloadSchedPolicy():2206] finished replay workload sched policy from image
    2024-09-05 20:16:30,017 INFO (main|1) [Env.loadInsertOverwrite():2272] finished replay iot from image
    2024-09-05 20:16:30,020 INFO (main|1) [Env.loadPlsqlProcedure():2212] finished replay plsql procedure from image
    2024-09-05 20:16:30,020 INFO (main|1) [MetaReader.read():136] finished to load image in 813 ms
    2024-09-05 20:16:50,817 WARN (UNKNOWN fe_ae692d73_7c0d_447b_aa84_1f1333a7daa1(-1)|1) [Env.notifyNewFETypeTransfer():2667] notify new FE type transfer: UNKNOWN
    2024-09-05 20:16:50,817 INFO (UNKNOWN fe_ae692d73_7c0d_447b_aa84_1f1333a7daa1(-1)|1) [LogUtils.stdout():50] StdoutLogger 2024-09-05 20:16:50,817 notify new FE type transfer: UNKNOWN
    2024-09-05 20:16:50,854 INFO (stateListener|88) [Env$5.runOneCycle():2690] begin to transfer FE type from INIT to UNKNOWN
    2024-09-05 20:16:50,854 INFO (stateListener|88) [Env$5.runOneCycle():2777] finished to transfer FE type to UNKNOWN
    2024-09-05 20:16:50,961 INFO (UNKNOWN fe_ae692d73_7c0d_447b_aa84_1f1333a7daa1(-1)|1) [Env.waitForReady():1082] wait catalog to be ready. feType:UNKNOWN isReady:false, counter:1 reason:
    2024-09-05 20:17:01,429 INFO (UNKNOWN fe_ae692d73_7c0d_447b_aa84_1f1333a7daa1(-1)|1) [Env.waitForReady():1082] wait catalog to be ready. feType:UNKNOWN isReady:false, counter:101 reason:
    2024-09-05 20:17:11,910 INFO (UNKNOWN fe_ae692d73_7c0d_447b_aa84_1f1333a7daa1(-1)|1) [Env.waitForReady():1082] wait catalog to be ready. feType:UNKNOWN isReady:false, counter:201 reason:
    2024-09-05 20:17:22,381 INFO (UNKNOWN fe_ae692d73_7c0d_447b_aa84_1f1333a7daa1(-1)|1) [Env.waitForReady():1082] wait catalog to be ready. feType:UNKNOWN isReady:false, counter:301 reason:
    2024-09-05 20:17:37,575 INFO (UNKNOWN fe_ae692d73_7c0d_447b_aa84_1f1333a7daa1(-1)|1) [Env.waitForReady():1082] wait catalog to be ready. feType:UNKNOWN isReady:false, counter:401 reason:
    2024-09-05 20:17:48,130 INFO (UNKNOWN fe_ae692d73_7c0d_447b_aa84_1f1333a7daa1(-1)|1) [Env.waitForReady():1082] wait catalog to be ready. feType:UNKNOWN isReady:false, counter:501 reason:
    2024-09-05 20:17:58,729 INFO (UNKNOWN fe_ae692d73_7c0d_447b_aa84_1f1333a7daa1(-1)|1) [Env.waitForReady():1082] wait catalog to be ready. feType:UNKNOWN isReady:false, counter:601 reason:
    2024-09-05 20:18:09,351 INFO (UNKNOWN fe_ae692d73_7c0d_447b_aa84_1f1333a7daa1(-1)|1) [Env.waitForReady():1082] wait catalog to be ready. feType:UNKNOWN isReady:false, counter:701 reason:
    2024-09-05 20:18:19,843 INFO (UNKNOWN fe_ae692d73_7c0d_447b_aa84_1f1333a7daa1(-1)|1) [Env.waitForReady():1082] wait catalog to be ready. feType:UNKNOWN isReady:false, counter:801 reason:
    2024-09-05 20:18:34,116 INFO (UNKNOWN fe_ae692d73_7c0d_447b_aa84_1f1333a7daa1(-1)|1) [Env.waitForReady():1082] wait catalog to be ready. feType:UNKNOWN isReady:false, counter:901 reason:
    2024-09-05 20:18:45,101 INFO (UNKNOWN fe_ae692d73_7c0d_447b_aa84_1f1333a7daa1(-1)|1) [Env.waitForReady():1082] wait catalog to be ready. feType:UNKNOWN isReady:false, counter:1001 reason:
    2024-09-05 20:18:55,720 INFO (UNKNOWN fe_ae692d73_7c0d_447b_aa84_1f1333a7daa1(-1)|1) [Env.waitForReady():1082] wait catalog to be ready. feType:UNKNOWN isReady:false, counter:1101 reason:
    2024-09-05 20:19:06,056 INFO (UNKNOWN fe_ae692d73_7c0d_447b_aa84_1f1333a7daa1(-1)|1) [Env.waitForReady():1082] wait catalog to be ready. feType:UNKNOWN isReady:false, counter:1201 reason:
    2024-09-05 20:19:16,368 INFO (UNKNOWN fe_ae692d73_7c0d_447b_aa84_1f1333a7daa1(-1)|1) [Env.waitForReady():1082] wait catalog to be ready. feType:UNKNOWN isReady:false, counter:1301 reason:
2 Answers

priority_networks = 机器具体ip/32
设置成这样试试呢

日志:notify new FE type transfer: UNKNOWN
看这块日志显示无法确定当前FE的类型,正常这块应该显示为FOLLOWER,如下
notify new FE type transfer: FOLLOWER

怀疑是没有加FE_ID,让其无法辨识自己角色,官网中,如果是单机模式,使用docker启动的话,会加这一参数,意思是让FE知道自己是Master,不用做之后选举操作。

如下图,建议添加该启动参数
image.png

官网地址:https://doris.apache.org/zh-CN/docs/2.1/install/cluster-deployment/run-docker-cluster