【已解决】doris fe隔段时间就会宕机

Viewed 90

be宕机会导致fe宕机吗?错误信息如下:

2024-04-28 13:07:49,375 WARN (replayer|104) [ExternalCatalog.getDbNullable():373] failed to get db middle in catalog hive
java.lang.RuntimeException: failed to forward init catalog hive operation to master.
at org.apache.doris.common.util.Util.logAndThrowRuntimeException(Util.java:629) ~[doris-fe.jar:1.2-SNAPSHOT]
at org.apache.doris.datasource.ExternalCatalog.makeSureInitialized(ExternalCatalog.java:168) ~[doris-fe.jar:1.2-SNAPSHOT]
at org.apache.doris.datasource.ExternalCatalog.getDbNullable(ExternalCatalog.java:371) ~[doris-fe.jar:1.2-SNAPSHOT]
at org.apache.doris.datasource.ExternalCatalog.getDbNullable(ExternalCatalog.java:72) ~[doris-fe.jar:1.2-SNAPSHOT]
at org.apache.doris.datasource.CatalogIf.getDbOrException(CatalogIf.java:104) ~[doris-fe.jar:1.2-SNAPSHOT]
at org.apache.doris.datasource.CatalogIf.getDbOrAnalysisException(CatalogIf.java:140) ~[doris-fe.jar:1.2-SNAPSHOT]
at org.apache.doris.catalog.external.ExternalTable.makeSureInitialized(ExternalTable.java:115) ~[doris-fe.jar:1.2-SNAPSHOT]
at org.apache.doris.catalog.external.HMSExternalTable.makeSureInitialized(HMSExternalTable.java:132) ~[doris-fe.jar:1.2-SNAPSHOT]
at org.apache.doris.catalog.external.HMSExternalTable.getPartitionColumnTypes(HMSExternalTable.java:202) ~[doris-fe.jar:1.2-SNAPSHOT]
at org.apache.doris.datasource.ExternalMetaCacheMgr.addPartitionsCache(ExternalMetaCacheMgr.java:171) ~[doris-fe.jar:1.2-SNAPSHOT]
at org.apache.doris.datasource.CatalogMgr.replayAddExternalPartitions(CatalogMgr.java:961) ~[doris-fe.jar:1.2-SNAPSHOT]
at org.apache.doris.persist.EditLog.loadJournal(EditLog.java:994) ~[doris-fe.jar:1.2-SNAPSHOT]
at org.apache.doris.catalog.Env.replayJournal(Env.java:2525) ~[doris-fe.jar:1.2-SNAPSHOT]
at org.apache.doris.catalog.Env$3.runOneCycle(Env.java:2302) ~[doris-fe.jar:1.2-SNAPSHOT]
at org.apache.doris.common.util.Daemon.run(Daemon.java:116) ~[doris-fe.jar:1.2-SNAPSHOT]
Caused by: java.lang.NullPointerException
at org.apache.doris.qe.MasterCatalogExecutor.forward(MasterCatalogExecutor.java:69) ~[doris-fe.jar:1.2-SNAPSHOT]
at org.apache.doris.datasource.ExternalCatalog.makeSureInitialized(ExternalCatalog.java:166) ~[doris-fe.jar:1.2-SNAPSHOT]
... 13 more
2024-04-28 13:07:49,375 WARN (replayer|104) [Util.logAndThrowRuntimeException():628] Exception to get db middle
org.apache.doris.common.AnalysisException: errCode = 2, detailMessage = Unknown database 'middle'

org.apache.thrift.transport.TTransportException: java.net.ConnectException: 拒绝连接 (Connection refused)
at org.apache.thrift.transport.TSocket.open(TSocket.java:255) ~[libthrift-0.16.0.jar:0.16.0]
at org.apache.doris.common.GenericPool$ThriftClientFactory.create(GenericPool.java:143) ~[doris-fe.jar:1.2-SNAPSHOT]
at org.apache.doris.common.GenericPool$ThriftClientFactory.create(GenericPool.java:126) ~[doris-fe.jar:1.2-SNAPSHOT]
at org.apache.commons.pool2.BaseKeyedPooledObjectFactory.makeObject(BaseKeyedPooledObjectFactory.java:62) ~[commons-pool2-2.2.jar:2.2]
at org.apache.commons.pool2.impl.GenericKeyedObjectPool.create(GenericKeyedObjectPool.java:1012) ~[commons-pool2-2.2.jar:2.2]
at org.apache.commons.pool2.impl.GenericKeyedObjectPool.borrowObject(GenericKeyedObjectPool.java:356) ~[commons-pool2-2.2.jar:2.2]
at org.apache.commons.pool2.impl.GenericKeyedObjectPool.borrowObject(GenericKeyedObjectPool.java:277) ~[commons-pool2-2.2.jar:2.2]
at org.apache.doris.common.GenericPool.borrowObject(GenericPool.java:95) ~[doris-fe.jar:1.2-SNAPSHOT]
at org.apache.doris.catalog.TabletStatMgr.lambda$null$0(TabletStatMgr.java:64) ~[doris-fe.jar:1.2-SNAPSHOT]
at java.util.stream.ForEachOps$ForEachOp$OfRef.accept(ForEachOps.java:184) ~[?:1.8.0_181]
at com.google.common.collect.CollectSpliterators$1WithCharacteristics.lambda$forEachRemaining$1(CollectSpliterators.java:72) ~[guava-32.1.2-jre
.jar:?]
at java.util.stream.Streams$RangeIntSpliterator.forEachRemaining(Streams.java:110) ~[?:1.8.0_181]
at com.google.common.collect.CollectSpliterators$1WithCharacteristics.forEachRemaining(CollectSpliterators.java:72) ~[guava-32.1.2-jre.jar:?]
at java.util.stream.AbstractPipeline.copyInto(AbstractPipeline.java:481) ~[?:1.8.0_181]
at java.util.stream.ForEachOps$ForEachTask.compute(ForEachOps.java:291) ~[?:1.8.0_181]
at java.util.concurrent.CountedCompleter.exec(CountedCompleter.java:731) ~[?:1.8.0_181]
at java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:289) ~[?:1.8.0_181]
at java.util.concurrent.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1056) ~[?:1.8.0_181]
at java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1692) ~[?:1.8.0_181]
at java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:157) ~[?:1.8.0_181]
Caused by: java.net.ConnectException: 拒绝连接 (Connection refused)
at java.net.PlainSocketImpl.socketConnect(Native Method) ~[?:1.8.0_181]
at java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:350) ~[?:1.8.0_181]
at java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:206) ~[?:1.8.0_181]
at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:188) ~[?:1.8.0_181]
at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392) ~[?:1.8.0_181]
at java.net.Socket.connect(Socket.java:589) ~[?:1.8.0_181]
at org.apache.thrift.transport.TSocket.open(TSocket.java:250) ~[libthrift-0.16.0.jar:0.16.0]
... 19 more
2 Answers

一般 be宕机 不会导致fe宕机;

为更具体地了解问题(处理后会更新回帖),可以➕我一下W:yz-jayhua