doris中所有的doris_be 突然出现因segcompaction引发crash的问题,不知道大家遇到过没有?如何解决

Viewed 51

版本是:version doris-2.1.6-rc04(SSE4.2) RELEASE (build git://vm-99@653e315ba5441707168c9c9a64f421037f305a4c) Built on Sat, 07 Sep 2024 03:21:08 CST by vm-99

下面是日志be.info
I20241210 18:02:23.149176 2752525 schema_change.cpp:1085] schema change type, sc_sorting: 0, sc_directly: 1, base_tablet=2855646, new_tablet=2985274
I20241210 18:02:24.422439 2752204 wal_manager.cpp:486] Scheduled(every 10s) WAL info: [/doris/be/storage/wal: limit 1101054420992 Bytes, used 0 Bytes, estimated wal bytes 0 Bytes, available 1101054420992 Bytes.];
I20241210 18:02:26.337330 2752526 beta_rowset_writer.cpp:440] submit segcompaction task, tablet_id:2985262 rowset_id:02000000000000c0684466cd3749f1dd859d6eabec4bbe8e segment num:10, segcompacted_point:0
I20241210 18:02:26.337409 2752320 olap_server.cpp:1100] segcompaction thread pool queue time(ms): 0
I20241210 18:02:26.515355 2752526 schema_change.h:126] all row nums. source_rows=7657995, source_filtered_rows=459582, source_merged_rows=0, merged_rows=0, filtered_rows=0, new_index_rows=7198413, writer_filtered_rows=0
I20241210 18:02:26.639236 2753294 mem_info.cpp:211] Refresh cgroup memory win, refresh again after 10s, cgroup mem limit: 9223372036854771712, cgroup mem usage: 3071709184
I20241210 18:06:23.436573 2754996 doris_main.cpp:382] version doris-2.1.6-rc04(SSE4.2) RELEASE (build git://vm-99@653e315ba5441707168c9c9a64f421037f305a4c)
Built on Sat, 07 Sep 2024 03:21:08 CST by vm-99

be.out日志:

*** Query id: 0-0 ***
*** is nereids: 0 ***
*** tablet id: 0 ***
*** Aborted at 1733828236 (unix time) try "date -d @1733828236" if you are using GNU date ***
*** Current BE git commitID: 653e315ba5 ***
*** SIGSEGV address not mapped to object (@0x40) received by PID 2776507 (TID 2777301 OR 0x7f3b39b6e700) from PID 64; stack trace: ***
0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t*, void*) at /home/zcp/repo_center/doris_release/doris/be/src/common/signal_handler.h:421
1# os::Linux::chained_handler(int, siginfo_t*, void*) in /usr/lib/jvm/java-1.8.0-openjdk/jre/lib/amd64/server/libjvm.so
2# JVM_handle_linux_signal in /usr/lib/jvm/java-1.8.0-openjdk/jre/lib/amd64/server/libjvm.so
3# signalHandler(int, siginfo_t*, void*) in /usr/lib/jvm/java-1.8.0-openjdk/jre/lib/amd64/server/libjvm.so
4# 0x00007F3D04E10400 in /lib64/libc.so.6
5# doris::SegcompactionWorker::_do_compact_segments(std::shared_ptr<std::vector<std::shared_ptr, std::allocator<std::shared_ptr > > >) at /home/zcp/repo_center/doris_release/doris/be/src/olap/rowset/segcompaction.cpp:249
6# doris::SegcompactionWorker::compact_segments(std::shared_ptr<std::vector<std::shared_ptr, std::allocator<std::shared_ptr > > >) at /home/zcp/repo_center/doris_release/doris/be/src/olap/rowset/segcompaction.cpp:335
7# doris::StorageEngine::_handle_seg_compaction(std::shared_ptr, std::shared_ptr<std::vector<std::shared_ptr, std::allocator<std::shared_ptr > > >, unsigned long) at /home/zcp/repo_center/doris_release/doris/be/src/olap/olap_server.cpp:1101
8# doris::Status std::__invoke_impl<doris::Status, doris::Status (doris::StorageEngine::&)(std::shared_ptr, std::shared_ptr<std::vector<std::shared_ptr, std::allocator<std::shared_ptr > > >, unsigned long), doris::StorageEngine&, std::shared_ptr&, std::shared_ptr<std::vector<std::shared_ptr, std::allocator<std::shared_ptr > > >&, unsigned long&>(std::__invoke_memfun_deref, doris::Status (doris::StorageEngine::&)(std::shared_ptr, std::shared_ptr<std::vector<std::shared_ptr, std::allocator<std::shared_ptr > > >, unsigned long), doris::StorageEngine&, std::shared_ptr&, std::shared_ptr<std::vector<std::shared_ptr, std::allocator<std::shared_ptr > > >&, unsigned long&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/invoke.h:74
9# std::_Function_handler<void (), std::_Bind_result<void, doris::Status (doris::StorageEngine::(doris::StorageEngine, std::shared_ptr, std::shared_ptr<std::vector<std::shared_ptr, std::allocator<std::shared_ptr > > >, unsigned long))(std::shared_ptr, std::shared_ptr<std::vector<std::shared_ptr<dor@@@

1 Answers

这是问题是偶发的,还是经常会有?