Check rocksdb closed before operating #4243

AnonHxy · 2024-03-24T09:30:03Z

Descriptions of the changes in this PR:

Motivation

From the issue above we can see that the core dump could happen when operate Rocksdb after it has been closed. So we should check if closed before operating rocksdb.

Changes

Checking if closed before operation RocksDB.

dlg99

LGTM

lhotari · 2025-03-05T14:44:22Z

@AnonHxy Thanks, I recently also came across this in #4558

lhotari · 2025-03-05T14:52:21Z

@AnonHxy
Another issue that I noticed in closing the RocksDB database is that it's not closing the database properly.

The javadoc of org.rocksdb.RocksDB#close says:
"This will not fsync the WAL files. If syncing is required, the caller must first call syncWal() or write(WriteOptions, WriteBatch) using an empty write batch with WriteOptions.setSync(boolean) set to true."

Would it make sense to also take this into account by adding db.syncWal() just after closed = true;?

merlimat · 2025-03-05T16:36:30Z

...er-server/src/main/java/org/apache/bookkeeper/bookie/storage/ldb/KeyValueStorageRocksDB.java


    @Override
    public void put(byte[] key, byte[] value) throws IOException {
+        readLock.lock();


Acquiring the read lock for each put/get operation might be quite expensive, since the lock has to maintain the state of threads and the order they try to acquire the lock.

We could use a different approach, like a reference counter on the entire object. When everyone is done using it, then rocksdb is really closed

@AnonHxy It seems that a StampedLock could be sufficient here since there doesn't seem to be a need for re-entrancy of the lock. @merlimat would using a StampedLock address your concern?

There's an alternative PR #4581 which simply handles close and count

…t() (possibly triggered by Prometheus) after RocksDB has been closed(apache#4243)

…t after rocksdb has been closed (#4581) * Fix the coredump that occurs when calling KeyValueStorageRocksDB.count() (possibly triggered by Prometheus) after RocksDB has been closed(#4243) * fix race when count op in process and db gets closed. --------- Co-authored-by: zhaizhibo <[email protected]>

…t after rocksdb has been closed (#4581) * Fix the coredump that occurs when calling KeyValueStorageRocksDB.count() (possibly triggered by Prometheus) after RocksDB has been closed(#4243) * fix race when count op in process and db gets closed. --------- Co-authored-by: zhaizhibo <[email protected]> (cherry picked from commit 2831ed3)

hangc0276 · 2025-06-04T18:01:34Z

Since #4581 has fixed this issue, close this PR. @AnonHxy If you encountered this issue again, feel free the reopen it.

…t after rocksdb has been closed (apache#4581) * Fix the coredump that occurs when calling KeyValueStorageRocksDB.count() (possibly triggered by Prometheus) after RocksDB has been closed(apache#4243) * fix race when count op in process and db gets closed. --------- Co-authored-by: zhaizhibo <[email protected]> (cherry picked from commit 2831ed3) (cherry picked from commit 9d067f4)

AnonHxy added 2 commits March 24, 2024 17:24

Check rocksdb closed before operating

9d98e11

Fix

f362987

AnonHxy force-pushed the operate_db_after_close branch from 8c6e5ff to f362987 Compare March 24, 2024 09:34

checkstyle

b8d25bf

dlg99 approved these changes May 30, 2024

View reviewed changes

lhotari requested review from hangc0276, merlimat and zymap March 5, 2025 14:42

lhotari mentioned this pull request Mar 5, 2025

RocksDB causes Bookkeeper to crash #4558

Closed

merlimat reviewed Mar 5, 2025

View reviewed changes

zhaizhibo pushed a commit to zhaizhibo/bookkeeper that referenced this pull request Apr 16, 2025

Fix the coredump that occurs when calling KeyValueStorageRocksDB.coun…

e48ca2b

…t() (possibly triggered by Prometheus) after RocksDB has been closed(apache#4243)

zhaizhibo pushed a commit to zhaizhibo/bookkeeper that referenced this pull request Apr 16, 2025

Fix the coredump that occurs when calling KeyValueStorageRocksDB.coun…

2a73596

…t() (possibly triggered by Prometheus) after RocksDB has been closed(apache#4243)

lhotari mentioned this pull request Apr 16, 2025

Fix the coredump that occurs when calling KeyValueStorageRocksDB.count after rocksdb has been closed #4581

Merged

hangc0276 closed this Jun 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Check rocksdb closed before operating #4243

Check rocksdb closed before operating #4243

Uh oh!

AnonHxy commented Mar 24, 2024 •

edited

Loading

Uh oh!

dlg99 left a comment

Uh oh!

lhotari commented Mar 5, 2025

Uh oh!

lhotari commented Mar 5, 2025

Uh oh!

merlimat Mar 5, 2025

Uh oh!

lhotari Apr 16, 2025

Uh oh!

lhotari Apr 16, 2025

Uh oh!

hangc0276 commented Jun 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Check rocksdb closed before operating #4243

Check rocksdb closed before operating #4243

Uh oh!

Conversation

AnonHxy commented Mar 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Changes

Uh oh!

dlg99 left a comment

Choose a reason for hiding this comment

Uh oh!

lhotari commented Mar 5, 2025

Uh oh!

lhotari commented Mar 5, 2025

Uh oh!

merlimat Mar 5, 2025

Choose a reason for hiding this comment

Uh oh!

lhotari Apr 16, 2025

Choose a reason for hiding this comment

Uh oh!

lhotari Apr 16, 2025

Choose a reason for hiding this comment

Uh oh!

hangc0276 commented Jun 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

AnonHxy commented Mar 24, 2024 •

edited

Loading