feat: actor messages & queues #3989

NathanFlurry · 2026-01-21T00:59:42Z

No description provided.

railway-app · 2026-01-21T00:59:49Z

🚅 Deployed to the rivet-pr-3989 environment in rivet-frontend

Service	Status	Web	Updated (UTC)
frontend-inspector	❌ Build Failed (View Logs)	Web	Jan 21, 2026 at 1:44 am
frontend-cloud	❌ Build Failed (View Logs)	Web	Jan 21, 2026 at 1:43 am
website	😴 Sleeping (View Logs)	Web	Jan 21, 2026 at 1:09 am

NathanFlurry · 2026-01-21T01:00:15Z

feat: actor messages & queues #3989 👈 (View in Graphite)
main

How to use the Graphite Merge Queue

Add the label merge-queue to this PR to add it to the merge queue.

You must have a Graphite account in order to use the merge queue. Sign up using this link.

_{An organization admin has enabled the Graphite Merge Queue in this repository.} _{Please do not merge from GitHub as this will restart CI on PRs being processed by the merge queue.}

This stack of pull requests is managed by Graphite. Learn more about stacking.

claude · 2026-01-21T01:03:44Z

Review of PR #3989: Actor Messages & Queues

This PR adds a message queue system to RivetKit actors. Overall, the implementation is well-structured with good separation of concerns. Here are my findings:

✅ Strengths

Clean Architecture: Queue functionality is properly separated into QueueManager (internal) and ActorQueue (user-facing API)
Comprehensive Testing: Good test coverage in actor-queue.ts including edge cases (timeouts, aborts, size limits)
Versioning: Proper schema versioning with v4 of actor-persist and v3 of client-protocol
Error Handling: Well-defined custom errors (QueueFull, QueueMessageTooLarge, QueueMessageInvalid, ActorAborted)
Type Safety: Good use of TypeScript generics and overloaded signatures in ActorQueue.next()

🐛 Potential Issues

High Priority

Race Condition in Metadata Updates (queue-manager.ts:129-138)

The code updates in-memory metadata before the KV write completes
If the batch write fails, in-memory state becomes inconsistent with storage
Fix: Only update in-memory metadata after successful write, or revert on failure

// Current (risky):
this.#metadata.nextId = id + 1n;
this.#metadata.size += 1;
const encodedMetadata = this.#serializeMetadata();
await this.#driver.kvBatchPut(this.#actor.id, [...]);

// Better:
const newMetadata = {
    nextId: id + 1n,
    size: this.#metadata.size + 1
};
const encodedMetadata = this.#serializeMetadata(newMetadata);
await this.#driver.kvBatchPut(this.#actor.id, [...]);
this.#metadata = newMetadata; // Only update after success

Message Ordering Not Guaranteed (queue-manager.ts:246-278)
- kvListPrefix returns entries, but sorting relies on BigInt ID comparison
- No guarantee that IDs are assigned in the order messages arrive if there are concurrent enqueues
- Consider documenting this behavior or adding sequence numbers
Unbounded Waiter Growth (queue-manager.ts:45)
- No limit on number of concurrent waiters
- A malicious actor could create thousands of waiters and cause memory issues
- Recommendation: Add maxConcurrentWaiters config option

Medium Priority

Missing Cleanup on Actor Stop (queue-manager.ts)
- No explicit cleanup method to reject pending waiters when actor stops unexpectedly
- The abort signal handling (lines 204-209) helps, but there's no cleanup of timeout handles if the manager is destroyed
- Recommendation: Add a destroy() method that clears all waiters and their timeouts
Inefficient Queue Draining (queue-manager.ts:228-244)
- #drainMessages loads ALL messages from storage, then filters in memory
- For large queues (up to 1000 messages), this is wasteful
- Optimization: Consider adding a way to filter at the storage layer, or at least batch the loading
Error Handling in Message Loading (queue-manager.ts:266-271)
- Corrupted messages are silently logged but not tracked
- Could lead to confusion if messages are "lost" due to corruption
- Recommendation: Consider exposing corruption metrics or warnings to the user
Metadata Rebuild Logic (queue-manager.ts:327-353)
- Only scans for max ID, doesn't validate message integrity
- The size is based on entry count, which could include corrupted entries
- Consider validating at least one message to ensure the schema is correct

Low Priority

Timestamp Precision (queue-manager.ts:107)
- Uses Date.now() which returns milliseconds
- Per CLAUDE.md conventions, timestamps should be "i64 epoch timestamps in milliseconds" ✓
- This is correct, no issue
Magic Number (keys.ts:10)
- QUEUE_ID_BYTES = 8 is defined separately
- Consider adding a comment explaining why 8 bytes (supports 2^64 messages)
Proxy Implementation (queue.ts:87-124)
- The queue proxy is clever but the has trap always returns true for strings
- This could lead to confusing behavior with in operator
- Consider being more selective about what properties are claimed to exist

🔒 Security Considerations

Message Size Validation: ✅ Properly validates message size before storage
CBOR Serialization Check: ✅ Validates serializability before encoding
Queue Size Limits: ✅ Enforces configurable limits
Path Traversal: N/A - queue names are not used in file paths
Injection: ✅ Queue names are strings, properly encoded

⚡ Performance Considerations

Batch Operations: ✅ Good use of kvBatchPut and kvBatchDelete to minimize round trips
Metadata Caching: ✅ Keeps size in memory, good for frequent checks
Message Filtering: ⚠️ See issue [SVC-2358] Enable Redis memory overcommit #5 above - could be optimized
Waiter Resolution: ✅ Efficiently iterates only active waiters

📋 Code Quality

Follows CLAUDE.md:
- ✅ Uses lowercase log messages
- ✅ Uses structured logging with ? prefix
- ✅ Timestamp named createdAt (correct convention)
- ✅ Complete sentences in comments
- ❌ Import issue: router-endpoints.ts:45 imports after imports (minor)
Type Safety: ✅ Excellent use of TypeScript
Documentation: ⚠️ User-facing API has JSDoc, but internal methods lack comments explaining complex logic
Testing: ✅ Good test coverage of happy paths and error cases

📝 Recommendations

Fix the metadata race condition (issue [SVC-2555] Set up issue templates #1) before merging - this could cause data loss
Add waiter limits to prevent DoS scenarios
Document message ordering guarantees (or lack thereof) in the API docs
Add cleanup/destroy method for proper resource management
Consider optimizing the queue draining for large queues
Add JSDoc comments to complex internal methods like #maybeResolveWaiters

📊 Test Coverage

The test suite covers:

✅ Basic send/receive
✅ Multiple queue names
✅ Timeout behavior
✅ Abort signals
✅ Size limits
✅ Message size limits

Missing tests:

❌ Concurrent enqueue/dequeue scenarios
❌ Metadata corruption recovery
❌ Actor restart with pending messages
❌ Multiple waiters on same queue

Overall Assessment

This is a solid implementation with good architecture and test coverage. The main concern is the metadata race condition which should be addressed before merging. The other issues are mostly optimization opportunities and defensive programming improvements.

Recommendation: Request changes for issue #1, recommend addressing issues #3-4 before merge.

railway-app bot requested a deployment to rivet-frontend / rivet-pr-3989 January 21, 2026 00:59 In progress

feat: actor messages & queues

ed996b2

NathanFlurry force-pushed the actor-messages branch from 20d84dd to ed996b2 Compare January 21, 2026 01:42

railway-app bot had a problem deploying to rivet-frontend / rivet-pr-3989 January 21, 2026 01:42 Failure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: actor messages & queues #3989

feat: actor messages & queues #3989

Uh oh!

NathanFlurry commented Jan 21, 2026

Uh oh!

railway-app bot commented Jan 21, 2026 •

edited

Loading

Uh oh!

NathanFlurry commented Jan 21, 2026

Uh oh!

claude bot commented Jan 21, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: actor messages & queues #3989

Are you sure you want to change the base?

feat: actor messages & queues #3989

Uh oh!

Conversation

NathanFlurry commented Jan 21, 2026

Uh oh!

railway-app bot commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NathanFlurry commented Jan 21, 2026

How to use the Graphite Merge Queue

Uh oh!

claude bot commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review of PR #3989: Actor Messages & Queues

✅ Strengths

🐛 Potential Issues

High Priority

Medium Priority

Low Priority

🔒 Security Considerations

⚡ Performance Considerations

📋 Code Quality

📝 Recommendations

📊 Test Coverage

Overall Assessment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

railway-app bot commented Jan 21, 2026 •

edited

Loading

claude bot commented Jan 21, 2026 •

edited

Loading