Implement write operations #16

h4l · 2024-12-24T06:45:57Z

This PR implements support for writing to KV databases, using KV Connect / datapath protocol's atomic_write operation. It implements support for both the raw operation in denokv.datapath.atomic_write(), and high level APIs on the Kv type:

Kv.set(), Kv.delete(), Kv.sum(), Kv.min(), Kv.max(), Kv.enqueue() and Kv.check().
These methods are available on Kv itself for one-off operations, and
Kv.atomic() can chain these methods to group write operations to apply together in a transaction.

test/denokv_testing.py

The dev service can be used for ad-hoc dev tasks in a specific Python version environment. e.g: docker compose run --rm dev-py39

Python versions before 3.11 don't allow TypedDict subclasses to also inherit from Generic at runtime.

We'll use this instead of ellipsis to detect arguments that are not set in a call. This is needed to distinguish overloads in some cases.

We could get by without a functioning mock of the DB, but I find it valuable to implement it to make sure I'm not missing something about the possible functionality. Plus it'll make testing the Kv API easier.

This is the low-level API to invoke Atomic Write requests without any hand-holding.

It makes Enums use 'EnumName.FIELD' as the repr rather than the default syntax with < ... >.

KvEntry, VersionStamp and KvU64 are now in denokv._kv_values. These types are needed by other submodules that I'd like to separate from the denokv.kv module.

PlannedWrite is the central type, it represents the collection of mutation operations to be applied to the DB in a single atomic write. We have a KvWriter protocol which PlannedWrite implements, it'll be used by Kv so that we don't have a circular dependency between denokv.kv.Kv and denokv._kv_writes.PlannedWrite. In addition, we now have a ProtobufMessageRepresentation[MessageT] type which can be used to generalise the types representing datapath protocol types, like Sum, Max, Min, etc; to avoid coupling PlannedWrite to our default implementations.

We need a slightly customised V8 serialization format encoder to handle int and float consistently in write sum/min/max operations.

This allows making atomic_write datapath requests to make changes to the database, by translating PlannedWrite (& related types) into AtomicWrite protobuf messages.

We need to use it in both test_kv and test_datapath.

Now in denokv_testing so use it in test_kv.

It's now make_database_metadata() and a separate meta_endpoint() fn to unpack it. We'll use make_database_metadata() to replace the similar and overlapping mk_db_meta() function.

make_database_metadata() covers its use-case now.

We were mocking the snapshot_read() function to return results from the mock db implementation, but now we access the mock db via the unmocked datapath module's HTTP requests to an aiohttp mock HTTP server.

We now use a customised V8 decoder for the MockKvDb API that mirrors the default encoder used by Kv (that encodes int as BigInt): it always decodes int32 values as float rather than the v8serialize default of decoding int32 as int and Double as float. We don't rely on this being in place as we currently encode all int32 range values as Double (which are always decoded as float), but it seems prudent to ensure that we don't mistakenly treat small ints as BigInt instead of Number.

CommittedWrite satisfies is_ok() and ConflictedWrite satisfies is_err().

The ResponseUnsuccessful exception held details of the error message from the db server, but was not showing it in the default string representation; now it does.

DenoKvError was extending BaseException instead of Exception, which meant that catch-all Exception handlers didn't catch it.

We now provide a custom assertion failure message for comparisons involving two protobuf Message objects. The property values are diffed, similar to how pytest diffs dicts, etc.

We'll add __notes__ to exceptions in older versions, but in practice they won't be visible in 3.9's tracebacks. Based on: h4l/v8serialize@dec9d82

It disables setattr and delattr, like @DataClass(frozen=True)

mypy thought it was Any.

MockKvDb's atomic sum/min/max operations now allow int/float to be used for float sum operations. Previously sum operations had to use Python float values, because int values were used to distinguish bigint values. Now that v8serialize uses JSBigInt for bigint and decodes int-safe floats as int, it's necessary to allow int in float operations. We now segregate bigint/float/u64 number types according to the datapath protobuf encoding rather than Python type, which avoids clashes between bigint and float using Python int.

The Mapping needs overloads as its params are invariant.

@overload

The PlannedWrite and Mutation types now have much better support and APIs for performing atomic number operations, like sum, min and max. The first main feature is that the 3 number types (bigint, u64, float) can be selected explicitly with a number_type parameter, or implicitly via the type of the number value, and both options are type-safe. The second is that we support min/max mutations on bigint and float types, not just u64 (KvU64), and we support sum on KvU64 with both wrapping and clamp limits. The underlying datapath protocol (and Deno's own KV API) do not support these. This makes the atomic number APIs more intuitive, as otherwise the user needs to know about the arbitrary restriction that only certain number types can use Min/Max/Sum. I don't love the massive amount of @overload annotations it takes to type-annotate these functions well, but I don't think there's a better alternative. The way that PlannedWrite interfaces with the main Kv class is now decoupled with a general interface, so Kv and PlannedWrite do not have explicit dependencies on each other. This makes it easier to test and modularise the codebase, and also allows alternative writer implementations to be created for special cases.

@overload

The set()/sum()/delete() etc methods of PlannedWrite are now implemented in individual mixin classes. This will allow us to implement the same set of methods on the Kv class, without duplicating all the horrific @overload annotations.

It used to always require a message string, which made it awkward to subclass for errors that don't want to define a message at init.

Rather than throwing the lower-level errors from datapath, we'll now use one of these two higher-level errors.

Checks for unbound variable use.

The denokv self-hosted implementation does not return indexes of failed checks, so we need to allow this. The spec does not say that servers MUST report indexes of failed checks, only that clients SHOULD report failed indexes. See: denoland/denokv#110

CommittedWrite, ConflictedWrite and FailedWrite now all have a has_unknown_conflicts property to indicate whether the conflicts/failed_checks is populated with the specific checks that failed. This is necessary because databases don't have to report which checks failed.

Kv now has the check(), set(), sum(), min(), max(), delete() and enqueue() methods of PlannedWrite, using the same mixins that define these methods on PlannedWrite. On Kv they execute immediately instead of accumulating operations to execute later.

Python 3.9's datetime.fromisoformat() does not parse datetimes with a UTC Z indicator.

Some mixin/interface types did not have empty __slots__ defined, which caused types using slots to still have dicts.

We now use ruff/flake8-tidy-imports to ban importing typing/typing_extensions. Unfortunately it doesn't seem to support banning everything except allow-listed items, so we have to ignore places that need to import Literal and overload from typing (due to ruff mis-handling them when they're imported from our typing module).

We now configure prettier to wrap markdown automatically.

github-advanced-security bot found potential problems Dec 24, 2024

View reviewed changes

test/denokv_testing.py Dismissed Show dismissed Hide dismissed

test/denokv_testing.py Dismissed Show dismissed Hide dismissed

h4l force-pushed the kv-write branch 3 times, most recently from a19e7f6 to 684f188 Compare February 4, 2025 03:51

h4l added 26 commits February 4, 2025 03:53

chore: remove black extension from devcontainer config

c988ef0

build: add dev image target and compose service

723c8d8

The dev service can be used for ad-hoc dev tasks in a specific Python version environment. e.g: docker compose run --rm dev-py39

feat: support Generic TypedDict in _pycompat.typing

ed55251

Python versions before 3.11 don't allow TypedDict subclasses to also inherit from Generic at runtime.

feat: add explicit NotSet constant

61ca0f2

We'll use this instead of ellipsis to detect arguments that are not set in a call. This is needed to distinguish overloads in some cases.

test: implement atomic_write on MockKvDb

e5c7d8e

We could get by without a functioning mock of the DB, but I find it valuable to implement it to make sure I'm not missing something about the possible functionality. Plus it'll make testing the Kv API easier.

feat: implement datapath.atomic_write()

3a21ea0

This is the low-level API to invoke Atomic Write requests without any hand-holding.

feat: add EvalEnumRepr Enum mixin

fc49e55

It makes Enums use 'EnumName.FIELD' as the repr rather than the default syntax with < ... >.

refactor: move standalone KV value types into submodule

f8af2b7

KvEntry, VersionStamp and KvU64 are now in denokv._kv_values. These types are needed by other submodules that I'd like to separate from the denokv.kv module.

feat: add create_default_v8_encoder()

7d77861

We need a slightly customised V8 serialization format encoder to handle int and float consistently in write sum/min/max operations.

feat: implement Kv.write() API

fcc8fd7

This allows making atomic_write datapath requests to make changes to the database, by translating PlannedWrite (& related types) into AtomicWrite protobuf messages.

refactor: move mock Data Path API into denokv_testing

f7945b1

We need to use it in both test_kv and test_datapath.

refactor: use denokv_testing's mock HTTP API in test_kv

cd9b0d7

test: replace deprecated body= arg for HTTP exceptions

2a01ecb

refactor: move make_database_metadata_for_endpoint

f539d8f

Now in denokv_testing so use it in test_kv.

refactor: generalise make_database_metadata_for_endpoint

c0e429a

It's now make_database_metadata() and a separate meta_endpoint() fn to unpack it. We'll use make_database_metadata() to replace the similar and overlapping mk_db_meta() function.

refactor: remove mk_db_meta()

7b99acd

make_database_metadata() covers its use-case now.

refactor: use mock db via datapath to test Kv.list()

659ad69

We were mocking the snapshot_read() function to return results from the mock db implementation, but now we access the mock db via the unmocked datapath module's HTTP requests to an aiohttp mock HTTP server.

feat: support is_ok/is_err for Kv write() result type

6ec537e

CommittedWrite satisfies is_ok() and ConflictedWrite satisfies is_err().

feat: include error details in ResponseUnsuccessful str

1c205a8

The ResponseUnsuccessful exception held details of the error message from the db server, but was not showing it in the default string representation; now it does.

fix: don't use BaseException for DenoKvError

c2d4a49

DenoKvError was extending BaseException instead of Exception, which meant that catch-all Exception handlers didn't catch it.

feat: make ConsistencyLevel enum ordered

a040e1f

test: provide assertion errors for protobuf Message

ee0bd70

We now provide a custom assertion failure message for comparisons involving two protobuf Message objects. The property values are diffed, similar to how pytest diffs dicts, etc.

test: add mocked(...) helper to access Mock methods

4069c9b

chore: support __notes__ < py3.11

18896ce

We'll add __notes__ to exceptions in older versions, but in practice they won't be visible in 3.9's tracebacks. Based on: h4l/v8serialize@dec9d82

h4l added 20 commits February 4, 2025 03:54

test: add typeval() util function

6b417f5

feat: add @Frozen class decorator

db1fb35

It disables setattr and delattr, like @DataClass(frozen=True)

fix: annotate KvU64.RANGE

d0a848f

mypy thought it was Any.

test: configure hypothesis profiles to run more examples

f9d2118

feat: add Result.or_raise(), Result.value_or_raise()

9744433

test: allow AnyKvKey in denokv_testing.add_entries()

e15f97a

The Mapping needs overloads as its params are invariant.

test: only include denokv module in coverage reports

1b510fd

refactor: extract PlannedWrite methods as mixins

8169df9

The set()/sum()/delete() etc methods of PlannedWrite are now implemented in individual mixin classes. This will allow us to implement the same set of methods on the Kv class, without duplicating all the horrific @overload annotations.

feat: allow DenoKvError to have no message argument

c85705f

It used to always require a message string, which made it awkward to subclass for errors that don't want to define a message at init.

feat: make FailedWrite, ConflictedWrite exceptions

1517bb0

Rather than throwing the lower-level errors from datapath, we'll now use one of these two higher-level errors.

chore: enable mypy possibly-undefined check

d13ba06

Checks for unbound variable use.

chore: clarify ambiguously-defined variable

a8694de

refactor: adjust type annotations to support py39

e8431f6

refactor: use parse_rfc3339_datetime in tests

3fde257

Python 3.9's datetime.fromisoformat() does not parse datetimes with a UTC Z indicator.

fix: prevent __dict__ creation on types with slots

45f1517

Some mixin/interface types did not have empty __slots__ defined, which caused types using slots to still have dicts.

h4l force-pushed the kv-write branch from 684f188 to 08d86c3 Compare February 4, 2025 04:43

h4l force-pushed the kv-write branch from 08d86c3 to 96c4e05 Compare February 4, 2025 04:45

h4l added 2 commits February 4, 2025 05:47

style: auto-format markdown files

d07120c

We now configure prettier to wrap markdown automatically.

docs: update README and CHANGELOG for write support

9e18b91

h4l merged commit 9e18b91 into main Feb 4, 2025
23 checks passed

h4l changed the title ~~wip: implement write operations~~ Implement write operations Feb 4, 2025

h4l deleted the kv-write branch February 4, 2025 06:03

h4l deployed to release February 4, 2025 06:04 — with GitHub Actions Active

h4l mentioned this pull request Feb 4, 2025

Support writing to the database #5

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement write operations #16

Implement write operations #16

Uh oh!

h4l commented Dec 24, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Implement write operations #16

Implement write operations #16

Uh oh!

Conversation

h4l commented Dec 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

h4l commented Dec 24, 2024 •

edited

Loading