Separating out the Policy Store, Policy Reasoner and API #270

Lut99 · 2025-05-05T09:50:51Z

This is the brane counterpart to BraneFramework/policy-reasoner#61

This PR see a complete rework of brane-chk to incorporate changes introduced by the policy-reasoner refactor. Specifically:

Moved the interface-part (API) to the brane-chk code. As such, the code here has become more complex (as in, there is some now) but removes the co-dependency between the Brane and reasoner repos.
Separated out the reasoner and store APIs, as per the new split in the reasoner itself. This mirrors the old deliberation/management API splits.
- However, note that the reasoner context is the only oddball out here. It is a reasoner concept but used in the store API, and as such, the interface on the Brane side does a little wizardry to inject that concept into the store API.
Moved to the eFLINT Haskell backend. This means the brane-chk-container is no longer compiling Go but Haskell instead.

The merge is almost complete. See #269 and BraneFramework/manual#5 for things left to do.

In addition:

Removed schemes (e.g., http://) from endpoints in config files as these were also fixed to the same anyway and caused confusion when writing the files. Instead, the code using the addresses infer it themselves.
Docker containers now run as non-root (brane user instead) for added security (save for brane-job, as it needs to access the Docker socket, also on mac).
Tested & runs on an ARM mac.
Added the make brane-let-docker command, which compiles the branelet executable in a container to support both Linux and macOS branelet incompatibility issues.

IMPORTANT: Before this is merged, merge the equivalent PR at the policy-reasoner side first. Then, update the dependencies in this repo to remove the branch = "lib-refactor"-keys. Then this PR may be merged.

(Also fix the rebase issues first lel)

Lut99 · 2025-05-05T09:51:33Z

Oh no, I see what went wrong with the rebase now

WHY DID IT PUT MAIN ON MY CHANGES AND THEN ADDED THESE CHANGES ATOP THE MAIN ONES AGAIN???

Daniel it's a mess 😂 help

Lut99 · 2025-05-05T10:08:15Z

OK, I completely see what I did wrong. I got confused after I finished the rebase and then I merged the remote back into it 🤦‍♂️ Clearly, since the histories now diverge, I should've just force pushed.

Is there an easy way to fix this? I think I added commits since that happened whoop

Lut99 · 2025-05-06T10:34:54Z

I'm having a bad time fixing this xD man I hate rebases (of this size)

I tried to rebase the last couple of commits (7f33db8 - 5b77c67) onto the commit where this starts to differ from main. Completely failed. 50% of my changes didn't even make it to the final product (which took me well over an hour). Idk man, I don't think I quite get how to do rebases just yet xD

Whatever's in this branch now works. I just want to end up with this but then without these double-changes commits. Should I just squash the whole branch to get it over with? Would be a shame, because it's an important refactor and a lot of history would be lost. But at least it'd be a straightforward history.

Oh yeah, PS, this would need an additional commit syncing with the latest version of policy-reasoner s.t. we work around the Gitlab syncing issue. Fails to do anything meaningful with Cargo otherwise.

DanielVoogsgerd · 2025-05-06T10:44:06Z

Yeah, this is not a fun rebase. I can take a look tomorrow, I just need a working version of the working tree so I know what we are working towards.

Lut99 · 2025-05-06T11:50:10Z

The current state of lib-refactor passes my "integration tests" (i.e., can remotely run workflows and do data transfers, both to other workers and end users). If that's what you mean. Would be great if you can take a look :) but don't rush your deadline of course. Let me know if it's too much work, then I'll see what I can do on my end.

github-advanced-security

devskim found more than 20 potential problems in the proposed changes. Check the Files changed tab for more details.

DanielVoogsgerd

I am going to need a couple of passes through this code if I want to catch most of the "issues". This first pass is very course and more focussed on style and blatant issues. Once I am happy with where the code is at, I will move on to testing and running it to see what has happened from an architectural perspective.

I hope I can get a mental model of it all, but such a large changeset without documentation is really pushing/exceeding my boundaries if I want to form a proper idea what has changed and what all the implications are.

Feel free to leave the changes to me. A lot of this stuff is trivial and sometimes subjective. I will gladly make the changes, myself, but maybe it is nice to get some feedback on this stuff.

Finally, most of it actually looks quite good, and I really like some structural changes I am seeing, so I really feel like this is a change for the better.

Dockerfile.let

Dockerfile.rls

Dockerfile.let

Dockerfile.rls

specifications/src/wir/builtins.rs

specifications/src/wir/merge_strategy.rs

tests/wir/arrays.json

Lut99 · 2025-05-12T09:15:17Z

OK; see my comments. What do you think?

DanielVoogsgerd

A last couple of comments so I can fix them in #276

brane-chk/src/state.rs

DanielVoogsgerd · 2025-06-04T11:18:25Z

brane-chk/src/stateresolver.rs

+struct CallFinder<'w> {
+    /// The workflow ID (for debugging)
+    wf_id: &'w str,
+    /// The task to find.
+    call:  &'w str,
+    /// Whether we already found it or not.
+    found: bool,
+}
+impl<'w> CallFinder<'w> {
+    /// Constructor for the CallFinder.
+    ///
+    /// # Arguments
+    /// - `wf_id`: The ID of the workflow we're asserting.
+    /// - `call`: The ID of the call to find.
+    ///
+    /// # Returns
+    /// A new instance of Self, ready to sniff out the call!
+    #[inline]
+    fn new(wf_id: &'w str, call: &'w str) -> Self { Self { wf_id, call, found: false } }
+}
+impl<'w> Visitor<'w> for CallFinder<'w> {
+    type Error = Error;
+
+    #[inline]
+    fn visit_call(&mut self, elem: &'w ElemCall) -> Result<Option<&'w Elem>, Self::Error> {
+        // Check if it's the one
+        if self.call == elem.id {
+            if !self.found {
+                self.found = true;
+            } else {
+                return Err(Error::DuplicateCallId { workflow: self.wf_id.into(), call: elem.id.clone() });
+            }
+        }
+
+        // OK, continue
+        Ok(Some(&elem.next))
+    }
+}
+
+/// Asserts that the given task occurs exactly once in the workflow and that it has exactly one
+/// input with the given name.
+#[derive(Debug)]
+struct CallInputFinder<'w> {
+    /// The workflow ID (for debugging)
+    wf_id: &'w str,
+    /// The task to find.
+    call: &'w str,
+    /// The input to find.
+    input: &'w str,
+    /// Whether we already found the call it or not.
+    found_call: bool,
+}
+impl<'w> CallInputFinder<'w> {
+    /// Constructor for the CallInputFinder.
+    ///
+    /// # Arguments
+    /// - `wf_id`: The ID of the workflow we're asserting.
+    /// - `call`: The ID of the call to find.
+    /// - `input`: The ID of the input to the given call to find.
+    ///
+    /// # Returns
+    /// A new instance of Self, ready to scooby the input to call.
+    #[inline]
+    fn new(wf_id: &'w str, call: &'w str, input: &'w str) -> Self { Self { wf_id, call, input, found_call: false } }
+}
+impl<'w> Visitor<'w> for CallInputFinder<'w> {
+    type Error = Error;
+
+    #[inline]
+    fn visit_call(&mut self, elem: &'w ElemCall) -> Result<Option<&'w Elem>, Self::Error> {
+        // Check if it's the one
+        if self.call == elem.id {
+            // It is, so mark it (or complain we've seen it before)
+            if !self.found_call {
+                self.found_call = true;
+            } else {
+                return Err(Error::DuplicateCallId { workflow: self.wf_id.into(), call: elem.id.clone() });
+            }
+
+            // Also verify the input exists in this call
+            let mut found_input: bool = false;
+            for i in &elem.input {
+                if self.input == i.id {
+                    if !found_input {
+                        found_input = true;
+                    } else {
+                        return Err(Error::DuplicateInputId { workflow: self.wf_id.into(), call: elem.id.clone(), input: i.id.clone() });
+                    }
+                }
+            }
+            if !found_input {
+                return Err(Error::UnknownInputToCall { workflow: self.wf_id.into(), call: elem.id.clone(), input: self.input.into() });
+            }
+        }
+
+        // OK, continue
+        Ok(Some(&elem.next))
+    }
+}


Is this a Brane thing or a policy reasoner thing? I am not sure if policy reasoner wants to make these assertions of a workflow in general, or whether this is specific for our usecase.

Also, a finder doesn't match with what these visitors seem to do.

brane-chk/src/workflow/compile.rs

brane-tsk/src/errors.rs

specifications/Cargo.toml

specifications/src/checking.rs

specifications/src/wir/builtins.rs

brane-tsk/src/errors.rs

… API

Also created a reference struct and fixed tracing issues

These were redundant and took unnecessary ownership

Furthermore, in order to do so: - Move to main branch for policy store - Remove removed generic from instantiated_path

Required an insert of a missing `Send` bound over in `policy-reasoner` land. Also, unintentionally, got a bunch of TOML formats now I switched editors since last time, whoops

Because `tonic` `1.13.0` requires it, if I read this right

This because `policy-store` (which `specifications`, among others, depends on) requires it.

See the comment, which links to <haskell/hackage-server#547 (comment)> that explains

It was both testing an unsupported test and using a wrong (and brittle) test dir path

codecov · 2025-07-30T19:25:04Z

Codecov Report

❌ Patch coverage is 31.81990% with 1802 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
specifications/src/checking.rs	0.00%	293 Missing ⚠️
brane-chk/src/workflow/eflint.rs	0.00%	206 Missing ⚠️
brane-ctl/src/policies.rs	0.00%	206 Missing ⚠️
brane-chk/src/workflow/preprocess.rs	72.24%	121 Missing and 45 partials ⚠️
specifications/src/address.rs	0.00%	134 Missing ⚠️
brane-job/src/worker.rs	0.00%	98 Missing ⚠️
brane-chk/src/apis/deliberation.rs	0.00%	84 Missing ⚠️
brane-chk/src/stateresolver.rs	0.00%	71 Missing ⚠️
brane-chk/src/workflow/compile.rs	71.72%	34 Missing and 33 partials ⚠️
brane-cli/src/repl.rs	0.00%	67 Missing ⚠️
... and 46 more

📢 Thoughts on this report? Let us know!

At least, it compiles 🎉

Lut99 · 2025-07-30T23:48:09Z

OK hold your breath fellas

Lut99 · 2025-07-30T23:49:25Z

(Assuming you can hold your breath for 15+ minutes)

Lut99 · 2025-07-30T23:53:44Z

NO

Lut99 · 2025-07-30T23:55:01Z

Of course now nightly clippy wakes up. Great.

Lut99 · 2025-07-30T23:56:49Z

Time to hold your breath again

Lut99 · 2025-07-31T00:11:06Z

This runner cannot be having a good time

Lut99 · 2025-07-31T00:18:16Z

W-What...? It literally failed again after 19 minutes of compilation...?

Lut99 · 2025-07-31T00:18:26Z

Guess I'll never go to sleep, huh

Lut99 · 2025-07-31T00:21:19Z

... (I locally tested building build-eflint-repl only, not brane-chk. Take a lesson from that, kids, or something)

Lut99 · 2025-07-31T00:21:33Z

Queue breath holding time *again*

Lut99 · 2025-07-31T00:53:53Z

Time to close this chapter 😎

DanielVoogsgerd · 2025-07-31T08:27:04Z

I will bookmark this under the name: Tim ~~slowly~~ rapidly losing his mind. Thanks for carrying this over the finish line.

It was a horrible refactor to land, but I think it is worth the struggle for Brane in the end. Both in removing the circular dependency (thank god), but also in the scope creep changes that inevitably made it into the PR. Thanks for all the hard work! ❤️

Lut99 · 2025-07-31T12:56:25Z

Haha yeah it was an adventure alright. But it's in a really good shape now, and quite the milestone. Thanks to you too for all the hard work, patience and lessons!

Lut99 mentioned this pull request May 5, 2025

Separating out the Policy Store, Policy Reasoner and API BraneFramework/policy-reasoner#61

Merged

Lut99 requested a review from DanielVoogsgerd May 5, 2025 09:53

DanielVoogsgerd force-pushed the lib-refactor branch from 5b77c67 to bb6af07 Compare May 7, 2025 14:08

github-advanced-security bot found potential problems May 7, 2025

View reviewed changes

DanielVoogsgerd requested changes May 8, 2025

View reviewed changes

Lut99 mentioned this pull request May 12, 2025

Adding qualification assertions to brane-chk #271

Open

DanielVoogsgerd force-pushed the lib-refactor branch 2 times, most recently from 1928365 to 6cc01c2 Compare June 10, 2025 16:03

DanielVoogsgerd approved these changes Jun 10, 2025

View reviewed changes

DanielVoogsgerd mentioned this pull request Jun 16, 2025

Enable brane build for Windows #277

Open

Lut99 and others added 14 commits July 30, 2025 17:40

refactor: Separating out the Policy Store, Policy Reasoner and API

202b867

Added an actually working Dockerfile.let now (at least on Linux)

8bfc9f4

Fixed dangling links in docs

fee4e8c

Clippy fixes

a3db81c

Fixed serde to equal minimum version everywhere

d9a697b

fixup! refactor: Separating out the Policy Store, Policy Reasoner and…

9a12649

… API

temp: Patch policy-reasoner and policy-store to use WIP branches

b491b17

chore(deps): Update rand

798acbc

chore(deps): Lower minimal versions

975a60d

fix: Replace manual async methods with native async methods

797de2b

Also created a reference struct and fixed tracing issues

refactor: Remove From<&?String> implementations for Mergestrategy

c65f05f

These were redundant and took unnecessary ownership

chore: Remove littered commented code

cf036f1

style: Format macros

0a2ec3b

style: Spelling corrections

c2880e2

DanielVoogsgerd and others added 5 commits July 30, 2025 17:40

refactor: Small idiomatic rust stuff

cbe2584

fix: Replace Reference::as_str with Deref impl

9e16baf

chore(deps): Bump axum to version 0.8

d131717

Furthermore, in order to do so: - Move to main branch for policy store - Remove removed generic from instantiated_path

chore: Upgrade tonic to version 0.13

f2812e6

Removed [patch] section

942fc1b

Required an insert of a missing `Send` bound over in `policy-reasoner` land. Also, unintentionally, got a bunch of TOML formats now I switched editors since last time, whoops

DanielVoogsgerd force-pushed the lib-refactor branch from 0226e8b to 942fc1b Compare July 30, 2025 15:40

DanielVoogsgerd and others added 5 commits July 30, 2025 17:43

fix(ci): add lockfile

49cb75a

Pushed tokio-stream to 0.1.16

956adb8

Because `tonic` `1.13.0` requires it, if I read this right

Bumped MSRV for many crates to 1.82

841864d

This because `policy-store` (which `specifications`, among others, depends on) requires it.

(Theoretically) fixed problem with Haskell not compiling

8dfaabf

See the comment, which links to <haskell/hackage-server#547 (comment)> that explains

Fixed tests not passing

f1f0a9f

It was both testing an unsupported test and using a wrong (and brittle) test dir path

Lut99 added 2 commits July 31, 2025 01:34

Really fixed brane-chk compilation this time

904c406

At least, it compiles 🎉

Fixed nightly mismatched_lifetime_syntaxes warnings

c60ebc7

Fixed nightly Clippy warnings

20a0e58

Fixed brane-chk copying eflint-repl from the wrong dir

fbc6fb4

Lut99 merged commit ad288df into main Jul 31, 2025
22 checks passed

Separating out the Policy Store, Policy Reasoner and API #270

Separating out the Policy Store, Policy Reasoner and API #270

Uh oh!

Conversation

Lut99 commented May 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Lut99 commented May 5, 2025

Uh oh!

Lut99 commented May 5, 2025

Uh oh!

Lut99 commented May 6, 2025

Uh oh!

DanielVoogsgerd commented May 6, 2025

Uh oh!

Lut99 commented May 6, 2025

Uh oh!

github-advanced-security bot left a comment

Choose a reason for hiding this comment

Uh oh!

DanielVoogsgerd left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Lut99 commented May 12, 2025

Uh oh!

DanielVoogsgerd left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

DanielVoogsgerd Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Lut99 commented Jul 30, 2025

Uh oh!

Lut99 commented Jul 30, 2025

Uh oh!

Lut99 commented Jul 30, 2025

Uh oh!

Lut99 commented Jul 30, 2025

Uh oh!

Lut99 commented Jul 30, 2025

Uh oh!

Lut99 commented Jul 31, 2025

Uh oh!

Lut99 commented Jul 31, 2025

Uh oh!

Lut99 commented Jul 31, 2025

Uh oh!

Lut99 commented Jul 31, 2025

Uh oh!

Lut99 commented Jul 31, 2025

Uh oh!

Lut99 commented Jul 31, 2025

Uh oh!

Uh oh!

DanielVoogsgerd commented Jul 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Lut99 commented May 5, 2025 •

edited

Loading

DanielVoogsgerd left a comment •

edited

Loading

codecov bot commented Jul 30, 2025 •

edited

Loading

DanielVoogsgerd commented Jul 31, 2025 •

edited

Loading