thoughts on effects and typestates

# Thoughts on how to represent stateful utxos

_NOTE:_ This is not about proving, but I'm using the same idea underneath.

## Background

Let's start by defining an object model based on effect systems, inspired by
[the objects in effekt](https://effekt-lang.org/tour/objects). Which is similar
to the way utxos methods work in Starstream's original design.

Note however that we can return these objects instead of having to write everything in CPS.

Basically we want to implement "methods" as effect handlers:

```sc
interface Object {
  foo(): u32
  bar(): String
}

def object = {
  def foo() = {
    1
  }

  def bar() = {
    "bar"
  }
}

let x = object.foo()
let y = object.bar()
```

Where this is the same as:

```sc
try {
  object.foo()
  object.bar()
}
with object: Object {
  def foo() = {
    1
  }

  def bar() = {
    "bar"
  }
}
```

In Starstream, we can implement that with something like this (in rusty
pseudo-code, but imagine wasm):

```rust
let mut (r, k) = yield();
loop {
  (r, k) = match r { // k is the caller
    Object::Foo => {
      resume(k, 1);
    }
    Object::Bar => {
      resume(k, "bar");
    }
  }
}
```

Where `k` is some handle to the coroutine that did call the method (it's
suspened on a `resume` call, and the input here is the output).

## State

With that framework, let's say we want to implement a utxo with a typestate-like
interface.

Let's say with have a utxo (coroutine) with states A and B. Each one with its
own interface. A goes to B, and B goes to A.

```sc
interface A {
  a(): String
}

interface B {
  b(): String
}
```

On the runtime side, this can be implemented by chaining two handlers (TCO can
eliminate recursion):

```rust
struct Storage {}

fn utxo() {
  let storage = Storage::new();

  let (r, k) = yield();

  utxo_a(r, k, &storage);
}

fn utxo_a(r, k, storage: &Storage) {
    let (r, k) = match r {
      A::a => {
        resume(k, "a");
      }
      Star::Supports(i) => {
        resume(k, i == A::HASH);
      } 
      _ => abort("unsupported operation")
    };

    utxo_b(r, k, storage)
}

fn utxo_b(r, k, storage: &Storage) {
    let (r, k) = match r {
      B::b => {
        resume(k, "b");
      }
      Star::Supports(i) => {
        resume(k, i == B::HASH);
      } 
      _ => abort("unsupported operation")
    }

    utxo_a(r, k, storage)
}
```

On the coordination script side you could just do:

```ts
let s1 = utxo.a(); // moves the coroutine to the second yield
let s2 = utxo.b(); // goes back to the first yield
let s1 = utxo.a();
```

Where invoking an unavailable operation will just fail and abort the
transaction.

The main question then becomes how to describe the type of such utxo, for the
purposes of static checking mainly, and also how to make it ergonomic, since it
has to be checked in any case.

## Par-inspired idea

Extend the effect system with trees (for typestates), inspired by session
types and [Par](https://faiface.github.io/par-lang/types_and_expressions.html)
(although the following is not session types, that also requires duality for
deadlock freedom).

I don't think the indexed types idea necessarily applies only to channels,
although it's simpler with named handlers like the boxed computations described
above, than with just contextual effects.

```rust
fn utxo(): Coroutine<Rec{ A.B.Self }> { ... }
```

Where the `.` (dot) means chaining, `Rec` is used to type the `loop` operator
(there could be a different one).

Then this could be used like this:

```rust
let u = utxo();
// u: Coroutine<Rec{ A.B.Self }>
let s = u.a();
// unfold Rec
// u: Coroutine<B.Rec{ A.B.Self }>
let s = u.b();
// back to the beginning
// u: Coroutine<Rec { A.B.Self }>
let s = u.a();
```

I think this is not super hard to type-check. The type of `u` changes across the
function, but I think that could be implemented by updating the type environment
(Γ), like some sort of flow typing, or maybe by desugaring `u` into unique
variables.

This also subsumes affine types, since you could just have `Coroutine<A>`
instead of `Coroutine<Rec{A.Self}>`. Linearity requires also branching types,
which also can be added.

Type inference on the other hand may be tricker though, since (I think) the
usual unification method leads to something like this:

```rust
let s = u.a();
// infer(u): Coroutine<A>
let s = u.b();
// infer(u): Coroutine<A.B>
let s = u.a();
// infer(u): Coroutine<A.B.A>
```

And this may require some sort of subtyping (doable though).

On the other hand though, we may not need to infer this and just get away with
bidirectional typechecking.

Another feature that I think could be useful is some sugar for chaining with
variable binding:

```
let u
  .a() [s1]
  .b() [s2]
```

Instead of

```
let s1 = u.a();
let s2 = u.b();
```

### State

A quick note about the coroutine state. The example above has the same state for
both interfaces. I think probably an ADT would work if each state has different
storage (kind of what async functions are compiled to in rust, where each yield
point is a variant). I'm not sure if we need this though.

### Syntax

Maybe something like this?

I think the hard part is separating lifetime from implementation, mostly.

One limitation is what would happen if you need two different impls of the same
interface but in different states. This is solved by labelling though, just ugly.

```
utxo Utxo {
  main {
    loop {
      with A {
        yield;
      }

      with B {
        yield;
      }
    }
  }

  impl Rec { A.B.Self } {
    fn a() -> String {
      "a"
    }

    fn b() -> String {
      "b"
    }
  }
}

script {
  fn driver(u: Coroutine<Recv { A.B.Self }>) {
    let s1 = u.a();
    let s2 = u.b();
  }
}
```

### Branching:

The weird thing about branching is that it doesn't play that well with the
nominal interfaces.

But let's start with plain unions. Let's say we have this type `Rec { A.End | B.Self }`.

I think it's weird to express this while keeping the idea of separating
lifetime from implementations, if we really want to keep that model. Writing the
coroutine with the matches explicitly inside directly is simpler to read in my
opinion, but if we want to do that, I think we could do something like this:

Perhaps something like this:

```
utxo Utxo {
  main {
    loop {
      offer { // yields
        A => break,
        B => continue,
      }
    }

    return
  }

  impl Rec { (A.End | B.Self) } {
    fn a() {
      "a"
    }

    fn b() {
      "b"
    }
  }
}
```

Where the coroutine gets compiled to:

```rust
let storage = Storage::new();

loop {
  let (r, k) = yield();

  (r, k) = match r {
    A::a => {
      resume(r, k, "a");

      // note that this requires that the utxo to be resumed once more for the
      // coroutine to really end
      //
      // this means End is also an "interface" in a way (the empty one).

      // the other option is to have a `resume` that also deletes the coroutine.

      // return probably needs to be a host call to prove linearity
      return();
    },
    B::b => {
      resume(r, k, "b")
    }
  }
}
```

Also probably want to support an anonymous interface notation, by just using
labels directly or something (see the choice types in Par).

## Simpler constructions?

I think there also simpler constructions if the above is too hard to implement
(although I think it may be worth it eventually since it's more about ergonomics
than safety, since we have to do runtime checking anyway).

The obvious option is to just type the utxo/coroutine with both interfaces (or
the union of all the handlers), and then just do the runtime checking.

Allowing boxed handlers (objects) to return other interfaces could also work in
some cases:

```sc
interface A[C] {
  a(): (Coroutine<C>, String)
}

interface B[C] {
  b(): (Coroutine<C>, String)
}

interface End {}
```

```
fn utxo(): Coroutine<A[B[End]]> { ... }
```

This also requires linear types.

In the case of a utxo, it would just return itself as the next coroutine:

** NOTE ** however that this is not recursive like the example before, since
that's harder to represent without adding some nominal recursion.

It could however be done for fixed interfaces with named loops, where `b`
returns `Coroutine<A[B[End]>`.

```rust
struct Storage {
  self_k: CoroutineId,
}

fn utxo() {
  let self_k = kid();

  let (r, k) = yield();

  utxo_a(r, k);
}

fn utxo_a(storage: &Storage) {
    let (r, k) = match r {
      A::a => {
        resume(k, (storage.self_k, "a"));
      }
      Star::Supports(i) => {
        resume(k, i == A::HASH);
      } 
      _ => abort("unsupported operation")
    };

    utxo_b()
}

fn utxo_b(storage: &Storage) {
    let (r, k) = match r {
      B::b => {
        resume(k, (storage.self_k, "b"));
      }
      Star::Supports(i) => {
        resume(k, i == B::HASH);
      } 
      _ => abort("unsupported operation")
    }
}
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

thoughts on effects and typestates #95

Thoughts on how to represent stateful utxos

Background

State

Par-inspired idea

State

Syntax

Branching:

Simpler constructions?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

thoughts on effects and typestates #95

Description

Thoughts on how to represent stateful utxos

Background

State

Par-inspired idea

State

Syntax

Branching:

Simpler constructions?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions