Component Model AST Explainer

This explainer walks through the grammar of a component and the proposed embedding of components into native JavaScript runtimes. For a more user-focused explanation, take a look at the Component Model Documentation.

Gated features
Grammar
Component invariants
JavaScript embedding
- JS API
- ESM-integration
Examples

Gated Features

By default, the features described in this explainer (as well as the supporting Binary.md, WIT.md and CanonicalABI.md) have been implemented and are included in the WASI Preview 2 stability milestone. Features that are not part of Preview 2 are demarcated by one of the emoji symbols listed below; these emojis will be removed once they are implemented, considered stable and included in a future milestone:

🪙: value imports/exports and component-level start function
🪺: nested namespaces and packages in import/export names
🔀: async
- 🚝: marking some builtins as async
- 🚟: using async with canon lift without callback (stackful lift)
🧵: threading built-ins
- 🧵②: shared-everything-threads-based threading built-ins
🔧: fixed-length lists
📝: the error-context type
🔗: canonical interface names

(Based on the previous scoping and layering proposal to the WebAssembly CG, this repo merges and supersedes the module-linking and interface-types proposals, pushing some of their original features into the post-MVP future feature backlog.)

Grammar

This section defines components using an EBNF grammar that parses something in between a pure Abstract Syntax Tree (like the Core WebAssembly spec's Structure Section) and a complete text format (like the Core WebAssembly spec's Text Format Section). The goal is to balance completeness with succinctness, with just enough detail to write examples and define a binary format in the style of the Binary Format Section, deferring full precision to the formal specification.

The main way the grammar hand-waves is regarding definition uses, where indices referring to X definitions (written <Xidx>) should, in the real text format, explicitly allow identifiers (<id>), checking at parse time that the identifier resolves to an X definition and then embedding the resolved index into the AST.

Additionally, standard abbreviations defined by the Core WebAssembly text format (e.g., inline export definitions) are assumed but not explicitly defined below.

Component Definitions

At the top-level, a component is a sequence of definitions of various kinds:

component  ::= (component <id>? <definition>*)
definition ::= core-prefix(<core:module>)
             | core-prefix(<core:instance>)
             | core-prefix(<core:type>)
             | <component>
             | <instance>
             | <alias>
             | <type>
             | <canon>
             | <start> 🪺
             | <import>
             | <export>
             | <value> 🪙

where core-prefix(X) parses '(' 'core' Y ')' when X parses '(' Y ')'

Components are like Core WebAssembly modules in that their contained definitions are acyclic: definitions can only refer to preceding definitions (in the AST, text format and binary format). However, unlike modules, components can arbitrarily interleave different kinds of definitions.

The core-prefix meta-function transforms a grammatical rule for parsing a Core WebAssembly definition into a grammatical rule for parsing the same definition, but with a core token added right after the leftmost paren. For example, core:module accepts (module (func)) so core-prefix(<core:module>) accepts (core module (func)). Note that the inner func doesn't need a core prefix; the core token is used to mark the transition from parsing component definitions into core definitions.

The core:module production is unmodified by the Component Model and thus components embed Core WebAssembly (text and binary format) modules as currently standardized, allowing reuse of an unmodified Core WebAssembly implementation. The next production, core:instance, is not currently included in Core WebAssembly, but would be if Core WebAssembly adopted the module-linking proposal. This new core definition is introduced below, alongside its component-level counterpart. Finally, the existing core:type production is extended below to add core module types as proposed for module-linking. Thus, the overall idea is to represent core definitions (in the AST, binary and text format) as-if they had already been added to Core WebAssembly so that, if they eventually are, the implementation of decoding and validation can be shared in a layered fashion.

The next kind of definition is, recursively, a component itself. Thus, components form trees with all other kinds of definitions only appearing at the leaves. For example, with what's defined so far, we can write the following component:

(component
  (component
    (core module (func (export "one") (result i32) (i32.const 1)))
    (core module (func (export "two") (result f32) (f32.const 2)))
  )
  (core module (func (export "three") (result i64) (i64.const 3)))
  (component
    (component
      (core module (func (export "four") (result f64) (f64.const 4)))
    )
  )
  (component)
)

This top-level component roots a tree with 4 modules and 1 component as leaves. However, in the absence of any instance definitions (introduced next), nothing will be instantiated or executed at runtime; everything here is dead code.

Index Spaces

Like Core WebAssembly, the Component Model places each definition into one of a fixed set of index spaces, allowing the definition to be referred to by subsequent definitions (in the text and binary format) via a nonnegative integral index. When defining, validating and executing a component, there are 5 component-level index spaces:

(component) functions
(component) values 🪙
(component) types
component instances
components

5 core index spaces that also exist in WebAssembly 1.0:

(core) functions
(core) tables
(core) memories
(core) globals
(core) types

and 2 additional core index spaces that contain core definition introduced by the Component Model that are not in WebAssembly 1.0 (yet: the module-linking proposal would add them):

module instances
modules

for a total of 12 index spaces that need to be maintained by an implementation when, e.g., validating a component. These 12 index spaces correspond 1:1 with the terminals of the sort production defined below and thus "sort" and "index space" can be used interchangeably.

Also like Core WebAssembly, the Component Model text format allows identifiers to be used in place of these indices, which are resolved when parsing into indices in the AST (upon which validation and execution is defined). Thus, the following two components are equivalent:

(component
  (core module (; empty ;))
  (component   (; empty ;))
  (core module (; empty ;))
  (export "C" (component 0))
  (export "M1" (core module 0))
  (export "M2" (core module 1))
)

(component
  (core module $M1 (; empty ;))
  (component $C    (; empty ;))
  (core module $M2 (; empty ;))
  (export "C" (component $C))
  (export "M1" (core module $M1))
  (export "M2" (core module $M2))
)

Instance Definitions

Whereas modules and components represent immutable code, instances associate code with potentially-mutable state (e.g., linear memory) and thus are necessary to create before being able to run the code. Instance definitions create module or component instances by selecting a module or component to instantiate and then supplying a set of named arguments which satisfy all the named imports of the selected module or component. This low-level instantiation mechanism allows the Component Model to simultaneously support multiple different styles of traditional linking.

The syntax for defining a core module instance is:

core:instance       ::= (instance <id>? <core:instancexpr>)
core:instanceexpr   ::= (instantiate <core:moduleidx> <core:instantiatearg>*)
                      | <core:inlineexport>*
core:instantiatearg ::= (with <core:name> (instance <core:instanceidx>))
                      | (with <core:name> (instance <core:inlineexport>*))
core:sortidx        ::= (<core:sort> <u32>)
core:sort           ::= func
                      | table
                      | memory
                      | global
                      | type
                      | module
                      | instance
core:inlineexport   ::= (export <core:name> <core:sortidx>)

When instantiating a module via instantiate, the two-level imports of the core modules are resolved as follows:

The first core:name of the import is looked up in the named list of core:instantiatearg to select a core module instance. (In the future, other core:sorts could be allowed if core wasm adds single-level imports.)
The second core:name of the import is looked up in the named list of exports of the core module instance found by the first step to select the imported core definition.

Each core:sort corresponds 1:1 with a distinct index space that contains only core definitions of that sort. The u32 field of core:sortidx indexes into the sort's associated index space to select a definition.

Based on this, we can link two core modules $A and $B together with the following component:

(component
  (core module $A
    (func (export "one") (result i32) (i32.const 1))
  )
  (core module $B
    (func (import "a" "one") (result i32))
  )
  (core instance $a (instantiate $A))
  (core instance $b (instantiate $B (with "a" (instance $a))))
)

To see examples of other sorts, we'll need alias definitions, which are introduced in the next section.

The <core:inlineexport>* form of core:instanceexpr allows module instances to be created by directly tupling together preceding definitions, without the need to instantiate a helper module. The <core:inlineexport>* form of core:instantiatearg is syntactic sugar that is expanded during text format parsing into an out-of-line instance definition referenced by with. To show an example of these, we'll also need the alias definitions introduced in the next section.

The syntax for defining component instances is symmetric to core module instances, but with an expanded component-level definition of sort:

instance       ::= (instance <id>? <instanceexpr>)
instanceexpr   ::= (instantiate <componentidx> <instantiatearg>*)
                 | <inlineexport>*
instantiatearg ::= (with <name> <sortidx>)
                 | (with <name> (instance <inlineexport>*))
name           ::= <core:name>
sortidx        ::= (<sort> <u32>)
sort           ::= core <core:sort>
                 | func
                 | value 🪙
                 | type
                 | component
                 | instance
inlineexport   ::= (export "<exportname>" <sortidx>)
                 | (export "<exportname>" <versionsuffix> <sortidx>) 🔗

Because component-level function, type and instance definitions are different than core-level function, type and instance definitions, they are put into disjoint index spaces which are indexed separately. Components may import and export various core definitions (when they are compatible with the shared-nothing model, which currently means only module, but may in the future include data). Thus, component-level sort injects the full set of core:sort, so that they may be referenced (leaving it up to validation rules to throw out the core sorts that aren't allowed in various contexts).

The name production reuses the core:name quoted-string-literal syntax of Core WebAssembly (which appears in core module imports and exports and can contain any valid UTF-8 string).

🪙 The value sort refers to a value that is provided and consumed during instantiation. How this works is described in the value definitions section.

To see a non-trivial example of component instantiation, we'll first need to introduce a few other definitions below that allow components to import, define and export component functions.

Alias Definitions

Alias definitions project definitions out of other components' index spaces and into the current component's index spaces. As represented in the AST below, there are three kinds of "targets" for an alias: the export of a component instance, the core export of a core module instance and a definition of an outer component (containing the current component):

alias            ::= (alias <aliastarget> (<sort> <id>?))
aliastarget      ::= export <instanceidx> <name>
                   | core export <core:instanceidx> <core:name>
                   | outer <u32> <u32>

If present, the id of the alias is bound to the new index added by the alias and can be used anywhere a normal id can be used.

In the case of export aliases, validation ensures name is an export in the target instance and has a matching sort.

In the case of outer aliases, the u32 pair serves as a de Bruijn index, with first u32 being the number of enclosing components/modules to skip and the second u32 being an index into the target's sort's index space. In particular, the first u32 can be 0, in which case the outer alias refers to the current component. To maintain the acyclicity of module instantiation, outer aliases are only allowed to refer to preceding outer definitions.

Components containing outer aliases effectively produce a closure at instantiation time, including a copy of the outer-aliased definitions. Because of the prevalent assumption that components are immutable values, outer aliases are restricted to only refer to immutable definitions: non-resource types, modules and components. (In the future, outer aliases to all sorts of definitions could be allowed by recording the statefulness of the resulting component in its type via some kind of "stateful" type attribute.)

Both kinds of aliases come with syntactic sugar for implicitly declaring them inline:

For export aliases, the inline sugar extends the definition of sortidx and the various sort-specific indices:

sortidx     ::= (<sort> <u32>)          ;; as above
              | <inlinealias>
Xidx        ::= <u32>                   ;; as above
              | <inlinealias>
inlinealias ::= (<sort> <u32> <name>+)

If <sort> refers to a <core:sort>, then the <u32> of inlinealias is a <core:instanceidx>; otherwise it's an <instanceidx>. For example, the following snippet uses two inline function aliases:

(instance $j (instantiate $J (with "f" (func $i "f"))))
(export "x" (func $j "g" "h"))

which are desugared into:

(alias export $i "f" (func $f_alias))
(instance $j (instantiate $J (with "f" (func $f_alias))))
(alias export $j "g" (instance $g_alias))
(alias export $g_alias "h" (func $h_alias))
(export "x" (func $h_alias))

For outer aliases, the inline sugar is simply the identifier of the outer definition, resolved using normal lexical scoping rules. For example, the following component:

(component
  (component $C ...)
  (component
    (instance (instantiate $C))
  )
)

is desugared into:

(component $Parent
  (component $C ...)
  (component
    (alias outer $Parent $C (component $Parent_C))
    (instance (instantiate $Parent_C))
  )
)

Lastly, for symmetry with imports, aliases can be written in an inverted form that puts the sort first:

    (func $f (import "i" "f") ...type...) ≡ (import "i" "f" (func $f ...type...))   (WebAssembly 1.0)
          (func $f (alias export $i "f")) ≡ (alias export $i "f" (func $f))
   (core module $m (alias export $i "m")) ≡ (alias export $i "m" (core module $m))
(core func $f (alias core export $i "f")) ≡ (alias core export $i "f" (core func $f))

With what's defined so far, we're able to link modules with arbitrary renamings:

(component
  (core module $A
    (func (export "one") (result i32) (i32.const 1))
    (func (export "two") (result i32) (i32.const 2))
    (func (export "three") (result i32) (i32.const 3))
  )
  (core module $B
    (func (import "a" "one") (result i32))
  )
  (core instance $a (instantiate $A))
  (core instance $b1 (instantiate $B
    (with "a" (instance $a))                      ;; no renaming
  ))
  (core func $a_two (alias core export $a "two")) ;; ≡ (alias core export $a "two" (core func $a_two))
  (core instance $b2 (instantiate $B
    (with "a" (instance
      (export "one" (func $a_two))                ;; renaming, using out-of-line alias
    ))
  ))
  (core instance $b3 (instantiate $B
    (with "a" (instance
      (export "one" (func $a "three"))            ;; renaming, using <inlinealias>
    ))
  ))
)

To show analogous examples of linking components, we'll need component-level type and function definitions which are introduced in the next two sections.

Type Definitions

The syntax for defining core types extends the existing core type definition syntax, adding a module type constructor:

core:rectype     ::= ... from the Core WebAssembly spec
core:typedef     ::= ... from the Core WebAssembly spec
core:subtype     ::= ... from the Core WebAssembly spec
core:comptype    ::= ... from the Core WebAssembly spec
                   | <core:moduletype>
core:moduletype  ::= (module <core:moduledecl>*)
core:moduledecl  ::= <core:importdecl>
                   | <core:type>
                   | <core:alias>
                   | <core:exportdecl>
core:alias       ::= (alias <core:aliastarget> (<core:sort> <id>?))
core:aliastarget ::= outer <u32> <u32>
core:importdecl  ::= (import <core:name> <core:name> <core:importdesc>)
core:exportdecl  ::= (export <core:name> <core:exportdesc>)
core:exportdesc  ::= strip-id(<core:importdesc>)

where strip-id(X) parses '(' sort Y ')' when X parses '(' sort <id>? Y ')'

Here, core:comptype (short for "composite type") as defined in the GC proposal is extended with a module type constructor. The GC proposal also adds recursion and explicit subtyping between core wasm types. Owing to their different requirements and intended modes of usage, module types support implicit subtyping and are not recursive. Thus, the existing core validation rules would require the declared supertypes of module types to be empty and disallow recursive use of module types.

In the MVP, validation will also reject core:moduletype defining or aliasing other core:moduletypes, since, before module-linking, core modules cannot themselves import or export other core modules.

The body of a module type contains an ordered list of "module declarators" which describe, at a type level, the imports and exports of the module. In a module-type context, import and export declarators can both reuse the existing core:importdesc production defined in WebAssembly 1.0, with the only difference being that, in the text format, core:importdesc can bind an identifier for later reuse while core:exportdesc cannot.

With the Core WebAssembly type-imports, module types will need the ability to define the types of exports based on the types of imports. In preparation for this, module types start with an empty type index space that is populated by type declarators, so that, in the future, these type declarators can refer to type imports local to the module type itself. For example, in the future, the following module type would be expressible:

(component $C
  (core type $M (module
    (import "" "T" (type $T))
    (type $PairT (struct (field (ref $T)) (field (ref $T))))
    (export "make_pair" (func (param (ref $T)) (result (ref $PairT))))
  ))
)

In this example, $M has a distinct type index space from $C, where element 0 is the imported type, element 1 is the struct type, and element 2 is an implicitly-created func type referring to both.

Lastly, the core:alias module declarator allows a module type definition to reuse (rather than redefine) type definitions in the enclosing component's core type index space via outer type alias. In the MVP, validation restricts core:alias module declarators to only allow outer type aliases (into an enclosing component's or component-type's core type index space). In the future, more kinds of aliases would be meaningful and allowed.

As an example, the following component defines two semantically-equivalent module types, where the former defines the function type via type declarator and the latter refers via alias declarator.

(component $C
  (core type $C1 (module
    (type (func (param i32) (result i32)))
    (import "a" "b" (func (type 0)))
    (export "c" (func (type 0)))
  ))
  (core type $F (func (param i32) (result i32)))
  (core type $C2 (module
    (alias outer $C $F (type))
    (import "a" "b" (func (type 0)))
    (export "c" (func (type 0)))
  ))
)

Component-level type definitions are symmetric to core-level type definitions, but use a completely different set of value types. Unlike core:valtype which is low-level and assumes a shared linear memory for communicating compound values, component-level value types assume no shared memory and must therefore be high-level, describing entire compound values.

type          ::= (type <id>? <deftype>)
deftype       ::= <defvaltype>
                | <resourcetype>
                | <functype>
                | <componenttype>
                | <instancetype>
defvaltype    ::= bool
                | s8 | u8 | s16 | u16 | s32 | u32 | s64 | u64
                | f32 | f64
                | char | string
                | error-context 📝
                | (record (field "<label>" <valtype>)+)
                | (variant (case "<label>" <valtype>?)+)
                | (list <valtype>)
                | (list <valtype> <u32>) 🔧
                | (tuple <valtype>+)
                | (flags "<label>"+)
                | (enum "<label>"+)
                | (option <valtype>)
                | (result <valtype>? (error <valtype>)?)
                | (own <typeidx>)
                | (borrow <typeidx>)
                | (stream <typeidx>?) 🔀
                | (future <typeidx>?) 🔀
valtype       ::= <typeidx>
                | <defvaltype>
resourcetype  ::= (resource (rep i32) (dtor <funcidx>)?)
                | (resource (rep i32) (dtor async <funcidx> (callback <funcidx>)?)?) 🚝
functype      ::= (func async? (param "<label>" <valtype>)* (result <valtype>)?)
componenttype ::= (component <componentdecl>*)
instancetype  ::= (instance <instancedecl>*)
componentdecl ::= <importdecl>
                | <instancedecl>
instancedecl  ::= core-prefix(<core:type>)
                | <type>
                | <alias>
                | <exportdecl>
                | <value> 🪙
importdecl    ::= (import "<importname>" bind-id(<externdesc>))
                | (import "<importname>" <versionsuffix> bind-id(<externdesc>)) 🔗
exportdecl    ::= (export "<exportname>" bind-id(<externdesc>))
                | (export "<exportname>" <versionsuffix> bind-id(<externdesc>)) 🔗
externdesc    ::= (<sort> (type <u32>) )
                | core-prefix(<core:moduletype>)
                | <functype>
                | <componenttype>
                | <instancetype>
                | (value <valuebound>) 🪙
                | (type <typebound>)
typebound     ::= (eq <typeidx>)
                | (sub resource)
valuebound    ::= (eq <valueidx>) 🪙
                | <valtype> 🪙

where bind-id(X) parses '(' sort <id>? Y ')' when X parses '(' sort Y ')'

Because there is nothing in this type grammar analogous to the gc proposal's rectype, none of these types are recursive.

Fundamental value types

The value types in valtype can be broken into two categories: fundamental value types and specialized value types, where the latter are defined by expansion into the former. The fundamental value types have the following sets of abstract values:

Type	Values
`bool`	`true` and `false`
`s8`, `s16`, `s32`, `s64`	integers in the range [-2^N-1, 2^N-1-1]
`u8`, `u16`, `u32`, `u64`	integers in the range [0, 2^N-1]
`f32`, `f64`	IEEE754 floating-point numbers, with a single NaN value
`char`	Unicode Scalar Values
`error-context` 📝	an immutable, non-deterministic, host-defined value meant to aid in debugging
`record`	heterogeneous tuples of named values
`variant`	heterogeneous tagged unions of named values
`list`	homogeneous, variable- or fixed-length sequences of values
`own`	a unique, opaque address of a resource that will be destroyed when this value is dropped
`borrow`	an opaque address of a resource that must be dropped before the current export call returns
`stream` 🔀	an asynchronously-passed list of homogeneous values
`future` 🔀	an asynchronously-passed single value

How these abstract values are produced and consumed from Core WebAssembly values and linear memory is configured by the component via canonical lifting and lowering definitions, which are introduced below. For example, while abstract variants contain a list of cases labelled by name, canonical lifting and lowering map each case to an i32 value starting at 0.

Numeric types

While core numeric types are defined in terms of sets of bit-patterns and operations that interpret the bits in various ways, component-level numeric types are defined in terms of sets of values. This allows the values to be translated between source languages and protocols that use different value representations.

Core integer types are just bit-patterns that don't distinguish between signed and unsigned, while component-level integer types are sets of integers that either include negative values or don't. Core floating-point types have many distinct NaN bit-patterns, while component-level floating-point types have only a single NaN value. And boolean values in core wasm are usually represented as i32s where operations interpret all-zeros as false, while at the component-level there is a bool type with true and false values.

📝 Error Context type

Values of error-context type are immutable, non-deterministic, host-defined and meant to be propagated from failure sources to callers in order to aid in debugging. Currently error-context values contain only a "debug message" string whose contents are determined by the host. Core wasm can create error-context values given a debug string, but the host is free to arbitrarily transform (discard, preserve, prefix or suffix) this wasm-provided string. In the future, error-context could be enhanced with other additional or more-structured context (like a backtrace or a chain of originating error contexts).

The intention of this highly-non-deterministic semantics is to provide hosts the full range of flexibility to:

append a basic callstack suitable for forensic debugging in production;
optimize for performance in high-volume production scenarios by slicing or discarding debug messages;
optimize for developer experience in debugging scenarios when debug metadata is present by appending expensive-to-produce symbolicated callstacks.

A consequence of this, however, is that components must not depend on the contents of error-context values for behavioral correctness. In particular, case analysis of the contents of an error-context should not determine error recovery; explicit result or variant types must be used in the function return type instead (e.g., (func (result (tuple (stream u8) (future $my-error)))).

Container types

The record, variant, and list types allow for grouping, categorizing, and sequencing contained values.

🔧 When the optional <u32> immediate of the list type constructor is present, the list has a fixed length and the representation of the list in memory is specialized to this length. Note that the fixed length must be larger than 0.

Handle types

The own and borrow value types are both handle types. Handles logically contain the opaque address of a resource and avoid copying the resource when passed across component boundaries. By way of metaphor to operating systems, handles are analogous to file descriptors, which are stored in a table and may only be used indirectly by untrusted user-mode processes via their integer index in the table.

In the Component Model, handles are lifted-from and lowered-into i32 values that index an encapsulated per-component-instance table that is maintained by the canonical function definitions described below. In the future, handles could be backwards-compatibly lifted and lowered from reference types (via the addition of a new canonopt, as introduced below).

The uniqueness and dropping conditions mentioned above are enforced at runtime by the Component Model through these canonical definitions. The typeidx immediate of a handle type must refer to a resource type (described below) that statically classifies the particular kinds of resources the handle can point to.

Asynchronous value types

The stream and future value types are both asynchronous value types that are used to deliver values incrementally over the course of a single async function call, instead of copying the values all-at-once as with other (synchronous) value types like list. The mechanism for performing these incremental copies avoids the need for intermediate buffering inside the stream or future value itself and instead uses buffers of memory whose size and allocation is controlled by the core wasm in the source and destination components. Thus, in the abstract, stream and future can be thought of as inter-component control-flow or synchronization mechanisms.

Just like with handles, in the Component Model, async value types are lifted-from and lowered-into i32 values that index an encapsulated per-component-instance table that is maintained by the canonical ABI built-ins below. The Component-Model-defined ABI for creating, writing-to and reading-from stream and future values is meant to be bound to analogous source-language features like promises, futures, streams, iterators, generators and channels so that developers can use these familiar high-level concepts when working directly with component types, without the need to manually write low-level async glue code. For languages like C without language-level concurrency support, these ABIs (described in detail in the Canonical ABI explainer) can be exposed directly as function imports and used like normal low-level Operation System I/O APIs.

A stream<T> asynchronously passes zero or more T values in one direction between a source and destination, batched in chunks for efficiency. Streams are useful for:

improving latency by incrementally processing values as they arrive;
delivering potentially-large lists of values that might OOM wasm if passed as a list<T>;
long-running or infinite streams of events.

A future<T> asynchronously delivers exactly one T value from a source to a destination, unless the destination signals that it doesn't want the T value any more. When function types contain the async effect, the return value is returned asynchronously if the call blocks (and is returned synchronously otherwise) and thus there is no need to additionally return a future<T> value unless additional asynchronous signalling is required.

The T element type of stream and future is an optional valtype. As with variant-case payloads and function results, when T is absent, the "value(s)" being asynchronously passed can be thought of as unit values. In such cases, there is no representation of the value in Core WebAssembly (pointers into linear memory are ignored) however the timing of completed reads and writes and the number of elements they contain are observable and meaningful. Thus, empty futures and streams can be useful for timing-related APIs.

Currently, validation rejects (stream T) and (future T) when T transitively contains a borrow. This restriction could be relaxed in the future by extending the call-scoping rules of borrow to streams and futures. Additionally, (stream char) is temporarily rejected with a future TODO to allow and properly use the string-encoding, taking care to not split code points.

Specialized value types

The sets of values allowed for the remaining specialized value types are defined by the following mapping:

                    (tuple <valtype>*) ↦ (record (field "𝒊" <valtype>)*) for 𝒊=0,1,...
                    (flags "<label>"*) ↦ (record (field "<label>" bool)*)
                     (enum "<label>"+) ↦ (variant (case "<label>")+)
                    (option <valtype>) ↦ (variant (case "none") (case "some" <valtype>))
(result <valtype>? (error <valtype>)?) ↦ (variant (case "ok" <valtype>?) (case "error" <valtype>?))
                                string ↦ (list char)

Specialized value types have the same set of semantic values as their corresponding despecialized types, but have distinct type constructors (which are not type-equal to the unspecialized type constructors) and thus have distinct binary encodings. This allows specialized value types to convey a more specific intent. For example, result isn't just a variant, it's a variant that means success or failure, so source-code bindings can expose it via idiomatic source-language error reporting. Additionally, this can sometimes allow values to be represented differently. For example, string in the Canonical ABI uses various Unicode encodings while list<char> uses a sequence of 4-byte char code points. Similarly, flags in the Canonical ABI uses a bit-vector while an equivalent record of boolean fields uses a sequence of boolean-valued bytes.

Note that, at least initially, variants are required to have a non-empty list of cases. This could be relaxed in the future to allow an empty list of cases, with the empty (variant) effectively serving as an empty type and indicating unreachability.

Definition types

The remaining 4 type constructors in deftype use valtype to describe shared-nothing functions, resources, components, and component instances:

The func type constructor describes a component-level function definition that takes a list of valtype parameters with strongly-unique names and optionally returns a valtype. The optional async effect type indicates that calling the function may block (because the function transitively calls a blocking built-in like waitable-set.wait or blocks on an imported async function).

The resource type constructor creates a fresh type for each instance of the containing component (with "freshness" and its interaction with general type-checking described in more detail below). Resource types can be referred to by handle types (such as own and borrow) as well as the canonical built-ins described below. The rep immediate of a resource type specifies its core representation type, which is currently fixed to i32, but will be relaxed in the future (to at least include i64, but also potentially other types). When the last handle to a resource is dropped, the resource's destructor function specified by the dtor immediate will be called (if present), allowing the implementing component to perform clean-up like freeing linear memory allocations. Destructors can be declared async, with the same meaning for the async and callback immediates as described below for canon lift.

The instance type constructor describes a list of named, typed definitions that can be imported or exported by a component. Informally, instance types correspond to the usual concept of an "interface" and instance types thus serve as static interface descriptions. In addition to the S-Expression text format defined here, which is meant to go inside component definitions, interfaces can also be defined as standalone, human-friendly text files in the wit Interface Definition Language.

The component type constructor is symmetric to the core module type constructor and contains two lists of named definitions for the imports and exports of a component, respectively. As suggested above, instance types can show up in both the import and export types of a component type.

Both instance and component type constructors are built from a sequence of "declarators", of which there are four kinds—type, alias, import and export—where only component type constructors can contain import declarators. The meanings of these declarators is basically the same as the core module declarators introduced above, but expanded to cover the additional capabilities of the component model.

Declarators

The importdecl and exportdecl declarators correspond to component import and export definitions, respectively, allowing an identifier to be bound for use by subsequent declarators. The definitions of label, importname and exportname are given in the imports and exports section below. Following the precedent of core:typeuse, the text format allows both references to out-of-line type definitions (via (type <typeidx>)) and inline type expressions that the text format desugars into out-of-line type definitions.

🪙 The value case of externdesc describes a runtime value that is imported or exported at instantiation time as described in the value definitions section below.

The type case of externdesc describes an imported or exported type along with its "bound":

The sub bound declares that the imported/exported type is an abstract type which is a subtype of some other type. Currently, the only supported bound is resource which (following the naming conventions of the GC proposal) means "any resource type". Thus, only resource types can be imported/exported abstractly, not arbitrary value types. This allows type imports to always be compiled independently of their arguments using a "universal representation" for handle values (viz., i32, as defined by the Canonical ABI). In the future, sub may be extended to allow referencing other resource types, thereby allowing abstract resource subtyping.

The eq bound says that the imported/exported type must be structurally equal to some preceding type definition. This allows:

an imported abstract type to be re-exported;
components to introduce another label for a preceding abstract type (which can be necessary when implementing multiple independent interfaces with the same resource); and
components to attach transparent type aliases to structural types to be reflected in source-level bindings (e.g., (export "bytes" (type (eq (list u64)))) could generate in C++ a typedef std::vector<uint64_t> bytes or in JS an exported field named bytes that aliases Uint64Array.

Relaxing the restrictions of core:alias declarators mentioned above, alias declarators allow both outer and export aliases of type and instance sorts. This allows the type exports of instance-typed import and export declarators to be used by subsequent declarators in the type:

(component
  (import "fancy-fs" (instance $fancy-fs
    (export "fs" (instance $fs
      (export "file" (type (sub resource)))
      ;; ...
    ))
    (alias export $fs "file" (type $file))
    (export "fancy-op" (func (param "f" (borrow $file))))
  ))
)

The type declarator is restricted by validation to disallow resource type definitions, thereby preventing "private" resource type definitions from appearing in component types and avoiding the avoidance problem. Thus, the only resource types possible in an instancetype or componenttype are introduced by importdecl or exportdecl.

With what's defined so far, we can define component types using a mix of type definitions:

(component $C
  (type $T (list (tuple string bool)))
  (type $U (option $T))
  (type $G (func (param "x" (list $T)) (result $U)))
  (type $D (component
    (alias outer $C $T (type $C_T))
    (type $L (list $C_T))
    (import "f" (func (param "x" $L) (result (list u8))))
    (import "g" (func (type $G)))
    (export "g2" (func (type $G)))
    (export "h" (func (result $U)))
    (import "T" (type $T (sub resource)))
    (import "i" (func (param "x" (list (own $T)))))
    (export "T2" (type $T' (eq $T)))
    (export "U" (type $U' (sub resource)))
    (export "j" (func (param "x" (borrow $T')) (result (own $U'))))
  ))
)

Note that the inline use of $G and $U are syntactic sugar for outer aliases.

Type Checking

Like core modules, components have an up-front validation phase in which the definitions of a component are checked for basic consistency. Type checking is a central part of validation and, e.g., occurs when validating that the with arguments of an instantiate expression are type-compatible with the imports of the component being instantiated.

To incrementally describe how type-checking works, we'll start by asking how type equality works for non-resource, non-handle, local type definitions and build up from there.

Type equality for almost all types (except as described below) is purely structural. In a structural setting, types are considered to be Abstract Syntax Trees whose nodes are type constructors with types like u8 and string considered to be "nullary" type constructors that appear at leaves and non-nullary type constructors like list and record appearing at parent nodes. Then, type equality is defined to be AST equality. Importantly, these type ASTs do not contain any type indices or depend on index space layout; these binary format details are consumed by decoding to produce the AST. For example, in the following compound component:

(component $A
  (type $ListString1 (list string))
  (type $ListListString1 (list $ListString1))
  (type $ListListString2 (list $ListString1))
  (component $B
    (type $ListString2 (list string))
    (type $ListListString3 (list $ListString2))
    (type $ListString3 (alias outer $A $ListString1))
    (type $ListListString4 (list $ListString3))
    (type $ListListString5 (alias outer $A $ListListString1))
  )
)

all 5 variations of $ListListStringX are considered equal since, after decoding, they all have the same AST.

Next, the type equality relation on ASTs is relaxed to a more flexible subtyping relation. Currently, subtyping is only relaxed for instance and component types, but may be relaxed for more type constructors in the future to better support API Evolution (being careful to understand how subtyping manifests itself in the wide variety of source languages so that subtype-compatible updates don't inadvertently break source-level clients).

Component and instance subtyping allows a subtype to export more and import less than is declared by the supertype, ignoring the exact order of imports and exports and considering only names. For example, here, $I1 is a subtype of $I2:

(component
  (type $I1 (instance
    (export "foo" (func))
    (export "bar" (func))
    (export "baz" (func))
  ))
  (type $I2 (instance
    (export "bar" (func))
    (export "foo" (func))
  ))
)

and $C1 is a subtype of $C2:

(component
  (type $C1 (component
    (import "a" (func))
    (export "x" (func))
    (export "y" (func))
  ))
  (type $C2 (component
    (import "a" (func))
    (import "b" (func))
    (export "x" (func))
  ))
)

🔗 Note that canonical interface names may be annotated with a versionsuffix which is ignored for type checking except to improve diagnostic messages.

When we next consider type imports and exports, there are two distinct subcases of typebound to consider: eq and sub.

The eq bound adds a type equality rule (extending the built-in set of subtyping rules mentioned above) saying that the imported type is structurally equivalent to the type referenced in the bound. For example, in the component:

(component
  (type $L1 (list u8))
  (import "L2" (type $L2 (eq $L1)))
  (import "L3" (type $L2 (eq $L1)))
  (import "L4" (type $L2 (eq $L3)))
)

all four $L* types are equal (in subtyping terms, they are all subtypes of each other).

In contrast, the sub bound introduces a new abstract type which the rest of the component must conservatively assume can be any type that is a subtype of the bound. What this means for type-checking is that each subtype-bound type import/export introduces a fresh abstract type that is unequal to every preceding type definition. Currently (and likely in the MVP), the only supported type bound is resource (which means "any resource type") and thus the only abstract types are abstract resource types. As an example, in the following component:

(component
  (import "T1" (type $T1 (sub resource)))
  (import "T2" (type $T2 (sub resource)))
)

the types $T1 and $T2 are not equal.

Once a type is imported, it can be referred to by subsequent equality-bound type imports, thereby adding more types that it is equal to. For example, in the following component:

(component $C
  (import "T1" (type $T1 (sub resource)))
  (import "T2" (type $T2 (sub resource)))
  (import "T3" (type $T3 (eq $T2)))
  (type $ListT1 (list (own $T1)))
  (type $ListT2 (list (own $T2)))
  (type $ListT3 (list (own $T3)))
)

the types $T2 and $T3 are equal to each other but not to $T1. By the above transitive structural equality rules, the types $List2 and $List3 are equal to each other but not to $List1.

Handle types (own and borrow) are structural types (like list) but, since they refer to resource types, transitively "inherit" the freshness of abstract resource types. For example, in the following component:

(component
  (import "T" (type $T (sub resource)))
  (import "U" (type $U (sub resource)))
  (type $Own1 (own $T))
  (type $Own2 (own $T))
  (type $Own3 (own $U))
  (type $ListOwn1 (list $Own1))
  (type $ListOwn2 (list $Own2))
  (type $ListOwn3 (list $Own3))
  (type $Borrow1 (borrow $T))
  (type $Borrow2 (borrow $T))
  (type $Borrow3 (borrow $U))
  (type $ListBorrow1 (list $Borrow1))
  (type $ListBorrow2 (list $Borrow2))
  (type $ListBorrow3 (list $Borrow3))
)

the types $Own1 and $Own2 are equal to each other but not to $Own3 or any of the $Borrow*. Similarly, $Borrow1 and $Borrow2 are equal to each other but not $Borrow3. Transitively, the types $ListOwn1 and $ListOwn2 are equal to each other but not $ListOwn3 or any of the $ListBorrow*. These type-checking rules for type imports mirror the introduction rule of universal types (∀T).

The above examples all show abstract types in terms of imports, but the same "freshness" condition applies when aliasing the exports of another component as well. For example, in this component:

(component
  (import "C" (component $C
    (export "T1" (type (sub resource)))
    (export "T2" (type $T2 (sub resource)))
    (export "T3" (type (eq $T2)))
  ))
  (instance $c (instantiate $C))
  (alias export $c "T1" (type $T1))
  (alias export $c "T2" (type $T2))
  (alias export $c "T3" (type $T3))
)

the types $T2 and $T3 are equal to each other but not to $T1. These type-checking rules for aliases of type exports mirror the elimination rule of existential types (∃T).

Next, we consider resource type definitions which are a third source of abstract types. Unlike the abstract types introduced by type imports and exports, resource type definitions provide canonical built-ins for setting and getting a resource's private representation value (that are introduced below). These built-ins are necessarily scoped to the component instance that generated the resource type, thereby hiding access to a resource type's representation from the outside world. Because each component instantiation generates fresh resource types distinct from all preceding instances of the same component, resource types are ["generative"].

For example, in the following example component:

(component
  (type $R1 (resource (rep i32)))
  (type $R2 (resource (rep i32)))
  (func $f1 (result (own $R1)) (canon lift ...))
  (func $f2 (param (own $R2)) (canon lift ...))
)

the types $R1 and $R2 are unequal and thus the return type of $f1 is incompatible with the parameter type of $f2.

The generativity of resource type definitions matches the abstract typing rules of type exports mentioned above, which force all clients of the component to bind a fresh abstract type. For example, in the following component:

(component
  (component $C
    (type $r1 (export "r1") (resource (rep i32)))
    (type $r2 (export "r2") (resource (rep i32)))
  )
  (instance $c1 (instantiate $C))
  (instance $c2 (instantiate $C))
  (type $c1r1 (alias export $c1 "r1"))
  (type $c1r2 (alias export $c1 "r2"))
  (type $c2r1 (alias export $c2 "r1"))
  (type $c2r2 (alias export $c2 "r2"))
)

all four types aliases in the outer component are unequal, reflecting the fact that each instance of $C generates two fresh resource types.

If a single resource type definition is exported more than once, the exports after the first are equality-bound to the first export. For example, the following component:

(component
  (type $r (resource (rep i32)))
  (export "r1" (type $r))
  (export "r2" (type $r))
)

is assigned the following componenttype:

(component
  (export "r1" (type $r1 (sub resource)))
  (export "r2" (type (eq $r1)))
)

Thus, from an external perspective, r1 and r2 are two labels for the same type.

If a component wants to hide this fact and force clients to assume r1 and r2 are distinct types (thereby allowing the implementation to actually use separate types in the future without breaking clients), an explicit type can be ascribed to the export that replaces the eq bound with a less-precise sub bound (using syntax introduced below).

(component
  (type $r (resource (rep i32)))
  (export "r1" (type $r))
  (export "r2" (type $r) (type (sub resource)))
)

This component is assigned the following componenttype:

(component
  (export "r1" (type (sub resource)))
  (export "r2" (type (sub resource)))
)

The assignment of this type to the above component mirrors the introduction rule of existential types (∃T).

When supplying a resource type (imported or defined) to a type import via instantiate, type checking performs a substitution, replacing all uses of the import in the instantiated component with the actual type supplied via with. For example, the following component validates:

(component $P
  (import "C1" (component $C1
    (import "T" (type $T (sub resource)))
    (export "foo" (func (param (own $T))))
  ))
  (import "C2" (component $C2
    (import "T" (type $T (sub resource)))
    (import "foo" (func (param (own $T))))
  ))
  (type $R (resource (rep i32)))
  (instance $c1 (instantiate $C1 (with "T" (type $R))))
  (alias export $c1 "foo" (func $foo))
  (instance $c2 (instantiate $C2 (with "T" (type $R)) (with "foo" (func $foo))))
)

This depends critically on the T imports of $C1 and $C2 having been replaced by $R when validating the instantiations of $c1 and $c2. These type-checking rules for instantiating type imports mirror the elimination rule of universal types (∀T).

Importantly, this type substitution performed by the parent is not visible to the child at validation- or run-time. In particular, there are no runtime casts that can "see through" to the original type parameter, avoiding avoiding the usual type-exposure problems with dynamic casts.

In summary: all type constructors are structural with the exception of resource, which is abstract and generative. Type imports and exports that have a subtype bound also introduce abstract types and follow the standard introduction and elimination rules of universal and existential types.

Lastly, since "nominal" is often taken to mean "the opposite of structural", a valid question is whether any of the above is "nominal typing". Inside a component, resource types act "nominally": each resource type definition produces a new local "name" for a resource type that is distinct from all preceding resource types. The interesting case is when resource type equality is considered from outside the component, particularly when a single component is instantiated multiple times. In this case, a single resource type definition that is exported with a single exportname will get a fresh type with each component instance, with the abstract typing rules mentioned above ensuring that each of the component's instance's resource types are kept distinct. Thus, in a sense, the generativity of resource types generalizes traditional name-based nominal typing, providing a finer granularity of isolation than otherwise achievable with a shared global namespace.

Canonical Definitions

From the perspective of Core WebAssembly running inside a component, the Component Model is an embedder. As such, the Component Model defines the Core WebAssembly imports passed to module_instantiate and how Core WebAssembly exports are called via func_invoke. This allows the Component Model to specify how core modules are linked together (as shown above) but it also allows the Component Model to arbitrarily synthesize Core WebAssembly functions (via func_alloc) that are imported by Core WebAssembly. These synthetic core functions are created via one of several canonical definitions defined below.

Canonical ABI

To implement or call a component-level function, we need to cross a shared-nothing boundary. Traditionally, this problem is solved by defining a serialization format. The Component Model MVP uses roughly this same approach, defining a linear-memory-based ABI called the "Canonical ABI" which specifies, for any functype, a corresponding core:functype and rules for copying values into and out of linear memory. The Component Model differs from traditional approaches, though, in that the ABI is configurable, allowing multiple different memory representations of the same abstract value. In the MVP, this configurability is limited to the small set of canonopt shown below. However, Post-MVP, adapter functions could be added to allow far more programmatic control.

The Canonical ABI is explicitly applied to "wrap" existing functions in one of two directions:

lift wraps a core function (of type core:functype) to produce a component function (of type functype) that can be passed to other components.
lower wraps a component function (of type functype) to produce a core function (of type core:functype) that can be imported and called from Core WebAssembly code inside the current component.

Canonical definitions specify one of these two wrapping directions, the function to wrap and a list of configuration options:

canon    ::= (canon lift core-prefix(<core:funcidx>) <canonopt>* bind-id(<externdesc>))
           | (canon lower <funcidx> <canonopt>* (core func <id>?))
canonopt ::= string-encoding=utf8
           | string-encoding=utf16
           | string-encoding=latin1+utf16
           | (memory <core:memidx>)
           | (realloc <core:funcidx>)
           | (post-return <core:funcidx>)
           | async 🔀
           | (callback <core:funcidx>) 🔀

While the production externdesc accepts any sort, the validation rules for canon lift would only allow the func sort. In the future, other sorts may be added (viz., types), hence the explicit sort.

The string-encoding option specifies the encoding the Canonical ABI will use for the string type. The latin1+utf16 encoding captures a common string encoding across Java, JavaScript and .NET VMs and allows a dynamic choice between either Latin-1 (which has a fixed 1-byte encoding, but limited Code Point range) or UTF-16 (which can express all Code Points, but uses either 2 or 4 bytes per Code Point). If no string-encoding option is specified, the default is utf8. It is a validation error to include more than one string-encoding option.

The (memory ...) option specifies the memory that the Canonical ABI will use to load and store values. If the Canonical ABI needs to load or store, validation requires this option to be present (there is no default).

The (realloc ...) option specifies a core function that is validated to have the following core function type:

(func (param $originalPtr i32)
      (param $originalSize i32)
      (param $alignment i32)
      (param $newSize i32)
      (result i32))

The Canonical ABI will use realloc both to allocate (passing 0 for the first two parameters) and reallocate. If the Canonical ABI needs realloc, validation requires this option to be present (there is no default).

The (post-return ...) option may only be present in canon lift when async is not present and specifies a core function to be called with the original return values after they have finished being read, allowing memory to be deallocated and destructors called. This immediate is always optional but, if present, is validated to have parameters matching the callee's return type and empty results.

🔀 The async option specifies that the component wants to make (for imports) or support (for exports) multiple concurrent (asynchronous) calls. This option can be applied to any component-level function type and changes the derived Canonical ABI significantly. See the concurrency explainer for more details. When a function signature contains a future or stream, validation of canon lower requires the async option to be set (since a synchronous call to a function using these types is highly likely to deadlock).

🔀 The (callback ...) option may only be present in canon lift when the async option has also been set and specifies a core function that is validated to have the following core function type:

(func (param $ctx i32)
      (param $event i32)
      (param $payload i32)
      (result $done i32))

Again, see the concurrency explainer for more details.

Based on this description of the AST, the Canonical ABI explainer gives a detailed walkthrough of the static and dynamic semantics of lift and lower.

One high-level consequence of the dynamic semantics of canon lift given in the Canonical ABI explainer is that component functions are different from core functions in that all control flow transfer is explicitly reflected in their type. For example, with Core WebAssembly exception-handling and stack-switching, a core function with type (func (result i32)) can return an i32, throw, suspend or trap. In contrast, a component function with type (func (result string)) may only return a string or trap. To express failure, component functions can return result and languages with exception handling can bind exceptions to the error case.

Similar to the import and alias abbreviations shown above, canon definitions can also be written in an inverted form that puts the sort first:

(func $f (import "i" "f") ...type...) ≡ (import "i" "f" (func $f ...type...))       (WebAssembly 1.0)
(func $g ...type... (canon lift ...)) ≡ (canon lift ... (func $g ...type...))
(core func $h (canon lower ...))      ≡ (canon lower ... (core func $h))

Note: in the future, canon may be generalized to define other sorts than functions (such as types), hence the explicit sort.

Using canonical function definitions, we can finally write a non-trivial component that takes a string, does some logging, then returns a string.

(component
  (import "logging" (instance $logging
    (export "log" (func (param string)))
  ))
  (import "libc" (core module $Libc
    (export "mem" (memory 1))
    (export "realloc" (func (param i32 i32) (result i32)))
  ))
  (core instance $libc (instantiate $Libc))
  (core func $log (canon lower
    (func $logging "log")
    (memory (core memory $libc "mem")) (realloc (func $libc "realloc"))
  ))
  (core module $Main
    (import "libc" "memory" (memory 1))
    (import "libc" "realloc" (func (param i32 i32) (result i32)))
    (import "logging" "log" (func $log (param i32 i32)))
    (func (export "run") (param i32 i32) (result i32)
      ... (call $log) ...
    )
  )
  (core instance $main (instantiate $Main
    (with "libc" (instance $libc))
    (with "logging" (instance (export "log" (func $log))))
  ))
  (func $run (param string) (result string) (canon lift
    (core func $main "run")
    (memory (core memory $libc "mem")) (realloc (func $libc "realloc"))
  ))
  (export "run" (func $run))
)

This example shows the pattern of splitting out a reusable language runtime module ($Libc) from a component-specific, non-reusable module ($Main). In addition to reducing code size and increasing code-sharing in multi-component scenarios, this separation allows $libc to be created first, so that its exports are available for reference by canon lower. Without this separation (if $Main contained the memory and allocation functions), there would be a cyclic dependency between canon lower and $Main that would have to be broken using an auxiliary module performing call_indirect.

Canonical Built-ins

In addition to the lift and lower canonical function definitions which adapt existing functions, there are also a set of canonical "built-ins" that define core functions out of nothing that can be imported by core modules to dynamically interact with Canonical ABI entities like resources, threads, tasks, subtasks, waitable sets, streams and futures.

canon ::= ...
        | (canon resource.new <typeidx> (core func <id>?))
        | (canon resource.drop <typeidx> (core func <id>?))
        | (canon resource.rep <typeidx> (core func <id>?))
        | (canon context.get <valtype> <u32> (core func <id>?)) 🔀
        | (canon context.set <valtype> <u32> (core func <id>?)) 🔀
        | (canon backpressure.set (core func <id>?)) 🔀✕
        | (canon backpressure.inc (core func <id>?)) 🔀
        | (canon backpressure.dec (core func <id>?)) 🔀
        | (canon task.return (result <valtype>)? <canonopt>* (core func <id>?)) 🔀
        | (canon task.cancel (core func <id>?)) 🔀
        | (canon yield cancellable? (core func <id>?)) 🔀❌ (renamed to 'thread.yield')
        | (canon waitable-set.new (core func <id>?)) 🔀
        | (canon waitable-set.wait cancellable? (memory <core:memidx>) (core func <id>?)) 🔀
        | (canon waitable-set.poll cancellable? (memory <core:memidx>) (core func <id>?)) 🔀
        | (canon waitable-set.drop (core func <id>?)) 🔀
        | (canon waitable.join (core func <id>?)) 🔀
        | (canon subtask.cancel async? (core func <id>?)) 🔀
        | (canon subtask.drop (core func <id>?)) 🔀
        | (canon stream.new <typeidx> (core func <id>?)) 🔀
        | (canon stream.read <typeidx> <canonopt>* (core func <id>?)) 🔀
        | (canon stream.write <typeidx> <canonopt>* (core func <id>?)) 🔀
        | (canon stream.cancel-read <typeidx> async? (core func <id>?)) 🔀
        | (canon stream.cancel-write <typeidx> async? (core func <id>?)) 🔀
        | (canon stream.drop-readable <typeidx> (core func <id>?)) 🔀
        | (canon stream.drop-writable <typeidx> (core func <id>?)) 🔀
        | (canon future.new <typeidx> (core func <id>?)) 🔀
        | (canon future.read <typeidx> <canonopt>* (core func <id>?)) 🔀
        | (canon future.write <typeidx> <canonopt>* (core func <id>?)) 🔀
        | (canon future.cancel-read <typeidx> async? (core func <id>?)) 🔀
        | (canon future.cancel-write <typeidx> async? (core func <id>?)) 🔀
        | (canon future.drop-readable <typeidx> (core func <id>?)) 🔀
        | (canon future.drop-writable <typeidx> (core func <id>?)) 🔀
        | (canon thread.index (core func <id>?)) 🧵
        | (canon thread.new-indirect <typeidx> <core:tableidx> (core func <id>?)) 🧵
        | (canon thread.switch-to cancellable? (core func <id>?)) 🧵
        | (canon thread.suspend cancellable? (core func <id>?)) 🧵
        | (canon thread.resume-later (core func <id>?) 🧵
        | (canon thread.yield-to cancellable? (core func <id>?) 🧵
        | (canon thread.yield cancellable? (core func <id>?) 🧵
        | (canon error-context.new <canonopt>* (core func <id>?)) 📝
        | (canon error-context.debug-message <canonopt>* (core func <id>?)) 📝
        | (canon error-context.drop (core func <id>?)) 📝
        | (canon thread.spawn-ref shared? <typeidx> (core func <id>?)) 🧵②
        | (canon thread.spawn-indirect shared? <typeidx> <core:tableidx> (core func <id>?)) 🧵②
        | (canon thread.available-parallelism (core func <id>?)) 🧵②

Resource built-ins

`resource.new`

Synopsis
Approximate WIT signature	`func<T>(rep: T.rep) -> T`
Canonical ABI signature	`[rep:i32] -> [i32]`

The resource.new built-in creates a new resource (of resource type T) with rep as its representation, and returns a new handle pointing to the new resource. Validation only allows resource.rep T to be used within the component that defined T.

In the Canonical ABI, T.rep is defined to be the $rep in the (type $T (resource (rep $rep) ...)) type definition that defined T. While it's designed to allow different types in the future, it is currently hard-coded to always be i32.

For details, see canon_resource_new in the Canonical ABI explainer.

`resource.drop`

Synopsis
Approximate WIT signature	`func<T>(t: T)`
Canonical ABI signature	`[t:i32] -> []`

The resource.drop built-in drops a resource handle t (with resource type T). If the dropped handle owns the resource, the resource's dtor is called, if present. Validation only allows resource.rep T to be used within the component that defined T.

For details, see canon_resource_drop in the Canonical ABI explainer.

`resource.rep`

Synopsis
Approximate WIT signature	`func<T>(t: T) -> T.rep`
Canonical ABI signature	`[t:i32] -> [i32]`

The resource.rep built-in returns the representation of the resource (with resource type T) pointed to by the handle t. Validation only allows resource.rep T to be used within the component that defined T.

As an example, the following component imports the resource.new built-in, allowing it to create and return new resources to its client:

(component
  (import "Libc" (core module $Libc ...))
  (core instance $libc (instantiate $Libc))
  (type $R (resource (rep i32) (dtor (func $libc "free"))))
  (core func $R_new (param i32) (result i32)
    (canon resource.new $R)
  )
  (core module $Main
    (import "canon" "R_new" (func $R_new (param i32) (result i32)))
    (func (export "make_R") (param ...) (result i32)
      (return (call $R_new ...))
    )
  )
  (core instance $main (instantiate $Main
    (with "canon" (instance (export "R_new" (func $R_new))))
  ))
  (export $R' "r" (type $R))
  (func (export "make-r") (param ...) (result (own $R'))
    (canon lift (core func $main "make_R"))
  )
)

Here, the i32 returned by resource.new, which is an index into the current component instance's table, is immediately returned by make_R, thereby transferring ownership of the newly-created resource to the export's caller.

For details, see canon_resource_rep in the Canonical ABI explainer.

🔀🧵 Concurrency built-ins

See the concurrency explainer for background.

🔀 `context.get`

Synopsis
Approximate WIT signature	`func<T,i>() -> T`
Canonical ABI signature	`[] -> [i32]`

The context.get built-in returns the ith element of the current thread's thread-local storage array. Validation currently restricts i to be less than 2 and t to be i32, but these restrictions may be relaxed in the future.

For details, see Thread-Local Storage in the concurrency explainer and canon_context_get in the Canonical ABI explainer.

🔀 `context.set`

Synopsis
Approximate WIT signature	`func<T,i>(v: T)`
Canonical ABI signature	`[i32] -> []`

The context.set built-in sets the ith element of the current thread's thread-local storage array to the value v. Validation currently restricts i to be less than 2 and t to be i32, but these restrictions may be relaxed in the future.

For details, see Thread-Local Storage in the concurrency explainer and canon_context_set in the Canonical ABI explainer.

🔀✕ `backpressure.set`

This built-in is deprecated in favor of backpressure.{inc,dec} and will be removed once producer tools have transitioned.

Synopsis
Approximate WIT signature	`func(enable: bool)`
Canonical ABI signature	`[enable:i32] -> []`

The backpressure.set built-in allows the async-lifted callee to toggle a per-component-instance flag that, when set, prevents new incoming export calls to the component (until the flag is unset). This allows the component to exert backpressure.

For details, see canon_backpressure_set in the Canonical ABI explainer.

🔀 `backpressure.inc` and `backpressure.dec`

Synopsis
Approximate WIT signature	`func()`
Canonical ABI signature	`[] -> []`

The backpressure.{inc,dec} built-ins allow code running in a component to prevent new incoming export calls to the component by enabling backpressure. These built-ins increment and decrement a per-component-instance counter that, when greater than zero, enables backpressure.

If these built-ins would overflow or underflow a 16-bit unsigned integer, they trap instead. As a composable convention, each piece of code that calls backpressure.inc must take responsibility for calling backpressure.dec exactly once when the source of backpressure subsides.

For details, see Backpressure in the concurrency explainer and canon_backpressure_{inc,dec} in the Canonical ABI explainer.

🔀 `task.return`

Synopsis
Approximate WIT signature	`func<FuncT>(results: FuncT.results)`
Canonical ABI signature	`[lower(FuncT.results)*] -> []`

The task.return built-in takes as parameters the result values of the current task. One of task.return or task.cancel must be called exactly once from any of a task's threads.

The canon task.return definition takes component-level return type and the list of canonopt to be used to lift the return value. When called, the declared return type and the string-encoding and memory canonopts are checked to exactly match those of the current task.

For details, see Returning in the concurrency explainer and canon_task_return in the Canonical ABI explainer.

🔀 `task.cancel`

Synopsis
Approximate WIT signature	`func()`
Canonical ABI signature	`[] -> []`

The task.cancel built-in indicates that the current task is now resolved and has dropped all borrowed handles lent to it during the call (trapping if otherwise). task.cancel can only be called after the task-cancelled event has been received (via callback, waitable-set.{wait,poll} or thread.*) to indicate that the supertask has requested cancellation and thus is not expecting a return value. Once this request is received, any of the task's threads can call task.cancel or task.return.

For details, see Cancellation in the concurrency explainer and canon_task_cancel in the Canonical ABI explainer.

🔀 `waitable-set.new`

Synopsis
Approximate WIT signature	`func() -> waitable-set`
Canonical ABI signature	`[] -> [i32]`

The waitable-set.new built-in returns the i32 index of a new waitable set. The waitable-set type is not a true WIT-level type but instead serves to document associated built-ins below. Waitable sets start out empty and are populated explicitly with waitables by waitable.join.

For details, see Waitables and Waitable Sets in the concurrency explainer and canon_waitable_set_new in the Canonical ABI explainer.

🔀 `waitable-set.wait`

Synopsis
Approximate WIT signature	`func<cancellable?>(s: waitable-set) -> event`
Canonical ABI signature	`[s:i32 payload-addr:i32] -> [event-code:i32]`

where event is defined in WIT as:

variant event {
    none,
    subtask(subtask-index, subtask-state),
    stream-read(stream-index, stream-result),
    stream-write(stream-index, stream-result),
    future-read(future-index, future-read-result),
    future-write(future-index, future-write-result),
    task-cancelled,
}

enum subtask-state {
    starting,
    started,
    returned,
    cancelled-before-started,
    cancelled-before-returned,
}

The waitable-set.wait built-in suspends the current thread in a "pending" state until any one of the waitables in the given waitable set s has an event to deliver. At that point, the thread is in the "ready" state and can be nondeterministically resumed by the runtime's scheduler at which point waitable-set.wait will return the event. (The none event is used by waitable-set.poll and never returned by waitable-set.wait.)

Waitable sets may be waited upon when empty, in which case the caller will necessarily block until another thread adds a waitable to the set.

If cancellable is set, waitable-set.wait may return task-cancelled (6) if the caller requests cancellation of the current task. If cancellable is not set, task-cancelled is never returned. task-cancelled is returned at most once for a given task and thus must be propagated once received.

If waitable-set.wait is called from a synchronous- or async callback-lifted export, no other threads that were implicitly created by a separate synchronous- or async callback-lifted export call can start or progress in the current component instance until waitable-set.wait returns (thereby ensuring non-reentrance of the core wasm code). However, explicitly-created threads and threads implicitly created by non-callback async-lifted ("stackful async") exports may start or progress at any time.

A subtask event notifies the supertask that its subtask is now in the given state (the meanings of which are described by the concurrency explainer).

The meanings of the {stream,future}-{read,write} events/payloads are given as part stream.read and stream.write and future.read and future.write below.

In the Canonical ABI, the event-code return value provides the event discriminant and the case payloads are stored as two contiguous i32s at the 8-byte-aligned address payload-addr.

For details, see Waitables and Waitable Sets in the concurrency explainer and canon_waitable_set_wait in the Canonical ABI explainer.

🔀 `waitable-set.poll`

Synopsis
Approximate WIT signature	`func<cancellable?>(s: waitable-set) -> event`
Canonical ABI signature	`[s:i32 payload-addr:i32] -> [event-code:i32]`

where event is defined as in waitable-set.wait.

The waitable-set.poll built-in returns either an event from one of the waitables in s or, if there is none, the none event.

If cancellable is set, waitable-set.poll may return task-cancelled (6) if the caller requests cancellation of the current task. If cancellable is not set, task-cancelled is never returned. task-cancelled is returned at most once for a given task and thus must be propagated once received.

The Canonical ABI of waitable-set.poll is the same as waitable-set.wait (with the none case indicated by returning 0).

For details, see Waitables and Waitable Sets in the concurrency explainer and canon_waitable_set_poll in the Canonical ABI explainer.

🔀 `waitable-set.drop`

Synopsis
Approximate WIT signature	`func(s: waitable-set)`
Canonical ABI signature	`[s:i32] -> []`

The waitable-set.drop built-in removes the indicated waitable set from the current component instance's table, trapping if the waitable set is not empty or if another thread is concurrently waiting on it.

For details, see Waitables and Waitable Sets in the concurrency explainer and canon_waitable_set_drop in the Canonical ABI explainer.

🔀 `waitable.join`

Synopsis
Approximate WIT signature	`func(w: waitable, maybe_set: option<waitable-set>)`
Canonical ABI signature	`[w:i32, maybe_set:i32] -> []`

The waitable.join built-in may be called given a [waitable] and an optional waitable set. join first removes w from any waitable set that it is a member of and then, if maybe_set is not none, w is added to that set. Thus, join can be used to arbitrarily add, change and remove waitables from waitable sets in the same component instance, preserving the invariant that a waitable can be in at most one set.

In the Canonical ABI, w is an index into the current component instance's table and can be any type of waitable (subtask or {readable,writable}-{stream,future}-end). A value of 0 represents a none maybe_set, since 0 is not a valid table index.

For details, see Waitables and Waitable Sets in the concurrency explainer and canon_waitable_join in the Canonical ABI explainer.

🔀 `subtask.cancel`

Synopsis
Approximate WIT signature	`func<async?>(subtask: subtask) -> option<subtask-state>`
Canonical ABI signature	`[subtask:i32] -> [i32]`

The subtask.cancel built-in requests cancellation of the indicated subtask. If the async is present, none is returned (reprented as -1 in the Canonical ABI) to indicate that the subtask blocked before it was resolved. Otherwise, subtask.cancel returns the subtask-state that the subtask resolved to (which is one of returned, cancelled-before-started or cancelled-before-returned).

The async immediate is gated on 🚝. Without async, the none case is not possible and subtask.cancel synchronously waits until the callee is resolved.

For details, see Cancellation in the concurrency explainer and canon_subtask_cancel in the Canonical ABI explainer.

🔀 `subtask.drop`

Synopsis
Approximate WIT signature	`func(subtask: subtask)`
Canonical ABI signature	`[subtask:i32] -> []`

The subtask.drop built-in removes the indicated subtask from the current component instance's table, trapping if the subtask hasn't returned.

For details, see canon_subtask_drop in the Canonical ABI explainer.

🔀 `stream.new` and `future.new`

Synopsis
Approximate WIT signature for `stream.new`	`func<stream<T?>>() -> tuple<readable-stream-end<T?>, writable-stream-end<T?>>`
Approximate WIT signature for `future.new`	`func<future<T?>>() -> tuple<readable-future-end<T?>, writable-future-end<T?>>`
Canonical ABI signature	`[] -> [packed-ends:i64]`

The stream.new and future.new built-ins return the readable and writable ends of a new stream<T?> or future<T?>. The readable and writable ends are added to the current component instance's table and then the two i32 indices of the two ends are packed into a single i64 return value (with the readable end in the low 32 bits).

The types readable-stream-end<T?> and writable-stream-end<T?> are not WIT types; they are the conceptual lower-level types that describe how the canonical built-ins use the readable and writable ends of a stream<T?>.

An analogous relationship exists among readable-future-end<T?>, writable-future-end<T?>, and the WIT future<T?>.

For details, see Streams and Futures in the concurrency explainer and canon_stream_new in the Canonical ABI explainer.

🔀 `stream.read` and `stream.write`

Synopsis
Approximate WIT signature for `stream.read`	`func<stream<T?>>(e: readable-stream-end<T?>, b: writable-buffer<T>?) -> option<stream-result>`
Approximate WIT signature for `stream.write`	`func<stream<T?>>(e: writable-stream-end<T?>, b: readable-buffer<T>?) -> option<stream-result>`
Canonical ABI signature	`[stream-end:i32 ptr:i32 num:i32] -> [i32]`

where stream-result is defined in WIT as:

record stream-result {
    /// The number of elements read/written.
    progress: u32,

    /// The status of the read/write operation.
    result: copy-result
}

enum copy-result {
    /// The read/write completed successfully.
    ///
    /// The stream remains open for new reads/writes.
    completed,

    /// The other end was dropped and so this end must now be dropped.
    ///
    /// For `stream.read`, this means that the end of the stream was reached.
    ///
    /// For `stream.write`, this means that the consumer has no need for further
    /// data from this stream. This doesn't signify an error; it just instructs
    /// the producer to stop sending data.
    dropped,

    /// The read/write was cancelled by `stream.cancel-{read,write}`.
    ///
    /// The stream remains open for new reads/writes.
    cancelled
}

The stream.read and stream.write built-ins take the matching readable or writable end of a stream as the first parameter and, if T is present, a buffer for the T values to be read from or written to. If T is not present, the buffer parameter is ignored.

If the return value is a stream-result, then the progress field indicates how many T elements were read or written from the given buffer before the copy-result was reached. For example, a return value of {progress: 4, result: dropped} from a stream<u32>.read means that 32 bytes were copied into the given buffer before the writer end dropped the stream. The cancelled case can only arise as the result of a call to stream.cancel-{read,write}.

If the return value is none, then the operation blocked and the caller needs to wait for progress (via waitable set and waitable-set.{wait,poll} or, if using a callback, by returning to the event loop) which will asynchronously produce an event containing a stream-result.

If stream.{read,write} return dropped (synchronously or asynchronously), any subsequent operation on the stream other than stream.drop-{readable,writable} traps.

In the Canonical ABI, the {readable,writable}-stream-end is passed as an i32 index into the component instance's table followed by a pair of i32s describing the linear memory offset and size-in-elements of the {readable,writable}-buffer<T>. The option<stream-result> return value is bit-packed into a single i32 where:

0xffff_ffff represents none.
Otherwise, the result is in the low 4 bits and the progress is in the high 28 bits.

For details, see Streams and Futures in the concurrency explainer and canon_stream_read in the Canonical ABI explainer.

🔀 `future.read` and `future.write`

Synopsis
Approximate WIT signature for `future.read`	`func<future<T?>>(e: readable-future-end<T?>, b: writable-buffer<T; 1>?) -> option<future-read-result>`
Approximate WIT signature for `future.write`	`func<future<T?>>(e: writable-future-end<T?>, v: readable-buffer<T; 1>?) -> option<future-write-result>`
Canonical ABI signature	`[readable-future-end:i32 ptr:i32] -> [i32]`

where future-{read,write}-result are defined in WIT as:

enum future-read-result {
    /// The read completed and this readable end must now be dropped.
    completed,

    /// The read was cancelled by `future.cancel-read`.
    ///
    /// The future remains open for a new `future.read`.
    cancelled
}
enum future-write-result {
    /// The write completed successfully and this writable end must now be dropped.
    completed,

    /// The readable end was dropped and so the writable end must now be dropped.
    dropped,

    /// The write was cancelled by `future.cancel-write`.
    ///
    /// The future remains open for a new `future.write`.
    cancelled
}

future-read-result is the same as the copy-result enum used in stream-result minus the dropped case (since futures do not allow the writer to drop their end before writing a value). future-write-result is the same as copy-result, including the dropped case (since the writer can be notified that the reader signalled loss of interest by dropping their end).

The future.{read,write} built-ins takes the readable or writable end of a future as the first parameter and, if T is present, a single-element buffer that can be used to write or read a single T value.

If the return value is none, then the call blocked and the caller needs to wait for progress (via waitable set and waitable-set.{wait,poll} or, if using a callback, by returning to the event loop) which will asynchronously produce an event containing a future-{read,write}-result.

If future.{read,write} return completed or dropped (synchronously or asynchronously), any subsequent operation on the future other than future.drop-{readable,writable} traps.

A component may call future.drop-readable before successfully reading a value to indicate a loss of interest. future.drop-writable will trap if called before successfully writing a value.

In the Canonical ABI, the {readable,writable}-future-end is passed as an i32 index into the component instance's table followed by a single i32 describing the linear memory offset of the {readable,writable}-buffer<T; 1>. The option<future-{read,write}-result> return value is bit-packed into the single i32 return value where 0xffff_ffff represents none. And, future-read-result.cancelled is encoded as the value of future-write-result.cancelled, rather than the value implied by the enum definition above.

For details, see Streams and Futures in the concurrency explainer and canon_future_read in the Canonical ABI explainer.

🔀 `stream.cancel-read`, `stream.cancel-write`, `future.cancel-read`, and `future.cancel-write`

Synopsis
Approximate WIT signature for `stream.cancel-read`	`func<stream<T?>>(e: readable-stream-end<T?>) -> option<stream-result>`
Approximate WIT signature for `stream.cancel-write`	`func<stream<T?>>(e: writable-stream-end<T?>) -> option<stream-result>`
Approximate WIT signature for `future.cancel-read`	`func<future<T?>>(e: readable-future-end<T?>) -> option<future-read-result>`
Approximate WIT signature for `future.cancel-write`	`func<future<T?>>(e: writable-future-end<T?>) -> option<future-write-result>`
Canonical ABI signature	`[e: i32] -> [i32]`

The {stream,future}.cancel-{read,write} built-ins take the matching readable or writable end of a stream or future that has a pending async {stream,future}.{read,write} (trapping otherwise).

If cancellation finishes without blocking, the return value is a stream-result or future-{read,write}-result. If cancellation blocks, the return value is none and the caller must wait for a corresponding {stream,future}-{read,write} event via waitable-set.{wait,poll} or, when using a callback, returning to the event loop. In either case, the result may be cancelled but may also be completed or dropped, if one of these racily happened first.

In the Canonical ABI, the optional result value is bit-packed into the single i32 result in the same way as {stream,future}.{read,write}.

For details, see Streams and Futures in the concurrency explainer and canon_stream_cancel_read in the Canonical ABI explainer.

🔀 `stream.drop-readable`, `stream.drop-writable`, `future.drop-readable`, and `future.drop-writable`

Synopsis
Approximate WIT signature for `stream.drop-readable`	`func<stream<T?>>(e: readable-stream-end<T?>)`
Approximate WIT signature for `stream.drop-writable`	`func<stream<T?>>(e: writable-stream-end<T?>)`
Approximate WIT signature for `future.drop-readable`	`func<future<T?>>(e: readable-future-end<T?>)`
Approximate WIT signature for `future.drop-writable`	`func<future<T?>>(e: writable-future-end<T?>)`
Canonical ABI signature	`[end:i32 err:i32] -> []`

The {stream,future}.drop-{readable,writable} built-ins remove the indicated stream or future from the current component instance's table, trapping if the stream or future has a mismatched direction or type or are in the middle of a read or write or, in the special case of future.drop-writable, if a value has not already been written.

For details, see Streams and Futures in the concurrency explainer and canon_stream_drop_readable in the Canonical ABI explainer.

🧵 `thread.index`

Synopsis
Approximate WIT signature	`func() -> u32`
Canonical ABI signature	`[] -> [i32]`

The thread.index built-in returns the index of the current thread in the component instance's table. While thread.new-indirect also returns the index of newly-created threads, threads created implicitly for export calls can only learn their index via thread.index.

For details, see Thread Built-ins in the concurrency explainer and canon_thread_index in the Canonical ABI explainer.

🧵 `thread.new-indirect`

Synopsis
Approximate WIT signature	`func<FuncT,tableidx>(fi: u32, c: FuncT.params[0]) -> thread`
Canonical ABI signature	`[fi:i32 c:i32] -> [i32]`

The thread.new-indirect built-in adds a new thread to the current component instance's table, returning the index of the new thread. The function table supplied via core:tableidx is indexed by the fi operand and then dynamically checked to match the type FuncT (in the same manner as call_indirect). Lastly, the indexed function is called in the new thread with c as its first and only parameter.

Currently, FuncT must be (func (param i32)) and thus c must always be an i32, but this restriction can be loosened in the future as the Canonical ABI is extended for memory64 and GC.

As explained in the concurrency explainer, a thread created by thread.new-indirect is initially in a suspended state and must be resumed eagerly or lazily by thread.yield-to or thread.resume-later, resp., to begin execution.

For details, see Thread Built-ins in the concurrency explainer and canon_thread_new_indirect in the Canonical ABI explainer.

🧵 `thread.switch-to`

Synopsis
Approximate WIT signature	`func<cancellable?>(t: thread) -> suspend-result`
Canonical ABI signature	`[t:i32] -> [i32]`

where suspend-result is defined in WIT as:

enum suspend-result { completed, cancelled }

The thread.switch-to built-in suspends the current thread and immediately resumes execution of the thread t, trapping if t is not in a "suspended" state. When the current thread is resumed by some other thread or, if cancellable was set, cancellation, thread.switch-to will return, indicating what happened.

If thread.switch-to is called from a synchronous- or async callback-lifted export, no other threads that were implicitly created by a separate synchronous- or async callback-lifted export call can start or progress in the current component instance until thread.switch-to returns (thereby ensuring non-reentrance of the core wasm code). However, explicitly-created threads and threads implicitly created by non-callback async-lifted ("stackful async") exports may start or progress at any time.

For details, see Thread Built-ins in the concurrency explainer and canon_thread_switch_to in the Canonical ABI explainer.

🧵 `thread.suspend`

Synopsis
Approximate WIT signature	`func<cancellable?>() -> suspend-result`
Canonical ABI signature	`[] -> i32`

The thread.suspend built-in suspends the current thread which, depending on the calling context, will either immediately switch control flow to an async-lowered caller or, if the current task has already suspended before, switch to the runtime's scheduler to find something else to do. When the current thread is resumed by some other thread or, if cancellable was set, cancellation, thread.suspend will return, indicating what happened.

If thread.suspend is called from a synchronous- or async callback-lifted export, no other threads that were implicitly created by a separate synchronous- or async callback-lifted export call can start or progress in the current component instance until thread.suspend returns (thereby ensuring non-reentrance of the core wasm code). However, explicitly-created threads and threads implicitly created by non-callback async-lifted ("stackful async") exports may start or progress at any time.

For details, see Thread Built-ins in the concurrency explainer and canon_thread_suspend in the Canonical ABI explainer.

🧵 `thread.resume-later`

Synopsis
Approximate WIT signature	`func(t: thread)`
Canonical ABI signature	`[t:i32] -> []`

The thread.resume-later built-in changes the state of thread t from "suspended" to "ready" (trapping if t is not in a "suspended" state) so that the runtime can nondeterministically resume t at some point in the future.

For details, see Thread Built-ins in the concurrency explainer and canon_thread_resume_later in the Canonical ABI explainer.

🧵 `thread.yield-to`

Synopsis
Approximate WIT signature	`func<cancellable?>(t: thread)`
Canonical ABI signature	`[t:i32] -> [suspend-result]`

The thread.yield-to built-in immediately resumes execution of the thread t, (trapping if t is not in a "suspended" state) leaving the current thread in a "ready" state so that the runtime can nondeterministically resume the current thread at some point in the future. When the current thread is resumed either due to runtime scheduling or, if cancellable was set, cancellation, thread.yield-to will return, indicating what happened.

If thread.yield-to is called from a synchronous- or async callback-lifted export, no other threads that were implicitly created by a separate synchronous- or async callback-lifted export call can start or progress in the current component instance until thread.yield-to returns (thereby ensuring non-reentrance of the core wasm code). However, explicitly-created threads and threads implicitly created by non-callback async-lifted ("stackful async") exports may start or progress at any time.

For details, see Thread Built-ins in the concurrency explainer and canon_thread_yield_to in the Canonical ABI explainer.

🧵 `thread.yield`

Synopsis
Approximate WIT signature	`func<cancellable?>() -> suspend-result`
Canonical ABI signature	`[] -> [i32]`

The thread.yield built-in allows the runtime to potentially switch to any other thread in the "ready" state, enabling a long-running computation to cooperatively interleave execution without specifically requesting another thread to be resumed (as with thread.yield-to). When the current thread is resumed either due to runtime scheduling or, if cancellable was set, cancellation, thread.yield will return, indicating what happened.

If thread.yield is called from a synchronous- or async callback-lifted export, no other threads that were implicitly created by a separate synchronous- or async callback-lifted export call can start or progress in the current component instance until thread.yield returns (thereby ensuring non-reentrance of the core wasm code). However, explicitly-created threads and threads implicitly created by non-callback async-lifted ("stackful async") exports may start or progress at any time.

For details, see Thread Built-ins in the concurrency explainer and canon_thread_yield in the Canonical ABI explainer.

🧵② `thread.spawn-ref`

Synopsis
Approximate WIT signature	`func<shared?,FuncT>(f: FuncT, c: FuncT.params[0]) -> bool`
Canonical ABI signature	`shared? [f:(ref null (shared (func (param i32))) c:i32] -> [i32]`

The thread.spawn-ref built-in is an optimization, fusing a call to thread.new_ref (assuming thread.new_ref was added as part of adding a GC ABI option to the Canonical ABI) with a call to thread.resume-later. This optimization is more impactful once given shared-everything-threads and thus gated on 🧵②.

For details, see canon_thread_spawn_ref in the Canonical ABI explainer.

🧵② `thread.spawn-indirect`

Synopsis
Approximate WIT signature	`func<shared?,FuncT,tableidx>(i: u32, c: FuncT.params[0]) -> bool`
Canonical ABI signature	`shared? [i:i32 c:i32] -> [i32]`

The thread.spawn-indirect built-in is an optimization, fusing a call to thread.new-indirect with a call to thread.resume-later. This optimization is more impactful once given shared-everything-threads and thus gated on 🧵②.

For details, see canon_thread_spawn_indirect in the Canonical ABI explainer.

🧵② `thread.available-parallelism`

Synopsis
Approximate WIT signature	`func<shared?>() -> u32`
Canonical ABI signature	`shared [] -> [i32]`

The thread.available-parallelism built-in returns the number of threads that can be expected to execute in parallel.

The concept of "available parallelism" corresponds is sometimes referred to as "hardware concurrency", such as in navigator.hardwareConcurrency in JavaScript.

For details, see canon_thread_available_parallelism in the Canonical ABI explainer.

📝 Error Context built-ins

📝 `error-context.new`

Synopsis
Approximate WIT signature	`func(message: string) -> error-context`
Canonical ABI signature	`[ptr:i32 len:i32] -> [i32]`

The error-context.new built-in returns a new error-context value. The given string is non-deterministically transformed to produce the error-context's internal debug message.

In the Canonical ABI, the returned value is an index into the current component instance's table of a new error context value.

For details, see canon_error_context_new in the Canonical ABI explainer.

📝 `error-context.debug-message`

Synopsis
Approximate WIT signature	`func(errctx: error-context) -> string`
Canonical ABI signature	`[errctxi:i32 ptr:i32] -> []`

The error-context.debug-message built-in returns the debug message of the given error-context.

In the Canonical ABI, it writes the debug message into ptr as an 8-byte (ptr, length) pair, according to the Canonical ABI for string, given the <canonopt>* immediates.

For details, see canon_error_context_debug_message in the Canonical ABI explainer.

📝 `error-context.drop`

Synopsis
Approximate WIT signature	`func(errctx: error-context)`
Canonical ABI signature	`[errctxi:i32] -> []`

The error-context.drop built-in drops the given error-context value from the component instance.

In the Canonical ABI, errctxi is an index into the current component instance's table.

For details, see canon_error_context_drop in the Canonical ABI explainer.

🪙 Value Definitions

Value definitions (in the value index space) are like immutable global definitions in Core WebAssembly except that validation requires them to be consumed exactly once at instantiation-time (i.e., they are linear).

Components may define values in the value index space using following syntax:

value    ::= (value <id>? <valtype> <val>)
val      ::= false | true
           | <core:i64>
           | <f64canon>
           | nan
           | '<core:stringchar>'
           | <core:name>
           | (record <val>+)
           | (variant "<label>" <val>?)
           | (list <val>*)
           | (tuple <val>+)
           | (flags "<label>"*)
           | (enum "<label>")
           | none | (some <val>)
           | ok | (ok <val>) | error | (error <val>)
           | (binary <core:datastring>)
f64canon ::= <core:f64> without the `nan:0x` case.

The validation rules for value require the val to match the valtype.

The (binary ...) expression form provides an alternative syntax allowing the binary contents of the value definition to be written directly in the text format, analogous to data segments, avoiding the need to understand type information when encoding or decoding.

For example:

(component
  (value $a bool true)
  (value $b u8  1)
  (value $c u16 2)
  (value $d u32 3)
  (value $e u64 4)
  (value $f s8  5)
  (value $g s16 6)
  (value $h s32 7)
  (value $i s64 8)
  (value $j f32 9.1)
  (value $k f64 9.2)
  (value $l char 'a')
  (value $m string "hello")
  (value $n (record (field "a" bool) (field "b" u8)) (record true 1))
  (value $o (variant (case "a" bool) (case "b" u8)) (variant "b" 1))
  (value $p (list (result (option u8)))
    (list
      error
      (ok (some 1))
      (ok none)
      error
      (ok (some 2))
    )
  )
  (value $q (tuple u8 u16 u32) (tuple 1 2 3))

  (type $abc (flags "a" "b" "c"))
  (value $r $abc (flags "a" "c"))

  (value $s (enum "a" "b" "c") (enum "b"))

  (value $t bool (binary "\00"))
  (value $u string (binary "\07example"))

  (type $complex
    (tuple
      (record
        (field "a" (option string))
        (field "b" (tuple (option u8) string))
      )
      (list char)
      $abc
      string
    )
  )
  (value $complex1 (type $complex)
    (tuple
      (record
        none
        (tuple none "empty")
      )
      (list)
      (flags)
      ""
    )
  )
  (value $complex2 (type $complex)
    (tuple
      (record
        (some "example")
        (tuple (some 42) "hello")
      )
      (list 'a' 'b' 'c')
      (flags "b" "a")
      "hi"
    )
  )
)

As with all definition sorts, values may be imported and exported by components. As an example value import:

(import "env" (value $env (record (field "locale" (option string)))))

As this example suggests, value imports can serve as generalized environment variables, allowing not just string, but the full range of valtype.

Values can also be exported. For example:

(component
  (import "system-port" (value $port u16))
  (value $url string "https://example.com")
  (export "default-url" (value $url))
  (export "default-port" (value $port))
)

The inferred type of this component is:

(component
  (import "system-port" (value $port u16))
  (value $url string "https://example.com")
  (export "default-url" (value (eq $url)))
  (export "default-port" (value (eq $port)))
)

Thus, by default, the precise constant or import being exported is propagated into the component's type and thus its public interface. In this way, value exports can act as semantic configuration data provided by the component to the host or other client tooling. Components can also keep the exact value being exported abstract (so that the precise value is not part of the type and public interface) using the "type ascription" feature mentioned in the imports and exports section below.

🪙 Start Definitions

Like modules, components can have start functions that are called during instantiation. Unlike modules, components can call start functions at multiple points during instantiation with each such call having parameters and results. Thus, start definitions in components look like function calls:

start ::= (start <funcidx> (value <valueidx>)* (result (value <id>?))*)

The (value <valueidx>)* list specifies the arguments passed to funcidx by indexing into the value index space. The arity and types of the two value lists are validated to match the signature of funcidx.

With this, we can define a component that imports a string and computes a new exported string at instantiation time:

(component
  (import "name" (value $name string))
  (import "libc" (core module $Libc
    (export "memory" (memory 1))
    (export "realloc" (func (param i32 i32 i32 i32) (result i32)))
  ))
  (core instance $libc (instantiate $Libc))
  (core module $Main
    (import "libc" ...)
    (func (export "start") (param i32 i32) (result i32)
      ... general-purpose compute
    )
  )
  (core instance $main (instantiate $Main (with "libc" (instance $libc))))
  (func $start (param string) (result string) (canon lift
    (core func $main "start")
    (memory (core memory $libc "mem")) (realloc (func $libc "realloc"))
  ))
  (start $start (value $name) (result (value $greeting)))
  (export "greeting" (value $greeting))
)

As this example shows, start functions reuse the same Canonical ABI machinery as normal imports and exports for getting component-level values into and out of core linear memory.

Import and Export Definitions

Import and export definitions are similar to Core Webssembly import and export definitions, but different in a few ways.

The first is that component-level imports have only a single name. (Two- and even N-level imports can be achieved by importing instance types).

Second, component-level imports and exports can use all of the component-level sorts (func, value 🪙, type, instance, component) but just 1 core sort: module. This restriction is enforced by validation which assigns every component a componenttype (which similarly only allows core types of the module type constructor). This restriction ensures that all cross-component calls transit through a lift/lower trampoline, which allows the Component Model to create a "membrane" around all the core module instances contained by a component instance in order to provide various structural guarantees to the Core WebAssembly code running inside.

A third difference is that not only import definitions, but also export definitions append a new element to the index space of the imported/exported sort which can be optionally bound to an identifier in the text format. In the case of imports, the identifier is bound just like Core WebAssembly, as part of the externdesc (e.g., (import "x" (func $x)) binds the identifier $x). In the case of exports, the <id>? right after the export is bound while the <id> inside the <sortidx> is a reference to the preceding definition being exported (e.g., (export $x "x" (func $f)) binds a new identifier $x).

Given this, the syntax of imports and exports are defined as follows:

import        ::= (import "<importname>" bind-id(<externdesc>))
                | (import "<importname>" <versionsuffix> bind-id(<externdesc>)) 🔗
export        ::= (export <id>? "<exportname>" <sortidx> <externdesc>?)
                | (export <id>? "<exportname>" <versionsuffix> <sortidx> <externdesc>?) 🔗
versionsuffix ::= (versionsuffix "<semversuffix>") 🔗

All import names are required to be strongly-unique. Separately, all export names are also required to be strongly-unique. The rest of the grammar for imports and exports defines a structured syntax for the contents of import and export names. Syntactically, these names appear inside quoted string literals. The grammar thus restricts the contents of these string literals to provide more structured information that can be mechanically interpreted by toolchains and runtimes to support idiomatic developer workflows and source-language bindings. The rules defining this structured name syntax below are to be interpreted as a lexical grammar defining a single token and thus whitespace is not automatically inserted, all terminals are single-quoted, and everything unquoted is a meta-character.

exportname        ::= <plainname>
                    | <interfacename>
importname        ::= <exportname>
                    | <depname>
                    | <urlname>
                    | <hashname>
plainname         ::= <label>
                    | '[constructor]' <label>
                    | '[method]' <label> '.' <label>
                    | '[static]' <label> '.' <label>
                    | '[implements=<' <interfacename> '>]' <label>
label             ::= <first-fragment> ( '-' <fragment> )*
first-fragment    ::= [a-z] <word>
                    | [A-Z] <acronym>
fragment          ::= <word>
                    | <acronym>
word              ::= [0-9a-z]*
acronym           ::= [0-9A-Z]*
interfacename     ::= <namespace> <label> <projection> <interfaceversion>?
                    | <namespace>+ <label> <projection>+ <interfaceversion>? 🪺
namespace         ::= <words> ':'
words             ::= <word>
                    | <words> '-' <word>
projection        ::= '/' <label>
interfaceversion  ::= '@' <valid semver>
                    | '@' <canonversion> 🔗
canonversion      ::= [1-9] [0-9]* 🔗
                    | '0.' [1-9] [0-9]* 🔗
                    | '0.0.' [1-9] [0-9]* 🔗
semversuffix      ::= [0-9A-Za-z.+-]* 🔗
depname           ::= 'unlocked-dep=<' <pkgnamequery> '>'
                    | 'locked-dep=<' <pkgname> '>' ( ',' <hashname> )?
pkgnamequery      ::= <pkgpath> <verrange>?
pkgname           ::= <pkgpath> <pkgversion>?
pkgversion        ::= '@' <valid semver>
pkgpath           ::= <namespace> <words>
                    | <namespace>+ <words> <projection>* 🪺
verrange          ::= '@*'
                    | '@{' <verlower> '}'
                    | '@{' <verupper> '}'
                    | '@{' <verlower> ' ' <verupper> '}'
verlower          ::= '>=' <valid semver>
verupper          ::= '<' <valid semver>
urlname           ::= 'url=<' <nonbrackets> '>' (',' <hashname>)?
nonbrackets       ::= [^<>]*
hashname          ::= 'integrity=<' <integrity-metadata> '>'

Components provide six options for naming imports:

a plain name that leaves it up to the developer to "read the docs" or otherwise figure out what to supply for the import;
an interface name that is assumed to uniquely identify a higher-level semantic contract that the component is requesting an unspecified wasm or native implementation of;
a URL name that the component is requesting be resolved to a particular wasm implementation by fetching the URL.
a hash name containing a content-hash of the bytes of a particular wasm implementation but not specifying location of the bytes.
a locked dependency name that the component is requesting be resolved via some contextually-supplied registry to a particular wasm implementation using the given hierarchical name and version; and
an unlocked dependency name that the component is requesting be resolved via some contextually-supplied registry to one of a set of possible of wasm implementations using the given hierarchical name and version range.

Not all hosts are expected to support all six import naming options and, in general, build tools may need to wrap a to-be-deployed component with an outer component that only uses import names that are understood by the target host. For example:

an offline host may only implement a fixed set of interface names, requiring a build tool to bundle URL, dependency and hash names (replacing the imports with nested definitions);
browsers may only support plain and URL names (with plain names resolved via import map or JS API), requiring the build process to publish or bundle dependencies, converting dependency names into nested definitions or URL names;
a production server environment may only allow deployment of components importing from a fixed set of interface and locked dependency names, thereby requiring all dependencies to be locked and deployed beforehand;
host embeddings without a direct developer interface (such as the JS API or import maps) may reject all plain names, requiring the build process to resolve these beforehand;
hosts without content-addressable storage may reject hash names (as they have no way to locate the contents).

The grammar and validation of URL names allows the embedded URLs to contain any sequence of UTF-8 characters (other than angle brackets, which are used to delimit the URL), leaving the well-formedness of the URL to be checked as part of the process of parsing the URL in preparation for fetching the URL. The base URL operand passed to the URL spec's parsing algorithm is determined by the host and may be absent, thereby disallowing relative URLs. Thus, the parsing and fetching of a URL import are host-defined operations that happen after the decoding and validation of a component, but before instantiation of that component.

When a particular implementation is indicated via URL or dependency name, importname allows the component to additionally specify a cryptographic hash of the expected binary representation of the wasm implementation, reusing the integrity-metadata production defined by the W3C Subresource Integrity specification. When this hash is present, a component can express its intention to reuse another component or core module with the same degree of specificity as if the component or core module was nested directly, thereby allowing components to factor out common dependencies without compromising runtime behavior. When only the hash is present (in a hashname), the host must locate the contents using the hash (e.g., using an OCI Registry).

The "registry" referred to by dependency names serves to map a hierarchical name and version to a particular module, component or exported definition. For example, in the full generality of nested namespaces and packages (🪺), in a registry name a:b:c/d/e/f, a:b:c traverses a path through namespaces a and b to a component c and /d/e/f traverses the exports of c (where d and e must be component exports but f can be anything). Given this abstract definition, a number of concrete data sources can be interpreted by developer tooling as "registries":

a live registry (perhaps accessed via warg)
a local filesystem directory (perhaps containing vendored dependencies)
a fixed set of host-provided functionality (see also the built-in modules proposal)
a programmatically-created tree data structure (such as the importObject parameter of WebAssembly.instantiate())

The valid semver production is as defined by the Semantic Versioning 2.0 spec and is meant to be interpreted according to that specification. The use of valid semver in interfaceversion is temporary for backward compatibility; see Canonical interface name below (🔗). The verrange production embeds a minimal subset of the syntax for version ranges found in common package managers like npm and cargo and is meant to be interpreted with the same semantics. (Mostly this interpretation is the usual SemVer-spec-defined ordering, but note the particular behavior of pre-release tags.)

The plainname production captures several language-neutral syntactic hints that allow bindings generators to produce more idiomatic bindings in their target language. At the top-level, a plainname allows functions to be annotated as being a constructor, method or static function of a preceding resource and/or being asynchronous.

When a function is annotated with constructor, method or static, the first label is the name of the resource and the second label is the logical field name of the function. This additional nesting information allows bindings generators to insert the function into the nested scope of a class, abstract data type, object, namespace, package, module or whatever resources get bound to. For example, a function named [method]C.foo could be bound in C++ to a member function foo in a class C. The JS API below describes how the native JavaScript bindings could look.

To restrict the set of cases that bindings generators need to consider, these annotations trigger additional type-validation rules (listed in Binary.md) such as:

An import or export named [static]R.foo must be a function and R must be the name of an imported or exported resource type in the same instance or component type.
Similarly, an import or export named [constructor]R must be a function whose return type must be (own $R) or (result (own $R) (error <valtype>)?) where $R is the type-index of the resource type named R.
Similarly, an import or export named [method]R.foo must be a function whose first parameter must be (param "self" (borrow $R)).

When an instance import or export is annotated with [implements=<I>]L, it indicates that the instance implements interface I but is given the plain name L. This enables a component to import or export the same interface multiple times with different plain names. For example:

(component
  (import "[implements=<wasi:keyvalue/store>]primary" (instance ...))
  (import "[implements=<wasi:keyvalue/store>]secondary" (instance ...))
)

Here, both imports implement wasi:keyvalue/store but have distinct plain names primary and secondary. Bindings generators can use the [implements=<I>] annotation to know which interface the instance implements, enabling them to share value type bindings across both imports. (Note that resource types defined in the interface, such as bucket, are treated as distinct for each import, since each may have a different implementation.)

The interfacename also helps hosts and clients of a component. A host that sees [implements=<wasi:keyvalue/store>]primary knows to supply a wasi:keyvalue/store implementation for that import, even though the import name is just primary. Similarly, a client composing components can use the annotation to match compatible imports and exports across components.

When a function's type is async, bindings generators are expected to emit whatever asynchronous language construct is appropriate (such as an async function in JS, Python or Rust). See the concurrency explainer for more details.

The label production used inside plainname as well as the labels of record and variant types are required to have kebab case. The reason for this particular form of casing is to unambiguously separate words and acronyms (represented as all-caps words) so that source language bindings can convert a label into the idiomatic casing of that language. (Indeed, because hyphens are often invalid in identifiers, kebab case practically forces language bindings to make such a conversion.) For example, the label is-XML could be mapped to isXML, IsXml, is_XML or is_xml, depending on the target language/convention. The highly-restricted character set ensures that capitalization is trivial and does not require consulting Unicode tables.

Based on the lexical definition of label above, the following are all valid labels:

a, a-b-c, a1-2-3, A, A-B-C and A1-2-3 (but not 1-2-3, since the first fragment must start with a letter)
a11-w0rds, A11-4CR0NYMS and m1x3d-4CR0NYMS

Technically, based on the definitions given, the fragment 2 (and 3) in a1-2-3 is ambiguously either a word or an acronym, but the distinction only relates to casing and numbers aren't cased.

Components provide two options for naming exports, symmetric to the first two options for naming imports:

a plain name that leaves it up to the developer to "read the docs" or otherwise figure out what the export does and how to use it; and
an interface name that is assumed to uniquely identify a higher-level semantic contract that the component is claiming to implement with the given exported definition.

As an example, the following component uses all 9 cases of imports and exports:

(component
  (import "custom-hook" (func (param string) (result string)))
  (import "wasi:http/handler" (instance
    (export "request" (type $request (sub resource)))
    (export "response" (type $response (sub resource)))
    (export "handle" (func (param (own $request)) (result (own $response))))
  ))
  (import "url=<https://mycdn.com/my-component.wasm>" (component ...))
  (import "url=<./other-component.wasm>,integrity=<sha256-X9ArH3k...>" (component ...))
  (import "locked-dep=<my-registry:sqlite@1.2.3>,integrity=<sha256-H8BRh8j...>" (component ...))
  (import "unlocked-dep=<my-registry:imagemagick@{>=1.0.0}>" (instance ...))
  (import "integrity=<sha256-Y3BsI4l...>" (component ...))
  ... impl
  (export "wasi:http/handler" (instance $http_handler_impl))
  (export "get-JSON" (func $get_json_impl))
)

Here, custom-hook and get-JSON are plain names for functions whose semantic contract is particular to this component and not defined elsewhere. In contrast, wasi:http/handler is the name of a separately-defined interface, allowing the component to request the ability to make outgoing HTTP requests (through imports) and receive incoming HTTP requests (through exports) in a way that can be mechanically interpreted by hosts and tooling.

The remaining 4 imports show the different ways that a component can import external implementations. Here, the URL and locked dependency imports use component types, allowing this component to privately create and wire up instances using instance definitions. In contrast, the unlocked dependency import uses an instance type, anticipating a subsequent tooling step (likely the one that performs dependency resolution) to select, instantiate and provide the instance.

Validation of export requires that all transitive uses of resource types in the types of exported functions or values refer to resources that were either imported or exported (concretely, via the type index introduced by an import or export). The optional <externdesc>? in export can be used to explicitly ascribe a type to an export which is validated to be a supertype of the definition's type, thereby allowing a private (non-exported) type definition to be replaced with a public (exported) type definition.

For example, in the following component:

(component
  (import "R1" (type $R1 (sub resource)))
  (type $R2 (resource (rep i32)))
  (export $R2' "R2" (type $R2))
  (func $f1 (result (own $R1)) (canon lift ...))
  (func $f2 (result (own $R2)) (canon lift ...))
  (func $f2' (result (own $R2')) (canon lift ...))
  (export "f1" (func $f1))
  ;; (export "f2" (func $f2)) -- invalid
  (export "f2" (func $f2) (func (result (own $R2'))))
  (export "f2" (func $f2'))
)

the commented-out export is invalid because its type transitively refers to $R2, which is a private type definition. This requirement is meant to address the standard avoidance problem that appears in module systems with abstract types. In particular, it ensures that a client of a component is able to externally define a type compatible with the exports of the component.

Similar to type exports, value exports may also ascribe a type to keep the precise value from becoming part of the type and public interface.

For example:

(component
  (value $url string "https://example.com")
  (export "default-url" (value $url) (value string))
)

The inferred type of this component is:

(component
  (export "default-url" (value string))
)

Note, that the url value definition is absent from the component type

Name Uniqueness

The goal of the label, exportname and importname productions defined and used above is to allow automated bindings generators to map these names into something more idiomatic to the language. For example, the plainname [method]my-resource.my-method might get mapped to a method named myMethod nested inside a class MyResource. To unburden bindings generators from having to consider pathological cases where two unique-in-the-component names get mapped to the same source-language identifier, Component Model validation imposes a stronger form of uniqueness than simple string equality on all the names that appear within the same scope.

To determine whether two names (defined as sequences of Unicode Scalar Values) are strongly-unique:

If one name is l and the other name is [constructor]l (for the same label l), they are strongly-unique.
If one name is l and the other name is [*]l.l (for the same label l and any annotation * with a dotted l.l name), they are not strongly-unique.
Otherwise:
- Lowercase all the acronyms (uppercase letters) in both names.
- Strip any [...] annotation prefix from both names.
- The names are strongly-unique if the resulting strings are unequal.

Thus, the following set of names are strongly-unique and can thus all be imports (or exports) of the same component (or component type or instance type):

foo, foo-bar, [constructor]foo, [method]foo.bar, [method]foo.baz, foo:bar/baz, [implements=<foo:bar/baz>]bar, [implements=<foo:bar/baz>]quux

but attempting to add any of the following names would be a validation error:

foo, foo-BAR, [constructor]foo-BAR, [method]foo.foo, [method]foo.BAR, [implements=<a:b/c>]foo, foo:bar/baz, bar, [implements=<x:y/z>]bar

Note that additional validation rules involving types apply to names with annotations. For example, the validation rules for [constructor]foo require foo to be a resource type. See Binary.md for details.

🔗 Canonical Interface Name

An interfacename (as defined above) is canonical iff it either:

has no interfaceversion
has an interfaceversion matching the canonversion production

The purpose of canonversion is to simplify the matching of compatible import and export versions. For example, if a guest imports some interface from wasi:http/types@0.2.1 and a host provides the (subtype-compatible) interface wasi:http/types@0.2.6, we'd like to make it easy for the host to link with the guest. The canonversion for both of these interfaces would be 0.2, so this linking could be done by matching canonical interface names literally. Symmetrically, if a host provides wasi:http/types@0.2.1 and a guest imports wasi:http/types@0.2.6, so long as the guest only uses the subset of functions defined in wasi:http/types@0.2.1 (which is checked by normal component type validation), linking succeeds. Thus, including only the canonicalized version in the name allows both backwards and (limited) forwards compatibility using only trivial string equality (as well as the type checking already required).

Any valid semver (as used in WIT) can be canonicalized by splitting it into two parts - the canonversion prefix and the remaining semversuffix. Using the <major>.<minor>.<patch> syntax of Semantic Versioning 2.0, the split point is chosen as follows:

if major > 0, split immediately after major
- 1.2.3 → 1 / .2.3
otherwise if minor > 0, split immediately after minor
- 0.2.6-rc.1 → 0.2 / .6-rc.1
otherwise, split immediately after patch
- 0.0.1-alpha → 0.0.1 / -alpha

When a version is canonicalized, any semversuffix that was split off of the version should be preserved in the versionsuffix field of any resulting imports and exports. This gives component runtimes and other tools access to the original version for error messages, documentation, and other development purposes. Where a versionsuffix is present the preceding interfacename must have a canonversion, and the concatenation of the canonversion and versionsuffix must be a valid semver.

For compatibility with older versions of this spec, non-canonical interfacenames (with interfaceversions matching any valid semver) are temporarily permitted. These non-canonical names may trigger warnings and will start being rejected some time after after WASI Preview 3 is released.

Component Invariants

As a consequence of the shared-nothing design described above, all calls into or out of a component instance necessarily transit through a component function definition. Thus, component functions form a "membrane" around the collection of core module instances contained by a component instance, allowing the Component Model to establish invariants that increase optimizability and composability in ways not otherwise possible in the shared-everything setting of Core WebAssembly. The Component Model proposes establishing the following two runtime invariants:

Components define a "lockdown" state that prevents continued execution after a trap. This both prevents continued execution with corrupt state and also allows more-aggressive compiler optimizations (e.g., store reordering). This was considered early in Core WebAssembly standardization but rejected due to the lack of clear trapping boundary. With components, each component instance is given a mutable "lockdown" state that is set upon trap and implicitly checked at every execution step by component functions. Thus, after a trap, it's no longer possible to observe the internal state of a component instance.
The Component Model disallows reentrance by trapping if a callee's component-instance is already on the stack when the call starts. (For details, see call_might_be_recursive in the Canonical ABI explainer.) This default prevents obscure composition-time bugs and also enables more-efficient non-reentrant runtime glue code. This rule will be relaxed by an opt-in function type attribute in the future.

JavaScript Embedding

JS API

The JS API currently provides WebAssembly.compile(Streaming) which take raw bytes from an ArrayBuffer or Response object and produces WebAssembly.Module objects that represent decoded and validated modules. To natively support the Component Model, the JS API would be extended to allow these same JS API functions to accept component binaries and produce new WebAssembly.Component objects that represent decoded and validated components. The binary format of components is designed to allow modules and components to be distinguished by the first 8 bytes of the binary (splitting the 32-bit core:version field into a 16-bit version field and a 16-bit layer field with 0 for modules and 1 for components).

Once compiled, a WebAssembly.Component could be instantiated using the existing JS API WebAssembly.instantiate(Streaming). Since components have the same basic import/export structure as modules, this means extending the read the imports logic to support single-level imports as well as imports of modules, components and instances. Since the results of instantiating a component is a record of JavaScript values, just like an instantiated module, WebAssembly.instantiate would always produce a WebAssembly.Instance object for both module and component arguments.

Types are a new sort of definition that are not (yet) present in Core WebAssembly and so the read the imports and create an exports object steps need to be expanded to cover them:

For type exports, each type definition would export a JS constructor function. This function would be callable iff a [constructor]-annotated function was also exported. All [method]- and [static]-annotated functions would be dynamically installed on the constructor's prototype chain. In the case of re-exports and multiple exports of the same definition, the same constructor function object would be exported (following the same rules as WebAssembly Exported Functions today). In pathological cases (which, importantly, don't concern the global namespace, but involve the same actual type definition being imported and re-exported by multiple components), there can be collisions when installing constructors, methods and statics on the same constructor function object. In such cases, a conservative option is to undo the initial installation and require all clients to instead use the full explicit names as normal instance exports.

For type imports, the constructors created by type exports would naturally be importable. Additionally, certain JS- and Web-defined objects that correspond to types (e.g., the RegExp and ArrayBuffer constructors or any Web IDL interface object) could be imported. The ToWebAssemblyValue checks on handle values mentioned below can then be defined to perform the associated internal slot type test, thereby providing static type guarantees for outgoing handles that can avoid runtime dynamic type tests.

Lastly, when given a component binary, the compile-then-instantiate overloads of WebAssembly.instantiate(Streaming) would inherit the compound behavior of the abovementioned functions (again, using the layer field to eagerly distinguish between modules and components).

For example, the following component:

;; a.wasm
(component
  (import "one" (func))
  (import "two" (value string)) 🪙
  (import "three" (instance
    (export "four" (instance
      (export "five" (core module
        (import "six" "a" (func))
        (import "six" "b" (func))
      ))
    ))
  ))
  ...
)

and module:

;; b.wasm
(module
  (import "six" "a" (func))
  (import "six" "b" (func))
  ...
)

could be successfully instantiated via:

WebAssembly.instantiateStreaming(fetch('./a.wasm'), {
  one: () => (),
  two: "hi", 🪙
  three: {
    four: {
      five: await WebAssembly.compileStreaming(fetch('./b.wasm'))
    }
  }
});

The other significant addition to the JS API would be the expansion of the set of WebAssembly types coerced to and from JavaScript values (by ToJSValue and ToWebAssemblyValue) to include all of valtype. At a high level, the additional coercions would be:

Type	`ToJSValue`	`ToWebAssemblyValue`
`bool`	`true` or `false`	`ToBoolean`
`s8`, `s16`, `s32`	as a Number value	`ToInt8`, `ToInt16`, `ToInt32`
`u8`, `u16`, `u32`	as a Number value	`ToUint8`, `ToUint16`, `ToUint32`
`s64`	as a BigInt value	`ToBigInt64`
`u64`	as a BigInt value	`ToBigUint64`
`f32`, `f64`	as a Number value	`ToNumber`
`char`	same as `USVString`	same as `USVString`, throw if the USV length is not 1
`record`	TBD: maybe a JS Record?	same as `dictionary`
`variant`	see below	see below
`list`	create a typed array copy for number types; otherwise produce a JS array (like `sequence`)	same as `sequence`
`string`	same as `USVString`	same as `USVString`
`tuple`	TBD: maybe a JS Tuple?	TBD
`flags`	TBD: maybe a JS Record?	same as `dictionary` of optional `boolean` fields with default values of `false`
`enum`	same as `enum`	same as `enum`
`option`	same as `T?`	same as `T?`
`result`	same as `variant`, but coerce a top-level `error` return value to a thrown exception	same as `variant`, but coerce uncaught exceptions to top-level `error` return values
`own`, `borrow`	see below	see below
`future`	to a `Promise`	from a `Promise`
`stream`	to a `ReadableStream`	from a `ReadableStream`

Notes:

Function parameter names are ignored since JavaScript doesn't have named parameters.
If a function's result type list is empty, the JavaScript function returns undefined. If the result type list contains a single unnamed result, then the return value is specified by ToJSValue above. Otherwise, the function result is wrapped into a JS object whose field names are taken from the result names and whose field values are specified by ToJSValue above.
In lieu of an existing standard JS representation for variant, the JS API would need to define its own custom binding built from objects. As a sketch, the JS values accepted by (variant (case "a" u32) (case "b" string)) could include { tag: 'a', value: 42 } and { tag: 'b', value: "hi" }.
For option, when Web IDL doesn't support particular type combinations (e.g., (option (option u32))), the JS API would fall back to the JS API of the unspecialized variant (e.g., (variant (case "some" (option u32)) (case "none")), despecializing only the problematic outer option).
When coercing ToWebAssemblyValue, own and borrow handle types would dynamically guard that the incoming JS value's dynamic type was compatible with the imported resource type referenced by the handle type. For example, if a component contains (import "Object" (type $Object (sub resource))) and is instantiated with the JS Object constructor, then (own $Object) and (borrow $Object) could accept JS object values.
When coercing ToJSValue, handle values would be wrapped with JS objects that are instances of the handles' resource type's exported constructor (described above). For own handles, a FinalizationRegistry would be used to drop the own handle (thereby calling the resource destructor) when its wrapper object was unreachable from JS. For borrow handles, the wrapper object would become dynamically invalid (throwing on any access) at the end of the export call.
When an imported JavaScript function is a built-in function wrapping a Web IDL function, the specified behavior should allow the intermediate JavaScript call to be optimized away when the types are sufficiently compatible, falling back to a plain call through JavaScript when the types are incompatible or when the engine does not provide a separate optimized call path.

ESM-integration

Like the JS API, ESM-integration can be extended to load components in all the same places where modules can be loaded today, branching on the layer field in the binary format to determine whether to decode as a module or a component.

For URL import names, the embedded URL would be used as the Module Specifier. For plain names, the whole plain name would be used as the Module Specifier (and an import map would be needed to map the string to a URL). For locked and unlocked dependency names, ESM-integration would likely simply fail loading the module, requiring a bundler to map these registry-relative names to URLs.

TODO: ESM-integration for interface imports and exports is still being worked out in detail.

The main remaining question is how to deal with component imports having a single string as well as the new importable component, module and instance types. Going through these one by one:

For component imports of module type, we need a new way to request that the ESM loader parse or decode a module without also instantiating that module. Recognizing this same need from JavaScript, there is a TC39 proposal called Import Reflection that adds the ability to write, in JavaScript:

import Foo from "./foo.wasm" as "wasm-module";
assert(Foo instanceof WebAssembly.Module);

With this extension to JavaScript and the ESM loader, a component import of module type can be treated the same as import ... as "wasm-module".

Component imports of component type would work the same way as modules, potentially replacing "wasm-module" with "wasm-component".

In all other cases, the (single) string imported by a component is first resolved to a Module Record using the same process as resolving the Module Specifier of a JavaScript import. After this, the handling of the imported Module Record is determined by the import type:

For imports of instance type, the ESM loader would treat the exports of the instance type as if they were the Named Imports of a JavaScript import. Thus, single-level imports of instance type act like the two-level imports of Core WebAssembly modules where the first-level has been factored out. Since the exports of an instance type can themselves be instance types, this process must be performed recursively.

Otherwise, function or value imports are treated like an Imported Default Binding and the Module Record is converted to its default value. This allows the following component:

;; bar.wasm
(component
  (import "./foo.js" (func (result string)))
  ...
)

to be satisfied by a JavaScript module via ESM-integration:

// foo.js
export default () => "hi";

when bar.wasm is loaded as an ESM:

<script src="bar.wasm" type="module"></script>

Examples

For some use-case-focused, worked examples, see:

FilesExpand file tree

Explainer.md

Latest commit

History

Explainer.md

File metadata and controls

Component Model AST Explainer

Gated Features

Grammar

Component Definitions

Index Spaces

Instance Definitions

Alias Definitions

Type Definitions

Fundamental value types

Numeric types

📝 Error Context type

Container types

Handle types

Asynchronous value types

Specialized value types

Definition types

Declarators

Type Checking

Canonical Definitions

Canonical ABI

Canonical Built-ins

Resource built-ins

resource.new

resource.drop

resource.rep

🔀🧵 Concurrency built-ins

🔀 context.get

🔀 context.set

🔀✕ backpressure.set

🔀 backpressure.inc and backpressure.dec

🔀 task.return

🔀 task.cancel

🔀 waitable-set.new

🔀 waitable-set.wait

🔀 waitable-set.poll

🔀 waitable-set.drop

🔀 waitable.join

🔀 subtask.cancel

🔀 subtask.drop

🔀 stream.new and future.new

🔀 stream.read and stream.write

🔀 future.read and future.write

🔀 stream.cancel-read, stream.cancel-write, future.cancel-read, and future.cancel-write

🔀 stream.drop-readable, stream.drop-writable, future.drop-readable, and future.drop-writable

🧵 thread.index

🧵 thread.new-indirect

🧵 thread.switch-to

🧵 thread.suspend

🧵 thread.resume-later

🧵 thread.yield-to

🧵 thread.yield

🧵② thread.spawn-ref

🧵② thread.spawn-indirect

🧵② thread.available-parallelism