diff --git a/DEVELOPER_GUIDE.md b/DEVELOPER_GUIDE.md new file mode 100644 index 000000000000..5de4a135d721 --- /dev/null +++ b/DEVELOPER_GUIDE.md @@ -0,0 +1,178 @@ +# Developer Guide + +The aim of this document is to give a general description of the codebase to those who would like to contribute. It will use technical language and will go deep into the various parts of the codebase. + +In the most general sense, Svelte works as follows: + +- A component is parsed into an [abstract syntax tree (AST)](https://en.m.wikipedia.org/wiki/Abstract_syntax_tree) compatible with the [ESTree spec](https://github.com/estree/estree) +- The AST is analyzed - defining the scopes, finding stateful variables, etc. +- The AST is transformed, either into a server component or a client component based on the `generate` option. The transformation produces a JS module and some CSS if there's any +- A server component imports the server runtime from `svelte/internal/server` and when executed with `render` produces a string of the `body` and a string of the `head` +- A client component imports the client runtime from `svelte/internal/client` and when executed - either with `mount` or `hydrate` - creates the DOM elements (or retrieves them from the pre-existing DOM in case of hydration), attaches listeners, and creates state and effects that are needed to keep the DOM in sync with the state. + +## Phase 1: Parsing + +Parsing is the first step to convert the component into a runnable JS file. Your Svelte component is effectively a string and while we could try to do something with regexes and replacements the standard way to do manipulation is to first build an AST and then manipulate that. An AST is a structured representation of code. Each language has its own syntax and relative AST (based on the parser used). Every JavaScript part of a Svelte component, be it the script tag or an expression tag in your template, is parsed with `acorn` (`acorn-typescript` in case you use `lang="ts"`) to produce an ESTree compatible tree. + +If you want a more in-depth explanation of how a Parser works, you can refer to [this video](https://www.youtube.com/watch?v=mwvyKGw2CzU) by @tanhauhau where he builds a mini svelte 4 from scratch, but the gist of it is that you can basically have three main operations during the parsing phase: `eat`, `read` and `match` (with some variations). + +You start from the first character of the string and try to match it with a known symbol in the language. Considering the shape of a svelte component, you either have an `element` (` + + + +{count} +``` + +Depending on where you read `count` it will refer to a different variable that has been shadowed. The `count` in the template and in `increase` refers to the `count` declared in instance script, while the one in the `log` function will refer to its argument. + +This is done by walking the AST and manually create a `new Scope` class every time we encounter a node that creates one. + +
+ What does walking the AST means? + + As we've seen, the AST is basically a giant Javascript object with a `type` property to indicate the node type and a series of extra properties. + + For example, a `$state(1)` node will look like this (excluding position information): + + ```js + { + type: "CallExpression", + callee: { + type: "Identifier", + name: "$state", + }, + arguments: [{ + type: "Literal", + value: 1, + raw: "1", + }] + } + ``` + + Walking allows you to invoke a function (that's called a visitor) for each of the nodes in the AST, receiving the node itself as an argument. + +
+ +Let's see an example: when you declare a function in your code the corresponding AST node is a `FunctionDeclaration`...so if you look into the `create_scopes` function you'll see something like this + +```ts +walk(ast, state, { + // other visitors + FunctionDeclaration(node, { state, next }) { + if (node.id) state.scope.declare(node.id, 'normal', 'function', node); + + const scope = state.scope.child(); + scopes.set(node, scope); + + add_params(scope, node.params); + next({ scope }); + } + // other visitors +}); +``` + +What this snippet of code is doing is: + +- checking if the function declaration has an identifier (basically if it's a named or anonymous function) +- if it has one it's declaring a new variable in the current scope +- creating a new scope (since in Javascript when you create a function you are creating a new lexical scope) with the current scope as the parent +- declare every argument of the function in the newly created scope +- invoking the next method that will continue the AST traversal, with the brand new scope as the current scope + +The same is obviously true for Svelte-specific nodes too: the `SnippetBlock` visitor looks basically identical to the `FunctionDeclaration` one: + +```ts +walk(ast, state, { + // other visitors + SnippetBlock(node, context) { + const state = context.state; + let scope = state.scope; + + scope.declare(node.expression, 'normal', 'function', node); + + const child_scope = state.scope.child(); + scopes.set(node, child_scope); + + for (const param of node.parameters) { + for (const id of extract_identifiers(param)) { + child_scope.declare(id, 'snippet', 'let'); + } + } + + context.next({ scope: child_scope }); + } + // other visitors +}); +``` + +After the initial walk to figure out the right scopes we can now walk once again, we use a generic visitor (that runs before any visit to a node) to pass down the appropriate scope to the node (and collect information about the `// svelte-ignore` comments): + +```ts +const visitors = { + _(node, { state, next, path }) { + const parent = path.at(-1); + + /** @type {string[]} */ + const ignores = []; + + // logic to collect svelte-ignore excluded for brevity + + const scope = state.scopes.get(node); + next(scope !== undefined && scope !== state.scope ? { ...state, scope } : state); + + if (ignores.length > 0) { + pop_ignore(); + } + } + // rest of the visitors +}; +``` + +This means that in every visitor we can access the `scope` property and ask information about every variable by name. diff --git a/assets/developer-guide/ast.png b/assets/developer-guide/ast.png new file mode 100644 index 000000000000..49c5be329190 Binary files /dev/null and b/assets/developer-guide/ast.png differ