Add concepts (#884)

lucylq · mergennachin · commit eca156155a1f · 2023-10-13T15:08:31.000-07:00
Summary: Pull Request resolved: #884 Add some more concepts Reviewed By: mergennachin Differential Revision: D50250661 fbshipit-source-id: b94643c8878934e1d2435d77033fe9df34db1597
diff --git a/docs/source/compiler-custom-compiler-passes.md b/docs/source/compiler-custom-compiler-passes.md
@@ -31,7 +31,7 @@ This is an
 way where we execute each node and recreate the graph except with
 transformations specified. This allows us to preserve the IR Spec by ensuring
 that all nodes created while in the pass meet the IR Spec including ensuring that
-metadata such as stack trace, FakeTensor values, and torch.nn.Module heirarchy
+metadata such as stack trace, FakeTensor values, and torch.nn.Module hierarchy
 are preserved and updated depending on the transformations made.
 
 To implement this pass, we can create a subclass of
diff --git a/docs/source/concepts.md b/docs/source/concepts.md
@@ -36,7 +36,11 @@ A specific hardware (like GPU, NPU) or a software stack (like XNNPACK) that cons
 
 Backend dialect is the result of exporting Edge dialect to specific backend. It’s target-aware, and may contain operators or submodules that are only meaningful to the target backend. This dialect allows the introduction of target-specific operators that do not conform to the schema defined in the Core ATen Operator Set and are not shown in ATen or Edge Dialect.
 
-## [Backend Specific Operator]
+## Backend registry
+
+A table mapping backend names to backend interfaces. This allows backends to be called via name during runtime.
+
+## Backend Specific Operator
 
 These are operators that are not part of ATen dialect or Edge dialect. Backend specific operators are only introduced by passes that happen after Edge dialect (see Backend dialect). These operators are specific to the target backend and will generally execute faster.
 
@@ -48,6 +52,10 @@ An open-source, large scale build system. Used to build ExecuTorch.
 
 An open-source, cross-platform family of tools designed to build, test and package software. Used to build ExecuTorch.
 
+## Codegen
+
+In ExecuTorch, code generation is used to generate the [kernel registration library](./kernel-library-selective_build.md).
+
 ## Core ATen Dialect
 
 Core ATen dialect contains the core ATen operators along with higher order operators (control flow) and registered custom operators.
@@ -122,12 +130,16 @@ An ExecuTorch `Program` maps string names like `forward` to specific ExecuTorch
 
 ## executor_runner
 
-The ExecuTorch runtime that executes the exported PyTorch model on-device.
+A sample wrapper around the ExecuTorch runtime which includes all the operators and backends.
 
 ## [EXIR](./ir-exir.md)
 
 The **EX**port **I**ntermediate **R**epresentation (IR) from `torch.export`. Contains the computational graph of the model. All EXIR graphs are valid [FX graphs](https://pytorch.org/docs/stable/fx.html#torch.fx.Graph).
 
+## `ExportedProgram`
+
+The output of `torch.export` that bundles the computational graph of a PyTorch model (usually an `nn.Module`) with the parameters or weights that the model consumes.
+
 ## [flatbuffer](https://github.com/google/flatbuffers)
 
 Memory efficient, cross platform serialization library. In the context of ExecuTorch, eager mode Pytorch models are exported to flatbuffer, which is the format consumed by the ExecuTorch runtime.
@@ -136,6 +148,10 @@ Memory efficient, cross platform serialization library. In the context of ExecuT
 
 The cost of various loading and initialization tasks (not inference). For example; loading a program, initializing executor, kernel and backend-delegate dispatch, and runtime memory utilization.
 
+## Functional ATen operators
+
+ATen operators that do not have any side effects.
+
 ## [Graph](./ir-exir.md)
 
 An EXIR Graph is a PyTorch program represented in the form of a DAG (directed acyclic graph). Each node in the graph represents a particular computation or operation, and edges of this graph consist of references between nodes. Note: all EXIR graphs are valid [FX graphs](https://pytorch.org/docs/stable/fx.html#torch.fx.Graph).
@@ -156,6 +172,11 @@ A representation of a program between the source and target languages. Generally
 
 An implementation of an operator. There can be multiple implementations of an operator for different backends/inputs/etc.
 
+
+## Kernel registry / Operator registry
+
+A table with mappings between kernel names and their implementations. This allows the ExecuTorch runtime to resolve references to kernels during execution.
+
 ## Lowering
 
 The process of transforming a model to run on various backends. It is called 'lowering' as it is moving code closer to the hardware. In ExecuTorch, lowering is performed as part of backend delegation.
@@ -176,9 +197,11 @@ Function on tensors. This is the abstraction; kernels are the implementation. Th
 
 Operator fusion is the process of combining multiple operators into a single compound operator, resulting in faster computation due to fewer kernel launches and fewer memory read/writes. This is a performance advantage of graph mode vs eager mode.
 
-## Operator registration
+## Out variant
+
+Instead of allocating returned tensors in kernel implementations, an operator's out variant will take in a pre-allocated tensor to its out kwarg, and store the result there.
 
-Operators need to be registered with the ExecuTorch runtime. This allows the compiler to resolve references to the operator in code.
+This makes it easier for memory planners to perform tensor lifetime analysis. In ExecuTorch, an out variant pass is performed before memory planning.
 
 ## [PAL (Platform Abstraction Layer)](./runtime-platform-abstraction-layer.md)