debug

mikebo93 · mikebo93 · commit ae6c3c9ff5bb · 2025-03-26T22:28:17.000Z
diff --git a/chapter_compiler_frontend/Intermediate_Representation.md b/chapter_compiler_frontend/Intermediate_Representation.md
@@ -61,4 +61,150 @@ considering the specific requirements of the compiler's design.
 :label:`ch06/ch06-categorize`
 
 
+### Linear Intermediate Representation
 
+Linear IRs are widely used in compiler design, resembling assembly code
+for abstract machines. They represent the code to be compiled as a
+sequentially ordered series of operations. This ordering is important in
+practical terms. Linear IRs are popular because most processors utilize
+linear assembly languages.
+
+Two common types of linear IRs are stack machine code and three-address
+code . Stack machine code, a form of single-address code, offers a
+straightforward and compact representation. Instructions in stack
+machine code typically consist solely of an opcode that specifies an
+operation, with operands stored on a stack. Most instructions retrieve
+operands from the stack and push the results of their operations back
+onto it. On the other hand, three-address code (3AC) emulates the
+instruction format used in modern RISC machines. It employs a set of
+quadruples, each containing an operator and three addresses (two
+operands and one target). Figure
+:numref:`ch04/ch04-linearIR` illustrates the stack machine code
+and three-address code representations for the expression $a-b*5$.
+
+![Stack machine code and three-addresscode](../img/ch04/IR-linear_IR.png)
+:label:`ch04/ch04-linearIR`
+
+### Graphical Intermediate Representation
+
+Graphical IRs store information about the compilation process in the
+form of graphs. These graphs utilize nodes, edges, lists, trees, and
+other elements to collectively represent an algorithm. Although all
+graphical IRs consist of nodes and edges, they differ in terms of
+abstraction levels and graph structures. Common examples of graphical
+IRs include abstract syntax trees (ASTs), directed acyclic graphs
+(DAGs), and control-flow graphs (CFGs).
+
+An AST is a tree-structured IR that closely resembles the structure of
+the source code. Figure :numref:`ch04/ch04-AST_DAG` depicts the AST for the expression
+$a5+a5b$. It is worth noting that the AST contains two identical copies
+of $a5$, which introduces redundancy. To address this redundancy, the
+DAG offers a simplified representation where identical subtrees can be
+shared by multiple parent nodes. By reusing subtrees, the DAG reduces
+the cost of the evaluation process, especially when the compiler can
+verify that the value of $a$ remains constant.
+
+![AST and DAG](../img/ch04/IR-ASTDAG.png)
+:label:`ch04/ch04-AST_DAG`
+
+### Hybrid Intermediate Representation
+
+Hybrid IRs combine both linear IR and graphical IR elements. An example
+of a hybrid IR is LLVM IR , which is illustrated in Figure
+:numref:`ch04/ch04-LLVM_IR`. LLVM is an open-source compiler
+framework with the goal of providing unified IRs for different frontends
+and backends.
+
+In LLVM IR, linear IRs are used to construct basic blocks, while
+graphical IRs represent the control flow between these blocks. Each
+instruction within a basic block is presented as a static single
+assignment (SSA) . SSA requires each variable to be defined before use,
+with values assigned to them only once. Multiple SSA instructions form a
+linear list within a basic block.
+
+In the control flow graph (CFG), each node represents a basic block, and
+control transfer between these blocks is implemented through edges. This
+combination of linear IR for basic blocks and graphical IR for control
+flow allows for a flexible and efficient representation in LLVM IR.
+
+![LLVM IR](../img/ch04/IR-LLVMIR.png)
+:label:`ch04/ch04-LLVM_IR`
+
+## Intermediate Representation in Machine Learning Frameworks
+
+Classical IRs (such as LLVM IR) primarily target programming languages
+for general-purpose computation tasks, which falls short of satisfying
+the unique requirements of machine-learning-related computation. When
+designing IRs tailored for machine learning frameworks, certain vital
+factors warrant attention:
+
+-   **Tensor Representation**. Given the predominance of tensor data in
+    machine learning frameworks, it's imperative that the IRs can
+    effectively handle tensor representation.
+
+-   **Automatic Differentiation**. A core aspect of machine learning
+    involves evaluating derivatives of neural networks and optimizers
+    through automatic differentiation. Accordingly, IRs must prioritize
+    simplicity, performance, and scalability of higher-order
+    differentials for automatic differentiation.
+
+-   **Computational Graph Mode**. Machine learning frameworks like
+    TensorFlow, PyTorch, and MindSpore operate on two computational
+    graph modes: static and dynamic. The static mode, with pre-defined
+    computational graphs, enhances optimization but compromises on
+    flexibility. Conversely, the dynamic mode trades running speed for
+    flexibility and easier debugging by executing operators immediately
+    in the computational graph. IRs should therefore support both modes,
+    enabling users to choose the one best suited for their tasks while
+    building algorithm models.
+
+-   **Support for Higher-order Functions and Closures**. Essential in
+    functional programming, higher-order functions take or return
+    functions, while closures bundle code blocks with references to the
+    surrounding environment, facilitating access to an outer function's
+    scope from an inner function. Such support reduces redundant code,
+    improves abstraction, and enhances the flexibility and simplicity of
+    framework representations.
+
+-   **Compilation Optimization**. Machine learning frameworks lean on
+    compilation optimizations, including hardware-agnostic,
+    hardware-specific, and deployment- or inference-related
+    optimizations. These rely significantly on IRs implementations.
+
+-   **Just-in-Time (JIT) Compilation**. For expedited compilation and
+    execution in machine learning frameworks, JIT compilation is
+    frequently utilized. Optimization of JIT compilation, including loop
+    unrolling, fusion, and inlining, plays a crucial role in optimizing
+    parts of data flow graphs in IRs. A flawed IR design could
+    potentially hamper JIT compilation performance in machine learning
+    frameworks, thereby impacting the program's running capabilities.
+
+Considering these factors, developers persistently refine classical IRs
+and introduce new IRs specifically tailored for machine learning
+frameworks. In the following section, we will delve into the IRs
+employed by various machine learning frameworks.
+
+### Intermediate Representation in PyTorch
+
+PyTorch is a dynamic, Python-oriented machine learning framework.
+Renowned for its usability and flexibility, PyTorch simplifies the
+process of writing and debugging machine learning programs. It
+introduces TorchScript, a method used for constructing serializable and
+optimizable models during the saving and loading of neural networks.
+
+Particularly, TorchScript IR employs JIT compilation to convert Python
+code into target model files. All TorchScript programs can be saved
+within the Python process and later loaded into processes devoid of
+Python dependencies.
+
+Aligning with the imperative programming paradigm, PyTorch incorporates
+the TorchScript IR, composed primarily of Single Static Assignment
+(SSA)-based linear IRs, to represent Python code. This representation
+can be achieved through either the Tracing or Scripting method of JIT
+compilation. TorchScript IR not only amplifies model deployment
+capabilities but also bolsters compilation performance. Additionally,
+TorchScript IR greatly improves the model visualization within the
+PyTorch framework.
+
+Code `lst:torchscript` illustrates the use of the Scripting method
+to print a TorchScript IR graph.