made revisions to pc-rao doc

Alexander Ororbia · Alexander Ororbia · commit d21e1a506921 · 2025-12-13T13:53:53.000-05:00
diff --git a/docs/museum/pc_rao_ballard1999.md b/docs/museum/pc_rao_ballard1999.md
@@ -1,4 +1,4 @@
-# Hierarchical Predictive Coding (Rao &amp; Ballard; 1999)
+# Hierarchical Predictive Coding for Reconstruction (Rao &amp; Ballard; 1999)
 
 In this exhibit, we create, simulate, and visualize the internally acquired receptive fields of the predictive coding 
 model originally proposed in (Rao &amp; Ballard, 1999) [1]. 
@@ -7,80 +7,68 @@ The model code for this exhibit can be found
 [here](https://github.com/NACLab/ngc-museum/tree/main/exhibits/pc_recon).
 
 
+## Setting Up Hierarchical Predictive Coding (HPC) with NGC-Learn
 
 
-## Predictive Coding with NGC-Learn
-<!--
------------------------------------------------------------------------------------------------------
--->
----
+### The HPC Model for Reconstruction Tasks
 
-### PC model for Reconstruction Task
-
-For building PC model you first need to define all the components inside the model.
-Then you should wire those components together with specific configuration depending
-on the task.
-
-1. **Create neural component**
-2. **Create synaptic component**
-3. **Wire components** – define how the components connect and interact with each others.
-
-<!--
------------------------------------------------------------------------------------------------------
--->
----
+To build an HPC model, you will first need to define all of the components inside of the model.
+After doing this, you will next wire those components together under a specific configuration, depending
+on the task. 
+This setup process involves doing the following: 
+1. **Create neural component**: instantiating neuronal unit (with dynamics) components. 
+2. **Create synaptic component**: instantiating synaptic connection components. 
+3. **Wire components**: defining how the components connect and interact with each other.
 
 <!-- ################################################################################ -->
 
-### 1: Make Neural Component(s):
+### 1: Create the Neural Component(s):
 
 <!-- ################################################################################ -->
 
 
-**Responding Neurons**
+**Representation (Response) Neuronal Layers**
 <br>
 
-We want to build a hierarchical neural network we need neural layers. In predictive coding network with real-valued dynamics
-we use `RateCell` components ([RateCell tutorial](https://ngc-learn.readthedocs.io/en/latest/tutorials/neurocog/rate_cell.html)).
-Here, we want 3-layer network (3-hidden layers) so we define 3 components, each with `n_units` size for hidden representatins.
+If we want to build an HPC model, which is a hierarchical neural network, we will need to set up a few neural layers. For predictive coding with real-valued (graded) dynamics, we will want to use the library's in-built `RateCell` components ([RateCell tutorial](https://ngc-learn.readthedocs.io/en/latest/tutorials/neurocog/rate_cell.html)). 
+Since we want a 3-layer network (i.e., an HPC model with three hidden, or "representation", layers), we need to define three components, each with an `n_units` size for their respective hidden representations. This is done as follows:
 
 ```python
-z3 = RateCell("z3", n_units=h3_dim, tau_m=tau_m, act_fx=act_fx, prior=(prior_type, lmbda))
-z2 = RateCell("z2", n_units=h2_dim, tau_m=tau_m, act_fx=act_fx, prior=(prior_type, lmbda))
-z1 = RateCell("z1", n_units=h1_dim, tau_m=tau_m, act_fx=act_fx, prior=(prior_type, lmbda))
+with Context("Circuit") as circuit: ## set up a (simulation) context for HPC model w/ 3 hidden layers
+    z3 = RateCell("z3", n_units=h3_dim, tau_m=tau_m, act_fx=act_fx, prior=(prior_type, lmbda))
+    z2 = RateCell("z2", n_units=h2_dim, tau_m=tau_m, act_fx=act_fx, prior=(prior_type, lmbda))
+    z1 = RateCell("z1", n_units=h1_dim, tau_m=tau_m, act_fx=act_fx, prior=(prior_type, lmbda))
 ```
 
-
-
-
 <!-- ################################################################################ -->
 
 <br>
 <br>
 
 <img src="../images/museum/hgpc/GEC.png" width="120" align="right" />
 
-**Error Neurons**
+**Error Neuronal Layers**
 <br>
 
 
-For each activation layer we have a set of additional neurons with the same size to measure the prediction error for individual 
-`RateCell` components. The error value will later be used to calculate the **energy** for layers (including hiddens) and the whole model.
-
+For each (`RateCell`) layer's activation, we will also want to setup an additional set of neuronal 
+layers -- with the same size as the representation layers -- to measure the prediction error(s) 
+for the sets of individual `RateCell` components. The error values that this layers will emit  will 
+be later used to calculate the (free) **energy** for each layer as well as the whole model. This is 
+specified like so: 
 
 ```python
-e2 = GaussianErrorCell("e2", n_units=h2_dim)          ## e2_size == z2_size
-e1 = GaussianErrorCell("e1", n_units=h1_dim)          ## e1_size == z1_size
-e0 = GaussianErrorCell("e0", n_units=in_dim)          ## e0_size == z0_size (x size)
+    e2 = GaussianErrorCell("e2", n_units=h2_dim)          ## e2_size == z2_size
+    e1 = GaussianErrorCell("e1", n_units=h1_dim)          ## e1_size == z1_size
+    e0 = GaussianErrorCell("e0", n_units=in_dim)          ## e0_size == z0_size (x size) (stimulus layer)
 ```
 
-
 <br>
 <br>
 
 <!-- ################################################################################ -->
 
-### 2: Make Synaptic Component(s):
+### 2: Create the Synaptic Component(s):
 
 <!-- ################################################################################ -->
 
@@ -89,18 +77,22 @@ e0 = GaussianErrorCell("e0", n_units=in_dim)          ## e0_size == z0_size (x s
 
 <!-- <img src="images/GEC.png" width="120" align="right"/> -->
 
-**Forward Synapses**
+**Forward Synaptic Connections**
 <br>
 
-To connect layers to each others we create synapstic components. To send infromation in forward pass (from input into deeper layers with a bottom-up stream) 
-we use `ForwardSynapse` components. Check out [Brain's Information Flow](https://github.com/Faezehabibi/pc_tutorial/blob/main/information_flow.md#---information-flow-in-the-brain--)
-for detailed explanation of information flow in brain modeling.
-
+To connect the layers of our model to each other, we will need to create synaptic components 
+(which will project/propagate information across the layers); ultimately, this means we need 
+to construct the message-passing scheme of our HPC model. In order to send information in a 
+"forward pass" (from the stimulus/input layer into deeper hidden layers, in a bottom-up stream), 
+we make use of `ForwardSynapse` components. Please check out 
+[Brain's Information Flow](https://github.com/Faezehabibi/pc_tutorial/blob/main/information_flow.md#---information-flow-in-the-brain--) for a more detailed explanation of the flow of information that we use in the context 
+of brain modeling. 
+Setting up the forward projections/pathway is done like so: 
 
 ```python
-E3 = ForwardSynapse("E3", shape=(h2_dim, h3_dim))          ## pre-layer size  (x) => (h1) post-layer size
-E2 = ForwardSynapse("E2", shape=(h1_dim, h2_dim))          ## pre-layer size (h1) => (h2) post-layer size
-E1 = ForwardSynapse("E1", shape=(in_dim, h1_dim))          ## pre-layer size (h2) => (h3) post-layer size
+    E3 = ForwardSynapse("E3", shape=(h2_dim, h3_dim))  ## pre-layer size  (h2) => (h3) post-layer size
+    E2 = ForwardSynapse("E2", shape=(h1_dim, h2_dim))  ## pre-layer size (h1) => (h2) post-layer size
+    E1 = ForwardSynapse("E1", shape=(in_dim, h1_dim))  ## pre-layer size (x) => (h1) post-layer size
 ```
 
 <!-- ################################################################################ -->
@@ -110,139 +102,136 @@ E1 = ForwardSynapse("E1", shape=(in_dim, h1_dim))          ## pre-layer size (h2
 
 <!-- <img src="images/GEC.png" width="120" align="right"/> -->
 
-**Backward Synapses**
+**Backward(s) Synaptic Connections**
 <br>
 
-For each `ForwardSynapse` components sending infromation upward (bottom-up stream) exist a `BackwardSynapse` component to reverse the information flow and 
-send it back downward (top-down stream -- from top layer to bottom/input). If you are not convinced, check out [Information Flow](https://github.com/Faezehabibi/pc_tutorial/blob/19b0692fa307f2b06676ca93b9b93ba3ba854766/information_flow.md).
+For each `ForwardSynapse` component that sends information upward (i.e., the "bottom-up" stream), 
+there exists a `BackwardSynapse` component that reverses the flow of information flow by sending 
+signals back downwards (i.e., the "top-down" stream -- from the top layer to the bottom/input ones). 
+Again, we refer you to this resource [Information Flow](https://github.com/Faezehabibi/pc_tutorial/blob/19b0692fa307f2b06676ca93b9b93ba3ba854766/information_flow.md) for more information. 
+To set up the backwards/message-passing connections, you will write to the following:
 
 ```python
-W3 = BackwardSynapse("W3",
-                     shape=(h3_dim, h2_dim),          ## pre-layer size (h3) => (h2) post-layer size
-                     optim_type=opt_type,             ## optimization method (sgd, adam, ...)
-                     weight_init=w3_init,             ## W3[t0]: initial values before training at time[t0]
-                     w_bound=w_bound,                 ## -1 for deactivating the bouding synaptic value
-                     sign_value=-1.,                  ## -1 means M-step solve minimization problem
-                     eta=eta,                         ## learning-rate (lr)
-)
-W2 = BackwardSynapse("W2",
-                     shape=(h2_dim, h1_dim),          ## pre-layer size (h2) => (h1) post-layer size
-                     optim_type=opt_type,             ## Optimizer
-                     weight_init=w2_init,             ## W2[t0]
-                     w_bound=w_bound,                 ## -1: deactivate the bouding
-                     sign_value=-1.,                  ## Minimization
-                     eta=eta,                         ## lr
-)
-W1 = BackwardSynapse("W1",
-                     shape=(h1_dim, in_dim),          ## pre-layer size (h1) => (x) post-layer size
-                     optim_type=opt_type,             ## Optimizer
-                     weight_init=w1_init,             ## W1[t0]
-                     w_bound=w_bound,                 ## -1: deactivate the bouding
-                     sign_value=-1.,                  ## Minimization
-                     eta=eta,                         ## lr
-)
+    W3 = BackwardSynapse("W3",
+        shape=(h3_dim, h2_dim),          ## pre-layer size (h3) => (h2) post-layer size
+        optim_type=opt_type,             ## optimization method (sgd, adam, ...)
+        weight_init=w3_init,             ## W3[t0]: initial values before training at time[t0]
+        w_bound=w_bound,                 ## -1 for deactivating the bouding synaptic value
+        sign_value=-1.,                  ## -1 means M-step solve minimization problem
+        eta=eta,                         ## learning-rate (lr)
+    )
+    W2 = BackwardSynapse("W2",
+        shape=(h2_dim, h1_dim),          ## pre-layer size (h2) => (h1) post-layer size
+        optim_type=opt_type,             ## Optimizer
+        weight_init=w2_init,             ## W2[t0]
+        w_bound=w_bound,                 ## -1: deactivate the bouding
+        sign_value=-1.,                  ## Minimization
+        eta=eta,                         ## lr
+    )
+    W1 = BackwardSynapse("W1",
+        shape=(h1_dim, in_dim),          ## pre-layer size (h1) => (x) post-layer size
+        optim_type=opt_type,             ## Optimizer
+        weight_init=w1_init,             ## W1[t0]
+        w_bound=w_bound,                 ## -1: deactivate the bouding
+        sign_value=-1.,                  ## Minimization
+        eta=eta,                         ## lr
+    )
 ```
 
-
-
-
-
-
 <br>
 <br>
 <!-- ----------------------------------------------------------------------------------------------------- -->
 
-### Wire the Component(s) Together:
+### Wiring the Component(s) Together:
 
 
-The signal pathway is according to Rao & Ballard 1999.
-Error is information goes from buttom to up in the forward pass.
-Corrected prediction comes back from top to the down in the backward pass.
+The signaling pathway that we will create is in accordance with <b>[1]</b> (Rao and Ballard's classical model).
+Error (mismatch signals) is information that goes from the bottom (layer) of the model to its top (layer) in 
+the forward pass(es). 
+Corrected prediction information will come back from the top (layer) to the bottom (layer) in the backward 
+pass(es).
 
+The following code block will set up the top-down projection message-passing pathway: 
 
 ```python
-            ######### Feedback pathways (Top-down) #########
-            ### actual neural activation
-            e2.target << z2.z
-            e1.target << z1.z
-
-            ### Top-down prediction
-            e2.mu << W3.outputs
-            e1.mu << W2.outputs
-            e0.mu << W1.outputs
-
-            ### Top-down prediction errors
-            z1.j_td << e1.dtarget
-            z2.j_td << e2.dtarget
-
-            W3.inputs << z3.zF
-            W2.inputs << z2.zF
-            W1.inputs << z1.zF
+    ######### Feedback pathways (Top-down) #########
+    ### Actual neural activations
+    z2.z >> e2.target   ## Layer 2's target is z2's rate-value `z`
+    z1.z >> e1.target  ## Layer 1's target is z1's rate-value `z`
+    ## Note: e0.target will be clamped to input data `x`
+
+    ### Top-down predictions 
+    z3.zF >> W3.inputs   ## pass phi(z3) down W3
+    W3.outputs >> e2.mu  ## prediction `mu` for (layer 2) z2's `z`
+    z2.zF >> W2.inputs   ## pass phi(z2) down W2
+    W2.outputs >> e1.mu  ## prediction `mu` for (layer 1) z1's `z`
+    z1.zF >> W1.inputs   ## pass phi(z1) down W1
+    W1.outputs >> e0.mu  ## prediction `mu` for (input layer) z0=x
+
+    ### Top-down prediction errors
+    e1.dtarget >> z1.j_td 
+    e2.dtarget >> z2.j_td 
 ```
 
+The following code-block will set up the error-feedback, bottom-up message-passing pathway:
 
 ```python
-            ######### Forward propagation (Bottom-up) #########
-            ## feedforward the errors via synapses
-            E3.inputs << e2.dmu
-            E2.inputs << e1.dmu
-            E1.inputs << e0.dmu
-
-            ## Bottom-up modulated errors
-            z3.j << E3.outputs
-            z2.j << E2.outputs
-            z1.j << E1.outputs
+    ######### Forward propagation (Bottom-up) #########
+    ## feedforward the errors via synapses
+    e2.dmu >> E3.inputs
+    e1.dmu >> E2.inputs
+    e0.dmu >> E1.inputs
+
+    ## Bottom-up modulated errors
+    E3.outputs >> z3.j 
+    E2.outputs >> z2.j 
+    E1.outputs >> z1.j
 ```
 
+Finally, to enable learning, we will need to set up simple 2-term/factor Hebbian rules like so: 
 
 ```python
-            ######## Hebbian learning #########
-            ## Pre Synaptic Activation
-            W3.pre << z3.zF
-            W2.pre << z2.zF
-            W1.pre << z1.zF
-
-            ## Post-synaptic residual error
-            W3.post << e2.dmu
-            W2.post << e1.dmu
-            W1.post << e0.dmu
+    ########### Hebbian learning ############
+    ### Set up terms for 2-term Hebbian rules
+    ## Pre-synaptic activation (terms)
+    z3.zF >> W3.pre
+    z2.zF >> W2.pre 
+    z1.zf >> W1.pre
+
+    ## Post-synaptic residual error (terms)
+    e2.dmu >> W3.post
+    e1.dmu >> W2.post
+    e0.dmu >> W1.post
 ```
 
-
-
-
 <br>
 <br>
 <!-- ----------------------------------------------------------------------------------------------------- -->
 
-#### Specifying the Process Dynamics:
+#### Specifying the HPC Model's Process Dynamics:
 
 
 ```python
-            ######### Process #########
+    ######### Process #########
   
-            ########### reset/set all components to their resting values / initial conditions
-            circuit.reset()
-    
-            circuit.clamp_input(obs)      ## clamp the signal to the lowest layer activation
-            z0.z.set(obs)                 ## or directly put obs in e0.target.set(obs)
+    ########### reset/set all components to their resting values / initial conditions
+    circuit.reset()
     
-            ########### pin/tie feedback synapses to transpose of forward ones
-            E1.weights.set(jnp.transpose(W1.weights.value))
-            E2.weights.set(jnp.transpose(W2.weights.value))
-            E3.weights.set(jnp.transpose(W3.weights.value))
+    circuit.clamp_input(obs)      ## clamp the signal to the lowest layer activation
+    z0.z.set(obs)                 ## or directly put obs in e0.target.set(obs)
     
-            circuit.process(jnp.array([[dt * i, dt] for i in range(T)])) ## Perform several E-steps
+    ########### pin/tie feedback synapses to transpose of forward ones
+    E1.weights.set(jnp.transpose(W1.weights.value))
+    E2.weights.set(jnp.transpose(W2.weights.value))
+    E3.weights.set(jnp.transpose(W3.weights.value))
     
-            circuit.evolve(t=T, dt=1.)    ## Perform M-step (scheduled synaptic updates)
+    circuit.process(jnp.array([[dt * i, dt] for i in range(T)])) ## Perform several E-steps
+    circuit.evolve(t=T, dt=1.)    ## Perform M-step (scheduled synaptic updates)
             
-            obs_mu = e0.mu.value          ## get reconstructed signal
-            L0 = e0.L.value               ## calculate reconstruction loss
+    obs_mu = e0.mu.value          ## get reconstructed signal
+    L0 = e0.L.value               ## calculate reconstruction loss
 ```
 
-
-
-
 <br>
 <br>
 <br>
@@ -274,10 +263,6 @@ similar filters or receptive fields as in convolutional neural networks (CNNs).
 ```
 
 
-
-
-
-
 <!-- -------------------------------------------------------------------------------------
 ### Train PC model for reconstructing the full image