NACLab
diff --git a/‎docs/images/museum/hgpc/GEC.png‎
340 KB b/‎docs/images/museum/hgpc/GEC.png‎
340 KB
diff --git a/‎docs/images/museum/hgpc/patch_input.png‎
101 KB b/‎docs/images/museum/hgpc/patch_input.png‎
101 KB
diff --git a/‎docs/museum/pc_rao_ballard1999.md‎
Lines changed: 279 additions & 1 deletion b/‎docs/museum/pc_rao_ballard1999.md‎
Lines changed: 279 additions & 1 deletion
@@ -6,7 +6,285 @@ model originally proposed in (Rao &amp; Ballard, 1999) [1].
 The model code for this exhibit can be found 
 [here](https://github.com/NACLab/ngc-museum/tree/main/exhibits/pc_recon).
 
+
+
+
+## Predictive Coding with NGC-Learn
+-----------------------------------------------------------------------------------------------------
+
+### PC model for Reconstruction Task
+
+For building PC model you first need to define all the components inside the model.
+Then you should wire those components together with specific configuration depending
+on the task.
+
+1. **Create neural component**
+2. **Create synaptic component**
+3. **Wire components** – define how the components connect and interact with each others.
+
+
+-----------------------------------------------------------------------------------------------------
+
+<!-- ################################################################################ -->
+
+### 1- Make Neural component:
+
+<!-- ################################################################################ -->
+
+
+**Responding Neurons**
+<br>
+
+We want to build a hierarchical neural network we need neural layers. In predictive coding network with real-valued dynamics
+we use `RateCell` components ([RateCell tutorial](https://ngc-learn.readthedocs.io/en/latest/tutorials/neurocog/rate_cell.html)).
+Here, we want 3-layer network (3-hidden layers) so we define 3 components, each with `n_units` size for hidden representatins.
+
+```python
+z3 = RateCell("z3", n_units=h3_dim, tau_m=tau_m, act_fx=act_fx, prior=(prior_type, lmbda))
+z2 = RateCell("z2", n_units=h2_dim, tau_m=tau_m, act_fx=act_fx, prior=(prior_type, lmbda))
+z1 = RateCell("z1", n_units=h1_dim, tau_m=tau_m, act_fx=act_fx, prior=(prior_type, lmbda))
+```
+
+
+
+
+<!-- ################################################################################ -->
+
+<br>
+<br>
+
+<img src="../images/museum/hgpc/GEC.png" width="120" align="right" />
+
+**Error Neurons**
+<br>
+
+
+For each activation layer we have a set of additional neurons with the same size to measure the prediction error for individual 
+`RateCell` components. The error value will later be used to calculate the **energy** for layers (including hiddens) and the whole model.
+
+
+```python
+e2 = GaussianErrorCell("e2", n_units=h2_dim)          ## e2_size == z2_size
+e1 = GaussianErrorCell("e1", n_units=h1_dim)          ## e1_size == z1_size
+e0 = GaussianErrorCell("e0", n_units=in_dim)          ## e0_size == z0_size (x size)
+```
+
+
+<br>
+<br>
+
+<!-- ################################################################################ -->
+
+### 2- Make Synaptic component:
+
+<!-- ################################################################################ -->
+
+<br>
+<br>
+
+<!-- <img src="images/GEC.png" width="120" align="right"/> -->
+
+**Forward Synapses**
+<br>
+
+To connect layers to each others we create synapstic components. To send infromation in forward pass (from input into deeper layers with a bottom-up stream) 
+we use `ForwardSynapse` components. Check out [Brain's Information Flow](https://github.com/Faezehabibi/pc_tutorial/blob/main/information_flow.md#---information-flow-in-the-brain--)
+for detailed explanation of information flow in brain modeling.
+
+
+```python
+E3 = ForwardSynapse("E3", shape=(h2_dim, h3_dim))          ## pre-layer size  (x) => (h1) post-layer size
+E2 = ForwardSynapse("E2", shape=(h1_dim, h2_dim))          ## pre-layer size (h1) => (h2) post-layer size
+E1 = ForwardSynapse("E1", shape=(in_dim, h1_dim))          ## pre-layer size (h2) => (h3) post-layer size
+```
+
+<!-- ################################################################################ -->
+
+<br>
+<br>
+
+<!-- <img src="images/GEC.png" width="120" align="right"/> -->
+
+**Backward Synapses**
+<br>
+
+For each `ForwardSynapse` components sending infromation upward (bottom-up stream) exist a `BackwardSynapse` component to reverse the information flow and 
+send it back downward (top-down stream -- from top layer to bottom/input). If you are not convinced, check out [Information Flow](https://github.com/Faezehabibi/pc_tutorial/blob/19b0692fa307f2b06676ca93b9b93ba3ba854766/information_flow.md).
+
+```python
+W3 = BackwardSynapse("W3",
+                     shape=(h3_dim, h2_dim),          ## pre-layer size (h3) => (h2) post-layer size
+                     optim_type=opt_type,             ## optimization method (sgd, adam, ...)
+                     weight_init=w3_init,             ## W3[t0]: initial values before training at time[t0]
+                     w_bound=w_bound,                 ## -1 for deactivating the bouding synaptic value
+                     sign_value=-1.,                  ## -1 means M-step solve minimization problem
+                     eta=eta,                         ## learning-rate (lr)
+)
+W2 = BackwardSynapse("W2",
+                     shape=(h2_dim, h1_dim),          ## pre-layer size (h2) => (h1) post-layer size
+                     optim_type=opt_type,             ## Optimizer
+                     weight_init=w2_init,             ## W2[t0]
+                     w_bound=w_bound,                 ## -1: deactivate the bouding
+                     sign_value=-1.,                  ## Minimization
+                     eta=eta,                         ## lr
+)
+W1 = BackwardSynapse("W1",
+                     shape=(h1_dim, in_dim),          ## pre-layer size (h1) => (x) post-layer size
+                     optim_type=opt_type,             ## Optimizer
+                     weight_init=w1_init,             ## W1[t0]
+                     w_bound=w_bound,                 ## -1: deactivate the bouding
+                     sign_value=-1.,                  ## Minimization
+                     eta=eta,                         ## lr
+)
+```
+
+
+
+
+
+
+<br>
+<br>
+<!-- ----------------------------------------------------------------------------------------------------- -->
+
+### Wire Component:
+
+
+The signal pathway is according to Rao & Ballard 1999.
+Error is information goes from buttom to up in the forward pass.
+Corrected prediction comes back from top to the down in the backward pass.
+
+
+```python
+            ######### feedback (Top-down) #########
+            ### actual neural activation
+            e2.target << z2.z
+            e1.target << z1.z
+
+            ### Top-down prediction
+            e2.mu << W3.outputs
+            e1.mu << W2.outputs
+            e0.mu << W1.outputs
+
+            ### Top-down prediction errors
+            z1.j_td << e1.dtarget
+            z2.j_td << e2.dtarget
+
+            W3.inputs << z3.zF
+            W2.inputs << z2.zF
+            W1.inputs << z1.zF
+```
+
+
+```python
+            ######### forward (Bottom-up) #########
+            ## feedforward the errors via synapses
+            E3.inputs << e2.dmu
+            E2.inputs << e1.dmu
+            E1.inputs << e0.dmu
+
+            ## Bottom-up modulated errors
+            z3.j << E3.outputs
+            z2.j << E2.outputs
+            z1.j << E1.outputs
+```
+
+
+```python
+            ######## Hebbian learning #########
+            ## Pre Synaptic Activation
+            W3.pre << z3.zF
+            W2.pre << z2.zF
+            W1.pre << z1.zF
+
+            ## Post Synaptic residual error
+            W3.post << e2.dmu
+            W2.post << e1.dmu
+            W1.post << e0.dmu
+```
+
+
+
+
+<br>
+<br>
+<!-- ----------------------------------------------------------------------------------------------------- -->
+
+##### Process Dynamics:
+
+
+```python
+            ######### Process #########
+  
+            ########### reset/set all components to their resting values / initial conditions
+            circuit.reset()
+    
+            circuit.clamp_input(obs)      ## clamp the signal to the lowest layer activation
+            z0.z.set(obs)                 ## or directly put obs in e0.target.set(obs)
+    
+            ########### pin/tie feedback synapses to transpose of forward ones
+            E1.weights.set(jnp.transpose(W1.weights.value))
+            E2.weights.set(jnp.transpose(W2.weights.value))
+            E3.weights.set(jnp.transpose(W3.weights.value))
+    
+            circuit.process(jnp.array([[dt * i, dt] for i in range(T)])) ## Perform several E-steps
+    
+            circuit.evolve(t=T, dt=1.)    ## Perform M-step (scheduled synaptic updates)
+            
+            obs_mu = e0.mu.value          ## get reconstructed signal
+            L0 = e0.L.value               ## calculate reconstruction loss
+```
+
+
+
+
+<br>
+<br>
+<br>
+<br>
+<!-- ----------------------------------------------------------------------------------------------------- -->
+<!-- ----------------------------------------------------------------------------------------------------- -->
+
+
+### Train PC model for reconstructing the patched image 
+
+<img src="../images/museum/hgpc/patch_input.png" width="300" align="right"/>
+
+<br>
+
+This time, the input image is not the full scene while it is locally patched. This changes the processing
+units among the network where local features are now important. The original models in Rao & ballard 1999
+are also in patch format where similar to retina the processing units are localized. This also is results in
+similar filters or receptive fields as in convolutional neural networks (CNNs).
+
+
+<br>
+
+```python
+    for nb in range(n_batches):
+        Xb = X[nb * images_per_batch: (nb + 1) * images_per_batch, :]               ## shape: (mb_size, 784)
+        Xb = generate_patch_set(Xb, patch_shape, center=True)
+
+        Xmu, Lb = model.process(Xb)
+```
+
+
+
+
+
+
+<!-- -------------------------------------------------------------------------------------
+### Train PC model for reconstructing the full image
+
+```python
+    for nb in range(n_batches):
+        Xb = X[nb * mb_size: (nb + 1) * mb_size, :]                                 ## shape: (mb_size, 784)
+        Xmu, Lb = model.process(Xb)
+```
+------------------------------------------------------------------------------------- -->
+
+
 <!-- references -->
 ## References
 <b>[1]</b> Rao, Rajesh PN, and Dana H. Ballard. "Predictive coding in the visual cortex: a functional interpretation of 
-some extra-classical receptive-field effects." Nature neuroscience 2.1 (1999): 79-87.
+some extra-classical receptive-field effects." Nature neuroscience 2.1 (1999): 79-87.