FrozenAssassine
diff --git a/‎.gitignore‎
Lines changed: 6 additions & 72 deletions b/‎.gitignore‎
Lines changed: 6 additions & 72 deletions
diff --git a/‎README.md‎
Lines changed: 113 additions & 38 deletions b/‎README.md‎
Lines changed: 113 additions & 38 deletions
diff --git a/‎firmware/.gitignore‎
Lines changed: 5 additions & 0 deletions b/‎firmware/.gitignore‎
Lines changed: 5 additions & 0 deletions
diff --git a/‎firmware/.vscode/extensions.json‎
Lines changed: 10 additions & 0 deletions b/‎firmware/.vscode/extensions.json‎
Lines changed: 10 additions & 0 deletions
diff --git a/‎firmware/include/nn_trained.h‎
Lines changed: 17 additions & 0 deletions b/‎firmware/include/nn_trained.h‎
Lines changed: 17 additions & 0 deletions
diff --git a/‎firmware/lib/NeuralNetwork/README.md‎
Lines changed: 15 additions & 0 deletions b/‎firmware/lib/NeuralNetwork/README.md‎
Lines changed: 15 additions & 0 deletions
diff --git a/‎firmware/lib/NeuralNetwork/include/nn/layerData.h‎
Lines changed: 11 additions & 0 deletions b/‎firmware/lib/NeuralNetwork/include/nn/layerData.h‎
Lines changed: 11 additions & 0 deletions
@@ -1,73 +1,7 @@
-app/bin/
-app/pde.jar
-build/macosx/work/
-arduino-core/bin/
-arduino-core/arduino-core.jar
-hardware/arduino/bootloaders/caterina_LUFA/Descriptors.o
-hardware/arduino/bootloaders/caterina_LUFA/Descriptors.lst
-hardware/arduino/bootloaders/caterina_LUFA/Caterina.sym
-hardware/arduino/bootloaders/caterina_LUFA/Caterina.o
-hardware/arduino/bootloaders/caterina_LUFA/Caterina.map
-hardware/arduino/bootloaders/caterina_LUFA/Caterina.lst
-hardware/arduino/bootloaders/caterina_LUFA/Caterina.lss
-hardware/arduino/bootloaders/caterina_LUFA/Caterina.elf
-hardware/arduino/bootloaders/caterina_LUFA/Caterina.eep
-hardware/arduino/bootloaders/caterina_LUFA/.dep/
-build/*.zip
-build/*.tar.bz2
-build/windows/work/
-build/windows/*.zip
-build/windows/*.tgz
-build/windows/*.tar.bz2
-build/windows/libastylej*
-build/windows/liblistSerials*
-build/windows/arduino-*.zip
-build/windows/dist/*.tar.gz
-build/windows/dist/*.tar.bz2
-build/windows/launch4j-*.tgz
-build/windows/launch4j-*.zip
-build/windows/launcher/launch4j
-build/windows/WinAVR-*.zip
-build/macosx/arduino-*.zip
-build/macosx/dist/*.tar.gz
-build/macosx/dist/*.tar.bz2
-build/macosx/*.tar.bz2
-build/macosx/libastylej*
-build/macosx/appbundler*.jar
-build/macosx/appbundler*.zip
-build/macosx/appbundler
-build/macosx/appbundler-1.0ea-arduino?
-build/macosx/appbundler-1.0ea-arduino*.zip
-build/macosx/appbundler-1.0ea-upstream*.zip
-build/linux/work/
-build/linux/dist/*.tar.gz
-build/linux/dist/*.tar.bz2
-build/linux/*.tgz
-build/linux/*.tar.xz
-build/linux/*.tar.bz2
-build/linux/*.zip
-build/linux/libastylej*
-build/linux/liblistSerials*
-build/shared/arduino-examples*
-build/shared/reference*.zip
-build/shared/Edison*.zip
-build/shared/Galileo*.zip
-build/shared/WiFi101-Updater-ArduinoIDE-Plugin*.zip
-test-bin
-*.iml
-.idea
-.DS_Store
-.directory
-hardware/arduino/avr/libraries/Bridge/examples/XivelyClient/passwords.h
-avr-toolchain-*.zip
-/app/nbproject/private/
-/arduino-core/nbproject/private/
-/app/build/
-/arduino-core/build/
+.pio
+.vscode/.browse.c_cpp.db*
+.vscode/c_cpp_properties.json
+.vscode/launch.json
+.vscode/ipch
 
-manifest.mf
-nbbuild.xml
-nbproject
-main/esp32.svd
-main/debug.cfg
-main/debug_custom.json
+python/out
@@ -12,74 +12,149 @@
 
 ## 🤔 What is this project?
 
-This project is a lightweight neural network implementation designed to run on microcontrollers like the **ESP32** and **Arduino**. It demonstrates how even resource-constrained devices can train and perform simple tasks like **XOR** prediction. Maybe you’ll find a use case for simple robot projects.
+This project is a lightweight neural network implementation designed to run on microcontrollers like the **ESP32** and **Arduino**. It demonstrates how even resource-constrained devices can train and perform simple tasks like **XOR** prediction. Maybe you’ll find a use case for simple robot or sensor projects.
 
-While it takes just some **seconds** to train on the ESP32, the Arduino requires significantly more time due to limited processing power.
+The project has two supported modes, inference and training mode. Inference mode uses an existing torch model and converts it to a header file, which can be loaded to your esp or arduino.
+For fun or testing purposes, you can also run your training directly on the microchip, but for larger models, the performance gets weak pretty fast and you run into memory constraints.
 
 ## 📎 [Blog to this project](https://medium.com/@FrozenAssassine/neural-network-from-scratch-on-esp32-2a53a7b65f9f)
 
 ## 🛠️ Features
-- **On-device training**: Train your neural network directly on ESP32 or Arduino.
-- **XOR**: Predict simple numbers like in xor.
-- **Activation Functions**: Use activation functions like Sigmoid, Relu, Softmax, TanH and LeakyRelu
-- **Fast Training**: The ESP32 can train in just a few seconds, while the Arduino requires longer due to its slow processor.
+
+- **Inference only**: Use a python script to convert your pytorch models to include file for esp32 and Arduino.
+- **On-device training**: Train your neural network directly on ESP32 or Arduino (no weight saving atm).
+- **Activation Functions**: Use activation functions like Softmax, Sigmoid, Relu, TanH and LeakyRelu
 - **Xavier Initialization**: Optimizes weight distribution for faster training.
+- **Simple building structure**: The oop approach makes building the initial model really simple.
+
 ## 🔮 Future features
-- Train on PC and load weights to chip
-- Save and load weights
-- More layer types
 
-## 🚀 Performance
-- ESP32: Fast training (~seconds).
-- Arduino: Slower training (~minutes or more).
+- Save and load weights from on device training
+- More layer types
 
 ## 🫶 Code considerations
+
 I tried to keep the code as simple and easy to understand as possible. The neural network is completely built using OOP principles, which means that everything is its own class. This is useful for structuring the model later.
-For the individual layers, I used the basic principle of inheritance, where I have a BaseLayer class and each layer inherits from it. The BaseLayer also implements some functions, like Train and FeedForward, as well as pointers to the weights, values, biases, and errors. In my inherited classes, I only have to override these functions with the training logic and variable implementations. This is very useful when adding new layers.
+For the individual layers, I used the basic principle of inheritance, where there is a BaseLayer class and each layer inherits from it. The BaseLayer also implements some functions, for Training and FeedForward, as well as pointers to the weights, values, biases, and errors. In the inherited classes, those functions can be overriden with the actual training logic and variable implementations. This is very useful for adding new layers.
 
-## 🏗️ How to Use
+## 🏗️ Run the code
 
-1. Clone this repository and open the project in Arduino IDE.
-2. Upload the code to your ESP32 or Arduino using Arduino IDE
+1. Clone this repository and open the project with PlatformIO.
+2. Upload the code to your ESP32 or Arduino
 3. Monitor the predictions via Serial Monitor at 115200 baud rate.
 
-Here is an example code:
+## 1. Training mode
 
 ```cpp
-#include "Layers.h"
-#include "NeuralNetwork.h"
-
-void setup() {
-  Serial.begin(115200);
+#include "nn/layers.h"
+#include "nn/neuralNetwork.h"
+#include <nn/predictionHelper.h>
+#include <Arduino.h>
 
+void TrainAndTest()
+{
   NeuralNetwork *nn = new NeuralNetwork(3);
   nn->StackLayer(new InputLayer(2));
   nn->StackLayer(new DenseLayer(4, ActivationKind::TanH));
-  nn->StackLayer(new OutputLayer(1, ActivationKind::Sigmoid));
-  nn->Build();
+  nn->StackLayer(new OutputLayer(2, ActivationKind::Softmax));
+  nn->Build(false); // training and prediction
+
+  float inputs[4][2] = {
+      {0, 0},
+      {0, 1},
+      {1, 0},
+      {1, 1}};
+
+  float desired[4][2] = {
+      {1, 0},
+      {0, 1},
+      {0, 1},
+      {1, 0}};
+
+  nn->Train((float *)inputs, (float *)desired, 4, 2, 220, 0.1);
+
+  Serial.println("Predictions:");
+  for (uint8_t i = 0; i < 4; i++)
+  {
+    float *pred = nn->Predict(inputs[i], 2);
+    Serial.printf(
+        "Input: [%.0f, %.0f] -> Softmax: [%.4f, %.4f] -> Class: %d\n",
+        inputs[i][0], inputs[i][1], pred[0], pred[1], ArgMax(pred, 2));
+  }
+}
 
-  float inputs[4][2] = { { 0, 0 }, { 0, 1 }, { 1, 0 }, { 1, 1 } };
-  float desired[4][1] = { { 0 }, { 1 }, { 1 }, { 0 } };
+void setup()
+{
+  Serial.begin(115200);
+  delay(1000);
+
+  TrainAndTest();
+}
+void loop() { }
+```
 
-  nn->Train((float*)inputs, (float*)desired, 4, 2, 600, 0.1f);
+**Output:**
 
-  // Predict XOR results:
-  for (int i = 0; i < 4; i++) {
+```
+Training Done!
+Predictions:
+Input: [0, 0] -> Softmax: [0.9665, 0.0335] -> Class: 0
+Input: [0, 1] -> Softmax: [0.0324, 0.9676] -> Class: 1
+Input: [1, 0] -> Softmax: [0.0783, 0.9217] -> Class: 1
+Input: [1, 1] -> Softmax: [0.9355, 0.0645] -> Class: 0
+```
+
+## 2. Inference only
+
+```cpp
+#include "nn/layers.h"
+#include "nn/neuralNetwork.h"
+#include <nn/predictionHelper.h>
+#include <Arduino.h>
+
+void InferenceOnly()
+{
+  Serial.println("Testing model inference only (XOR Classification)");
+
+  NeuralNetwork *nn = new NeuralNetwork(3);
+  nn->StackLayer(new InputLayer(2));
+  nn->StackLayer(new DenseLayer(4, ActivationKind::TanH));
+  nn->StackLayer(new OutputLayer(2, ActivationKind::Softmax));
+  nn->Build(true); // inference only
+
+  float inputs[4][2] = {
+      {0, 0},
+      {0, 1},
+      {1, 0},
+      {1, 1}};
+
+  Serial.println("Predictions:");
+  for (uint8_t i = 0; i < 4; i++)
+  {
     float *pred = nn->Predict(inputs[i], 2);
-    Serial.print("PREDICTION ");
-    Serial.print(inputs[i][0]);
-    Serial.print(" ");
-    Serial.print(inputs[i][1]);
-    Serial.print(" = ");
-    Serial.println(pred[0]);
+    Serial.printf(
+        "Input: [%.0f, %.0f] -> Softmax: [%.4f, %.4f] -> Class: %d\n",
+        inputs[i][0], inputs[i][1], pred[0], pred[1], ArgMax(pred, 2));
   }
 }
 
-void loop() {
+void setup()
+{
+  Serial.begin(115200);
   delay(1000);
+
+  InferenceOnly();
 }
+void loop() { }
 ```
 
-# 📷 Images:
-![image](https://github.com/user-attachments/assets/4b32f9ee-a1e9-4b4f-b626-1c4d5d9a3861)
+**Output:**
 
+```
+Testing model inference only (XOR Classification)
+Predictions:
+Input: [0, 0] -> Softmax: [0.9523, 0.0477] -> Class: 0
+Input: [0, 1] -> Softmax: [0.0702, 0.9298] -> Class: 1
+Input: [1, 0] -> Softmax: [0.0817, 0.9183] -> Class: 1
+Input: [1, 1] -> Softmax: [0.9112, 0.0888] -> Class: 0
+```
@@ -0,0 +1,5 @@
+.pio
+.vscode/.browse.c_cpp.db*
+.vscode/c_cpp_properties.json
+.vscode/launch.json
+.vscode/ipch
@@ -0,0 +1,10 @@
+{
+    // See http://go.microsoft.com/fwlink/?LinkId=827846
+    // for the documentation about the extensions.json format
+    "recommendations": [
+        "platformio.platformio-ide"
+    ],
+    "unwantedRecommendations": [
+        "ms-vscode.cpptools-extension-pack"
+    ]
+}
@@ -0,0 +1,17 @@
+#pragma once
+
+#include "nn/layerData.h"
+
+// Example arrays (small XOR-like model)
+static float layer0_weights[8] = {1.694597f, 1.308419f, -1.314386f, -0.903650f, -1.036660f, 2.091955f, -2.517021f, 2.006923f};
+static float layer0_bias[4] = {-0.033063f, -0.264372f, 0.208891f, -1.040136f};
+
+static float layer1_weights[8] = {-1.449604f, 0.621033f, 1.519490f, -1.582430f, 0.827401f, -0.810482f, -1.936075f, 1.971920f};
+static float layer1_bias[2] = {-0.410137f, -0.222768f};
+
+static const LayerData nn_layers[] = {
+    {nullptr, nullptr, 0, 2},
+    {layer0_weights, layer0_bias, 2, 4},
+    {layer1_weights, layer1_bias, 4, 2}};
+
+static const uint8_t nn_total_layers = 3;
@@ -0,0 +1,15 @@
+# NeuralNetwork (PlatformIO/Arduino library)
+
+Small, header/source based neural network library for Arduino/PlatformIO.
+
+Usage
+
+- Copy the `NeuralNetwork` folder into your project's `lib/` directory, or add it via `lib_extra_dirs`/`lib_deps`.
+- Include public headers like:
+
+  #include <nn/neuralNetwork.h>
+  #include <nn/layers.h>
+
+Example
+
+- See `examples/BasicExample` for a minimal inference sketch using the built-in trained model.
@@ -0,0 +1,11 @@
+#pragma once
+
+#include <cstdint>
+
+struct LayerData
+{
+    float *weights;
+    float *bias;
+    uint16_t inputSize;
+    uint16_t outputSize;
+};