Convolutional Neural Networks from First Principles (CIFAR-10)

This repository studies CNNs in two complementary ways:

Analytical / educational view (NumPy, manual backprop) in src
Computational / practical view (PyTorch checkpoint inference) in notebooks/predict_cifar10.ipynb

The idea is simple: understand the math deeply, then use an optimized stack for fast experimentation.

Demo

Theoretical Motivation

A convolutional network is a function approximation pipeline that maps an image $x$ to class probabilities $p(y\mid x)$.

At a high level:

Convolution learns local pattern detectors (edges, textures, shapes)
Nonlinearity (ReLU) increases expressive power
Pooling / downsampling trades spatial resolution for invariance
Dense classifier maps learned features to logits
Softmax + cross-entropy defines the training objective

For one layer, the core operation is:

$$ z = W * x + b, \quad a = \phi(z) $$

And for classification:

$$ \mathcal{L} = -\sum_{c=1}^{C} y_c \log \hat{y}_c $$

where $\hat{y}=\text{softmax}(\text{logits})$.

What This Repository Contains

src: CNN components built manually (Conv2D, pooling, flatten, dense, activations, loss, optimizer)
data: CIFAR-10 data files
model/cifar10_cnn.pt: trained PyTorch checkpoint used for prediction
notebooks/predict_cifar10.ipynb: inference notebook (loads checkpoint, evaluates, visualizes)
note/NOTE.md: detailed learning notes and progress log

Methodological Split

A) From-scratch path (for understanding)

Explicit forward/backward logic in NumPy
Useful for verifying gradient flow and tensor shapes
Best for learning internals, not for high-speed training

B) PyTorch path (for performance)

GPU-accelerated training/inference
Practical for reaching stronger CIFAR-10 results quickly
Checkpoint currently used by notebook: model/cifar10_cnn.pt

Note: PyTorch checkpoints (model_state, optimizer_state, etc.) are not the same format as NumPy scratch checkpoints.

Experimental Snapshot

CUDA-enabled environment validated
Best reported test accuracy during training: 91.58%

How to Run Inference

Open notebooks/predict_cifar10.ipynb
Run cells from top to bottom
The notebook will:
- load model/cifar10_cnn.pt
- evaluate accuracy
- show sample predictions with ground-truth labels

Environment

Python 3.12
PyTorch (CUDA build)
torchvision
numpy
matplotlib

Project Goal

Build intuition for CNN mechanics from first principles, then bridge that understanding to practical, high-performance inference workflows.

License

This project use CC-BY-SA 4.0 license.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
model		model
note		note
notebooks		notebooks
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
{A6F5B299-5288-4919-8EF2-68CC79CB29B7}.png		{A6F5B299-5288-4919-8EF2-68CC79CB29B7}.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Convolutional Neural Networks from First Principles (CIFAR-10)

Demo

Theoretical Motivation

What This Repository Contains

Methodological Split

A) From-scratch path (for understanding)

B) PyTorch path (for performance)

Experimental Snapshot

How to Run Inference

Environment

Project Goal

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Convolutional Neural Networks from First Principles (CIFAR-10)

Demo

Theoretical Motivation

What This Repository Contains

Methodological Split

A) From-scratch path (for understanding)

B) PyTorch path (for performance)

Experimental Snapshot

How to Run Inference

Environment

Project Goal

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages