multilayer-perceptron: simple neural network

Python Pkl

This project implements a simple Multilayer Perceptron (MLP), a fundamental type of neural network commonly used in machine learning for classification tasks. Built from scratch using NumPy, it demonstrates core concepts like forward/backward propagation, gradient descent, and various activation functions—without relying on high-level frameworks like TensorFlow or PyTorch.

Features

Custom Neural Network: Fully-connected layers with configurable architecture
Multiple Activation Functions: ReLU, Sigmoid, Softmax
Optimizers: Gradient Descent, Adam
Regularization: L1 (Lasso) and L2 (Ridge) regularization
Training Features: Mini-batch training, early stopping, gradient clipping
Configuration: Model setup via .pkl configuration files
Visualization: Automatic metric plotting (loss, accuracy, precision, recall)
Fast Setup: Uses uv for lightning-fast dependency management

Installation & Usage

Prerequisites

Python 3.10+
uv (recommended) or pip

Quick Install

git clone https://github.com/lucas-ht/multilayer-perceptron.git
cd multilayer-perceptron
uv sync

For detailed configuration options and advanced usage, see the complete documentation.

Quick Start

1. Split your dataset

uv run mlp.py split data/data.csv --test-size 0.2

2. Train the model

uv run mlp.py train data/train.csv model.test.pkl

3. Make predictions

uv run mlp.py predict data/test.csv models/model.json

Architecture

The MLP consists of:

Input Layer: Accepts feature vectors from the dataset
Hidden Layers: Learns non-linear representations using activation functions
Output Layer: Produces predictions (typically with softmax for classification)

Each layer performs:

Linear transformation: $z = Wx + b$
Activation: $a = \sigma(z)$
Backpropagation: Gradients computed via chain rule

Training Process

Forward propagation: Pass inputs through layers to compute predictions
Loss calculation: Measure error using cross-entropy loss
Backpropagation: Compute gradients of loss w.r.t. weights
Weight update: Apply optimizer (GD or Adam) to minimize loss
Regularization: Optional L1/L2 penalties to prevent overfitting

Key hyperparameters (configurable in .pkl files):

Learning rate
Batch size
Number of epochs
Layer sizes and activation functions
Regularization strength
Early stopping patience

Acknowledgements

This project is part of the 42 School curriculum. Special thanks to the 42 School community for their support and resources.

Name		Name	Last commit message	Last commit date
Latest commit History 130 Commits
.github/workflows		.github/workflows
assets/banner		assets/banner
data		data
docs		docs
mlp		mlp
pkl		pkl
tests		tests
.editorconfig		.editorconfig
.gitignore		.gitignore
.pylintrc		.pylintrc
.python-version		.python-version
README.md		README.md
cspell.json		cspell.json
findseed.py		findseed.py
mlp.py		mlp.py
model.default.pkl		model.default.pkl
model.example.pkl		model.example.pkl
pyproject.toml		pyproject.toml
tests.py		tests.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

multilayer-perceptron: simple neural network

Table of Contents

Features

Installation & Usage

Prerequisites

Quick Install

Quick Start

1. Split your dataset

2. Train the model

3. Make predictions

Architecture

Training Process

Acknowledgements

About

Uh oh!

Languages

lucas-ht-42/multilayer-perceptron

Folders and files

Latest commit

History

Repository files navigation

multilayer-perceptron: simple neural network

Table of Contents

Features

Installation & Usage

Prerequisites

Quick Install

Quick Start

1. Split your dataset

2. Train the model

3. Make predictions

Architecture

Training Process

Acknowledgements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages