GPT MLX

Introduction

This is a simple implementation of a GPT using Apple's new MLX library.

Installation

Ensure Poetry is installed and run:

poetry install

Usage

Training

We train the model on OpenWebText, which is an open-source replication of OpenAI's WebText dataset.

Run the following command to download the dataset, tokenize it and then save it to disk:

poetry run python prepare.py

This will create a data directory with two files: train.bin and validation.bin.

Once complete, run the following command to train the model:

poetry run python train.py

Checkpoints will be saved to the checkpoints directory.

Generation

Run the following command to generate text using the trained model:

poetry run python generate.py

Acknowledgments

This implementation has been inspired by Andrej Karpathy's nanoGPT and minGPT repositories, which are themselves PyTorch reimplementations of GPT-2 with a few modifications.

TODO

Configuration improvements (e.g. YAML file)
Calculate validation loss
Adjust hyperparameters to improve performance

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
notebooks		notebooks
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
attention.py		attention.py
constants.py		constants.py
generate.py		generate.py
gpt.py		gpt.py
poetry.lock		poetry.lock
prepare.py		prepare.py
pyproject.toml		pyproject.toml
train.py		train.py
transformer.py		transformer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GPT MLX

Introduction

Installation

Usage

Training

Generation

Acknowledgments

TODO

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

georgeherbert/gpt-mlx

Folders and files

Latest commit

History

Repository files navigation

GPT MLX

Introduction

Installation

Usage

Training

Generation

Acknowledgments

TODO

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages