gpt2-style-transformer

This repository provides scripts and utilities for training a GPT-2 style transformer model on the FineWeb dataset (or its variants).

Installation

Install dependencies (preferably in a virtual environment):
```
pip install -r requirements.txt
```

Downloading the Dataset

You can download and preprocess the FineWeb dataset (or any of its flavors) using the CLI:

python cli.py download-dataset --local-dir <output_dir> --dataset-flavor <flavor>

--local-dir: Directory to save the processed dataset (default: data)
--dataset-flavor: Dataset flavor to download (default: fineweb10B). Example: fineweb10B-edu, fineweb10B, etc.

Example:

python cli.py download-dataset --local-dir data --dataset-flavor fineweb10B

Training the Model

You can train the model using the CLI:

python cli.py train-cli [OPTIONS]

Training Options

--dataset-location: Directory containing the processed dataset (default: data)
--epochs: Number of epochs (default: 19073)
--batch-size: Batch size (default: 4)
--block-size: Block size (default: 1024)
--total-batch-size: Total batch size (default: 524288)
--lr: Learning rate (default: 3e-4)

Example:

python cli.py train-cli --dataset-location data --epochs 10 --batch-size 8

Direct Usage

You can also run the dataset download script directly:

python fineweb.py --local_dir data --dataset_flavor fineweb10B

Notes

The dataset is sharded and saved in the specified directory.
Training logs and model weights are saved in the working directory.

Requirements

See requirements.txt for all dependencies.

License

[Add your license here]

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
README.md		README.md
cli.py		cli.py
fineweb.py		fineweb.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gpt2-style-transformer

Installation

Downloading the Dataset

Training the Model

Training Options

Direct Usage

Notes

Requirements

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

ayushdeolasee/gpt2-style-transformer

Folders and files

Latest commit

History

Repository files navigation

gpt2-style-transformer

Installation

Downloading the Dataset

Training the Model

Training Options

Direct Usage

Notes

Requirements

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages