GPT-2 Implementation

This repository contains a minimal implementation of GPT-2 for training and inference, using PyTorch.

Setup

Clone the repository

git clone [email protected]:TimilsinaBimal/gpt2.git
cd gpt2

Install dependencies
Make sure you have Python and pip installed along with torch, transformers and tiktoken installed.
Prepare your data
Place your training text file (e.g., tiny_shakespeare.txt) in the desired location. Update the data_path in your config if needed.

Training

To train the model:

python -m gpt2.train

The training script will:
- Load and tokenize your dataset.
- Train the GPT-2 model using the specified configuration.
- Save the model weights at the end of each epoch to the path specified in your config.

Configuration:
Edit the gpt2/config.py file to adjust parameters such as data_path, batch_size, seq_length, learning_rate, num_epochs, and model_path.

Inference

To run inference with a pretrained model:

Ensure you have a trained model checkpoint at the path specified in your config.
Use the inference script:

python -m gpt2.inference

The script will:
- Load the pretrained GPT-2 model.
- Generate text based on your input prompt (modify the script to set your prompt).

Notes

The tokenizer uses tiktoken with the GPT-2 vocabulary.
Model checkpoints and configuration files are saved as specified in your config.
For custom datasets or different model sizes, update the config and scripts accordingly.

File Structure

gpt2/train.py: Training pipeline.
gpt2/inference.py: Inference/generation script.
gpt2/model.py: Model definition and loading.
gpt2/config.py: Configuration class.

Troubleshooting

Ensure your data file path is correct in the config.
If CUDA is available, set device in config to "cuda" for faster training.

License

Apache 2.0 License

Disclaimer: This README file is generated using AI

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
gpt2		gpt2
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
play.ipynb		play.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GPT-2 Implementation

Setup

Training

Inference

Notes

File Structure

Troubleshooting

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

TimilsinaBimal/gpt2

Folders and files

Latest commit

History

Repository files navigation

GPT-2 Implementation

Setup

Training

Inference

Notes

File Structure

Troubleshooting

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages