Chess Transformer Engine

A transformer-based chess engine trained on 2 million PGN games

Overview

Modern chess engines are rapidly transitioning from convolutional neural networks (CNNs) to transformer-based architectures. This shift is exemplified by Leela Chess Zero's adoption of transformers, driven by fundamental architectural advantages for chess position evaluation.

Why Transformers Over CNNs?

Limitations of CNNs in Chess

CNNs process information through local convolution operations, which inherently limits their ability to capture global board patterns:

Local Pattern Recognition: Even with deep networks and residual connections, CNNs primarily identify local piece configurations
Limited Spatial Awareness: Understanding relationships between distant pieces requires information to propagate through numerous layers, creating a computational bottleneck
Sequential Information Flow: Long-range tactical patterns suffer from the layered nature of convolutional processing

Advantages of Transformers

Transformers revolutionize chess position evaluation through their self-attention mechanism:

Global Board Vision: Every square can directly attend to every other square in a single layer, enabling immediate awareness of the entire position
Superior Long-Range Dependencies: Chess tactics and strategies frequently involve distant piece coordination—transformers naturally excel at modeling these relationships
Parallel Processing: Multi-head attention allows simultaneous evaluation of multiple strategic patterns

The Trade-off

Transformers come with increased computational requirements:

Higher memory consumption
Greater processing power demands
Larger training datasets needed

However, for competitive chess engines, these costs are justified by the substantial gains in positional understanding and tactical accuracy.

Model Architecture

Our model utilizes the following configuration:

Performance Test: Smothered Mate

We tested the model's tactical vision with a challenging position requiring precise calculation:

The Critical Move: Black to play and deliver checkmate in one move.

The position features a classic "smothered mate" pattern—a tactical motif that requires recognizing how the king's own pieces restrict its escape squares. The winning move is Ne2#, a knight check that delivers immediate checkmate.

Results

Our model successfully identified the checkmate:

The model's ability to find this non-obvious tactical blow demonstrates its strong pattern recognition and evaluation capabilities, particularly for complex mating attacks.

Training Details

Dataset: 2 million chess games in PGN format
Architecture: Transformer-based neural network
Task: Move prediction and position evaluation

This project demonstrates the practical application of transformer architectures to chess engine development, achieving strong tactical performance through modern deep learning techniques.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
LICENSE		LICENSE
README.md		README.md
checkmate.png		checkmate.png
dataloader.py		dataloader.py
infrence.py		infrence.py
model.py		model.py
params.png		params.png
position.png		position.png
requirements.txt		requirements.txt
results.png		results.png
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chess Transformer Engine

A transformer-based chess engine trained on 2 million PGN games

Overview

Why Transformers Over CNNs?

Limitations of CNNs in Chess

Advantages of Transformers

The Trade-off

Model Architecture

Performance Test: Smothered Mate

Results

Training Details

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Chess Transformer Engine

A transformer-based chess engine trained on 2 million PGN games

Overview

Why Transformers Over CNNs?

Limitations of CNNs in Chess

Advantages of Transformers

The Trade-off

Model Architecture

Performance Test: Smothered Mate

Results

Training Details

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages