Skip to content

Commit 8b9e5d1

Browse files
authored
Blog Post about the Odds Tournament at Jerusalem Chess Club (#186)
* Directly from Hissha * Few edits from johnsp * Reorganize and add images * Add Alt Text and grammar * Update future outlook and include links --------- Co-authored-by: John Derr <20563885+john-sp@users.noreply.github.com>
1 parent c44d884 commit 8b9e5d1

File tree

3 files changed

+185
-0
lines changed

3 files changed

+185
-0
lines changed
457 KB
Loading
393 KB
Loading
Lines changed: 185 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,185 @@
1+
---
2+
author: GM Larry "Hissha" Kaufman
3+
published: "2025-12-07"
4+
title: 'Leela Odds Tournament in Jerusalem'
5+
---
6+
7+
Is a rook advantage enough for a master to beat a superhuman AI in classical chess? For decades, the answer was an obvious "yes." However, a recent event in Jerusalem suggests the answer is no longer guaranteed.
8+
9+
<!-- more -->
10+
11+
![7 Players Preparing for Rook Odds Games](587505319_25475188565455802_8905523921313172164_n.jpg)
12+
13+
On November 29, a Classical time limit (90'+30") match took place on Lichess between 19 strong human players meeting at the Jerusalem Club in Israel and the LeelaOddsBots (mostly LeelaRookOdds). There was prize money for wins and draws to ensure that the players would play seriously, while also ruling out cheating, making this a much stricter test than typical online play. A similar event was held several years ago when the computer was Komodo and the handicap was knight odds; both of these events were organized and partly funded by Uri Blass.
14+
15+
### The Setup and The Hardware
16+
17+
The match was a double-round event. This was practical because Leela was playing at blitz speed, so games rarely exceeded two hours.
18+
19+
- **Round 1:** Mostly Rook odds (Leela White with the a1 rook removed), with two lower-rated players receiving two-knight odds.
20+
- **Round 2:** Rook odds again, except for first-round winners (who moved to knight odds) and one loser (who moved to two-knight odds).
21+
22+
LeelaRookOdds, running on my home computer with a 5090 GPU, played up to 12 games at once with replacement as games finished. The remaining games ran on my older 4090 GPU computer.
23+
24+
The humans generally played on real chess boards, entering their moves online only when ready. This led to very few errors. The only significant one at rook odds was a player entering a wrong move just as he was about to force a draw by repetition; we adjudicated this to a draw for the tournament.
25+
26+
### The Human Field
27+
28+
![4 Players at the Jerusalem Chess Club tournament](588773175_25475190302122295_1343816795594061971_n.jpg)
29+
30+
There were 38 games in total, 31 of which were at rook odds. The rook-odds players ranged from 1900 FIDE up to a 2463 FIDE IM, Yannay Ben Ari. The field was notably young; 12 of the 17 rook-odds players were between the ages of 12 and 23, suggesting the field was likely underrated compared to their official FIDE metrics.
31+
32+
The roster included:
33+
34+
- 2 International Masters (IMs)
35+
- 3 FIDE Masters (FMs)
36+
- 4 National Masters (NMs)
37+
38+
### Rook Odds: Results and Performance
39+
40+
The performance of LeelaRookOdds was staggering given the handicap.
41+
42+
| Category | Games | Leela Wins | Draws | Human Wins | Leela Perf. Rating |
43+
| :--- | :---: | :---: | :---: | :---: | :---: |
44+
| All Rook Odds | 31 | 18 | 8 | 5 | 2321 |
45+
| vs Titled Players (2150+) | 17 | 9 | 5 | 3 | 2417 |
46+
47+
Based on the average FIDE rating of the field (2166), Leela achieved a performance rating of 2321. Since the FIDE requirement for the FM title is to play 30 games with a rating of at least 2300, we can unofficially say **LeelaRookOdds has "earned" the FM title** - despite playing at blitz speed, in a simul on a single GPU, and starting without a rook!
48+
49+
Interestingly, performance rose with the rating of the opposition. Against the nine titled players (NM, FM, IM), Leela performed at an IM level (2417). This phenomenon is normal for the LeelaOddsBots. Leela "assumes" the opposition is strong enough to score 50% at the odds, so she often allows early draws by repetition if the opening goes well for the human. Weaker players accept these draws, while stronger players fight on.
50+
51+
### Analysis: How Leela Wins (and Loses)
52+
53+
The five human wins were high-quality games. Leela successfully complicated positions, but the humans found clever ways to simplify into winning endgames. One featured a prolonged battle to convert a Queen-for-Rook advantage with opposite-colored bishops, which was ultimately indefensible for Leela.
54+
55+
Conversely, the 18 games won by Leela were decided by complex tactics. These were rarely obvious blunders, but rather deep complications that master-level players can usually spot given enough time. However, Leela keeps the board so chaotic that humans eventually overlook something, either due to complexity or time pressure.
56+
57+
**The Accuracy Requirement:**
58+
I checked the Lichess accuracy figures (using Stockfish 17.1) for the humans and found the results surprising:
59+
60+
- **No one won** a game with less than **97% accuracy**.
61+
- Some losses had accuracy as high as **90–92%**.
62+
- Some draws had accuracy up to **94%**.
63+
64+
It turns out that winning a game starting with an extra rook requires near-perfection. Leela gradually reduces the disadvantage until one serious error makes the game close, then usually wins via further small errors.
65+
66+
### Knight and Two-Knight Odds Results
67+
68+
The results for minor piece handicaps were mixed, partially due to sample size.
69+
70+
| Handicap | Opponent | Result |
71+
| :--- | :--- | :---: |
72+
| **Knight Odds** | IM (Top Rated) | **Leela Win** |
73+
| **Knight Odds** | Y. Gurevich (2081) | **Leela Win** |
74+
| **2-Knights Odds** | Various | **+1 =3 -1** |
75+
76+
*Note: One of the draws in the 2-Knights category was due to a computer crash in a position that was objectively drawn but likely too difficult for the human to hold.*
77+
78+
The performance rating at two knights was only 1743 (or 1813 if we count the crash as a win). This is surprisingly low compared to blitz/rapid data, likely because three of the five games were against underrated junior players (aged 12-13). I suspect against established adults, the break-even for two-knight odds is closer to 2000.
79+
80+
The win against the top IM at knight odds confirms our belief that the break-even point in classical chess for knight odds is in the strong GM range (approx. 2600 FIDE).
81+
82+
### Re-evaluating Handicap Values
83+
84+
Based on this event, I would argue that rook odds is closer to knight odds than it is to two-knight odds. The nominal 3-3-5-9 piece values are endgame values and aren't relevant to the initial position.
85+
86+
In round numbers, if removing the "f" pawn is a value of **1**, here is how I estimate the current handicaps:
87+
88+
- **Knight Odds:** ~3.0
89+
- **Bishop Odds:** ~3.5
90+
- **Rook Odds:** ~4.0 (perhaps slightly more)
91+
- **Queen Odds:** ~8.0
92+
93+
Our experience shows Leela performs much better giving Queen odds than two rooks, though this may change once we have a net specifically trained for two-rook odds.
94+
95+
### Comparison: Classical vs. Blitz
96+
97+
This event allows us to compare human performance at Classical controls against their performance at Blitz vs Leela.
98+
99+
I analyzed LeelaRookOdds' blitz results since the major upgrade in February 2025 against identified GMs in the world top 50. Most games were 3'2" (adjusted for ratings).
100+
101+
| Format | Avg Opponent Rating | Leela Score | Perf. Rating |
102+
| :--- | :---: | :---: | :---: |
103+
| **Blitz (3'2")** | 2714 | +39 =11 -9 | **2909** |
104+
| **Classical (90'30")** | 2166 | +18 =8 -5 | **2321** |
105+
106+
For rook odds at a high human level, the difference in quality of play between 3'2" blitz and 90'30" classical chess appears to be about **600 Elo**. Since standard estimates for this gap are usually around 500 Elo, it seems that odds play increases the divergence between classical and blitz performance.
107+
108+
### A Million Games and A New Announcement
109+
110+
As we analyze the results of this classical match, we are also celebrating a massive milestone for the project as a whole. As of the morning of December 6, 2025 (Lichess time), the LeelaOddsBots have officially played over one million games.
111+
112+
When this project started two years ago, I don’t think anyone imagined we would be crushing Grandmasters by significant margins in Rapid at rook odds, or that a human would need close to IM-level strength just to break even against Queen odds in blitz. Reaching a million games this quickly would have sounded just as crazy as the strength of the bots.
113+
114+
Total Games Played (as of Dec 6):
115+
116+
- LeelaQueenOdds (LQO): 697,911
117+
- LeelaPieceOdds: 105,663
118+
- LeelaRookOdds (LRO): 95,501
119+
- LeelaKnightOdds (LKO): 89,137
120+
- LeelaQueenForKnight: 13,985
121+
- Total: 1,002,197
122+
123+
To celebrate this milestone, we will be promoting a new, BT4-based net for LeelaQueenOdds very soon. This new network has already proven its superiority in testing, and I am currently finalizing the optimal settings for Search Contempt and related parameters.
124+
125+
Regarding LeelaRookOdds (which played in this Jerusalem event), we are still using the LKO4 net trained by Marcus98 one year ago. It handles Knight, Bishop, Rook, and FRC Knight odds simultaneously and has yet to be surpassed. While net progress for rook odds has been flat, we have made large and rapid progress with the search. We recently upgraded the search to look at more nodes when the position is no longer objectively lost, and we hope to gain similar strength by adjusting search contempt values.
126+
127+
All in all, I expect that progress with the odds bots will continue at a much faster pace than is possible with standard chess. I also expect more odds matches with titled players; perhaps an FRC match at knight or rook odds with a GM might be next.
128+
129+
> [!INFO]- Links to Lichess Games at the Event
130+
>
131+
> #### Rook Odds
132+
>
133+
> - https://lichess.org/cZklEcr0
134+
> - https://lichess.org/Sp1dvJx7
135+
> - https://lichess.org/LR0absor
136+
> - https://lichess.org/IFvGDOax
137+
> - https://lichess.org/DyVsk4kn
138+
> - https://lichess.org/2iBv6AYg
139+
> - https://lichess.org/xXYGF7US
140+
> - https://lichess.org/v8G8lftq
141+
> - https://lichess.org/BXBrdCuh
142+
> - https://lichess.org/abstxHfM
143+
> - https://lichess.org/ed2Cc3DE
144+
> - https://lichess.org/cMMxF0qB
145+
> - https://lichess.org/BFIEQkWa
146+
> - https://lichess.org/SxftFixP
147+
> - https://lichess.org/1K1ZS7q1
148+
> - https://lichess.org/kVNMdeFv
149+
> - https://lichess.org/2DodfoJx
150+
> - https://lichess.org/G7O1j1qI
151+
> - https://lichess.org/mpgcf9d1
152+
> - https://lichess.org/BJfNpN0K
153+
> - https://lichess.org/Z4Oc8Kc0
154+
> - https://lichess.org/3xomU6B1
155+
> - https://lichess.org/3rGyKnj4
156+
> - https://lichess.org/ynJQVQuM
157+
> - https://lichess.org/dIo3ZBt2
158+
> - https://lichess.org/r3ApF53O
159+
> - https://lichess.org/0NHkDMyk
160+
> - https://lichess.org/MyBfQIlJ
161+
> - https://lichess.org/Gj8B2LwR
162+
> - https://lichess.org/3npjdaOt
163+
> - https://lichess.org/hDalrYC4
164+
>
165+
> #### 2 Knight Odds
166+
>
167+
> - https://lichess.org/UUJOqXbQ
168+
> - https://lichess.org/ZA3doEvl
169+
> - https://lichess.org/Uva4tcd7
170+
> - https://lichess.org/v8G8lftq
171+
> - https://lichess.org/H64s8I58
172+
>
173+
> #### Knight Odds
174+
>
175+
> - https://lichess.org/obVj3veI
176+
> - https://lichess.org/PLjXMdYo
177+
178+
### Conclusion: What is a Fair Match?
179+
180+
Thanks to Uri Blass and everyone involved in organizing this event, we have clarified where the odds bots stand in Classical chess. Based on this data, here are my estimates for a fair match (50% score) at Classical time limits:
181+
182+
- **Rook Odds:** Par with an **IM (~2400 FIDE)**.
183+
- **Knight Odds:** Par with a **Strong GM (~2600 FIDE)**.
184+
- **FRC (Fischer Random):** Add ~100 Elo to the above requirements.
185+
- **Fast Classical (45'30"):** Add ~100 Elo to the above requirements.

0 commit comments

Comments
 (0)