CCRL update (14th November 2008)

Designed for posting all types of tournaments and Games (e.g. Man vs. Machine, Computer vs. Computer and basement matches.)

Moderators: Harvey Williamson, Watchman

Forum rules
This textbox is used to restore diagrams posted with the [d] tag before the upgrade.
Post Reply
Graham Banks
Full Member
Posts: 709
Joined: Mon Sep 10, 2007 4:38 am

CCRL update (14th November 2008)

Post by Graham Banks »

The latest updates of the CCRL Rating Lists and Statistics are available for viewing at:
http://www.computerchess.org.uk/ccrl/4040/ (40/40)
http://computerchess.org.uk/ccrl/404/ (40/4)
The live link to the 40/4 list given below is currently the most up to date for that list.

The lists sometimes get updated during the week and these updates can be viewed here:
http://www.computerchess.org.uk/ccrl/4040.live/ (40/40)
http://computerchess.org.uk/ccrl/404.live/ (40/4)
However, no game downloads are available from these live links.

The links to the various rating lists can be found just beneath the default Best Versions list.
For example there is a 32-bit Single CPU list.

Our 40 moves in 40 minutes repeating and 40 moves in 4 minutes repeating are both adjusted to the AMD64 X2 4600+ (2.4GHz).

Currently active testers are:
Graham Banks, Ray Banks, Shaun Brewer, Kirill Kryukov, Dom Leste, Tom Logan, Wassim Saeed, George Speight and Gabor Szots.
Currently inactive testers are:
Sarah Bird, Andreas Schwartmann, Charles Smith, Chris Taylor, Martin Thoresen and Chuck Wilson.

Be aware that in the early stages of testing, an engine's rating can often fluctuate a lot.
It is strongly advised to also look at the many other rating lists available in order to get a more accurate overall picture of an engine's rating relative to others.


40/40 Notes

There are currently just under 150,000 games in the 40/40 database.

4CPU 64-bit Engines

We have only just started testing Deep Fritz 11, but early indications lead us to believe that coold well be the new number two. Just how far it can close Rybka's commanding lead at the top remains to be seen.
Naum 3.1 and Zappa Mexico II are very even in strength and are likely to occupy third and fourth spots.
Deep Sjeng WC2008 comes in next, ahead of a closely bunched group that includes Deep Shredder 11, Toga II 1.4.1SE, Hiarcs 12 and Bright 0.4a (private).
Glaurung 2.1 and Loop M1-T lie further back.

The relative ratings of the 2CPU engines that have been well tested are pretty much the same as their 4CPU counterparts.


Single CPU Engines

Rybka 3 is over 150 elo ahead of other engines in this category.
Naum 3.1, Zappa Mexico II and Fritz 11 are all pretty close in strength. It is expected that Deep Sjeng WC2008 would be up there with them, but we haven't tested the 64-bit version yet.
Whether or not we can spare the resources to test Deep Fritz 11 in this category is also unclear at this stage.
There is a small margin back to Shredder 11 and Toga II 1.4.1SE.
Cyclone 2.2 bridges the gap back to Hiarcs 12, which is in turn ahead of the group that includes Fruit 2.3.1, Thinker 5.3b Inert, Loop 13.6, Glaurung 2.1 and Bright 0.4a (private).
Thinker 5.3b Inert needs many more games and its present rating should be taken with a grain of salt.


Free Single CPU Engines

Rybka 2.2 heads the field with a 50+ elo gap back to Toga II 1.4.1SE and Cyclone 2.2.
The next group not too far behind consists of Fruit 2.3.1, Thinker 5.3b Inert and Glaurung 2.1.
Grapefruit 1.0, Spike 1.2 Turin and Bright 0.3a are further back, but clearly stronger than the next group that includes Frenzee Feb08, Stockfish 1.01, Twisted Logic 20080620, BugChess2 1.6.3 and Delfi 5.4.
Thinker 5.3b Inert, Grapefruit 1.0, Stockfish 1.01 and BugChess2 1.6.3 are still in the very early stages of testing, so their ratings are not stable yet.

CCRL tests a wide range of free engines, ranging right down to the 1900 elo level. The intention is to get well over 200 games for each of these engines. This rating list is certainly our most extensive one.

Recently released engines that seem to have made big strides are Twisted Logic, Cyrano, DanaSah, Rotor, Pupsi and NanoSzachy. BugChess could well be another, but needs more games before we know with certainty.


Blitz Notes

An enormous amount of work goes into the blitz list, and with over 350,000 games in the database, it is well worth a visit.

Of special interest to some will be the best free 1CPU engines list which is being constructed through a systematic testing approach as mentioned here:
http://kirill-kryukov.com/chess/discuss ... f=7&t=3271


FRC Notes

Ray tests only those engines that can play FRC through the Shredder Classic GUI.
If engine authors have a new and stable version of their engine that will run under this GUI, they should contact Ray if they wish to see it tested.

Rybka 3 has a massive 200 elo lead over the closely grouped Shredder 11, Naum 3.1 and Deep Sjeng 3.0.
Hiarcs Paderborn 2007 in fifth spot is well ahead of Fruit 051103 and Loop 10.32f (the most recent Loop version that could play FRC).

For FRC the best list to look at is the pure list.
http://www.computerchess.org.uk/ccrl/404FRC/


Stats/Presentation Notes

The LOS (likelihood of superiority) stats to the right hand side of each rating list tell you the likelihood in percentage terms of each engine being superior to the engine directly below them.

All games are available for download by engine, by month or by ECO code.
ELO ratings are now saved in all game databases for those engines that have 200 games or more.

Clicking on an engine name will give details as to opponents played plus homepage links where applicable.

Custom lists of engines can be selected for comparison.

An openings report page lists the number of games played by ECO codes with draw percentage and White win percentage. Clicking on a column heading will sort the list by that column.
Post Reply