CCRL update (28th November 2008)

Designed for posting all types of tournaments and Games (e.g. Man vs. Machine, Computer vs. Computer and basement matches.)

Moderators: Harvey Williamson, Watchman

Forum rules
This textbox is used to restore diagrams posted with the [d] tag before the upgrade.
Post Reply
Graham Banks
Full Member
Posts: 709
Joined: Mon Sep 10, 2007 4:38 am

CCRL update (28th November 2008)

Post by Graham Banks »

The latest updates of the CCRL Rating Lists and Statistics are available for viewing at:
http://www.computerchess.org.uk/ccrl/4040/ (40/40)
http://computerchess.org.uk/ccrl/404/ (40/4)
The live link to the 40/4 list given below is currently the most up to date for that list.

The lists sometimes get updated during the week and these updates can be viewed here:
http://www.computerchess.org.uk/ccrl/4040.live/ (40/40)
http://computerchess.org.uk/ccrl/404.live/ (40/4)
However, no game downloads are available from these live links.

The links to the various rating lists can be found just beneath the default Best Versions list.
For example there is a 32-bit Single CPU list.

Our 40 moves in 40 minutes repeating and 40 moves in 4 minutes repeating are both adjusted to the AMD64 X2 4600+ (2.4GHz).

Currently active testers are:
Graham Banks, Ray Banks, Shaun Brewer, Kirill Kryukov, Dom Leste, Tom Logan, Wassim Saeed, Charles Smith, George Speight and Gabor Szots.
Currently inactive testers are:
Sarah Bird, Andreas Schwartmann, Chris Taylor, Martin Thoresen and Chuck Wilson.

Be aware that in the early stages of testing, an engine's rating can often fluctuate a lot.
It is strongly advised to also look at the many other rating lists available in order to get a more accurate overall picture of an engine's rating relative to others.


40/40 Notes

There are now over 150,000 games in the 40/40 database.

4CPU 64-bit Engines

Rybka 3 still holds a mammoth lead over its nearest challengers.
Although Deep Fritz 11 still needs more games, it looks likely to be the new number two, ahead of Naum 3.1 and Zappa Mexico II.
Deep Sjeng WC2008 comes in next with a small edge over Deep Shredder 11, which in turn has a small edge over Toga II 1.4.1SE, Hiarcs 12 and Bright 0.4a (private).
Glaurung 2.1 and Loop M1-T lie further back.

The relative ratings of the 2CPU engines that have been well tested are pretty much the same as their 4CPU counterparts.


Single CPU Engines

Rybka 3 is over 150 elo ahead of other engines in this category.
Naum 3.1, Zappa Mexico II and Fritz 11 are all pretty close in strength. Deep Sjeng WC2008 and Thinker 5.3b Inert could well join this group, but need more games.
Whether or not we can spare the resources to test Deep Fritz 11 in this category is unclear at this stage.
There is a small margin back to Shredder 11 and Toga II 1.4.1SE.
Grapefruit 1.0a3 requires more games and its rating is most likely to fall back a little.
Cyclone 2.2 and Hiarcs 12 come in next, ahead of the group that includes Fruit 2.3.1, Loop 13.6, Glaurung 2.1 and Bright 0.4a (private).


Free Single CPU Engines

Rybka 2.2n2 still heads the field, but it looks a strong possibility that Thinker 5.3b Inert will close the gap by cementing itself ahead of Toga II 1.4.1SE and Cyclone 2.2.
Grapefruit 1.0a3 currently comes in next, ahead of Fruit 2.3.1 and Glaurung 2.1, but it requires further testing.
Spike 1.2 Turin, Stockfish 1.01 and Bright 0.3a are further back, but clearly stronger than the next group that includes Twisted Logic 20080620, Frenzee Feb08 and Delfi 5.4.

CCRL tests a wide range of free engines, ranging right down to the 1950 elo level. The intention is to get well over 200 games for each of these engines. This rating list is certainly our most extensive one.

Recently released engines that seem to have made big strides are Twisted Logic, Cyrano, DanaSah, Rotor, Pupsi and NanoSzachy. BugChess could well be another, but needs more games before we know with certainty.


Blitz Notes

An enormous amount of work goes into the blitz list, and with over 350,000 games in the database, it is well worth a visit.

Of special interest to some will be the best free 1CPU engines list which is being constructed through a systematic testing approach as mentioned here:
http://kirill-kryukov.com/chess/discuss ... f=7&t=3271


FRC Notes

Ray tests only those engines that can play FRC through the Shredder Classic GUI.
If engine authors have a new and stable version of their engine that will run under this GUI, they should contact Ray if they wish to see it tested.

In the past few weeks, Ray has tested Deep Sjeng WC2008 and Hermann 2.4. Hamsters 0.7.1 could well be next.

Rybka 3 has a massive 200 elo lead over the evenly matched Deep Sjeng WC2008 and Shredder 11.
Naum 3.1 is next in the pecking order, with an edge over Hiarcs Paderborn 2007.
There is then a reasonable gap back to Fruit 051103 and Loop 10.32f (the most recent Loop version that could play FRC).
Ray is hoping to get the chance to test Naum 4 upon its release.

For FRC the best list to look at is the pure list.
http://www.computerchess.org.uk/ccrl/404FRC/


Stats/Presentation Notes

The LOS (likelihood of superiority) stats to the right hand side of each rating list tell you the likelihood in percentage terms of each engine being superior to the engine directly below them.

All games are available for download by engine, by month or by ECO code.
ELO ratings are now saved in all game databases for those engines that have 200 games or more.

Clicking on an engine name will give details as to opponents played plus homepage links where applicable.

Custom lists of engines can be selected for comparison.

An openings report page lists the number of games played by ECO codes with draw percentage and White win percentage. Clicking on a column heading will sort the list by that column.
Post Reply