CCRL update report (31st July 2009)

Designed for posting all types of tournaments and Games (e.g. Man vs. Machine, Computer vs. Computer and basement matches.)

Moderators: Harvey Williamson, Watchman

Forum rules
This textbox is used to restore diagrams posted with the [d] tag before the upgrade.
Post Reply
Graham Banks
Full Member
Posts: 709
Joined: Mon Sep 10, 2007 4:38 am

CCRL update report (31st July 2009)

Post by Graham Banks »

The latest CCRL Rating Lists and Statistics are available for viewing from the following links:
http://computerchess.org.uk/ccrl/4040.live/ (40/40)
http://www.computerchess.org.uk/ccrl/404/ (40/4)
http://www.computerchess.org.uk/ccrl/404FRC/ (FRC 40/4)

Please note that the three lists are updated separately to each other.
Also please note the live link will no longer necessarily give you the most updated lists. The links given in each update report will be the ones to use.

The links to the various rating lists can be found just beneath the default Best Versions list.
For example there is a 32-bit Single CPU list.

Our 40 moves in 40 minutes repeating and 40 moves in 4 minutes repeating are both adjusted to the AMD64 X2 4600+ (2.4GHz).

Currently active testers are:
Graham Banks, Shaun Brewer, Aser Huerga, Kirill Kryukov, Dom Leste, Wassim Saeed, Andreas Schwartmann, Charles Smith and Gabor Szots.

Be aware that in the early stages of testing, an engine's rating can often fluctuate a lot.
It is strongly advised to look at the many other rating lists available in order to get a more accurate overall picture of an engine's rating relative to others.


40/40 Notes

There are currently almost 195,000 games in our 40/40 database.


4CPU 64-bit Engines

Rybka 3 is clearly stronger than second placed Naum 4, which in turn is clearly stronger than third placed Deep Fritz 11.
Zappa Mexico II is in fourth spot, ahead of the closely grouped Thinker 5.4c Inert, Deep Sjeng WC2008, Stockfish 1.4 and Deep Shredder 11.
Hiarcs 12.1, Toga II 1.4.1SE, Glaurung 2.2 and Bright 0.4a are the next group, not too far behind.
Other well tested latest versions (in order of strength) are Loop 13.6, Deep Junior 10, Crafty 23.0, Scorpio 2.1 and Pharaon 3.5.1.
The strong Toga derivatives, Grapefruit 1.0, Cyclone 3.4 and TheMadPrune 1.1.25 have not been tested in this category.


The relative ratings of the 2CPU engines that have been well tested are pretty much the same as their 4CPU counterparts.


Single CPU Engines

A similar situation exists at the top in this category with Rybka 3 holding a huge lead over second placed Naum 4, which in turn has a sizeable lead over Thinker 5.4c Inert.
Zappa Mexico II and Fritz 11 come in next and are fairly even in strength.
There is a small gap back to the evenly matched Stockfish 1.4, Deep Sjeng WC2008, Grapefruit 1.0, Shredder 11 and Toga II 1.4.1SE. However, we're already talking more than 200 elo below Rybka!
There is another small gap back to Cyclone 3.4, Onno 1.0, Hiarcs 12.1 and TheMadPrune 1.1.25.
Glaurung 2.2, Fruit 2.3.1, Loop 13.6 and Bright 0.4a are the next group of engines, ahead of Ktulu 9, Spike 1.2 Turin and Junior 10.
The remaining commercial engines, SmarThink 1.10, Chess Tiger 2007.1 and the top Chessmaster 11 settings, are in a group that includes Twisted Logic 20080620, Frenzee Feb08, Delfi 5.4 and Booot 4.15.0 .


Free Single CPU Engines

Rybka 2.3.2a is the the top free engine, comfortably ahead of Thinker 5.4c Inert.
In third spot, there is a group of fairly evenly matched engines - Stockfish 1.4, Grapefruit 1.0, Toga II 1.4.1SE and Cyclone 3.4.
TheMadPrune 1.1.25 bridges the gap between the previous group and the next, which includes Glaurung 2.2, Fruit 2.3.1 and Bright 0.4a.
Spike 1.2 Turin is further back still, but with a clear edge over Twisted Logic 20080620, Naum 2.0, Frenzee Feb08, Delfi 5.4 and Booot 4.15.0.

CCRL tests a wide range of free engines, ranging right down to the 2000 elo level. The intention is to get well over 200 games for each of these engines.
Tournaments involving these engines can be followed in our public forum.


Blitz Notes

There are over 464,000 games in the 40/4 database and it is well worth a visit.
Shaun, Gabor, Kirill and Aser put a lot of work into this list, testing engines in a very well organised and systematic manner. Well worth a look!
This list will be updated soon.


FRC Notes

There are currently almost 60,000 games in the database and it is completely up to date with the exception of the latest version of Tornado.

Thanks to the much appreciated efforts of Matthias Gemuh, ChessGUI is now able to be used to test all FRC engines, both Winboard/UCI plus Shredder/Arena specific.

Rybka 3 has a big lead over Naum 4, which in turn has a similarly big lead over the evenly matched Deep Sjeng WC2008 and Shredder 11.
Hiarcs 12.1 comes in next, comfortably ahead of Glaurung 2.2.
Fruit 051103 and Loop 10.32f lie further back, ahead of Bright 0.4a and Spike 1.2 Turin.

For FRC the best list to look at is the pure list.
http://www.computerchess.org.uk/ccrl/404FRC/


Stats/Presentation Notes

The LOS (likelihood of superiority) stats to the right hand side of each rating list tell you the likelihood in percentage terms of each engine being superior to the engine directly below them.

All games are available for download by engine, by month or by ECO code not from the live link though).
ELO ratings are now saved in all game databases for those engines that have 200 games or more.

Clicking on an engine name will give details as to opponents played plus homepage links where applicable.

Custom lists of engines can be selected for comparison.

An openings report page lists the number of games played by ECO codes with draw percentage and White win percentage. Clicking on a column heading will sort the list by that column.

For any testers interested in joining our group, please read our homepage before applying.
Post Reply