CCRL Rating List - August 4th update

Designed for posting all types of tournaments and Games (e.g. Man vs. Machine, Computer vs. Computer and basement matches.)

Moderators: Harvey Williamson, Watchman

Forum rules
This textbox is used to restore diagrams posted with the [d] tag before the upgrade.
Post Reply
Ray

CCRL Rating List - August 4th update

Post by Ray »

The August 4th update of the CCRL Rating Lists and Statistics is now available for viewing at:
http://www.computerchess.org.uk/ccrl/4040/

The links to the various rating lists can be found just beneath the default Best Versions list.
For example there is a 32-bit Single CPU list.

Our standard testing is at 40 moves in 40 minutes repeating while our current blitz testing is at both 40 moves in 4 minutes repeating and 40 moves in 12 minutes repeating, all adjusted to the AMD64 X2 4600+ (2.4GHz).

Currently active testers in our team are:
Graham Banks, Ray Banks, Shaun Brewer, Kirill Kryukov, Dom Leste, Tom Logan, Andreas Schwartmann, Charles Smith, George Speight, Chris Taylor, Chuck Wilson, Gabor Szots and Martin Thoresen.

A big thanks to all testers as usual for their efforts this week.


40/40 Notes

There currently 68,131 games in our 40/40 database.

Many engines on our list have few games and in many cases their ratings are likely to fluctuate (markedly for some) until a lot more games are played. Therefore no conclusions should be drawn about their strength yet.
To illustrate this point, when an engine has 200 games played, the error margin is still approximately +/-40 ELO, after 500 games +-25 ELO, after 1000 games +-17 ELO and even after 2000 games there is a +-13 ELO error margin!
This of course highlights the importance of looking at other rating lists that are also available in order to draw comparisons and get a more accurate overall picture.


Multi CPU Engines

Rybka 2.3.2a 64-bit 4CPU is a small improvement over Rybka 2.2 64-bit 4CPU.
Interestingly, the improvement is greater on 2CPU.

Zap!Chess Zanzibar 64-bit 4CPU is clearly number 2 ahead of Hiarcs 11.1 4CPU.
Hiarcs 11.2 is untested at this stage.

Naum 2.2 64-bit 4CPU has made a strong start, but has only a handful of games to date.
From what we've seen so far, it looks likely to be an improvement over its well established predecessor.

The ratings for Loop M1-T 64-bit 4CPU and 2CPU are very similar, suggesting little gain from the extra two CPU.

Deep Shredder 10 64-bit 4CPU, Deep Fritz 10 4CPU and Deep Junior 10 4CPU, are off the pace.

Whilst we've not tested Glaurung 2 epsilon/5 4CPU, the performance under 2CPU shows an impressive gain over the previous version.


Single CPU Engines

Rybka 2.3.2a leads the ratings here as well, although by a slightly larger margin.

Toga II 1.3 Beta 1 (to be officially released as Toga II 1.3.1) has now taken over second spot ahead of Zap!Chess Zanzibar!
Loop M1-T could well be a threat to both as it gets more games under its belt.

We have started testing Fruit 051103 and Fruit 2.3.1, so it will be interesting to see where they stand in comparison.

Hiarcs 11.1, Fritz 10 and Shredder 10 are the next three in the ranking order.
The controversial Strelka 1.8 looks likely to be close in strength to them.

We have only just started testing Naum 2.2 and Hiarcs 11.2.

Spike 1.2 Turin, Junior 10 and Deep Sjeng 2.5 are the next group of engines and are very even in strength.
Junior 10.1 is weaker than Junior 10 according to our testing.

Ktulu 8.0 and Chess Tiger 2007.1 are further adrift.
Alaric 707 and Glaurung 2 epsilon/5 could yet turn out to be stronger than both.


Amateur News

The final release of Toga II 1.3 will rival Rybka 1.0 as the strongest free engine.
Of course Fruit 051103 and Fruit 2.3.1 could well surpass both!

Strelka 1.8 is a little stronger than Spike 1.2 Turin.

Both Alaric 707 and Glaurung 2 epsilon/5 are expected to be stronger than the next group of engines - Scorpio 1.91, Delfi 5.1 and SlowChess Blitz WV2.1.

WildCat 7 and Pro Deo 1.2 are further back.

As we make our way down the list, it should be noted that the most recent versions of DanaSah, Natwarlal and Feuerstein seem to have made good gains over previous versions.
Others to keep an eye on are the latest versions of Popochin, NanoSzachy and BugChess.

We test a very extensive range of amateur engines through our Amateur Championship divisions (32-bit 1CPU) plus other tournaments, all of which can be followed in our public forum.

Our aim is of course to ensure that all engines lower on our lists get at least 200 games.

It should be noted that while the latest version of Matacz will load in the Chessbase GUIs on Pentiums, it does not seem to do so on Athlons.
The latest version of Twisted Logic is untested yet due to reported problems with frequent losses on time.
Counter 0.2 loses on time far too frequently. We haven't tested the latest version yet to see if the problem has been addressed.


Blitz Notes

There are currently 155,999 games in our 40/4 database.

The 40/4 update is usually done separately to our 40/40 update. The most recent update can always be viewed here:
http://computerchess.org.uk/ccrl/404.live/


FRC Notes

Ray tests only those engines that can play FRC through the Shredder Classic GUI.
If engine authors have a new and stable version of their engine that will run under this GUI, they should contact Ray if they wish to see it tested.

Ray has tested Naum 2.2, Fruit 2.3 and Hermann 2.0 during the past week, and all are now included in the ratings.
Fruit 051103 will be tested next.

For FRC the best list to look at is the pure list.
http://www.computerchess.org.uk/ccrl/404FRC/


Stats/Presentation Notes

The LOS stats to the right hand side of each rating list are "likelihood of superiority" stats. They tell you the likelihood in percentage terms of each engine being superior to the engine directly below them.

A list of games played this week per engine can be found in the update thread in the CCRL public forum, accessible through the link given at the top of this post.

All games are available for download through the link given at the top of this post. They can be downloaded by engine or by month.
ELO ratings are now saved in all game databases for those engines that have 200 games or more.

Clicking on an engine name will give details as to opponents played plus homepage links where applicable.

Custom list selections now have the option of including or excluding betas, private engines, settings and others.

An openings report page (link at bottom of index page) lists the number of games played by ECO codes with draw percentage and White win percentage. Clicking on a column heading will sort the list by that column.
Games can now be downloaded by ECO code.
User avatar
Harvey Williamson
Site Admin
Posts: 6079
Joined: Sun Jul 29, 2007 6:57 am
Location: Media City, UK
Contact:

Post by Harvey Williamson »

Hi Ray,

Thanks. It is nice to see CCRL posting their results here. CEGT have also offered to post results regularly. We also hope to add regular SSDF results.

Thanks Again,

Harvey
Ray

Post by Ray »

Unfortunately I think the web site just went down - must have been the hundreds of people rushing to look at it :D

Back-up site ccrl.org.uk will be updated in 2-3 hrs
Ray

Post by Ray »

Seems to be fine again now :)
Post Reply