GRADING ANOMALIES

General discussions about ratings.
User avatar
Robert Jurjevic
Posts: 207
Joined: Wed May 16, 2007 1:31 pm
Location: Surrey

Post by Robert Jurjevic » Mon May 21, 2007 4:12 pm

What do GS and CGS do about Condition 2...

We already mentioned Condition 2...

Condition 2: Let 'a' and 'b' are grades of two players, A and B, and let a >= b ('a' must be greater than or equal to 'b'). If the players play a match and then afterwards a rematch and if performance of player A in both, the match and rematch, is 'p2', not necessarily equal to the expected performance, p2 /= p = 50+a-b ('/=' means 'not equal'), if the players were graded (twice) after each match, and (once) after the match and rematch (taken as one match), one should obtain the same final grades (for both players).

...and an example...

Player A and B with grades a=130 and b=100 play a match and then afterwards a rematch. In both, the match and rematch, player A scores 70%. Let the grades after the match are 'a2' and 'b2' and after the rematch 'a3' and 'b3' (we use 'a2' and 'b2' in calculation of 'a3' and 'b3'). GS and CGS give: a2=120 and b2=110, a3=130 and b3=100. AGS gives: a2=125 and b2=105, a3=125 and b3=105. Let the grades after the match and rematch (taken as one match) are 'a4' and 'b4'. GS and CGS give: a4=120 and b4=110. AGS gives: a4=125 and b4=105. Condition 2 requires that the grading system gives a4=a3 and b4=b3.

Please note that if in the above calculation we haven't bothered to calculate 'b2', 'b3' and 'b4', and in the calculation of 'a3' we have taken b2=b=100 (which is wrong), both GS and CGS would give a3=120 and a4=120, giving an illusion that Condition 2 is satisfied for GS and CGS (as a3=a4). This may indicate that the designers of the original grading system (GS) intended to satisfy Condition 2 but they have forgotten to take into account grades of both players.

This should go in favour to AGS, which (I believe) satisfies Condition 2 (I have no mathematical proof yet).

Numerical examples...

Now follow some examples for which AGS satisfies Condition 2 (you can check that for yourself)...

The examples are show in the format...

Code: Select all

a b
1 1 1 1 1 1 2 2 2 2
1 1 1 2 1 0 1 0 2 2
a3 b3
a4 b4
...where 'a', 'b', 'a3', 'b3', 'a4' and 'b4' are as described in the above (Condition 2) example ('a3', 'b3', 'a4' and 'b4' calculated by AGS), '1 1 1 1 1 1 2 2 2 2' means that player A (with grade 'a') scored 6 wins in a raw and then 4 loses in a raw in the match, '1 1 1 2 1 0 1 0 2 2' means that player A (with grade 'a') scored 3 wins in a raw, 1 loss, 1 win, 1 draw, 1 win, 1 draw and 2 loses in a raw in the rematch...

Code: Select all

130 100
1 1 1 1 1 1 1 2 2 2 
1 1 1 1 1 1 1 2 2 2 
125 105
125 105

Code: Select all

130 100
1 1 1 1 1 1 1 2 2 2
1 1 1 1 1 0 1 0 2 2
125 105
125 105

Code: Select all

130 100
1 1 1 1 1 1 2 2 2 2
1 1 1 2 1 0 1 0 2 2
120 110
120 110

Code: Select all

130 100
1 1 1 1 1 1 2 2 2 2 
1 1 1 0 1 0 0 0 2 2
120 110
120 110

Code: Select all

150 100
1 1 1 1 1 1 1 1 1 2 
1 1 1 1 1 1 1 1 1 2 
145 105
145 105

Code: Select all

150 100
1 1 1 1 1 1 1 1 1 2 
1 1 0 0 1 1 1 1 1 1
145 105
145 105

Code: Select all

180 100
1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 
180 100
180 100

Code: Select all

140 120
1 1 1 1 1 0 1 1 0 1 
1 1 1 0 1 1 1 0 1 1 
150 110
180 100

Code: Select all

140 120
1 1 1 1 1 1 1 1 2 1 
1 1 1 0 1 1 1 0 1 1 
150 110
180 100

Code: Select all

150 110
1 1 1 1 1 1 1 1 2 1 
1 1 1 0 1 1 1 0 1 1
150 110
180 100
What next...

I'll try to prove mathematically that AGS satisfies Condition 2.
Robert Jurjevic
Vafra

User avatar
Robert Jurjevic
Posts: 207
Joined: Wed May 16, 2007 1:31 pm
Location: Surrey

Post by Robert Jurjevic » Mon May 21, 2007 4:31 pm

A note on decimal grades...

If one has started with grades as whole numbers, neither GS nor CGS algorithm would generate rational (decimal) grades, but AGS can (as average grade need not to be a whole number). My suggestion would be to either round the grades to zero (i.e. a grade of 99.25 would be 99) or one (i.e. a grade of 99.25 would be 99.3) decimal places. Rounding the grades on one decimal place would give granularity that is similar to that of Elo System (I think).
Robert Jurjevic
Vafra

Howard Grist
Posts: 84
Joined: Tue Dec 12, 2006 1:14 pm
Location: Southend-on-Sea

Post by Howard Grist » Mon May 21, 2007 8:14 pm

Robert Jurjevic wrote:To summarize (from my side)... in order to "fix" the grading system on should (in my opinion)...

change the grading rules to read... For a win you score your opponent's grade plus 50; for a draw, your opponent's grade; and for a loss, your opponent's grade minus 50. Note that, if your opponent's grade differs from yours by more than 50 (not 40) points, it is taken to be exactly 50 (not 40) points above (or below) yours.

allow for zero and negative grades (all grades may be artificially increased for the same amount in hope that after the change no grade will ever fall to zero or below, but cutoffs on the lower end should not be made, even if they fall below zero)
The reason the '40 point' rule is not a '50 point' rule is because a '50 point' rule would provide no grading benefit to the stronger player and the weaker player would win or draw the occasional game. Statistically (i.e. looking at actual game results) the stronger player only scores something like 98% against someone graded 50 points lower.

Zero and negative grades are allowed for and used in the calculations - the only difference is that negative grades are published as zero rather than the true figure.

You then go on at great length regarding problems grading players who only play each other. Your suggested improvement may solve the two-player problem but does very little for the real world case. If a player grade 100 scores 50% against 30 players with an average grade of 120, then his grade under current ECF rules would be 120. Similarly, if a player graded 140 scored 50% against the same 30 players he would also have a grade of 120. Under your suggested changes the 100 player would have a grade of 110 and the 140 player would have a grade of 130 - surely as the two players have had an identical performance they should also have an identical grade?

User avatar
Robert Jurjevic
Posts: 207
Joined: Wed May 16, 2007 1:31 pm
Location: Surrey

Post by Robert Jurjevic » Tue May 22, 2007 8:02 am

Howard wrote:The reason the '40 point' rule is not a '50 point' rule is because a '50 point' rule would provide no grading benefit to the stronger player and the weaker player would win or draw the occasional game. Statistically (i.e. looking at actual game results) the stronger player only scores something like 98% against someone graded 50 points lower.
According to Condition 1...

Condition 1: The basis of English (ECF) ratings broadly speaking is that the difference in ratings is half the difference in percentage scores or higher. That is, if player A beats player B in a match 8 to 2 (60% difference), you would expect his grade to be about 30 points higher.

...which is equivalent to...

Condition 1: Let 'a' and 'b' are grades of two players, A and B, and let a >= b ('a' must be greater than or equal to 'b'). If the players play a match it is expected (according to their grades) that player A performance 'p' (in percent) will be p = 50+a-b, or equivalently if performance of player A is 'p', p = 50+a-b, the players performed as expected (according to their grades) and their grades would remain unchanged after the match. For example, if a=130 and b=100 and A performs 80% in the match the grades will not change after the match (A's expected performance is p = 50+a-b = 50+130-100 = 80%).

...the stronger player is expected to score 100% against someone graded 50 or more points lower (the actual percentage of 98% does not look to me far from the theoretical 100%, or is this treated as 'far').

I see no reason why would one choose to obey Condition 1 if the difference in grades is 40 points or less and break it in all other cases. The number '50' in '50 point' rule, as opposed to '40 point' rule, comes form the formula 'p = 50+a-b', which is a mathematical form of Condition 1, and in my opinion should not arbitrarily be changed into '40', if one wishes that the difference in ratings is half the difference in percentage scores.

Puttying it simple, if one uses '40 point' rule rather than '50 point' rule I think he or she breaks Condition 1. If you think I'm wrong, could you please elaborate where does '40' come from? Thanks.
Howard wrote:Zero and negative grades are allowed for and used in the calculations - the only difference is that negative grades are published as zero rather than the true figure.
I see, I think that is how it should be (negative grades used in calculation).
Howard wrote:You then go on at great length regarding problems grading players who only play each other. Your suggested improvement may solve the two-player problem but does very little for the real world case.
If there are two grading systems: the first (GS and CGS) which fails on simple examples (with two players who play only each other in the matches) and the second (AGS) which does not, one would expect the second to perform in the real world better than the first.
Howard wrote:If a player grade 100 scores 50% against 30 players with an average grade of 120, then his grade under current ECF rules would be 120.
Correct (both GS and CGS give that result, GS would give a wrong result if the difference in the grades was more than 40).
Howard wrote:Similarly, if a player graded 140 scored 50% against the same 30 players he would also have a grade of 120.
Correct (both GS and CGS give that result, GS would give a wrong result if the difference in the grades was more than 40).
Howard wrote:Under your suggested changes the 100 player would have a grade of 110 and the 140 player would have a grade of 130
Correct (AGS gives that result).
Howard wrote:surely as the two players have had an identical performance they should also have an identical grade?
In my opinion they should not have an identical grade. Let me explain...

According to AGS, after the games with the 100 player, an average grade of the 30 players would be 110 (assuming that in the other games the 30 players performed as expected according to their average grade of 120), Now, if the 100 player scores 50% against the same 30 players again, say in the next season, (assuming that in the other games the 30 players performed as expected according to their average grade which is now 110) neither the 100 player's grade (which is now 110) nor the average grade of the 30 players (which is now 110) would change (it will remain 110). That is the consequence of the fact that AGS satisfies Condition 2.

Similarly, according to AGS, after the games with the 140 player, an average grade of the 30 players would be 130 (assuming that in the other games the 30 players performed as expected according to their average grade of 140), Now, if the 140 player scores 50% against the same 30 players again, say in the next season, (assuming that in the other games the 30 players performed as expected according to their average grade which is now 130) neither the 140 player's grade (which is now 130) nor the average grade of the 30 players (which is now 130) would change (it will remain 130). That is the consequence of the fact that AGS satisfies Condition 2.

You may also look at this as follows...

The 100 player's new grades should be 110 (rather than 120) as you do not assume that the 100 player performance is only due to increase in his or her chess ability, but also due to possible decrease in chess ability of the 30 players (you in fact do not know if the 100 player improved or the 30 players worsen, that is why you take to be both). Similar reasoning holds for why the 140 player's new grade should be 130 (rather than 120).

If the 100 player had played 30 games against a 120 player scoring 50% (in a match), then I think you would agree that you do not know if the 100 player has improved or the 120 player has worsened, and, in my opinion, a new grade of 110 for both players would be fair (as if they play again and score the same, 50%, their grade of 110 will remain unchanged). I think nothing should change if, instead, the 100 player had played against 30 players with an average grade of 120, as the fact that they are 30 players should not, in my opinion, imply that they are an absolute measure and that we know that the score is solely due to increase in the 100 player's chess ability.

Code...

Code: Select all

(*Amended Grading System*)
ClearAll[a, b, d, d0, gw, gd, gl, g, a0, b0, nw, nd, nl, nt];
a = 120; b = 100;
nw = 15; nd = 0; nl = 15;
nt = nw + nd + nl;
d = a - b;
g = 50;
If[d > g, a0 = b + g, a0 = a];
If[d > g, b0 = a - g, b0 = b];
If[d > g, d0 = g, d0 = d];
gw = (g + d0)/2; gd = d0/2; gl = (g - d0)/2;
a2 = (nw*(b0 + gw) + nd*(b0 + gd) + nl*(b0 - gl))/nt;
b2 = (nw*(a0 - gw) + nd*(a0 - gd) + nl*(a0 + gl))/nt;
a2
Print[];
b2
110
110

Code: Select all

(*Amended Grading System*)
ClearAll[a, b, d, d0, gw, gd, gl, g, a0, b0, nw, nd, nl, nt];
a = 140; b = 120;
nw = 15; nd = 0; nl = 15;
nt = nw + nd + nl;
d = a - b;
g = 50;
If[d > g, a0 = b + g, a0 = a];
If[d > g, b0 = a - g, b0 = b];
If[d > g, d0 = g, d0 = d];
gw = (g + d0)/2; gd = d0/2; gl = (g - d0)/2;
a2 = (nw*(b0 + gw) + nd*(b0 + gd) + nl*(b0 - gl))/nt;
b2 = (nw*(a0 - gw) + nd*(a0 - gd) + nl*(a0 + gl))/nt;
a2
Print[];
b2
130
130
If I require for AGS to obey your condition above, which I may call Condition 3, I get...

Code: Select all

(*Amended Grading System*)
ClearAll[a, b, d, d0, g0, g, a0, b0, nw, nd, nl, nt];
a = 140; b = 120;
nw = 15; nd = 0; nl = 15;
nt = nw + nd + nl;
d = a - b;
g = 50;
If[d > g, a0 = b + g, a0 = a];
If[d > g, b0 = a - g, b0 = b];
If[d > g, d0 = g, d0 = d];
a2 = 120; b2 = 140;
Solve[{
    a2 == (nw*(b0 + gw) + nl*(b0 - gl))/nt,
    b2 == (nw*(a0 - gw) + nl*(a0 + gl))/nt}, {gw}]
{{gw -> gl}}    
...and as AGS has to obey Condition 1 too, I get...

Code: Select all

(*Amended Grading System*)
ClearAll[a, b, d, d0, g0, g, a0, b0, nw, nd, nl, nt];
a = 130; b = 100;
nw = 8; nd = 0; nl = 2;
nt = nw + nd + nl;
d = a - b;
g = 50;
If[d > g, a0 = b + g, a0 = a];
If[d > g, b0 = a - g, b0 = b];
If[d > g, d0 = g, d0 = d];
a2 = 130; b2 = 100;
Solve[{
    a2 == (nw*(b0 + g0) + nl*(b0 - g0))/nt,
    b2 == (nw*(a0 - g0) + nl*(a0 + g0))/nt}, {g0}]
{{g0 -> 50}}    
...which implies that the only possible grading system which would obey both Conditions 1 and 3 is CGS...

Code: Select all

(*Corrected Grading System*)
ClearAll[a, b, d, g, a0, b0, nw, nd, nl, nt];
a = 140; b = 120;
nw = 15; nd = 0; nl = 15;
nt = nw + nd + nl;
d = a - b;
g = 50;
If[d > g, a0 = b + g, a0 = a];
If[d > g, b0 = a - g, b0 = b];
a2 = (nw*(b0 + g) + nd*(b0) + nl*(b0 - g))/nt;
b2 = (nw*(a0 - g) + nd*(a0) + nl*(a0 + g))/nt;
a2
Print[];
b2
Nevertheless, as Condition 3 needs not (or better, must not) be satisfied by a good grading system, AGS appears to be the best of the three.
Robert Jurjevic
Vafra

User avatar
Robert Jurjevic
Posts: 207
Joined: Wed May 16, 2007 1:31 pm
Location: Surrey

Post by Robert Jurjevic » Tue May 22, 2007 12:17 pm

In what CGS and AGS differ...

I found a simple explanation in what CGS obeying...

Rule 1b: For a win you score your opponent's grade plus 50; for a draw, your opponent's grade; and for a loss, your opponent's grade minus 50. Note that, if your opponent's grade differs from yours by more than 50 (not 40) points, it is taken to be exactly 50 (not 40) points above (or below) yours. At the end of the season an average of points-per-game is taken, and that is your new grade.

...and AGS obeying...

Rule 2: For a win you score average grade plus 25; for a draw, average grade; and for a loss, average grade minus 25. Note that, if your opponent's grade differs from yours by more than 50 points, it is taken to be exactly 50 points above (or below) yours. Average grade is half of the sum of your and your opponent's grade. At the end of the season an average of points-per-game is taken, and that is your new grade.

...differ.

When calculating a new grade for a player... CGS assumes that the difference in expected performance is exclusively due to the change in chess ability of the player in question (i.e., the grades of the other players are taken to represent their true chess abilities) ...while... AGS assumes that the difference in expected performance is due to both, the change in chess ability of the player in question, and the change in chess ability of his or her opponents (i.e., the grades of the other players are not taken to represent their true chess abilities).

A good example is one given by Howard (in his previous post) where he considers... a player graded a 100 who scores 50% against 30 players with an average grade of 120... stating that the player's new grade is... 120 according to CGS (and GS, GS would give a different, wrong, result if the grading difference was more than 40)... and 110 according to AGS.

The grade of 110 looks to me fairer than 120 as (as I previously discussed)...

If the 100 player had played 30 games against a 120 player scoring 50% (in a match), then I think you would agree that you do not know if the 100 player has improved or the 120 player has worsened, and, in my opinion, a new grade of 110 for both players would be fair (as if they play again and score the same, 50%, their grade of 110 will remain unchanged). I think nothing should change if, instead, the 100 player had played against 30 players with an average grade of 120, as the fact that they are 30 players should not, in my opinion, imply that they are an absolute measure and that we know that the score is solely due to increase in the 100 player's chess ability.

I think, it would not be wrong to assume that... the grades of the other players represent their true chess abilities ...if the (other) players would always perform as expected (according to Condition 1)... but as this is not the case, I think that it is better to assume that... the grades of the other players do not represent their true chess abilities.

GS obeying...

Rule 1a: For a win you score your opponent's grade plus 50; for a draw, your opponent's grade; and for a loss, your opponent's grade minus 50. Note that, if your opponent's grade differs from yours by more than 40 points, it is taken to be exactly not 40 points above (or below) yours. At the end of the season an average of points-per-game is taken, and that is your new grade.

...is, in my opinion, unaccaptabe, as it even does not satisfy the very definition of ECF grade (the difference in grades is half the difference in percentage scores, or 'p = 50+a-b', where 'p' is percentage score of player A, 'a' a grade of player A, 'b' a grade of player B and 'a>=b') expressed in Condition 1.

So, of the three, in my opinion, the best is AGS.

Note that I haven't yet mathematically proven that AGS satisfies Condition 2 (although I have a plenty of numerical examples which suggest that it should), but I hope to succeed in this. I've already conducted a number of mathematical proofs for AGS (and GS and CGS), and once all mathematical proofs are finished and checked, I think, ECF could consider implementing AGS.

Please consider it, please...
Robert Jurjevic
Vafra

User avatar
Greg Breed
Posts: 723
Joined: Thu Apr 05, 2007 8:30 am
Location: Aylesbury, Bucks, UK

Post by Greg Breed » Wed May 23, 2007 11:21 am

Robert, would this system of yours work with ELO numbers?
See this thread:
http://forum.bcfservices.org.uk/viewtopic.php?p=262#262

The conversions shown below are taken from this webpage:
http://grading.bcfservices.org.uk/help.php#elo

Code: Select all

Conversion between FIDE Elo, Other National Elo ratings and ECF Gradings.

The following conversion applies to national Elo ratings and for FIDE Elo ratings greater than 2327: 
(Elo - 600) / 8 = ECF
ECF x 8 + 600 = Elo

For FIDE ratings lower than this, and for ECF Grading purposes only, a different formula is used:
(FIDE - 1250) / 5 = ECF
ECF x 5 + 1250 = FIDE

ECF 	FIDE  Other National
 70 	1600 	1160
 80 	1650 	1240
 90 	1700 	1320
100 	1750 	1400
110 	1800 	1480
120 	1850 	1560
130 	1900 	1640
140 	1950 	1720
150 	2000 	1800
160 	2050 	1880
170 	2100 	1960
180 	2150 	2040
190 	2200 	2120
200 	2250 	2200
210 	2300 	2280
215 	2325 	2320
216 	2328 	2328
220 	2360 	2360
230 	2440 	2440
240 	2520 	2520
250 	2600 	2600
260 	2680 	2680
270 	2760 	2760
Note: the current conversion assumes a minimum FIDE Elo of 1600
In the first thread mentioned above the topic is about converting to ELO. I'm unsure which it would be (FIDE or National) or why there is a difference. Probably for the same reason why ours is different.

Anyway, if we used the FIDE numbers and used Mike Gunn's suggestion of +400 for a win and -400 for a loss would that work? I'm just seeing if we can put all your hard work to use on another subject of discussion and try to 'kill two birds with one stone' - Fixing the grading system and converting to ELO.
Hatch End A Captain (Hillingdon League)
Controller (Hillingdon League)

User avatar
Robert Jurjevic
Posts: 207
Joined: Wed May 16, 2007 1:31 pm
Location: Surrey

Post by Robert Jurjevic » Wed May 23, 2007 2:29 pm

Greg Breed wrote:Robert, would this system of yours work with ELO numbers?
If you meant to ask if the grade range could be adjusted to match that of FIDE, the answer is yes, for all GS, CGS and AGS, but one would need to modify Condition 1 accordingly. Please note that when one talks about Élo system (FIDE, USCF, etc.) there is more to it than just the fact that the grades are normally thousands rather than hundreds.
Greg Breed wrote:Anyway, if we used the FIDE numbers and used Mike Gunn's suggestion of +400 for a win and -400 for a loss would that work?
I think not.

Élo grading system...

Élo Grading System using logistically distributed random variable (rather than original normally distributed random variable) states...

Condition 1b: Let 'a' and 'b' are grades of two players, A and B, and let a >= b ('a' is assumed to be greater than or equal to 'b'). If the players play a match it is expected (according to their grades) that player A performance 'p' (in percent) will be 'p = 100/(1 + 10^(-(a-b)/g))'.

...which is an equivalent of ECF Condition 1...

Condition 1: Let 'a' and 'b' are grades of two players, A and B, and let a >= b ('a' is assumed to be greater than or equal to 'b'). If the players play a match it is expected (according to their grades) that player A performance 'p' (in percent) will be 'p = 50+a-b'.

Note: Statistical tests have shown that chess performance is almost certainly not normally distributed. Weaker players have significantly greater winning chances than Élo's model predicts. Therefore, both the USCF and FIDE have switched to formulas based on the logistic distribution. However, in deference to Élo's contribution, both organizations are still commonly said to use "the Elo system".

Élo suggested scaling ratings so that a difference of 200 rating points in chess would mean that the stronger player has an expected score of approximately 0.75.

In order to keep present ECF grade scale we could... suggest scaling ratings so that a difference of 25 (not 200) rating points in chess would mean that the stronger player has an expected score of approximately 0.75.

Now, we can determine 'g' in 'p = 100/(1 + 10^(-(a-b)/g))' from... '100/(1 + 10^(-(a-b)/g)) = p = 50+a-b', 'a-b=25' ...to be 'g=(25*Log[10])/Log[3]'.

Let us call this grading system ÉGS for short (that would be Élo grading system with logistic distribution and grades scaled to ECF standard).

Then, the formulas for calculating grades according to ÉGS are...

Code: Select all

(*Elo Grading System*)
ClearAll[a, b, a2, b2, d, g, k, p, nw, nd, nl, nt];
a = 130; b = 100;
nw = 8; nd = 0; nl = 2;
nt = nw + nd + nl;
d = a - b;
g = (25*Log[10])/Log[3];
k = 1/2;
p = 100/(1 + 10^(-d/g));
a2 = a + k*((nw + nd/2)/nt*100 - p);
b2 = b + k*((nl + nd/2)/nt*100 - (100 - p));
Round[N[a2]]
Print[];
Round[N[b2]]
The AGS formulas could be rewritten in a similar form (note that the difference is only in calculation of 'p')...

Code: Select all

(*Amended Grading System*)
ClearAll[a, b, a2, b2, d, g, k, p, nw, nd, nl, nt];
a = 130; b = 100;
nw = 8; nd = 0; nl = 2;
nt = nw + nd + nl;
g = 50;
d = a - b;
k = 1/2;
If[d > g, p = 100, p = 50 + d];
a2 = a + k*((nw + nd/2)/nt*100 - p);
b2 = b + k*((nl + nd/2)/nt*100 - (100 - p));
Round[N[a2]]
Print[];
Round[N[b2]]
Please note that ÉGS grades are already scaled to ECF standard (as shown above) and should not be confused with FIDE or national ratings. i.e., if your ECF grade was 120 your ECF grade calculated using ÉGS would be 120ish (and say not 1850).

Of course, a disadvantage of ÉGS is that the calculation of grades is not so easy and one would need a scientific calculator (logarithms and powers) or a computer in order to calculate them.

Grading systems tabular grade comparison...

Let player A with grade 'a' and player B with grade 'b' play a match in which player A scores 'p' percents. Let the calculated grades after the match are 'a2' and 'b2'. The following table shows 'a2' and 'b2' calculated by GS, CGS, AGS, and ÉGS for a number of chosen 'a's, 'b's and 'p's...

Code: Select all

------------------------------------------------  
                 GS      CGS      AGS      ÉGS
  a   b   p    a2  b2   a2  b2   a2  b2   a2  b2    
------------------------------------------------ 
220 100 100   230  90  220 100  220 100  220 100
200 100 100   210  90  200 100  200 100  201  99
180 100 100   190  90  180 100  180 100  181  99
170 100 100   180  90  170 100  170 100  172  98
160 100 100   170  90  160 100  160 100  163  97
150 100 100   160  90  150 100  150 100  155  95
140 100  90   140 100  140 100  140 100  142  98
130 100  80   130 100  130 100  130 100  131  99
120 100  70   120 100  120 100  120 100  120 100
110 100  60   110 100  110 100  110 100  110 100
110 100  50   100 100  100 100  100 100  100 100
------------------------------------------------ 
220 100  50   160 140  170 150  195 125  195 125
200 100  50   160 140  150 150  175 125  176 124
180 100  50   140 140  130 150  155 125  156 124
160 100  50   120 140  110 150  135 125  138 122
150 100  50   110 140  100 150  125 125  130 120
140 100  50   100 140  100 140  120 120  122 118
130 100  50   100 130  100 130  115 115  116 114
120 100  50   100 120  100 120  110 110  110 110
------------------------------------------------ 
160 130  80   160 130  160 130  160 130  161 129
150 120  80   150 120  150 120  150 120  151 119
140 110  80   140 100  140 110  140 110  141 109
130 100  80   130 100  130 100  130 100  131  99
------------------------------------------------  
Conclusions...

GS is very wrong and in my opinion must be abandoned.

CGS is wrong and in my opinion should be replaced with AGS or ÉGS.

AGS is acceptable, although ÉGS could be better.

ÉGS should be the best of the four.

Both, GS and CGS, are wrong. GS is very wrong as it does not obey neither Condition 1 nor Condition 2, and CGS is wrong as it does not obey Condition 2.

Both, AGS and ÉGS, obey Condition 2 (I haven't mathematically proven that but rather conjecture it) and which one is better depends on the fact which Condition, Condition 1 or Condition 1a, model reality better. Apparently, Élo system has been statistically checked against real data, therefore, it is likely that Condition 1b is better than Condition 1, and consequently that ÉGS is better than AGS.
Robert Jurjevic
Vafra

User avatar
Robert Jurjevic
Posts: 207
Joined: Wed May 16, 2007 1:31 pm
Location: Surrey

Post by Robert Jurjevic » Wed May 23, 2007 6:21 pm

Is ÉGS really better than AGS...

Both, ÉGS and AGS, satisfy Condition 2 (and it looks like all other necessary conditions, Condition 1 or 1b, Condition 0, etc.).

It looks like the difference between ÉGS and AGS is only in a way of how they define grade, i.e., ÉGS defines it as 'p = 100/(1 + 10^(-(a-b)/g))' (Condition 1b) and AGS defines it as 'p = 50+a-b' (Condition 1).

Can 'p = 100/(1 + 10^(-(a-b)/g))' be better than 'p = 50+a-b'? I think not! If one had chosen, say, 'p = 50+(a-b)/2', that should have led to an equally good grading system, I think (note that in such a system a 130 player would be expected to score in a match against a 100 player 65% and not 80% which is a requirement in AGS).

So, it looks like both, ÉGS and AGS, are equally good.

What may happen in practice is that in one implementation of a system an average difference between expected and actual performance is greater than in another, but, in my opinion, this is not due do the fact that one system is better than another, rather, I think, it exclusively depends on the particular circumstances such as the number of rapidly improving juniors, the number of games played, how often the grades are calculated, etc.

Conclusions...

If the above is correct...

I would suggest to go for a simpler model, which is AGS (not ÉGS).

AGS uses a simple rule...

Rule 2: For a win you score average grade plus 25; for a draw, average grade; and for a loss, average grade minus 25. Note that, if your opponent's grade differs from yours by more than 50 points, it is taken to be exactly 50 points above (or below) yours. Average grade is half of the sum of your and your opponent's grade. At the end of the season an average of points-per-game is taken, and that is your new grade.
Robert Jurjevic
Vafra

User avatar
Greg Breed
Posts: 723
Joined: Thu Apr 05, 2007 8:30 am
Location: Aylesbury, Bucks, UK

Post by Greg Breed » Thu May 24, 2007 11:12 am

This all sounds great Robert. I wasn't asking for the ELO system to be used merely their numbers.

i.e. translate all current ECF grades into National ELO by (ECF*8+600) and then use the new and improved ECF grading system but tweaked for the new number set - something like +/-400 for W/L then averaged.
Hatch End A Captain (Hillingdon League)
Controller (Hillingdon League)

User avatar
Greg Breed
Posts: 723
Joined: Thu Apr 05, 2007 8:30 am
Location: Aylesbury, Bucks, UK

Post by Greg Breed » Thu May 24, 2007 11:55 am

Robert Jurjevic wrote:Rule 2: For a win you score average grade plus 25; for a draw, average grade; and for a loss, average grade minus 25. Note that, if your opponent's grade differs from yours by more than 50 points, it is taken to be exactly 50 points above (or below) yours. Average grade is half of the sum of your and your opponent's grade. At the end of the season an average of points-per-game is taken, and that is your new grade.
I've just had a brainstorm, although I imagine some will call it a brainfart instead...
So this is how it could work. Using the method above we can translate that in the new ECF ELO system... after having corrected everyones grades and then converted them to ELO..
Greg Breed Brainiac wrote:Rule 2: For a win you score average grade plus 200; for a draw, average grade; and for a loss, average grade minus 200. Note that, if your opponent's grade differs from yours by more than 400 points, it is taken to be exactly 400 points above (or below) yours. Average grade is half of the sum of your and your opponent's grade. At the end of the season an average of points-per-game is taken, and that is your new grade.
Simple :D
Hatch End A Captain (Hillingdon League)
Controller (Hillingdon League)

User avatar
Robert Jurjevic
Posts: 207
Joined: Wed May 16, 2007 1:31 pm
Location: Surrey

Post by Robert Jurjevic » Thu May 24, 2007 12:43 pm

Mathematical proofs that AGS is correct...

AGS uses formulas...

Code: Select all

(*Amended Grading System*)
ClearAll[a, b, d, d0, gw, gd, gl, g, a0, b0, nw, nd, nl, nt];
a = 130; b = 100;
nw = 8; nd = 0; nl = 2;
nt = nw + nd + nl;
d = a - b;
g = 50;
If[d > g, a0 = b + g, a0 = a];
If[d > g, b0 = a - g, b0 = b];
If[d > g, d0 = g, d0 = d];
gw = (g + d0)/2; gd = d0/2; gl = (g - d0)/2;
a2 = (nw*(b0 + gw) + nd*(b0 + gd) + nl*(b0 - gl))/nt;
b2 = (nw*(a0 - gw) + nd*(a0 - gd) + nl*(a0 + gl))/nt;
Round[N[a2]]
Print[];
Round[N[b2]]
...which is equivalent to...

Code: Select all

(*Amended Grading System*)
ClearAll[a, b, a2, b2, d, g, k, p, nw, nd, nl, nt];
a = 130; b = 100;
nw = 8; nd = 0; nl = 2;
nt = nw + nd + nl;
g = 50;
d = a - b;
k = 1/2;
If[d > g, p = 100, p = 50*(1 + d/g)];
a2 = a + k*((nw + nd/2)/nt*100 - p);
b2 = b + k*((nl + nd/2)/nt*100 - (100 - p));
Round[N[a2]]
Print[];
Round[N[b2]]
P1: Proof that AGS treats two draws as one win...

'(nw + nd/2)' in 'a2 = a + k*((nw + nd/2)/nt*100 - p)' and '(nl + nd/2)' in 'b2 = b + k*((nl + nd/2)/nt*100 - (100 - p))' show that a draw is treated as half a win or that two draws are counted as one win.

P2: Proof that AGS satisfies Condition 1, ie., that it satisfies 'If[d > g, p = 100, p = 50 + d]'...

If the players have performed as expected then '(nw + nd/2)/nt*100 = p' and '(nl + nd/2)/nt*100 = 100 - p', where 'p = 50 + d' if 'd>g' and 'p = 100' if 'd<=g' and consequently 'a2 = a + k*((nw + nd/2)/nt*100 - p) = a' and 'b2 = b + k*((nl + nd/2)/nt*100 - (100 - p)) = b'.

P3: Proof that AGS satisfies Condition 2, i.e., if 'a2 = a + k*(q - p)', 'b2 == b + k*((100 - q) - (100 - p))', 'p = 50*(1 + d/g)', 'a3 = a2 + k*(q - p2)', 'b3 =b2 + k*((100 - q) - (100 - p2))' and 'p2 = 50*(1 + d2/g)', where 'g = 50','k = 1/2', 'd = a - b' and 'd2 = a2 - b2', then 'a3 = a2', 'b3 = b2'and 'p2 = q'...

Code: Select all

(*Amended Grading System*)
ClearAll[a, b, a2, b2, a3, b3, d, d2, g, k, p, p2, q];
g = 50;
k = 1/2;
d = a - b;
d2 = a2 - b2;
Solve[{a2 == a + k*(q - p), b2 == b + k*((100 - q) - (100 - p)), 
    p == 50*(1 + d/g), a3 == a2 + k*(q - p2), 
    b3 == b2 + k*((100 - q) - (100 - p2)), p2 == 50*(1 + d2/g)}, {a2, b2, p, 
    a3, b3, p2}]
{{a2 -> (-50 + a + b + q)/2, b2 -> (50 + a + b - q)/2, 
  p -> 50 + a - b, a3 -> (-50 + a + b + q)/2, 
  b3 -> (50 + a + b - q)/2, p2 -> q}}    
Summary, etc....

I regard this issue closed from my side...
  • AGS is correct, i.e. it treats two draws as one win, it satisfies Condition 1 and 2, and we also allow for zero and negative grades. All mathematical proofs had been conducted.
  • AGS cannot be 'worse' than ÉGS. That is because the only difference between AGS and ÉGS is in definition of grade. There are a number of different grade definitions yielding correct grading systems. Both, AGS and ÉGS, are correct grading systems. One cannot check if a definition of grade is 'better' or 'worse' by comparing it to experimental data.
  • I suggest that ECF adopts AGS rather than ÉGS as calculations would be simpler.
AGS uses...

Rule 2: For a win you score average grade plus 25; for a draw, average grade; and for a loss, average grade minus 25. Note that, if your opponent's grade differs from yours by more than 50 points, it is taken to be exactly 50 points above (or below) yours. Average grade is half of the sum of your and your opponent's grade. At the end of the season an average of points-per-game is taken, and that is your new grade.

One should allow for zero and negative grades. The grades cloud be rounded to zero, one (or more) decimals.
Greg Breed wrote:i.e. translate all current ECF grades into National ELO by (ECF*8+600) and then use the new and improved ECF grading system but tweaked for the new number set - something like +/-400 for W/L then averaged.
That would be possible, although I think it might be simpler (and possibly better) to switch to AGS directly, assuming that present grades calculated with GS are accurate enough. Alternatively, one can recalculate the grades (using AGS) for a couple of last seasons (if the data is still available) and use newly calculated grades as starting grades when switching to AGS.

Please note that the formulas (presumably based on statistical analysis)...

The following conversion applies to national Elo ratings and for FIDE Elo ratings greater than 2327:
(Elo - 600) / 8 = ECF
ECF x 8 + 600 = Elo

For FIDE ratings lower than this, and for ECF Grading purposes only, a different formula is used:
(FIDE - 1250) / 5 = ECF
ECF x 5 + 1250 = FIDE


..are an estimate. As the number of players in the world having ECF grade is much smaller than the number of players having FIDE rating, and as most of the ECF players are more or less isolated from the rest of the world (do not play in FIDE events), the estimate may not be any better than what would one obtain, say, by recalculating the grades (using AGS) for a couple of last seasons.

Also, I am not sure if people would like to switch to FIDE rating range where top grandmasters have grade of 2760-ish rather than 270-ish.

Note: There will always be a difference between expected and real performance. There are many reasons for that and this is not due to (in my opinion wrong) assumption that one definition of grade is better than another and consequently due to assumption that one grading system is better than another (as long as both grading systems are correct).

For example, if I play only 2 games in a season my 'real' performance is not so real as one's who played say 150 games in a season, also, I am learning chess, hopefully improving, and I expect to play better in every game, this is I think particularly true for rapidly improving juniors, etc.

In my opinion these issues won't be resolved (actually there is nothing to resolve as these are the facts of life) if one say uses ÉGS instead of AGS (note that GS and CGS are incorrect and therefore they cannot be considered).
Last edited by Robert Jurjevic on Sun May 27, 2007 5:55 pm, edited 3 times in total.
Robert Jurjevic
Vafra

User avatar
Robert Jurjevic
Posts: 207
Joined: Wed May 16, 2007 1:31 pm
Location: Surrey

Post by Robert Jurjevic » Thu May 24, 2007 2:08 pm

Greg Breed wrote:
Greg Breed Brainiac wrote:Rule 2: For a win you score average grade plus 200; for a draw, average grade; and for a loss, average grade minus 200. Note that, if your opponent's grade differs from yours by more than 400 points, it is taken to be exactly 400 points above (or below) yours. Average grade is half of the sum of your and your opponent's grade. At the end of the season an average of points-per-game is taken, and that is your new grade.
Simple :D
Yes, that would work...

Then, Condition 1 would require... 'p = 100' if 'd>g' and 'p = 50*(1 + d/g)' if 'd<=g', where 'g = 400'... and the formulas would be...

Code: Select all

(*Amended Grading System*)
ClearAll[a, b, a2, b2, d, g, k, p, nw, nd, nl, nt];
a = 1200; b = 1000;
nw = 75; nd = 0; nl = 25;
nt = nw + nd + nl;
g = 400;
d = a - b;
k = 1/2;
If[d > g, p = 100, p = 50*(1 + d/g)];
a2 = a + k*((nw + nd/2)/nt*100 - p);
b2 = b + k*((nl + nd/2)/nt*100 - (100 - p));
Round[N[a2]]
Print[];
Round[N[b2]]
The comparison between the two AGS grades (one with 'g=50' and one with 'g=400'), assuming that the grade of 100 (AGS 'g=50') corresponds to the grade of 1400 (AGS 'g=400'), would be...

Code: Select all

--------
  AGS
    g
50   400
--------
-20  440
-10  520
  0  600
 10  680
 20  760
 30  840
 40  920
 50 1000
 60 1080
 70 1160
 80 1240
 90 1320
100 1400
110 1480
120 1560
130 1640
140 1720
150 1800
160 1880
170 1960
180 2040
190 2120
200 2200
210 2280
220 2360
230 2440
240 2520
250 2600
260 2680
270 2760
--------
Please note that converting current grades from AGS ('g=50') into AGS ('g=400') and starting with AGS ('g=400') would be equivalent to (regarding initial grade correction) keeping current grades and switching to AGS ('g=50').

In order to correct grades before switching to AGS one should, say, recalculate grades in a couple of last seasons using AGS ('g=50') and then use newly calculated grades as initial grades in the season switching to AGS (if the chosen AGS was AGS ('g=400') the initial AGS ('g=50') grades must be converted from AGS ('g=50') into AGS ('g=400') first).
Robert Jurjevic
Vafra

User avatar
Greg Breed
Posts: 723
Joined: Thu Apr 05, 2007 8:30 am
Location: Aylesbury, Bucks, UK

Post by Greg Breed » Fri May 25, 2007 9:12 am

Robert you wanted to check how your system would work with a real sample.
I took your suggestion of Rule 2:
Robert Jurjevic wrote:Rule 2: For a win you score average grade plus 25; for a draw, average grade; and for a loss, average grade minus 25. Note that, if your opponent's grade differs from yours by more than 50 points, it is taken to be exactly 50 points above (or below) yours. Average grade is half of the sum of your and your opponent's grade. At the end of the season an average of points-per-game is taken, and that is your new grade.
and applied it to my performance over this season. (My current grade is 106) I have already been keeping track of it in a spreadsheet using the current ECF method which gave me a performance this season of 124 (rounded down).
I've just re-done using your formula and i came out with a performance of 115 (rounded down). This is over 61 rated games.

Actually, to be more accurate I should say how many points I accrued.
Points
Current ECF system: 7623
Robert's system: 7075.5

divided by 61 games =

Grade

Current ECF system: 124.96
Robert's system: 115.98

I think that it would be fair to say that there will be a general decrease in the grades if Robert's system is applied. Maybe that's because at the moment our grades are over-inflated.

Does anyone else have an opinion?
Hatch End A Captain (Hillingdon League)
Controller (Hillingdon League)

User avatar
Robert Jurjevic
Posts: 207
Joined: Wed May 16, 2007 1:31 pm
Location: Surrey

Post by Robert Jurjevic » Fri May 25, 2007 4:39 pm

Greg Breed wrote:I think that it would be fair to say that there will be a general decrease in the grades if Robert's system is applied. Maybe that's because at the moment our grades are over-inflated. Does anyone else have an opinion?
My grade for season 2006/2007 is 99. My new grade (in 2007/2008 season) based on my games (not all but most of the games I played so far in 2006/2007 season) calculated using GS, CGS, AGS, and ÉGS is as follows...

Code: Select all

------------------
 GS  CGS  AGS  ÉGS
 84   84  103   92
------------------
Detailed calculation follows...

GS...

Code: Select all

ECF_grades version 1.4
Copyright (c) M&R Research Team 2003-2007 
calculating a new GS ECF grade for chess player Jurjevic using information in all0607.pgn file... 
 
date: 2006.09.19 
I was White
my grade: 99 
my opponent's grade: 98 
new opponent's grade: 98 
I won
summand: 148.000000 
sum: 148.000000 
 
date: 2006.10.16 
I was White
my grade: 99 
my opponent's grade: 98 
new opponent's grade: 98 
I lost
summand: 48.000000 
sum: 196.000000 
 
date: 2006.10.17 
I was White
my grade: 99 
my opponent's grade: 125 
new opponent's grade: 125 
a draw
summand: 125.000000 
sum: 321.000000 
 
date: 2006.10.23 
I was White
my grade: 99 
my opponent's grade: 92 
new opponent's grade: 92 
I lost
summand: 42.000000 
sum: 363.000000 
 
date: 2006.10.24 
I was Black
my grade: 99 
my opponent's grade: 120 
new opponent's grade: 120 
a draw
summand: 120.000000 
sum: 483.000000 
 
date: 2006.10.26 
I was Black
my grade: 99 
my opponent's grade: 70 
new opponent's grade: 70 
I lost
summand: 20.000000 
sum: 503.000000 
 
date: 2006.10.28 
I was Black
my grade: 99 
my opponent's grade: 94 
new opponent's grade: 94 
a draw
summand: 94.000000 
sum: 597.000000 
 
date: 2006.11.28 
I was White
my grade: 99 
my opponent's grade: 114 
new opponent's grade: 114 
I won
summand: 164.000000 
sum: 761.000000 
 
date: 2007.01.09 
I was Black
my grade: 99 
my opponent's grade: 100 
new opponent's grade: 100 
I lost
summand: 50.000000 
sum: 811.000000 
 
date: 2007.01.16 
I was Black
my grade: 99 
my opponent's grade: 94 
new opponent's grade: 94 
a draw
summand: 94.000000 
sum: 905.000000 
 
date: 2007.02.20 
I was Black
my grade: 99 
my opponent's grade: 86 
new opponent's grade: 86 
I won
summand: 136.000000 
sum: 1041.000000 
 
date: 2007.03.07 
I was Black
my grade: 99 
my opponent's grade: 139 
new opponent's grade: 139 
I lost
summand: 89.000000 
sum: 1130.000000 
 
date: 2007.05.15 
I was Black
my grade: 99 
my opponent's grade: 50 
new opponent's grade: 59 
a draw
summand: 59.000000 
sum: 1189.000000 
 
date: 2007.03.03 
I was Black
my grade: 99 
my opponent's grade: 99 
new opponent's grade: 99 
a draw
summand: 99.000000 
sum: 1288.000000 
 
date: 2007.04.11 
I was White
my grade: 99 
my opponent's grade: 169 
new opponent's grade: 139 
I lost
summand: 89.000000 
sum: 1377.000000 
 
date: 2007.04.25 
I was White
my grade: 99 
my opponent's grade: 98 
new opponent's grade: 98 
a draw
summand: 98.000000 
sum: 1475.000000 
 
date: 2007.05.14 
I was White
my grade: 99 
my opponent's grade: 99 
new opponent's grade: 99 
a draw
summand: 99.000000 
sum: 1574.000000 
 
date: 2007.05.12 
I was Black
my grade: 99 
my opponent's grade: 72 
new opponent's grade: 72 
a draw
summand: 72.000000 
sum: 1646.000000 
 
date: 2007.02.13 
I was Black
my grade: 99 
my opponent's grade: 90 
new opponent's grade: 90 
I lost
summand: 40.000000 
sum: 1686.000000 
 
date: 2007.03.06 
I was Black
my grade: 99 
my opponent's grade: 99 
new opponent's grade: 99 
a draw
summand: 99.000000 
sum: 1785.000000 
 
date: 2007.04.30 
I was Black
my grade: 99 
my opponent's grade: 87 
new opponent's grade: 87 
a draw
summand: 87.000000 
sum: 1872.000000 
 
date: 2007.04.06 
I was Black
my grade: 99 
my opponent's grade: 77 
new opponent's grade: 77 
I lost
summand: 27.000000 
sum: 1899.000000 
 
date: 2007.04.06 
I was White
my grade: 99 
my opponent's grade: 58 
new opponent's grade: 59 
I won
summand: 109.000000 
sum: 2008.000000 
 
date: 2007.04.06 
I was Black
my grade: 99 
my opponent's grade: 60 
new opponent's grade: 60 
I lost
summand: 10.000000 
sum: 2018.000000 
 
date: 2007.04.07 
I was White
my grade: 99 
my opponent's grade: 90 
new opponent's grade: 90 
a draw
summand: 90.000000 
sum: 2108.000000 
 
date: 2007.04.07 
I was Black
my grade: 99 
my opponent's grade: 45 
new opponent's grade: 59 
a draw
summand: 59.000000 
sum: 2167.000000 
 
date: 2007.04.07 
I was White
my grade: 99 
my opponent's grade: 51 
new opponent's grade: 59 
I won
summand: 109.000000 
sum: 2276.000000 
 
new GS ECF grade for chess player Jurjevic based on 27 games in all0607.pgn file is 84 
OK operation successful 

Fri May 25 16:24:11 2007
CGS...

Code: Select all

ECF_grades version 1.4
Copyright (c) M&R Research Team 2003-2007 
calculating a new CGS ECF grade for chess player Jurjevic using information in all0607.pgn file... 
 
date: 2006.09.19 
I was White
my grade: 99 
my opponent's grade: 98 
new opponent's grade: 98 
I won
summand: 148.000000 
sum: 148.000000 
 
date: 2006.10.16 
I was White
my grade: 99 
my opponent's grade: 98 
new opponent's grade: 98 
I lost
summand: 48.000000 
sum: 196.000000 
 
date: 2006.10.17 
I was White
my grade: 99 
my opponent's grade: 125 
new opponent's grade: 125 
a draw
summand: 125.000000 
sum: 321.000000 
 
date: 2006.10.23 
I was White
my grade: 99 
my opponent's grade: 92 
new opponent's grade: 92 
I lost
summand: 42.000000 
sum: 363.000000 
 
date: 2006.10.24 
I was Black
my grade: 99 
my opponent's grade: 120 
new opponent's grade: 120 
a draw
summand: 120.000000 
sum: 483.000000 
 
date: 2006.10.26 
I was Black
my grade: 99 
my opponent's grade: 70 
new opponent's grade: 70 
I lost
summand: 20.000000 
sum: 503.000000 
 
date: 2006.10.28 
I was Black
my grade: 99 
my opponent's grade: 94 
new opponent's grade: 94 
a draw
summand: 94.000000 
sum: 597.000000 
 
date: 2006.11.28 
I was White
my grade: 99 
my opponent's grade: 114 
new opponent's grade: 114 
I won
summand: 164.000000 
sum: 761.000000 
 
date: 2007.01.09 
I was Black
my grade: 99 
my opponent's grade: 100 
new opponent's grade: 100 
I lost
summand: 50.000000 
sum: 811.000000 
 
date: 2007.01.16 
I was Black
my grade: 99 
my opponent's grade: 94 
new opponent's grade: 94 
a draw
summand: 94.000000 
sum: 905.000000 
 
date: 2007.02.20 
I was Black
my grade: 99 
my opponent's grade: 86 
new opponent's grade: 86 
I won
summand: 136.000000 
sum: 1041.000000 
 
date: 2007.03.07 
I was Black
my grade: 99 
my opponent's grade: 139 
new opponent's grade: 139 
I lost
summand: 89.000000 
sum: 1130.000000 
 
date: 2007.05.15 
I was Black
my grade: 99 
my opponent's grade: 50 
new opponent's grade: 50 
a draw
summand: 50.000000 
sum: 1180.000000 
 
date: 2007.03.03 
I was Black
my grade: 99 
my opponent's grade: 99 
new opponent's grade: 99 
a draw
summand: 99.000000 
sum: 1279.000000 
 
date: 2007.04.11 
I was White
my grade: 99 
my opponent's grade: 169 
new opponent's grade: 149 
I lost
summand: 99.000000 
sum: 1378.000000 
 
date: 2007.04.25 
I was White
my grade: 99 
my opponent's grade: 98 
new opponent's grade: 98 
a draw
summand: 98.000000 
sum: 1476.000000 
 
date: 2007.05.14 
I was White
my grade: 99 
my opponent's grade: 99 
new opponent's grade: 99 
a draw
summand: 99.000000 
sum: 1575.000000 
 
date: 2007.05.12 
I was Black
my grade: 99 
my opponent's grade: 72 
new opponent's grade: 72 
a draw
summand: 72.000000 
sum: 1647.000000 
 
date: 2007.02.13 
I was Black
my grade: 99 
my opponent's grade: 90 
new opponent's grade: 90 
I lost
summand: 40.000000 
sum: 1687.000000 
 
date: 2007.03.06 
I was Black
my grade: 99 
my opponent's grade: 99 
new opponent's grade: 99 
a draw
summand: 99.000000 
sum: 1786.000000 
 
date: 2007.04.30 
I was Black
my grade: 99 
my opponent's grade: 87 
new opponent's grade: 87 
a draw
summand: 87.000000 
sum: 1873.000000 
 
date: 2007.04.06 
I was Black
my grade: 99 
my opponent's grade: 77 
new opponent's grade: 77 
I lost
summand: 27.000000 
sum: 1900.000000 
 
date: 2007.04.06 
I was White
my grade: 99 
my opponent's grade: 58 
new opponent's grade: 58 
I won
summand: 108.000000 
sum: 2008.000000 
 
date: 2007.04.06 
I was Black
my grade: 99 
my opponent's grade: 60 
new opponent's grade: 60 
I lost
summand: 10.000000 
sum: 2018.000000 
 
date: 2007.04.07 
I was White
my grade: 99 
my opponent's grade: 90 
new opponent's grade: 90 
a draw
summand: 90.000000 
sum: 2108.000000 
 
date: 2007.04.07 
I was Black
my grade: 99 
my opponent's grade: 45 
new opponent's grade: 49 
a draw
summand: 49.000000 
sum: 2157.000000 
 
date: 2007.04.07 
I was White
my grade: 99 
my opponent's grade: 51 
new opponent's grade: 51 
I won
summand: 101.000000 
sum: 2258.000000 
 
new CGS ECF grade for chess player Jurjevic based on 27 games in all0607.pgn file is 84 
OK operation successful 

Fri May 25 16:25:20 2007
AGS...

Code: Select all

ECF_grades version 1.4
Copyright (c) M&R Research Team 2003-2007 
calculating a new AGS ECF grade for chess player Jurjevic using information in all0607.pgn file... 
 
date: 2006.09.19 
I was White
my grade: 99 
my opponent's grade: 98 
new opponent's grade: 98 
I won
summand: 123.500000 
sum: 123.500000 
 
date: 2006.10.16 
I was White
my grade: 99 
my opponent's grade: 98 
new opponent's grade: 98 
I lost
summand: 73.500000 
sum: 197.000000 
 
date: 2006.10.17 
I was White
my grade: 99 
my opponent's grade: 125 
new opponent's grade: 125 
a draw
summand: 137.000000 
sum: 334.000000 
 
date: 2006.10.23 
I was White
my grade: 99 
my opponent's grade: 92 
new opponent's grade: 92 
I lost
summand: 70.500000 
sum: 404.500000 
 
date: 2006.10.24 
I was Black
my grade: 99 
my opponent's grade: 120 
new opponent's grade: 120 
a draw
summand: 134.500000 
sum: 539.000000 
 
date: 2006.10.26 
I was Black
my grade: 99 
my opponent's grade: 70 
new opponent's grade: 70 
I lost
summand: 59.500000 
sum: 598.500000 
 
date: 2006.10.28 
I was Black
my grade: 99 
my opponent's grade: 94 
new opponent's grade: 94 
a draw
summand: 121.500000 
sum: 720.000000 
 
date: 2006.11.28 
I was White
my grade: 99 
my opponent's grade: 114 
new opponent's grade: 114 
I won
summand: 131.500000 
sum: 851.500000 
 
date: 2007.01.09 
I was Black
my grade: 99 
my opponent's grade: 100 
new opponent's grade: 100 
I lost
summand: 74.500000 
sum: 926.000000 
 
date: 2007.01.16 
I was Black
my grade: 99 
my opponent's grade: 94 
new opponent's grade: 94 
a draw
summand: 121.500000 
sum: 1047.500000 
 
date: 2007.02.20 
I was Black
my grade: 99 
my opponent's grade: 86 
new opponent's grade: 86 
I won
summand: 117.500000 
sum: 1165.000000 
 
date: 2007.03.07 
I was Black
my grade: 99 
my opponent's grade: 139 
new opponent's grade: 139 
I lost
summand: 94.000000 
sum: 1259.000000 
 
date: 2007.05.15 
I was Black
my grade: 99 
my opponent's grade: 50 
new opponent's grade: 50 
a draw
summand: 99.500000 
sum: 1358.500000 
 
date: 2007.03.03 
I was Black
my grade: 99 
my opponent's grade: 99 
new opponent's grade: 99 
a draw
summand: 124.000000 
sum: 1482.500000 
 
date: 2007.04.11 
I was White
my grade: 99 
my opponent's grade: 169 
new opponent's grade: 149 
I lost
summand: 99.000000 
sum: 1581.500000 
 
date: 2007.04.25 
I was White
my grade: 99 
my opponent's grade: 98 
new opponent's grade: 98 
a draw
summand: 123.500000 
sum: 1705.000000 
 
date: 2007.05.14 
I was White
my grade: 99 
my opponent's grade: 99 
new opponent's grade: 99 
a draw
summand: 124.000000 
sum: 1829.000000 
 
date: 2007.05.12 
I was Black
my grade: 99 
my opponent's grade: 72 
new opponent's grade: 72 
a draw
summand: 110.500000 
sum: 1939.500000 
 
date: 2007.02.13 
I was Black
my grade: 99 
my opponent's grade: 90 
new opponent's grade: 90 
I lost
summand: 69.500000 
sum: 2009.000000 
 
date: 2007.03.06 
I was Black
my grade: 99 
my opponent's grade: 99 
new opponent's grade: 99 
a draw
summand: 124.000000 
sum: 2133.000000 
 
date: 2007.04.30 
I was Black
my grade: 99 
my opponent's grade: 87 
new opponent's grade: 87 
a draw
summand: 118.000000 
sum: 2251.000000 
 
date: 2007.04.06 
I was Black
my grade: 99 
my opponent's grade: 77 
new opponent's grade: 77 
I lost
summand: 63.000000 
sum: 2314.000000 
 
date: 2007.04.06 
I was White
my grade: 99 
my opponent's grade: 58 
new opponent's grade: 58 
I won
summand: 103.500000 
sum: 2417.500000 
 
date: 2007.04.06 
I was Black
my grade: 99 
my opponent's grade: 60 
new opponent's grade: 60 
I lost
summand: 54.500000 
sum: 2472.000000 
 
date: 2007.04.07 
I was White
my grade: 99 
my opponent's grade: 90 
new opponent's grade: 90 
a draw
summand: 119.500000 
sum: 2591.500000 
 
date: 2007.04.07 
I was Black
my grade: 99 
my opponent's grade: 45 
new opponent's grade: 49 
a draw
summand: 99.000000 
sum: 2690.500000 
 
date: 2007.04.07 
I was White
my grade: 99 
my opponent's grade: 51 
new opponent's grade: 51 
I won
summand: 100.000000 
sum: 2790.500000 
 
new AGS ECF grade for chess player Jurjevic based on 27 games in all0607.pgn file is 103 
OK operation successful 

Fri May 25 16:26:06 2007
ÉGS...

Code: Select all

ECF_grades version 1.4
Copyright (c) M&R Research Team 2003-2007 
calculating a new EGS ECF grade for chess player Jurjevic using information in all0607.pgn file... 
 
date: 2006.09.19 
I was White
my grade: 99 
my opponent's grade: 98 
new opponent's grade: 98 
I won
summand: 123.450782 
sum: 123.450782 
 
date: 2006.10.16 
I was White
my grade: 99 
my opponent's grade: 98 
new opponent's grade: 98 
I lost
summand: 73.450782 
sum: 196.901564 
 
date: 2006.10.17 
I was White
my grade: 99 
my opponent's grade: 125 
new opponent's grade: 125 
a draw
summand: 111.907438 
sum: 308.809002 
 
date: 2006.10.23 
I was White
my grade: 99 
my opponent's grade: 92 
new opponent's grade: 92 
I lost
summand: 70.184893 
sum: 378.993896 
 
date: 2006.10.24 
I was Black
my grade: 99 
my opponent's grade: 120 
new opponent's grade: 120 
a draw
summand: 109.780963 
sum: 488.774859 
 
date: 2006.10.26 
I was Black
my grade: 99 
my opponent's grade: 70 
new opponent's grade: 70 
I lost
summand: 59.925332 
sum: 548.700191 
 
date: 2006.10.28 
I was Black
my grade: 99 
my opponent's grade: 94 
new opponent's grade: 94 
a draw
summand: 96.264466 
sum: 644.964657 
 
date: 2006.11.28 
I was White
my grade: 99 
my opponent's grade: 114 
new opponent's grade: 114 
I won
summand: 131.953666 
sum: 776.918323 
 
date: 2007.01.09 
I was Black
my grade: 99 
my opponent's grade: 100 
new opponent's grade: 100 
I lost
summand: 74.549218 
sum: 851.467541 
 
date: 2007.01.16 
I was Black
my grade: 99 
my opponent's grade: 94 
new opponent's grade: 94 
a draw
summand: 96.264466 
sum: 947.732007 
 
date: 2007.02.20 
I was Black
my grade: 99 
my opponent's grade: 86 
new opponent's grade: 86 
I won
summand: 117.047095 
sum: 1064.779102 
 
date: 2007.03.07 
I was Black
my grade: 99 
my opponent's grade: 139 
new opponent's grade: 139 
I lost
summand: 91.646568 
sum: 1156.425670 
 
date: 2007.05.15 
I was Black
my grade: 99 
my opponent's grade: 50 
new opponent's grade: 50 
a draw
summand: 79.201255 
sum: 1235.626925 
 
date: 2007.03.03 
I was Black
my grade: 99 
my opponent's grade: 99 
new opponent's grade: 99 
a draw
summand: 99.000000 
sum: 1334.626925 
 
date: 2007.04.11 
I was White
my grade: 99 
my opponent's grade: 169 
new opponent's grade: 169 
I lost
summand: 96.794833 
sum: 1431.421758 
 
date: 2007.04.25 
I was White
my grade: 99 
my opponent's grade: 98 
new opponent's grade: 98 
a draw
summand: 98.450782 
sum: 1529.872541 
 
date: 2007.05.14 
I was White
my grade: 99 
my opponent's grade: 99 
new opponent's grade: 99 
a draw
summand: 99.000000 
sum: 1628.872541 
 
date: 2007.05.12 
I was Black
my grade: 99 
my opponent's grade: 72 
new opponent's grade: 72 
a draw
summand: 85.694263 
sum: 1714.566804 
 
date: 2007.02.13 
I was Black
my grade: 99 
my opponent's grade: 90 
new opponent's grade: 90 
I lost
summand: 69.119694 
sum: 1783.686498 
 
date: 2007.03.06 
I was Black
my grade: 99 
my opponent's grade: 99 
new opponent's grade: 99 
a draw
summand: 99.000000 
sum: 1882.686498 
 
date: 2007.04.30 
I was Black
my grade: 99 
my opponent's grade: 87 
new opponent's grade: 87 
a draw
summand: 92.556946 
sum: 1975.243445 
 
date: 2007.04.06 
I was Black
my grade: 99 
my opponent's grade: 77 
new opponent's grade: 77 
I lost
summand: 62.776151 
sum: 2038.019596 
 
date: 2007.04.06 
I was White
my grade: 99 
my opponent's grade: 58 
new opponent's grade: 58 
I won
summand: 106.082066 
sum: 2144.101661 
 
date: 2007.04.06 
I was Black
my grade: 99 
my opponent's grade: 60 
new opponent's grade: 60 
I lost
summand: 56.633347 
sum: 2200.735008 
 
date: 2007.04.07 
I was White
my grade: 99 
my opponent's grade: 90 
new opponent's grade: 90 
a draw
summand: 94.119694 
sum: 2294.854703 
 
date: 2007.04.07 
I was Black
my grade: 99 
my opponent's grade: 45 
new opponent's grade: 45 
a draw
summand: 78.262735 
sum: 2373.117437 
 
date: 2007.04.07 
I was White
my grade: 99 
my opponent's grade: 51 
new opponent's grade: 51 
I won
summand: 104.409638 
sum: 2477.527075 
 
new EGS ECF grade for chess player Jurjevic based on 27 games in all0607.pgn file is 92 
OK operation successful 

Fri May 25 16:26:58 2007
Robert Jurjevic
Vafra

User avatar
Robert Jurjevic
Posts: 207
Joined: Wed May 16, 2007 1:31 pm
Location: Surrey

Post by Robert Jurjevic » Sat May 26, 2007 1:05 am

AGS and ÉGS formulas...

AGS and ÉGS formulas with no limitation 'a>=b' are (i..e., they work for both 'a>=b' and 'a<b')...

AGS...

Code: Select all

(*Amended Grading System*)
ClearAll[a, b, a2, b2, d, g, k, p, nw, nd, nl, nt];
a = 130; b = 100;
nw = 8; nd = 0; nl = 2;
nt = nw + nd + nl;
g = 50;
d = a - b;
k = 1/2;
If[d >= 0, If[d > g, p = 100, p = 50*(1 + d/g)], 
    If[d < -50, p = 0, p = 50*(1 + d/g)]];
a2 = a + k*((nw + nd/2)/nt*100 - p);
b2 = b + k*((nl + nd/2)/nt*100 - (100 - p));
Round[N[a2]]
Print[];
Round[N[b2]]
ÉGS...

Code: Select all

(*Elo Grading System*)
ClearAll[a, b, a2, b2, d, g, k, p, nw, nd, nl, nt];
a = 130; b = 100;
nw = 8; nd = 0; nl = 2;
nt = nw + nd + nl;
d = a - b;
g = (25*Log[10])/Log[3];
k = 1/2;
p = 100/(1 + 10^(-d/g));
a2 = a + k*((nw + nd/2)/nt*100 - p);
b2 = b + k*((nl + nd/2)/nt*100 - (100 - p));
Round[N[a2]]
Print[];
Round[N[b2]]
Grade definition in AGS and ÉGS...

The following graph...

Image

...shows a relationship between expected performance 'p' (vertical axes) and grade difference 'd=a-b' (horizontal axes) as defined in AGS (blue line) and in ÉGS (red line). The grades are similarly defined, and, roughly, the difference exist, for expected performance between 80 and 100 and grade differences between 30 and 130, and for expected performance between 0 and 20 and grade differences between -30 and -130.

Statistical experiments, do they make any sense...

I understand that some people are trying to find 'flaws' in correct grading systems by performing statistical experiments with models of real players and real games, or by analysing how real players perform in real games.

In my opinion this is impossible!

Let me elaborate...

I have found that the only difference between AGS and ÉGS is in how they define grade (both, AGS and ÉGS, are correct).

Proving that either ÉGS or AGS is 'better' grading system would be, in my opinion, equivalent to 'proving' that one definition of grade is better than another. I think, one can say I like ÉGS definition more than AGS (say, as the graph does not have sharp edges), but, I think, there is no way to 'prove' that one definition is 'better' than another.

I think that one should not say that players should perform in real life as expected (according to the grading system and its grade definition), nor, if they don't, that this is because grade was 'wrongly' defined.

If you choose ÉGS rather than AGS, in my opinion, players won't start to perform as expected or any other way (they are the same players).

Besides, why would one expect that players should perform as expected? For example, if I play only few games in a season is my performance 'real', maybe it is statistically insignificant, if you have a rapidly improving junior should he play at the end of the season as expected from his grade, chess abilities of the players are constantly changing, even grades of some players may have been calculated based on too few games, maybe stronger players are playing only stronger players, etc.

If all players would always perform as expected (assuming that their performance and grades are statistically significant) that would mean that their relative chess abilities do not change (simple as that).

If one had performed experiments using computer models, all he or she could have done is either to assume that all players would perform as expected or model some deviations. If the players did not perform as expected, that was because one wanted to be that way, and, in my opinion, this has nothing to do with assessment of a 'goodness' of a correct grading system.

I think that saying that ÉGS is 'better' than AGS is like saying that red is 'better' than blue (and that you can prove that using some real experiments with light spectrums or some computer experiments modelling light spectrums).

Refuting a correct grading system...

I think, eventually, one will find a better grading system. In my opinion a grading system which would be better than AGS and ÉGS has to be 'more correct' than AGS and ÉGS, i.e. it has to satisfy some additional (clever) Condition which AGS and ÉGS do not (AGS and ÉGS satisfy Condition 0, Condition 1 or equivalent and Condition 2). That additional condition, say Condition 3, would raise the 'correctness' of the system to a higher level and a grading system which would satisfy Condition 0, Condition 1 or equivalent, Condition 2 and Condition 3, would be better than AGS and ÉGS.

Say, if one conjectures that Condition 3 should require that... if 'p=100' or 'p=0' then 'Abs[d=a-b] = Infinity', then ÉGS would be better than AGS. Does this condition deserve to be given a status of Condition 3 is another matter.
Robert Jurjevic
Vafra