GRADING ANOMALIES

General discussions about ratings.
J T Melsom
Posts: 1294
Joined: Wed Aug 12, 2009 11:12 pm

Re: GRADING ANOMALIES

Post by J T Melsom » Mon Dec 21, 2009 9:24 pm

John

There is scope to move towards automatic inputting of results as arises in the Chiltern League and in neighbouring counties, and it would save myself and the grader from re-formatting stuff, but that would not necessarily improve the receipt of results from those who don't like using the system, and fail to appreciate the extra work they can cause by not reporting. My sanctions for non reporting seem to have dropped out of the league rules. [ I did notice in passing a Berkshire match scheduled for Dec 25. Any other leagues where the players show such commitment to the game?]

Ian

You are spot on about the work grader's do to identify and diffferentiate between new or returning players. One club recently advised me of a new player by first name, the opposition provided a surname, but it was only after my game against him subsequently that we were able to establish these details were correct and find his USCF rating. Some clubs just don't help their administrators at all, even after the annual reminders.

Sean Hewitt

Re: GRADING ANOMALIES

Post by Sean Hewitt » Tue Dec 22, 2009 7:46 am

We went through similar pain in Leicester.

Firstly we brought in online result reporting but, due to opposition from some clubs, it was merely an option and we had to retain the 21 day (!) deadline for submission.

Gradually though, teams and players saw the benefits of quicker online result updates and so more teams submitted results online, and more quickly.

Now, we have changed the rules so that online result submission is the only form of reporting and the time limit is 7 days. Most actually do it same or next day.

So the carrot is up to date results and tables on the website. The stick is penalties for late submission of results (deducting match points from teams league total). It's sufficient enough that we rarely have to apply it.

An interesting by-product is that it tended to be older stuck in the muds who objected to all this on-line malarky at the outset. However, when these clubs were next getting their results updated in a timely fashion the younger players appear to have stepped forward and volunteered to take up positions - a direct result of our move to online technology I believe.

Alex Holowczak
Posts: 9085
Joined: Sat May 30, 2009 5:18 pm
Location: Oldbury, Worcestershire
Contact:

Re: GRADING ANOMALIES

Post by Alex Holowczak » Tue Dec 22, 2009 10:02 am

Sean Hewitt wrote: Firstly we brought in online result reporting but, due to opposition from some clubs, it was merely an option and we had to retain the 21 day (!) deadline for submission.
And then we wonder why newspapers don't publish the results anymore!

J T Melsom
Posts: 1294
Joined: Wed Aug 12, 2009 11:12 pm

Re: GRADING ANOMALIES

Post by J T Melsom » Tue Dec 22, 2009 10:25 am

Alex

The decision of newspapers to publish chess results will depend on space in the paper, the reliability of copy, and the timeliness of the supply, and a few others. This is indeed one of the reasons why I would like to move to more immediate reporting of results, because the sooner I get one stuff the more can be done with it. At present I aim to post some observations on the latest round of fixtures on the website at regular intervals, but opening discussions with local press may follow.

However, it is worth noting that the last ten to twenty years has seem many local papers cut down on reporting of a number of minority sports. Table tennis and netball, snooker and darts all used to get more coverage, and you no longer get as many cricket results from local clubs in the summer. This does place the onus on the chess organisation to ensure the material is printworthy.

Alex Holowczak
Posts: 9085
Joined: Sat May 30, 2009 5:18 pm
Location: Oldbury, Worcestershire
Contact:

Re: GRADING ANOMALIES

Post by Alex Holowczak » Tue Dec 22, 2009 11:11 am

J T Melsom wrote:Table tennis and netball, snooker and darts all used to get more coverage,
It's true that minority sports get less coverage now than before. Snooker has a league or two around here, and the results of that get published in a weekly column in the Express & Star. Indeed, there's often a whole page column on it once per week. If they can manage a turnover of a results in maybe 3 days, why can't chess? (Whether the results had to go in a newspaper column or not.)

User avatar
Robert Jurjevic
Posts: 207
Joined: Wed May 16, 2007 1:31 pm
Location: Surrey

Re: GRADING ANOMALIES

Post by Robert Jurjevic » Tue Dec 22, 2009 12:43 pm

Hello Roger and others,

Could FGSl be a new ECF grading method?

May I ask what do you think about FGSl (FIDE-like Grading Sytem with live internal grades) defined below in "FIDE-like grading system with internal live grade calculation and FGSl..." section?

In order to illustrate the calculation method the lighthouse keepers and some other examples follow.

A two lighthouse keepers one graded 100 and other 120 drew 30 games in the grading period (which may be assumed to be a season), their live grades would be calculated as follows (let us denote a 100 player with 'i=1' and a 120 player with 'i=2'):

'n0 = 30'
'n1 = n2 = 30;
'k12 = k21 = 1', 'k11 = k22 = 0'
'q12 = q21 = 50', 'q11 = q22 = 0'
'p12 = 100/(1 + 10^(-(g1 - g2)/50))',
'p21 = 100/(1 + 10^(-(g2 - g1)/50))'

before game 1 (live grades before the 1st game are the grades published for the previous grading period which may be assumed to be a season):
'g1 = 100'
'g2 = 120'

after game 1:
'g1 = g1 + (k12*(q12 - p12))/n0 = 100.72'
'g2 = g2 + (k21*(q21 - p21))/n0 = 119.28'

after game 2:
'g1 = g1 + (k12*(q12 - p12))/n0 = 101.39'
'g2 = g2 + (k21*(q21 - p21))/n0 = 118.61'

after game 3:
'g1 = g1 + (k12*(q12 - p12))/n0 = 102.02'
'g2 = g2 + (k21*(q21 - p21))/n0 = 117.98'

after game 4:
'g1 = g1 + (k12*(q12 - p12))/n0 = 102.60'
'g2 = g2 + (k21*(q21 - p21))/n0 = 117.40'

after game 5:
'g1 = g1 + (k12*(q12 - p12))/n0 = 103.15'
'g2 = g2 + (k21*(q21 - p21))/n0 = 116.85'

after game 6:
'g1 = g1 + (k12*(q12 - p12))/n0 = 103.66'
'g2 = g2 + (k21*(q21 - p21))/n0 = 116.34'

after game 7:
'g1 = g1 + (k12*(q12 - p12))/n0 = 104.13'
'g2 = g2 + (k21*(q21 - p21))/n0 = 115.87'

after game 8:
'g1 = g1 + (k12*(q12 - p12))/n0 = 104.57'
'g2 = g2 + (k21*(q21 - p21))/n0 = 115.43'

after game 9:
'g1 = g1 + (k12*(q12 - p12))/n0 = 104.98'
'g2 = g2 + (k21*(q21 - p21))/n0 = 115.02'

after game 10:
'g1 = g1 + (k12*(q12 - p12))/n0 = 105.36'
'g2 = g2 + (k21*(q21 - p21))/n0 = 114.64'

after game 11:
'g1 = g1 + (k12*(q12 - p12))/n0 = 105.71'
'g2 = g2 + (k21*(q21 - p21))/n0 = 114.29'

after game 12:
'g1 = g1 + (k12*(q12 - p12))/n0 = 106.04'
'g2 = g2 + (k21*(q21 - p21))/n0 = 113.96'

after game 13:
'g1 = g1 + (k12*(q12 - p12))/n0 = 106.34'
'g2 = g2 + (k21*(q21 - p21))/n0 = 113.66'

after game 14:
'g1 = g1 + (k12*(q12 - p12))/n0 = 106.61'
'g2 = g2 + (k21*(q21 - p21))/n0 = 113.39'

after game 15:
'g1 = g1 + (k12*(q12 - p12))/n0 = 106.87'
'g2 = g2 + (k21*(q21 - p21))/n0 = 113.13'

after game 16:
'g1 = g1 + (k12*(q12 - p12))/n0 = 107.11'
'g2 = g2 + (k21*(q21 - p21))/n0 = 112.89'

after game 17:
'g1 = g1 + (k12*(q12 - p12))/n0 = 107.33'
'g2 = g2 + (k21*(q21 - p21))/n0 = 112.67'

after game 18:
'g1 = g1 + (k12*(q12 - p12))/n0 = 107.54'
'g2 = g2 + (k21*(q21 - p21))/n0 = 112.46'

after game 19:
'g1 = g1 + (k12*(q12 - p12))/n0 = 107.72'
'g2 = g2 + (k21*(q21 - p21))/n0 = 112.28'

after game 20:
'g1 = g1 + (k12*(q12 - p12))/n0 = 107.90'
'g2 = g2 + (k21*(q21 - p21))/n0 = 112.10'

after game 21:
'g1 = g1 + (k12*(q12 - p12))/n0 = 108.06'
'g2 = g2 + (k21*(q21 - p21))/n0 = 111.94'

after game 22:
'g1 = g1 + (k12*(q12 - p12))/n0 = 108.21'
'g2 = g2 + (k21*(q21 - p21))/n0 = 111.79'

after game 23:
'g1 = g1 + (k12*(q12 - p12))/n0 = 108.34'
'g2 = g2 + (k21*(q21 - p21))/n0 = 111.66'

after game 24:
'g1 = g1 + (k12*(q12 - p12))/n0 = 108.47'
'g2 = g2 + (k21*(q21 - p21))/n0 = 111.53'

after game 25:
'g1 = g1 + (k12*(q12 - p12))/n0 = 108.59'
'g2 = g2 + (k21*(q21 - p21))/n0 = 111.41'

after game 26:
'g1 = g1 + (k12*(q12 - p12))/n0 = 108.70'
'g2 = g2 + (k21*(q21 - p21))/n0 = 111.30'

after game 27:
'g1 = g1 + (k12*(q12 - p12))/n0 = 108.80'
'g2 = g2 + (k21*(q21 - p21))/n0 = 111.20'

after game 28:
'g1 = g1 + (k12*(q12 - p12))/n0 = 108.89'
'g2 = g2 + (k21*(q21 - p21))/n0 = 111.11'

after game 29:
'g1 = g1 + (k12*(q12 - p12))/n0 = 108.97'
'g2 = g2 + (k21*(q21 - p21))/n0 = 111.02'

after game 30:
'g1 = g1 + (k12*(q12 - p12))/n0 = 109.05'
'g2 = g2 + (k21*(q21 - p21))/n0 = 110.95'

The players' new published grades after the grading period (which may be assumed to be a season) would be their live grades after game 30:
'g1 = 109.05'
'g2 = 110.95'

If the grading period was chosen so that in it the players finished their 1st 15 games, then the players' new published grades after that grading period would be their live grades after game 15:
'g1 = 106.87'
'g2 = 113.13'

etc.

Note that if the 100 player played all different opponents with live grade of 120 instead, his new grade after 30 games would be:
'g1 = 113.61'
(note that it would' be 'g1 = 120.00', as we update the 100 player's grade after every draw and he is getting less and less grading points for every subsequent draw)
(note that it would be 'g1 = 119.99' if the number of games was approximately 200)
(note that it would be 'g1 = 112.77' if we used ECF linear approximation 'f(dij=gi-gj)=50*(1 + (gi-gj)/50)' if '|dij=gi-gj|<=40' and 'f(dij=gi-gj)=90' if 'dij=gi-gj>40' and 'f(dij=gi-gj)=10' if 'dij=gi-gj<-40', rather than FIDE logistic equivalent 'f(dij=gi-gj)=100/(1 + 10^(-(gi-gj)/50))')

I know that Roger is interested in the example where a 140 player scores 50% against a 160 opposition, so say if a 140 player scored 50% against a 160 opposition (i.e. the opposition's live grades were 160 on average) the 140 player would need approximately 95 games to get a grade of 159.5.
(note that if the 140 player played 250 opposition instead, he or she would need approximately 162 games to get a grade of 249.5.)

Let us consider a fast improving junior with a grade of a 100 who scored 75% against a 140 opposition (i.e. the opposition's live grades were 140 on average) in 80 games he played in the grading period, the junior's new FGSl grade for the grading period would be approximately 159.53.
(note that junior's new grade according to the current grading system would be 165 regardless of the number games he played in the grading period)
(note that if the junior played 200 games in the grading period his new FGSl grade would be approximately 163.73)

Notes on FGSl

!
In order to implement FGSl or equivalent FIDE-like grading system with internal live grade calculation one would need to enforce keeping records of date and time when the games finished (so that the games can be sorted in chronological order before grading for a desired grading period).

If you calculate using live grades internally the frequency of grade publication does not affect the grade calculation, though it may affect the things such as determination who is eligible to play in certain events, etc.

If ECF would adopt FGSl or equivalent FIDE-like grading system with internal live grade calculation, the choice of the length of the grading period (be it a season, six or two months, etc.) is more of an administrative issue (i.e. you collect the game results and run the program to calculate new grades for the new grading period instead of once a year every six or two months, etc.).

'n0' can be fine-tuned, instead of setting 'n0=30' one may find another constant to fit better the ECF situation.

'k' factors of graded players in the games where they play ungraded players may be set to zero when 'k' factors of ungraded players in the games where they play graded players would be set to 2 (we honour 'kij+kji=k=2' if 'i'th and 'j'th players played at least one game in the grading period).

Improving on FGSl

One may consider using variable 'k' factors, honouring 'kij+kji=k=2' if 'i'th and 'j'th players played at least one game in the grading period, to account for the players' grade trust estimated on the basis of the frequency of play (this would be sort of Glicko 1 idea).

Say, if a player played only a few games in the last grading period his grade should not be equally trusted as the grade of a player who played a lot of games in the last grading period, and the grade of the player whose grade is less trusted should be corrected more (his 'k' factor is greater than 1) than the grade of the player whose grade is more trusted (his 'k' factor is less than 1), honouring 'kij+kji=k=2'.

FIDE-like grading system with internal live grade calculation and FGSl...

Restrictions:
(none)

Definitions:
  • 'n0' is some constant number of games, 'n0>0', note that if 'kij=kji=1' if 'i'th and 'j'th players played at least one game in the grading period and 'kij=kji=0' if 'i=j' or if 'i'th and 'j'th player did not play each other in the grading period, then 'n0' is a number of games after which a player's new grade would equal his or her actual performance
  • 'm' is the total number of players in the system, 'm>1'
  • 'n_' is game number vector where 'ni' is a number of games 'i'th player played in the grading period
  • 'g_' is a grade vector where 'gi' is a live grade of 'i'th player, a live grade for the first game in the grading period for a player is a player's grade from the previous grading period
  • 'g2_' is a new grade vector where 'g2i' is a new grade for the grading period of 'i'th player (player's new grade for the grading period is calculated using the player's game results in the grading period and the player's and the player's opposition live grades)
  • 'Q_' 'is 'mxm' matrix where 'qij' is actual performance of 'i'th player in the game against 'j'th player ('qij' is 0 if 'i'th player lost, 50 if 'i'th player drew and 100 if 'i'th player won, note that 'qij' and 'qji' are related, i.e., if 'qij=0' then 'qji=100','if 'qij=50' then 'qji=50', if 'qij=100' then 'qji=0', etc.), note that 'qij+qji=100' if 'i'th and 'j'th players played at least one game in the grading period, note that 'qij = qji = 0' if 'i=j' or if 'i'th and 'j'th player did not play each other in the grading period, 'Q_' is actual performance matrix
  • 'P_' is 'mxm' matrix where 'pij=f(gi-gj)' is expected performance of 'i'th player in the game against 'j'th player, note that ECF is using 'f(dij=gi-gj)=50*(1 + (gi-gj)/50)' if '|dij=gi-gj|<=40' and 'f(dij=gi-gj)=90' if 'dij=gi-gj>40' and 'f(dij=gi-gj)=10' if 'dij=gi-gj<-40', FIDE logistic equivalent is 'f(dij=gi-gj)=100/(1 + 10^(-(gi-gj)/50))', note that if 'gi=gj' then 'f(gi-gj)=50', note that 'pij+pji=100', 'P_' is expected performance matrix
  • 'K_' is 'mxm' matrix where 'kij' are 'k' factors used in calculation of new grades for the grading period, 'kij=kji=0' if 'i=j' or if 'i'th and 'j'th player did not play each other in the grading period, 'kij+kji=k', where 'k>0' is a constant, if 'i'th and 'j'th players played at least one game in the grading period, 'K_' is 'k' factors matrix
!
(note that if players 'i' and 'j' played more than one game in the grading period than for each game there will be a separate set of 'gi', 'qij' and 'qji' and 'pij' and 'pji', in theory 'kij' and 'kji' should not change within one grading period, i.e., an ungraded player may be regarded as an ungraded player for the whole of grading period, live grades may be equally trusted throughout the grading period, etc.)

New grades for the grading period are calculated as follows (this is a generic formula covering a number of grading systems including the current ECF one, what makes the grading systems different is a choice of matrices 'K_' and 'P_')

Code: Select all

g2i = gi + Sum_j ((kij*(qij - pij))/n0)
where for FGSl (FIDE-like Grading System with internal live grades) 'kij=kji=1' if 'i'th and 'j'th players played at least one game in the grading period, 'kij=kji=0' if 'i=j' or if 'i'th and 'j'th player did not play each other in the grading period, 'pij=f(dij=gi-gj)' where 'f(dij=gi-gj)=100/(1 + 10^(-(gi-gj)/50))' and 'n0=30'.

Sense of ECF grade scale...

If a 120 player plays a 100 player then the expected performance of the 120 player is approximately '50+(120-100)=70%', in FIDE case it would be approximately '50+(1610-1450)/8=70%' (i.e., ECF has no scaling factor of 8 ).
:)

Merry Christmas and a Happy New Year

!
I wish you all Merry Christmas and a Happy New Year!

Kind regards,
Robert Jurjevic
Vafra

matt_ward
Posts: 418
Joined: Thu Dec 03, 2009 2:20 pm

Re: GRADING ANOMALIES

Post by matt_ward » Sat Apr 03, 2010 10:55 am

It does not seem a bad Idea although I think it's best to not to change the ECF grading around anymore as it's going to take a few years to get back to normal, due to inflation.

Matt.

User avatar
Robert Jurjevic
Posts: 207
Joined: Wed May 16, 2007 1:31 pm
Location: Surrey

Re: GRADING ANOMALIES

Post by Robert Jurjevic » Wed Jul 28, 2010 7:21 pm

Hello all,

GS and AGS3 grades for 2009 and 2010...

Please note that I have estimated AGS3 grades for 2009 and 2010 using GS grades for 2009 published in gradeslive.csv and GS grades for 2010 published in grades2010v2.csv. (I assumed that 2008 EGS3 grades are equal to 2008 GS grades and that number of games on which 2008 GS and AGS3 grades are based is 0.)

The G30 rule below has been taken into account in the calculation.

Rule G30: The Grade is calculated by dividing the total number of points scored by the number of games played. If there are at least 30 games in the current period, then the Grade is based on these games alone. If there are not, results are brought forward from the previous period to make the total up to exactly thirty. If there are not 30 games in the two seasons together, results are taken from the season before that. Games are never taken from further back than this; the maximum is two prior grading periods.

The results can be found in ags3grade10.zip file which can be downloaded from... http://www.jurjevic.org.uk/chess/grade/. The Zip file contains ags3grade09.txt (textual tab delimited), ags3grade09.xls (Excel spreadsheet document), gradeslive.csv (Excel spreadsheet document, official 2009 grades), gs3grade10.txt (textual tab delimited), ags3grade10.xls (Excel spreadsheet document) and grades2010v2.csv (Excel spreadsheet document, official 2010 grades, version 1).

The calculation was performed by a Windows .net console (command line) application written in C# programming language interfacing with Oracle Database 10g Express Edition database. Histograms are obtained with Mathematica 7 (I saved relevant data in a database, I have no game results as they are not published, but as GS's and AGS3's formulae are identical and the only difference is in the value of 'k' factors, one can estimate AGS3 grades using GS grades and no actual game results, the C# program was written by me, Oracle Database 10g Express Edition database is free, but it has limitations imposed, some or all of which are lifted in commercial versions).

Image
Figure 1: Histogram for 2009 GS grades (consisting of 10153 grades in bins [0,1), [1,2), etc., of 1 grading point width, where minimum grade is 0.00, median grade is 133.00, maximum grade is 281.00, mean grade is 133.69 and standard deviation is 37.05).

Image
Figure 2: Histogram for 2009 AGS3 grades (consisting of 10153 grades in bins [0,1), [1,2), etc., of 1 grading point width, where minimum grade is 0.00, median grade is 133.50, maximum grade is 281.00, mean grade is 133.55 and standard deviation is 36.73).

Image
Figure 3: Histogram for 2010 GS grades (consisting of 10073 grades in bins [0,1), [1,2), etc., of 1 grading point width, where minimum grade is 2.00, median grade is 133.00, maximum grade is 276.00, mean grade is 133.33 and standard deviation is 37.46).

Image
Figure 4: Histogram for 2010 AGS3 grades (consisting of 10073 grades in bins [0,1), [1,2), etc., of 1 grading point width, where minimum grade is 1.00, median grade is 132.88, maximum grade is 276.00, mean grade is 133.09 and standard deviation is 36.99).

(I can't see any obvious behavioural histogram-related difference between GS and AGS3 grades. If anybody see anything could he or she please share his or her opinion with us. Thanks.)

Grading anomalies...

My point in a nutshell is that the 'k' factors (wrongly chosen) in the current grading system are causing the grade stretching and that the amount of stretch (due to the 'k' factors) is larger (in fact it is equal to '|p - q|') than grade fluctuations caused by other anomalies which may be corrected by using FIDE logistic relation for 'p = f(d)', Glickman idea on changing less trusted grades (based on frequency of play) faster than more trusted grades, or even a solution to the "junior problem".

In my opinion three candidates for replacing GS are AGS3, ÉGS5 and ÉGS6, with ÉGS6 as the best and ÉGS5 as the second best.

AGS3 is GS with the 'k' factors equal to '1/2' (the ECF's linear approximation is used for 'p = f(d)'; does not stretch the grades). (The only difference between GS and AGS3 is in 'k' factors in their formulae which are '1/2' in AGS3 and '1' in GS.)

ÉGS5 is sort of ECF equivalent of FIDE's Élo (logistic curve is used for 'p = f(d)', this is regarded as more accurate than the ECF's linear approximation; the grades are ECF grades, not FIDE ratings, i.e., a strong grandmaster is about 270 not 2800; grading is done every season rather than after every tournament; does not stretch the grades).

ÉGS6 has similar improvement (taken in a simple from) over ÉGS5 as Glicko has over FIDE's Élo which accounts for a grade trust (or establishment) based on frequency of play (i.e., less trusted or established grades change more rapidly than more trusted or established grades, consequently, in extreme case, ungraded players do not affect the grades of graded players; uses logistic curve for 'p = f(d)'; does not stretch the grades).

More details (including formulae, logical argument, etc.) can be found at:

http://www.jurjevic.org.uk/chess/grade/ ... malies.htm

Am I sure...

Of course not, it is only my belief.

I published AGS3 grades in case somebody would like to examine the difference in behaviour between GS (official) and AGS3 (in my opinion the simplest 'improvement' over official grading system) grades.

Maybe my definition of 'grade stretching' is not what the problem was (and why new GS grades were introduced), but at least it is a try.

Kind regards,
Last edited by Robert Jurjevic on Thu Nov 18, 2010 6:12 pm, edited 8 times in total.
Robert Jurjevic
Vafra

User avatar
Robert Jurjevic
Posts: 207
Joined: Wed May 16, 2007 1:31 pm
Location: Surrey

Re: GRADING ANOMALIES

Post by Robert Jurjevic » Thu Jul 29, 2010 12:29 pm

Hello all,

Peter Lalic vs Robert Jurjevic though experiment or on a possible mechanism of grade stretching...

Peter Lalic is a chess colleague and a friend of mine whose GS grade was changed from 158 in 2008 to 185 in 2010. According to AGS3 Peter's 2010 grade should have been 177 (his AGS3 grade in 2008 was taken to be the same as GS grade, i.e. 158). Robert Jurjevic (that is me) is a weaker chess player whose grade remained pretty much constant during that period at approximately 120 (both Robert's GS and AGS3 grades were 122 in 2008 and are 121 in 2010).

Let us assume that Peter and Robert decided to play a match (of sufficiently enough games) in 2011. If both players perform as expected (as their grades suggest) then according to GS Peter should score 95.1% and according to AGS3 93.0% in the match (calculated assumed logistic relationship between expected performance and grade difference, note that the grade difference is greater than 30, so one should use more accurate relationship between expected performance and grade difference). It seems that both 95.1% and 93.0% cannot be true at the same time. Which of the estimates of Peter's expected performance is more accurate?

Let us assume that Peter scored 93.0% in the match (exactly what was expected by AGS3). According to GS grades Peter under-performed in the match. This can be due to a fact that during the match Robert became relatively stronger to Peter. Another explanation could be that both Peter and Robert indeed did perform as expected (i.e. Robert's relative strength to Peter hasn't changed) but that grades were not correctly calculated. If that was the case one may have corrected Robert's grade to be 129 (it may be better to leave the grade of the stronger player unchanged) in order to match Peter's performance of 93.0%.

Let us assume now that somehow an analysis has been made which shows that all stronger players in general under-perform against weaker players. Then one may assume that this is not due to the fact that all weaker players improved in respect to stronger players (what is statistically very unlikely) but rather that something is wrong with the grade calculation. If that is the case one can make a correction to weaker player grades (increasing them) to match the observed performance of the stronger players in games against weaker players (this correction was done in 2008 and my grade went from the old 90ish to a new 120ish). My understanding is that this phenomenon was referred to as "grade stretching" (as it would seem that grade difference between stronger and weaker players is higher than what performance results suggest it should have been).

If we assumed that Peter's 2010 grade was 177 as suggested by AGS3 (rather than 185 as suggested by GS) Peter's performance of 93.0% in the match would have been exactly what one would expected and the general conclusion may have been that the stronger players when playing against weaker players do perform as expected on average (hence no grade correction would have been needed).

How AGS3 grades may be used...

My intention is to keep calculating and publishing AGS3 grades until 2013. In 2013 one may perform an analysis which establishes if all stronger players in general under-perform against weaker players or not (a similar analysis which has already been done and led to grade correction in 2008, I understand), first using GS, then AGS3 grades (2013 is chosen in order to gather 5 years of AGS3 grade data, I understand that 5 year grade data was used in the original analysis which had lead to grade corrections in 2008). If GS grades suggest that the stronger players under-perform in the games against weaker players in general and AGS3 that they do not, then it is likely that my assumption about what is causing grade stretching was correct.

If it would be shown that AGS3 does not stretch the grades and GS does then ECF may adopt AGS3 grades (which would already have been calculated by me) as official grades for 2013 and continue calculating grades in 2014, 2015, etc. either using AGS3 or ÉGS6 (which would have been better than AGS3 in cases where grade difference is greater than 30, and where players do play only a few games per season, their grade would charge more rapidly as it is less trusted as it is based on older games, also players who play more games wouldn't be so much affected by the results of games they play against players who play fewer games, their grade would change less rapidly as the grades of their opponents are less trusted, and in extreme case the games ungraded players play against graded players wouldn't affect the grades of graded players at all, actually as an extension to ÉGS6 there is a proposal in http://www.jurjevic.org.uk/chess/grade/ ... malies.htm in "Ungraded players" section for calculation of grades of ungraded players based only on the game results with no need for or reference to the estimated grades).

AGS3 and ÉGS6 (with the mentioned extension for grade calculation of ungraded players) would not require any change in gathering data or timing when the collection or calculation is done, etc., and all the procedures would remain the same. The only difference would have been in a method of calculation which would anyway be given to a computer program to do. AGS3 has a benefit of the same simplicity as GS, so a calculation by hand would still be possible if AGS3 is used.

Peter Lalic's chess videos...

Please allow me to let you know (as a digression) that Peter has done some I would dare to say great chess videos. Some of the selected videos can be found at...
http://www.ashteadchessclub.org/html/lalic.html

Peter's YouTube Channel is at...
http://www.youtube.com/user/DraganLalic

Kind regards,
Last edited by Robert Jurjevic on Wed Aug 04, 2010 9:56 am, edited 7 times in total.
Robert Jurjevic
Vafra

User avatar
Robert Jurjevic
Posts: 207
Joined: Wed May 16, 2007 1:31 pm
Location: Surrey

Re: GRADING ANOMALIES

Post by Robert Jurjevic » Sat Jul 31, 2010 2:01 am

Hello all,

Delta grade histograms...

Please find below delta grade histograms for GA and AGS3 grades for 2009 and 2010 (2009 GS delta grade is defined as 2009 GS grade minus 2008 GS grade, 2009 AGS3 delta grade is defined as 2009 AGS3 grade minus 2008 AGS3 grade, 2010 GS delta grade is defined as 2010 GS grade minus 2008 GS grade, 2010 AGS3 delta grade is defined as 2010 AGS3 grade minus 2008 AGS3 grade).

Delta grade is equal to a number of grading points one's 2009, 2010, etc. GS or AGS3 grade has changed in respect to his or her 2008 GS or AGS3 grade (year when new GS grades were introduced) and it is a measure or players' GS or AGS3 improvement or decline in 2009, 2010, etc. since 2008.

Image
Figure 1: Histogram for 2009 GS delta grades (consisting of 8624 delta grades in bins [0,1), [1,2), etc., of 1 delta grading point width, where minimum delta grade is -38.00, median delta grade is 0.00, maximum delta grade is 63.00, mean delta grade is 0.33 and standard deviation of delta grades is 8.13; delta grade is defined as 2009 GS grade minus 2008 GS grade).

Image
Figure 2: Histogram for 2009 AGS3 delta grades (consisting of 8624 delta grades in bins [0,1), [1,2), etc., of 1 delta grading point width, where minimum delta grade is -19.00, median delta grade is 0.00, maximum delta grade is 31.50, mean delta grade is 0.16 and standard deviation of delta grades is 4.07; delta grade is defined as 2009 AGS3 grade minus 2008 AGS3 grade).

Image
Figure 3: Histogram for 2010 GS delta grades (consisting of 7582 delta grades in bins [0,1), [1,2), etc., of 1 delta grading point width, where minimum delta grade is -39.00, median delta grade is 0.00, maximum delta grade is 80.00, mean delta grade is 0.52 and standard deviation of delta grades is 10.96; delta grade is defined as 2010 GS grade minus 2008 GS grade).

Image
Figure 4: Histogram for 2010 AGS3 delta grades (consisting of 7582 delta grades in bins [0,1), [1,2), etc., of 1 delta grading point width, where minimum delta grade is -23.39, median delta grade is -0.08, maximum delta grade is 47.75, mean delta grade is 0.40 and standard deviation of delta grades is 6.60; delta grade is defined as 2010 AGS3 grade minus 2008 AGS3 grade).

Kind regards,
Last edited by Robert Jurjevic on Thu Nov 18, 2010 2:45 am, edited 2 times in total.
Robert Jurjevic
Vafra

User avatar
Robert Jurjevic
Posts: 207
Joined: Wed May 16, 2007 1:31 pm
Location: Surrey

Re: GRADING ANOMALIES

Post by Robert Jurjevic » Wed Aug 11, 2010 6:43 pm

Players' list in grade improvement order for 2010 season since 2008 season...

Please find the players' list in grade improvement order for 2010 season since 2008 season (PDF document) at:
http://www.jurjevic.org.uk/chess/grade/10i08.pdf

column legend:

REF - ECF grading reference ID
NAME - full player's name
SEX - player's sex
FIDECODE - player's FIDE code
NATION - player's nationality
GAMES - number of standard graded games the player played in 2010 season
CATEGORY - 2010 season grading category based on GAMES
GS - player's GS grade for 2010 season
AGS3 - player's AGS3 grade for 2010 season
GSLAST09 - player's GS grade for 2008 season
AGS3LAST09 - player's AGS3 grade for 2008 season
GS-GSLAST09 - player's GS grade improvement in 2010 season since 2008 season
AGS3-AGS3LAST09 - player's AGS3 grade improvement in 2010 season since 2008 season

GS grade - official ECF current Grading System grade
AGS3 grade - ECF Amended Grading System three grade
(player's AGS3 grade for 2008 season was taken to be the same as player's GS grade for 2008 season)

P.S. ...

what is PDF?
http://en.wikipedia.org/wiki/Portable_Document_Format

download PDF reader
http://get.adobe.com/uk/reader/enterprise/
Last edited by Robert Jurjevic on Thu Nov 18, 2010 3:10 am, edited 1 time in total.
Robert Jurjevic
Vafra

User avatar
Robert Jurjevic
Posts: 207
Joined: Wed May 16, 2007 1:31 pm
Location: Surrey

Re: GRADING ANOMALIES

Post by Robert Jurjevic » Sun Aug 22, 2010 1:24 am

Error in calculation of 2010 AGS3 grades...

I said...

"Please note that I have calculated AGS3 grades for 2009 and 2010 using GS grades for 2009 published in gradeslive.csv and GS grades for 2010 published in grades2010v1.csv. (I assumed that 2008 EGS3 grades are equal to 2008 GS grades and that number of games on which 2008 GS and AGS3 grades are based is 0.)"

"The calculation was performed by a Windows .net console (command line) application written in C# programming language interfacing with Oracle Database 10g Express Edition database. Histograms are obtained with Mathematica 7 (I saved relevant data in a database, I have no game results as they are not published, but as GS's and AGS3's formulae are identical and the only difference is in the value of 'k' factors, one can calculate AGS3 grades using GS grades and no actual game results, the C# program was written by me, Oracle Database 10g Express Edition database is free, but it has limitations imposed, some or all of which are lifted in commercial versions)."

I have realized that the statement ... "one can calculate AGS3 grades using GS grades and no actual game results" ...may not be true. (Calculation of AGS3 grades may not be possible without the game results.)

The formula I used is...

Code: Select all

A2 = A1 + (a2 - a1)/2 
and the formula I should have used is...

Code: Select all

A2 = A1 + (a2 - a1)/2 + (p - P)/2
where 'A2' is new AGS3 grade, 'A1' current AGS3 grade, 'a2' new GS grade, 'a1' current GS grade, 'p' expected performance based on GS grades and 'P' expected performance based on AGS3 grades.

I overlooked that 'p - P' is not zero.

'p - P' can be calculated from the actual game results, but if I would have actual game results I would use direct AGS3 formulae (so there would be no need to calculate 'p - P'), moreover then I would be able to calculate ÉGS6 grades as well.

It is possible that 'p - P' can be approximated by...

Code: Select all

p - P = (a1 - a0)/2
where 'a0' is previous GS grade, so that the formula for calculating new AGS3 grade, 'A2', may be...

Code: Select all

A2 = A1 + (a2 - a1)/2 + (a1 - a0)/4
where 'A2' is new AGS3 grade, 'A1' current AGS3 grade, 'a2' new GS grade, 'a1' current GS grade and 'a0' previous GS grade.

(The above approximation assumes that the average GS grade of the player's opponents did not change, which may be a fair assumption, but obviously needs not to hold in every case.)

If the above approximation is good enough I will correct AGS3 grades for 2010 (AGS3 grades for 2009 are correctly calculated as for 2009 'p - P = 0' by definition) and continue to calculate AGS3 grades for 20011, 2012, etc. using the above formula.

Correction of 2010 AGS3 grades...

The formula I used was...

Code: Select all

A2 = a1 + (a2 - a1)/2 
and the formula I should have used is...

Code: Select all

A2 = A1 + (a2 - a1)/2 + (p - P)/2
where 'A2' is new AGS3 grade, 'A1' current AGS3 grade, 'a2' new GS grade, 'a1' current GS grade, 'p' expected performance based on GS grades and 'P' expected performance based on AGS3 grades.

I erroneously used 'A2 = a1 +...' instead of 'A2 = A1 +...' and I overlooked that 'p - P' is not zero.

(This affects only calculation of 2010 AGS3 grades and onwards. 2009 AGS3 grades were calculated correctly, as for 2009 AGS3 grades 'a1 = A1' and '(p - P)/2 = 0'.)

(All the relevant files mentioned in the previous posts have been updated (namely ags3grade10.zip and 10i08.pdf), histogram for 2010 AGS3 grades and histogram for 2010 AGS3 delta grades have been also updated.)

(Now all the relevant files in the previous post have been changed to include corrections given in grades2010v2.csv at http://grading.bcfservices.org.uk/downloads.php ) Edit 18-Nov-10

AGS3 grades cannot be accurately calculated using only published GS grades, but can be estimated using formula...

Code: Select all

A2 = A1 + (a2 - a1)/2 + (a1 - a0)/4
where 'A2' is new AGS3 grade, 'A1' current AGS3 grade, 'a2' new GS grade, 'a1' current GS grade and 'a0' previous GS grade.

The above approximation assumes that the average GS grade of the player's opponents did not change (between current and previous season), which may be a fair assumption, but obviously needs not to hold in every case. The assumption is equivalent to approximating 'p - P' with...

Code: Select all

p - P = (a1 - a0)/2
where 'a0' is previous GS grade and 'a1' current GS grade.
Robert Jurjevic
Vafra

User avatar
Robert Jurjevic
Posts: 207
Joined: Wed May 16, 2007 1:31 pm
Location: Surrey

Re: GRADING ANOMALIES

Post by Robert Jurjevic » Thu Nov 18, 2010 3:21 pm

AGS3 grade estimation with no game results...

AGS3 grades cannot be accurately calculated using only published GS grades (i.e., without actual game results), but can be estimated using formula...

Code: Select all

A2 = A1 + (a2 - a1)/2 + (a1 - a0)/4
where 'A2' is new AGS3 grade, 'A1' current AGS3 grade, 'a2' new GS grade, 'a1' current GS grade and 'a0' previous GS grade.

The above approximation assumes that the average GS grade of the player's opponents did not change between current and previous season, which may be a fair assumption, but obviously needs not to hold in every case. The assumption is equivalent to approximating 'p - P' with...

Code: Select all

p - P = (a1 - a0)/2
where 'a0' is previous GS grade and 'a1' current GS grade.

As both GS and AGS3 are preserving total system grade median of GS and AGS3 delta grades should be zero. (In probability theory and statistics, a median is described as the numeric value separating the higher half of a sample, a population, or a probability distribution, from the lower half.) This fact can be used to assess the error made by assuming that the average GS grade of the player's opponents did not change between current and previous season.

Image
Figure 1: Histogram for 2010 AGS3 delta grades (consisting of 7582 delta grades in bins [0,1), [1,2), etc., of 1 delta grading point width, where minimum delta grade is -23.39, median delta grade is -0.08, maximum delta grade is 47.75, mean delta grade is 0.40 and standard deviation of delta grades is 6.60; delta grade is defined as 2010 AGS3 grade minus 2008 AGS3 grade).

As the median for 2010 AGS3 delta grades is -0.08 (which seems to be fairly close to zero) we may assess that the error made by assuming that the average GS grade of the player's opponents did not change between current and previous season is rather small. (AGS3 delta grade median of -0.08 means that the grades are ever so slightly more decreased than increased resulting in a rather small net grade decrease on average or a small grade deflation; in other words the AGS3 grade calculation approximately preserves total system grade).

Conlusion...

AGS3 grade estimation/ calculation with no game results seems to be fairly accurate (it is likely that the error for 2010 AGS3 delta grades is about -0.08 grading points on average).

Calculation results...

Calculation results (ags3grade10.xls file) can be found at...
http://www.jurjevic.org.uk/chess/grade/ags3grade10.xls
Robert Jurjevic
Vafra

User avatar
Robert Jurjevic
Posts: 207
Joined: Wed May 16, 2007 1:31 pm
Location: Surrey

Re: GRADING ANOMALIES

Post by Robert Jurjevic » Fri Jan 21, 2011 12:45 am

Single game outcome probability...

Dear chess colleagues,

Please note that I think that I have found a way how to calculate chances for each outcome of a single game between two players if the grade difference is known.

The result is presented in a form of a table shown in “Single game outcome probability...” section (near the end of the document) at (the table is also reproduced below)…
http://www.jurjevic.org.uk/chess/grade/ ... malies.htm
(the grade difference may be corrected so to make White 5 grading points stronger than Black)

Example:
What is the chance of a 120 player playing White winning against a 140 player playing Black? Grade difference ‘d’ is ‘d = 140 – (120 + 5) = 140 – 125 = 15’ and from the table it follows that for grade difference of 15 the chance of the weaker player winning against the stronger player is 16.7 percent.

Code: Select all

---------------------------------------
        chance in percents           
        loss   draw   win   (stronger) 
    d   win    draw   loss  (weaker)
---------------------------------------
    0   25.0   50.0   25.0
    1   24.4   48.9   26.7
    2   23.8   47.7   28.5
    3   23.3   46.5   30.2
    4   22.7   45.4   31.9
    5   22.1   44.3   33.6
    6   21.6   43.1   35.3
    7   21.0   42.0   37.0
    8   20.4   40.9   38.7
    9   19.9   39.8   40.3
   10   19.3   38.7   42.0
   11   18.8   37.6   43.6
   12   18.3   36.5   45.2
   13   17.7   35.5   46.8
   14   17.2   34.4   48.4
   15   16.7   33.4   49.9
   16   16.2   32.4   51.4
   17   15.7   31.4   52.9
   18   15.2   30.4   54.4
   19   14.7   29.4   55.9
   20   14.2   28.5   57.3
   21   13.8   27.5   58.7
   22   13.3   26.7   60.0
   23   12.9   25.7   61.4
   24   12.4   24.9   62.7
   25   12.0   24.0   64.0
   26   11.6   23.2   65.2
   27   11.2   22.4   66.4
   28   10.8   21.6   67.6
   29   10.4   20.8   68.8
   30   10.0   20.1   69.9
   31    9.7   19.3   71.0
   32    9.3   18.7   72.0
   33    9.0   17.9   73.1
   34    8.6   17.3   74.1
   35    8.3   16.7   75.0
   36    8.0   16.0   76.0
   37    7.7   15.4   76.9
   38    7.4   14.8   77.8
   39    7.1   14.3   78.6
   40    6.8   13.7   79.5
   41    6.6   13.1   80.3
   42    6.3   12.6   81.1
   43    6.1   12.1   81.8
   44    5.8   11.7   82.5
   45    5.6   11.2   83.2
   46    5.4   10.7   83.9
   47    5.1   10.3   84.6
   48    4.9    9.9   85.2
   49    4.7    9.5   85.8
   50    4.5    9.1   86.4
   51    4.4    8.7   86.9
   52    4.2    8.3   87.5
   53    4.0    8.0   88.0
   54    3.8    7.7   88.5
   55    3.7    7.3   89.0
   56    3.5    7.1   89.4
   57    3.4    6.7   89.9
   58    3.2    6.5   90.3
   59    3.1    6.2   90.7
   60    3.0    5.9   91.1
   61    2.8    5.7   91.5
   62    2.7    5.5   91.8
   63    2.6    5.2   92.2
   64    2.5    5.0   92.5
   65    2.4    4.8   92.8
   66    2.3    4.6   93.1
   67    2.2    4.4   93.4
   68    2.1    4.2   93.7
   69    2.0    4.0   94.0
   70    1.9    3.8   94.3
   71    1.8    3.7   94.5
   72    1.8    3.5   94.7
   73    1.7    3.3   95.0
   74    1.6    3.2   95.2
   75    1.5    3.1   95.4
   76    1.5    2.9   95.6
   77    1.4    2.8   95.8
   78    1.3    2.7   96.0
   79    1.3    2.5   96.2
   80    1.2    2.5   96.3
   81    1.2    2.3   96.5
   82    1.1    2.3   96.6
   83    1.1    2.1   96.8
   84    1.0    2.1   96.9
   85    1.0    1.9   97.1
   86    0.9    1.9   97.2
   87    0.9    1.8   97.3
   88    0.9    1.7   97.4
   89    0.8    1.6   97.6
   90    0.8    1.5   97.7
   91    0.7    1.5   97.8
   92    0.7    1.4   97.9
   93    0.7    1.3   98.0
   94    0.7    1.3   98.0
   95    0.6    1.3   98.1
   96    0.6    1.2   98.2
   97    0.6    1.1   98.3
   98    0.5    1.1   98.4
   99    0.5    1.1   98.4
  100    0.5    1.0   98.5
  101    0.5    0.9   98.6
  102    0.5    0.9   98.6
  103    0.4    0.9   98.7
  104    0.4    0.8   98.8
  105    0.4    0.8   98.8
  106    0.4    0.7   98.9
  107    0.4    0.7   98.9
  108    0.3    0.7   99.0
  109    0.3    0.7   99.0
  110    0.3    0.6   99.1
  111    0.3    0.6   99.1
  112    0.3    0.6   99.1
  113    0.3    0.5   99.2
  114    0.3    0.5   99.2
  115    0.2    0.5   99.3
  116    0.2    0.5   99.3
  117    0.2    0.5   99.3
  118    0.2    0.5   99.3
  119    0.2    0.4   99.4
  120    0.2    0.4   99.4
  121    0.2    0.4   99.4
  122    0.2    0.3   99.5
  123    0.2    0.3   99.5
  124    0.2    0.3   99.5
  125    0.2    0.3   99.5
  126    0.2    0.3   99.5
  127    0.1    0.3   99.6
  128    0.1    0.3   99.6
  129    0.1    0.3   99.6
  130    0.1    0.3   99.6
  131    0.1    0.3   99.6
  132    0.1    0.2   99.7
  133    0.1    0.2   99.7
  134    0.1    0.2   99.7
  135    0.1    0.2   99.7
  136    0.1    0.2   99.7
  137    0.1    0.2   99.7
  138    0.1    0.2   99.7
  139    0.1    0.1   99.8
  140    0.1    0.1   99.8
  141    0.1    0.1   99.8
  142    0.1    0.1   99.8
  143    0.1    0.1   99.8
  144    0.1    0.1   99.8
  145    0.1    0.1   99.8
  146    0.1    0.1   99.8
  147    0.1    0.1   99.8
  148    0.1    0.1   99.8
  149    0.1    0.1   99.8
  150    0.0    0.1   99.9
  151    0.0    0.1   99.9
  152    0.0    0.1   99.9
  153    0.0    0.1   99.9
  154    0.0    0.1   99.9
  155    0.0    0.1   99.9
  156    0.0    0.1   99.9
  157    0.0    0.1   99.9
  158    0.0    0.1   99.9
  159    0.0    0.1   99.9
  160    0.0    0.1   99.9
  161    0.0    0.1   99.9
  162    0.0    0.1   99.9
  163    0.0    0.1   99.9
  164    0.0    0.1   99.9
  165    0.0    0.1   99.9
  166    0.0    0.1   99.9
  167    0.0    0.1   99.9
  168    0.0    0.1   99.9
  169    0.0    0.1   99.9
  170    0.0    0.1   99.9
  171    0.0    0.1   99.9
  172    0.0    0.1   99.9
  173    0.0    0.1   99.9
  174    0.0    0.0  100.0
  175    0.0    0.0  100.0
  176    0.0    0.0  100.0
  177    0.0    0.0  100.0
  178    0.0    0.0  100.0
  179    0.0    0.0  100.0
  180    0.0    0.0  100.0
  181    0.0    0.0  100.0
  182    0.0    0.0  100.0
  183    0.0    0.0  100.0
  184    0.0    0.0  100.0
  185    0.0    0.0  100.0
  186    0.0    0.0  100.0
  187    0.0    0.0  100.0
  188    0.0    0.0  100.0
  189    0.0    0.0  100.0
  190    0.0    0.0  100.0
  191    0.0    0.0  100.0
  192    0.0    0.0  100.0
  193    0.0    0.0  100.0
  194    0.0    0.0  100.0
  195    0.0    0.0  100.0
  196    0.0    0.0  100.0
  197    0.0    0.0  100.0
  198    0.0    0.0  100.0
  199    0.0    0.0  100.0
  200    0.0    0.0  100.0
  201    0.0    0.0  100.0
  202    0.0    0.0  100.0
  203    0.0    0.0  100.0
  204    0.0    0.0  100.0
  205    0.0    0.0  100.0
  206    0.0    0.0  100.0
  207    0.0    0.0  100.0
  208    0.0    0.0  100.0
  209    0.0    0.0  100.0
  210    0.0    0.0  100.0
  211    0.0    0.0  100.0
  212    0.0    0.0  100.0
  213    0.0    0.0  100.0
  214    0.0    0.0  100.0
  215    0.0    0.0  100.0
  216    0.0    0.0  100.0
  217    0.0    0.0  100.0
  218    0.0    0.0  100.0
  219    0.0    0.0  100.0
  220    0.0    0.0  100.0
---------------------------------------

Table 3: Chance in percents of the stronger player losing (the weaker player winning) '50 - 1/2*f(d)', the players drawing '100 - f(d)', and the stronger player winning (the weaker player losing) '3/2*f(d) - 50' as a function of grade difference 'd'. Calculation is performed assuming logistic 'p = f(d)' (yellow line in figure 2) and that the most probable number of draws is the average between the maximum '200 - 2*f(d)' and the minimum '0'. White is approximately 5 grading points stronger than Black.
Robert Jurjevic
Vafra

User avatar
Robert Jurjevic
Posts: 207
Joined: Wed May 16, 2007 1:31 pm
Location: Surrey

Re: GRADING ANOMALIES

Post by Robert Jurjevic » Mon Oct 24, 2011 11:18 am

Hello all,

GS and AGS3 grades for 2009, 2010 and 2011...

Please note that I have estimated AGS3 grades for 2009, 2010 and 2011 using GS grades for 2009 published in gradeslive.csv, GS grades for 2010 published in grades2010v2.csv and GS grades for 2011 published in grades2011v2.csv. (I assumed that 2008 AGS3 grades are equal to 2008 GS grades and that number of games on which 2008 GS and AGS3 grades are based is 0.)

(The estimation assumes that the average GS grade of the player's opponents did not change between current and previous season, which may be a fair assumption, but obviously needs not to hold. Nevertheless, AGS3 grade estimation/ calculation with no game results seems to be fairly accurate, it is likely that the error for 2011 AGS3 grades is about -0.10 grading points on average).

The G30 rule below has been taken into account in the calculation.

Rule G30: The Grade is calculated by dividing the total number of points scored by the number of games played. If there are at least 30 games in the current period, then the Grade is based on these games alone. If there are not, results are brought forward from the previous period to make the total up to exactly thirty. If there are not 30 games in the two seasons together, results are taken from the season before that. Games are never taken from further back than this; the maximum is two prior grading periods.

The results can be found in ags3grade11.zip file which can be downloaded from... http://www.jurjevic.org.uk/chess/grade/. The Zip file contains ags3grade09.txt (textual tab delimited), ags3grade09.xls (Excel spreadsheet document), gradeslive.csv (Excel spreadsheet document, official 2009 grades), ags3grade10.txt (textual tab delimited), ags3grade10.xls (Excel spreadsheet document), grades2010v2.csv (Excel spreadsheet document, official 2010 grades, version 2), ags3grade11.txt (textual tab delimited), ags3grade11.xls (Excel spreadsheet document) and grades2011v2.csv (Excel spreadsheet document, official 2011 grades, version 2).

The calculation was performed by a Windows .net console (command line) application written in C# programming language interfacing with Oracle Database 10g Express Edition database. Histograms are obtained with Mathematica 7 (I saved relevant data in a database; I have no game results, as they are not published, but as GS's and AGS3's formulae are identical, and the only difference is in the value of 'k' factors, one can estimate AGS3 grades using GS grades with no actual game results; the C# program was written by me; Oracle Database 10g Express Edition database is free, but it has limitations imposed, some or all of which are lifted in commercial versions).

Image
Figure 1a: Histogram for 2009 GS grades (consisting of 10153 grades in bins [0,1), [1,2), etc., of 1 grading point width, where minimum grade is 0.00, median grade is 133.00, maximum grade is 281.00, mean grade is 133.69 and standard deviation is 37.05).

Image
Figure 1b: Histogram for 2009 AGS3 grades (consisting of 10153 grades in bins [0,1), [1,2), etc., of 1 grading point width, where minimum grade is 0.00, median grade is 133.50, maximum grade is 281.00, mean grade is 133.55 and standard deviation is 36.73).

Image
Figure 2a: Histogram for 2010 GS grades (consisting of 10073 grades in bins [0,1), [1,2), etc., of 1 grading point width, where minimum grade is 2.00, median grade is 133.00, maximum grade is 276.00, mean grade is 133.33 and standard deviation is 37.46).

Image
Figure 2b: Histogram for 2010 AGS3 grades (consisting of 10073 grades in bins [0,1), [1,2), etc., of 1 grading point width, where minimum grade is 1.00, median grade is 132.88, maximum grade is 276.00, mean grade is 133.09 and standard deviation is 36.99).

Image
Figure 3a: Histogram for 2011 GS grades (consisting of 10104 grades in bins [0,1), [1,2), etc., of 1 grading point width, where minimum grade is 0.00, median grade is 133.00, maximum grade is 296.00, mean grade is 132.96 and standard deviation is 38.57).

Image
Figure 3b: Histogram for 2011 AGS3 grades (consisting of 10104 grades in bins [0,1), [1,2), etc., of 1 grading point width, where minimum grade is 0.00, median grade is 132.12, maximum grade is 296.00, mean grade is 132.56 and standard deviation is 38.07).

Grading anomalies...

My point in a nutshell is that the 'k' factors (wrongly chosen) in the current grading system are causing the grade stretching and that the amount of the stretch (due to the 'k' factors) is larger (in fact it is equal to '|p - q|') than grade fluctuations caused by other anomalies which may be corrected by using FIDE logistic relation for 'p = f(d)', Glickman idea on changing less trusted grades (based on frequency of play) faster than more trusted grades, or even a solution to the "junior problem".

In my opinion three candidates for replacing GS are AGS3, ÉGS5 and ÉGS6, with ÉGS6 as the best and ÉGS5 as the second best.

AGS3 is GS with the 'k' factors equal to '1/2' (the ECF's linear approximation is used for 'p = f(d)'; does not stretch the grades). (The only difference between GS and AGS3 is in 'k' factors in their formulae which are '1/2' in AGS3 and '1' in GS.)

ÉGS5 is sort of ECF equivalent of FIDE's Élo (logistic curve is used for 'p = f(d)', this is regarded as more accurate than the ECF's linear approximation; the grades are ECF grades, not FIDE ratings, i.e., a strong grandmaster is about 270 not 2800; grading is done every season rather than after every tournament; does not stretch the grades).

ÉGS6 has similar improvement (taken in a simple from) over ÉGS5 as Glicko has over FIDE's Élo which accounts for a grade trust (or establishment) based on frequency of play (i.e., less trusted or established grades change more rapidly than more trusted or established grades, consequently, in extreme case, ungraded players do not affect the grades of graded players; uses logistic curve for 'p = f(d)'; does not stretch the grades).

More details (including formulae, logical arguments, etc.) can be found at:

http://www.jurjevic.org.uk/chess/grade/ ... malies.htm

Kind regards,
Robert Jurjevic
Vafra

Post Reply